I am a Research Scientist at Allen Institute for Artificial Intelligence (AI2) working on Natural Langauge Processing with focus on scientific documents. I received my PhD in Computer Science from the University of Texas at Austin working with Ray Mooney and Katrin Erk.
Longformer - a BERT-like model for long documents. Code and Pretrained model
SPECTER - a citation-informed embedding model for scintific documents. Code, Data and Pretrained Model
SciSpacy - a Spacy pipeline for scientific documents. Code
SciBERT - a BERT model for scientific documents. Code, Data, and Pretrained model