Biography
Associate Professor specializing in NLP and large language models. PhD from Carnegie Mellon 2015. 90+ publications, 12,000+ citations.
Publications (3)
Transformer Architecture Scaling Laws for Scientific Text Classification
Journal of Artificial Intelligence Research
- Volume 13, Issue 1 — January 2024 (2024)
- Research Article
Author
Corresponding Author
We investigate scaling laws for transformer-based language models applied to scientific literature classification. Our experiments across 12 benchmark datasets demonstrate that model performance scales predictably with parameter count and training…
Research Article
DOI
Open Access
Peer Reviewed
Reinforcement Learning from Human Feedback: A Systematic Review
Journal of Artificial Intelligence Research
- Volume 12, Issue 4 — October 2023 (Special: LLMs) (2023)
- Review Article
Author
This systematic review examines 147 studies on reinforcement learning from human feedback (RLHF) published between 2017 and 2024. We identify key methodological trends, open challenges in reward modeling, and propose…
Review Article
DOI
Open Access
Peer Reviewed
Emergent Reasoning in Large Language Models: A Case Study on STEM Problem Solving
Journal of Artificial Intelligence Research
- Volume 13, Issue 1 — January 2024 (2024)
- Case Study
Author
We present a detailed case study examining emergent reasoning capabilities in GPT-4 class models on multi-step STEM problems. Using a novel evaluation framework comprising 2,400 problems across physics, chemistry, and…
Case Study
DOI
Open Access
Peer Reviewed
Statistics
3
Publications
0
Journals
0
Version Contributions
Contact Information
Email:
ORCID:
Affiliation:
MIT Laboratory for Artificial Intelligence
Website: