| Jan, 2026 | Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration accepted at ICLR 2026. |
| Nov, 2025 | Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges accepted at COLING 2025. |
| Sep, 2025 | Started at UMD College Park. |
| Aug, 2025 | PersonaGym: Evaluating Persona Agents and LLMs accepted at Findings of EMNLP 2025. |
| Aug, 2025 | CIE: Controlling Language Model Text Generations Using Continuous Signals accepted at EMNLP 2025. |
| Jul, 2025 | NOVELTYBENCH: Evaluating Language Models for Humanlike Diversity accepted at COLM 2025. |
| Jul, 2024 | Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges accepted at ACL SRW 2024. |
| Jun, 2024 | ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction accepted at Findings of ACL 2024. |