Thursday, October 9, 2025 2:30 pm
-
3:30 pm
EDT (GMT -04:00)
Please note: This PhD seminar will take place online.
Ronak Pradeep, PhD candidate
David R. Cheriton School of Computer Science
Supervisor: Professor Jimmy Lin
Large Language Models have greatly improved information access systems, especially with retrieval-augmented generation (RAG). However, evaluating RAG systems remains a challenge. We address this by introducing AutoNuggetizer, an automatic evaluation framework that uses LLMs to generate and assign “nuggets”, factual information units, to system answers. Our experiments show strong agreement between automatic scores and human-based evaluations, especially when specific components like nugget assignment are automated. These results pave the path for more reliable and scalable RAG evaluation and beyond.