Master’s Thesis Presentation • Cryptography, Security, and Privacy (CrySP) • On the Generalizability of AI-Generated Text Detection | Cheriton School of Computer Science

Tuesday, January 13, 2026 10:00 am - 11:00 am EST (GMT -05:00)

Please note: This master’s thesis presentation will take place online.

Amir David, Master’s candidate
David R. Cheriton School of Computer Science

Supervisor: Professor Florian Kerschbaum

As large language models (LLMs) become ubiquitous, reliably distinguishing their outputs from human writing is critical for academic integrity, content moderation, and preventing model collapse from synthetic training data. This thesis examines the generalizability of LLM-text detectors across evolving model families and domains. We compiled a comprehensive evaluation dataset from commonly-used human corpora and generated corresponding samples using recent OpenAI and Anthropic models spanning multiple generations. Comparing the state-of-the-art zero-shot detector (Binoculars) against supervised RoBERTa/DeBERTa classifiers, we arrive at four main findings. First, zero-shot detection fails on newer models. Second, supervised detectors maintain high TPR in-distribution but exhibit asymmetric cross-generation transfer. Third, commonly reported metrics such as AUROC can obscure poor performance at deployment-relevant thresholds: detectors achieving high AUROC yield near-zero TPR at low FPR, and existing low-FPR evaluations often lack statistical reliability due to small sample sizes. Fourth, through tail-focused training and calibration, we reduce FPR by up to 4× (from ∼1% to ∼0.25%) while maintaining 90% TPR. Our results suggest that robust detection requires continually recalibrated, model-aware pipelines rather than static universal detectors.

Attend this master’s thesis presentation virtually on Zoom.

Location Information

Location Address: DC - William G. Davis Computer Research Centre
200 University Avenue West
Online master’s thesis presentation
Waterloo, ON, CA N2L 3G1

Location coordinates: