PhD Seminar • Cryptography, Security, and Privacy (CrySP) | Artificial Intelligence • SynQP: A Framework and Metrics for Evaluating the Quality and Privacy Risk of Synthetic Data

Monday, October 20, 2025 1:00 pm - 2:00 pm EDT (GMT -04:00)

Please note: This PhD seminar will take place in DC 3317 and online.

Bing Hu, PhD candidate
David R. Cheriton School of Computer Science

Supervisors: Professors Helen Chen, Anita Layton

The use of synthetic data in health applications raises privacy concerns, yet the lack of open frameworks for privacy evaluations has slowed its adoption. A major challenge is the absence of accessible benchmark datasets for evaluating privacy risks, due to difficulties in acquiring sensitive data.

To address this, we introduce SynQP, an open framework for benchmarking privacy in synthetic data generation (SDG) using simulated sensitive data, ensuring that original data remains confidential. We also highlight the need for privacy metrics that fairly account for the probabilistic nature of machine learning models. As a demonstration, we use SynQP to benchmark CTGAN and propose a new identity disclosure risk metric that offers a more accurate estimation of privacy risks compared to existing approaches. Our work provides a critical tool for improving the transparency and reliability of privacy evaluations, enabling safer use of synthetic data in health-related applications.


To attend this PhD seminar in person, please go to DC 3317. You can also attend virtually on MS Teams.