Seminar • Bioinformatics • Scalable Representation Learning for Multi-View, Multi-Condition Single-Cell Data

Monday, March 2, 2026 10:30 am - 11:30 am EST (GMT -05:00)

Please note: This seminar will take place online.

Ziqi Zhang, PhD
School of Computational Science and Engineering, Georgia Institute of Technology

Single-cell sequencing technology produces large-scale, high-dimensional, multi-view datasets that measure molecules (e.g., mRNA, proteins) across each target cell within the bio-sample of interest. These data are noisy, sparse, and heterogeneous, presenting core data-science challenges in representation learning and knowledge transfer across domains and conditions.

In this talk I present my machine-learning solutions to address these computational challenges. First, I introduce my approach for learning robust cell representations by integrating information across multiple data domains. Next, I describe my method for modeling multi-conditional data and predicting cellular responses to external stimuli. In the end, I extend these ideas to large-scale data atlases and introduce an efficient single-cell foundation model that leverages pretraining on massive single-cell datasets to achieve improved generalization and strong performance across a wide range of downstream tasks.

I close by outlining future directions toward clinical impact: modeling heterogeneity at the levels of cells, cell types, and patients, and using pretrained, interpretable models to predict patient-level responses to treatments. More broadly, this work contributes to core machine learning challenges in multi-view representation learning, conditional generative modeling, and structured knowledge retrieval from high-dimensional, multimodal data.


Bio: Ziqi Zhang received his Ph.D. from the School of Computational Science and Engineering at the Georgia Institute of Technology, where he was advised by Xiuwei Zhang. His research focuses on developing machine-learning methods to study cellular mechanisms using high-throughput, multi-modal single-cell data.

His work has been published in leading computational biology journals, including Nature Communications, Nature Methods, and Genome Biology, and has been supported by the U.S. National Science Foundation (NSF) and the National Institutes of Health (NIH). Ziqi has also collaborated closely with wet-lab researchers at Emory University and gained industry experience at Genentech, Inc., where he conducted computational cancer data analysis and translated methodological advances into real-world clinical applications.