Publications

(2025). Stochastic Forward-Backward Deconvolution: Training Diffusion Models with Finite Noisy Datasets. International Conference on Machine Learning (ICML).
(2025). Leveraging Variable Sparsity to Refine Pareto Stationarity in Multi-Objective Optimization. International Conference on Learning Representations (ICLR).
(2025). Diffusion Models under Group Transformations. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2025). Last-iterate Convergence in Regularized Graphon Mean Field Game. Association for the Advancement of Artificial Intelligence (AAAI).
(2024). Disguised Copyright Infringement of Latent Diffusion Models. International Conference on Machine Learning (ICML).
(2024). Noise-Aware Aggregation for Heterogeneous Differentially Private Federated Learning. International Conference on Machine Learning (ICML).
(2024). Indiscriminate Data Poisoning Attacks on Pre-trained Feature Extractors. 2nd IEEE Conference on Secure and Trustworthy Machine Learning (SaTML).
(2024). Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2024). Faster Approximation of Probabilistic and Distributional Values via Least Squares. International Conference on Learning Representations (ICLR).
(2024). One Sample Fits All: Approximating All Probabilistic Values Simultaneously and Efficiently. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Robust Data Valuation with Weighted Banzhaf Values. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Batchnorm Allows Unsupervised Radial Attacks. Advances in Neural Information Processing Systems (NeurIPS).
(2023). $f$-MICL: Understanding and Generalizing InfoNCE-based Contrastive Learning. Transactions on Machine Learning Research.
(2023). CM-GAN: Stabilizing GAN Training with Consistency Models. ICML Workshop on Structured Probabilistic Inference & Generative Modeling.
(2023). Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks. International Conference on Machine Learning (ICML).
(2023). Functional Rényi Differential Privacy for Generative Modeling. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers. Findings of the Association for Computational Linguistics (ACL).
(2023). Distilling the Knowledge in Diffusion Models. CVPR workshop on Generative Models for Computer Vision.
(2023). Multi-Objective Reinforcement Learning: Convexity, Stationarity and Pareto Optimality. International Conference on Learning Representations (ICLR).
(2023). Proportional Fairness in Federated Learning. Transactions on Machine Learning Research.
(2022). Indiscriminate Data Poisoning Attacks on Neural Networks. Transactions on Machine Learning Research.
(2022). A Unifying Framework for Federated Learning. Federated and Transfer Learning.
(2022). Network Comparison with Interpretable Contrastive Network Representation Learning. Journal of Data Science, Statistics, and Visualisation.
(2022). FedMGDA+: Federated Learning meets Multi-objective Optimization. IEEE Transactions on Network Science and Engineering.
(2022). Revisiting flow generative models for Out-of-distribution detection. International Conference on Learning Representations (ICLR).
(2022). Conditional Generative Quantile Networks via Optimal Transport. ICLR Workshop on Deep Generative Models for Highly Structured Data.
(2022). Optimality and Stability in Non-Convex Smooth Games. Journal of Machine Learning Research.
(2021). S$^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks. Advances in Neural Information Processing Systems (NeurIPS).
(2021). Quantifying and Improving Transferability in Domain Generalization. Advances in Neural Information Processing Systems (NeurIPS).
(2021). Demystifying and Generalizing BinaryConnect. Advances in Neural Information Processing Systems (NeurIPS).
(2021). Are My Deep Learning Systems Fair? An Empirical Study of Fixed-Seed Training. Advances in Neural Information Processing Systems (NeurIPS).
(2021). The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing. The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP).
(2021). Newton-type Methods for Minimax Optimization. ICML Workshop on Beyond First-Order Methods in ML Systems.
(2021). Posterior Differential Regularization with $f$-divergence for Improving Model Robustness. Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL).
(2021). BERxiT: Better-fine-tuned and Wider-applicable Early Exit for *BERT. The 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL).
(2021). Problems and Opportunities in Training Deep-Learning Software Systems: An Analysis of Variance. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE).
(2021). Unsupervised Multilingual Alignment using Wasserstein Barycenters. International Joint Conference on Artificial Intelligence (IJCAI).
(2020). Early Exiting BERT for Efficient Document Ranking. Proceedings of the First Workshop on Simple and Efficient Natural Language Processing (SustaiNLP 2020).
(2020). DeepAntigen: a novel method for neoantigen prioritization via 3D genome and deep sparse learning. Bioinformatics.
(2020). A novel neoantigen discovery approach based on chromatin high order conformation. BMC Med Genomics.
(2020). Density Deconvolution with Normalizing Flows. ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models.
(2020). Showing Your Work Doesn't Always Work. Proceedings of the Association for Computational Linguistics (ACL).
(2020). DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference. Proceedings of the Association for Computational Linguistics (ACL).
(2020). Tails of Lipschitz Triangular Flows. International Conference on Machine Learning (ICML).
(2020). On Minimax Optimality of GANs for Robust Mean Estimation. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2020). Convex Representation Learning for Generalized Invariance in Semi-Inner-Product Space. International Conference on Machine Learning (ICML).
(2020). Stronger and Faster Wasserstein Adversarial Attacks. International Conference on Machine Learning (ICML).
(2019). Convergence of Gradient Methods on Bilinear Zero-Sum Games. International Conference on Learning Representations (ICLR).
(2019). Least-Squares Estimation of Weakly Convex Functions. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2019). A Penalized Regression Model for the Joint Estimation of eQTL Associations and Gene Network Structure. Annals of Applied Statistics.
(2019). What Part of the Neural Network Does This? Understanding LSTMs by Measuring and Dissecting Neurons. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).
(2019). Sum-of-squares Polynomial Flow. International Conference on Machine Learning (ICML).
(2019). Multivariate Triangular Quantile Maps for Novelty Detection. Advances in Neural Information Processing Systems (NeurIPS).
(2018). Orpheus: Efficient Distributed Machine Learning via System and Algorithm Co-design. ACM Symposium on Cloud Computing (SoCC).
(2018). Inductive Two-Layer Modeling with Parametric Bregman Transfer. International Conference on Machine Learning (ICML).
(2018). Distributed Proximal Gradient Algorithm for Partially Asynchronous Computer Clusters. Journal of Machine Learning Research.
(2018). Deep Homogeneous Mixture Models: Representation, Separation and Approximation. Advances in Neural Information Processing Systems (NeurIPS).
(2017). Bregman Divergence for Stochastic Variance Reduction Methods: Adversarial Prediction and Saddle-Point Problems. Advances in Neural Information Processing Systems (NeurIPS).
(2017). Efficient Multiple Instance Metric Learning using Weakly Supervised Data. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
(2017). Convex-constrained Sparse Additive Modeling and Its Extensions. Conference on Uncertainty in Artificial Intelligence (UAI).
(2017). Robust Top-$k$ Multiclass SVM for Visual Category Recognition. ACM Conference on Knowledge Discovery and Data Mining (KDD).
(2017). Learning Latent Space Models with Angular Constraints. International Conference on Machine Learning (ICML).
(2017). Dropout with Expectation-Linear Regularization. International Conference on Learning Representations (ICLR).
(2017). Generalized Conditional Gradient for Sparse Estimation. Journal of Machine Learning Research (JMLR).
(2016). They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
(2016). Closed-Form Training of Mahalanobis Distance for Supervised Clustering. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
(2016). Convex Two-Layer Modeling with Latent Structure. Advances in Neural Information Processing Systems (NeurIPS).
(2016). Semantic Pooling for Complex Event Analysis in Untrimmed Videos. IEEE Transactions on Pattern Analysis and Machine Intelligence.
(2016). Scalable and Sound Low-Rank Tensor Learning. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2016). On Convergence of Model Parallel Proximal Gradient Algorithm for Stale Synchronous Parallel System. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2016). Additive Approximations in High Dimensional Nonparametric Regression via the SALSA. International Conference on Machine Learning (ICML).
(2016). Lighter-Communication Distributed Machine Learning via Sufficient Factor Broadcasting. Conference on Uncertainty in Artificial Intelligence (UAI).
(2016). Online Learning and Optimization. Encyclopedia of Algorithms.
(2015). Exact Algorithms for Isotonic Regression and Related. Journal of Physics: Conference Series.
(2015). Searching Persuasively: Joint Event Detection and Evidence Recounting with Limited Supervision. ACM Conference on Multimedia (MM).
(2015). Petuum: A New Platform for Distributed Machine Learning on Big Data. IEEE Transactions on Big Data.
(2015). Linear Time Samplers for Supervised Topic Models using Compositional Proposals. ACM Conference on Knowledge Discovery and Data Mining (KDD).
(2015). Semantic Concept Discovery for Large-Scale Zero-Shot Event Detection. International Joint Conference on Artificial Intelligence (IJCAI).
(2015). Complex Event Detection using Semantic Saliency and Nearly-Isotonic SVM. International Conference on Machine Learning (ICML).
(2015). Minimizing Nonconvex Non-Separable Functions. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2014). Efficient Structured Matrix Rank Minimization. Advances in Neural Information Processing Systems (NeurIPS).
(2013). Polar Operators for Structured Sparse Estimation. Advances in Neural Information Processing Systems (NeurIPS).
(2013). Better Approximation and Faster Algorithm Using the Proximal Average. Advances in Neural Information Processing Systems (NeurIPS).
(2013). On Decomposing the Proximal Map. Advances in Neural Information Processing Systems (NeurIPS).
(2013). Characterizing the Representer Theorem. International Conference on Machine Learning (ICML).
(2012). Convex Multi-view Subspace Learning. Advances in Neural Information Processing Systems (NeurIPS).
(2012). Accelerated Training for Matrix-Norm Regularization: A Boosting Approach. Advances in Neural Information Processing Systems (NeurIPS).
(2012). A Polynomial-time Form of Robust Regression. Advances in Neural Information Processing Systems (NeurIPS).
(2012). Regularizers versus Losses for Nonlinear Dimensionality Reduction. International Conference on Machine Learning (ICML).
(2012). Analysis of Kernel Mean Matching under Covariate Shift. International Conference on Machine Learning (ICML).
(2011). Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions. Association for the Advancement of Artificial Intelligence (AAAI).
(2011). Distance Metric Learning by Minimal Distance Maximization. Pattern Recognition.
(2011). Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering. Conference on Uncertainty in Artificial Intelligence (UAI).
(2010). Relaxed Clipping: A Global Training Method for Robust Regression and Classification. Advances in Neural Information Processing Systems (NeurIPS).
(2009). Online TD(1) Meets Offline Monte Carlo. Multidisciplinary Symposium on Reinforcement Learning.
(2009). A General Projection Property for Distribution Families. Advances in Neural Information Processing Systems (NeurIPS).
(2009). A Conditional Value-at-Risk Approach for Uncertain Markov Decision Processes. Multidisciplinary Symposium on Reinforcement Learning.