Publications

(2025). Last-iterate Convergence in Regularized Graphon Mean Field Game. Association for the Advancement of Artificial Intelligence (AAAI).
(2024). One Sample Fits All: Approximating All Probabilistic Values Simultaneously and Efficiently. Advances in Neural Information Processing Systems (NeurIPS).
(2024). Noise-Aware Aggregation for Heterogeneous Differentially Private Federated Learning. International Conference on Machine Learning (ICML).
(2024). Indiscriminate Data Poisoning Attacks on Pre-trained Feature Extractors. 2nd IEEE Conference on Secure and Trustworthy Machine Learning (SaTML).
(2024). Faster Approximation of Probabilistic and Distributional Values via Least Squares. International Conference on Learning Representations (ICLR).
(2024). Disguised Copyright Infringement of Latent Diffusion Models. International Conference on Machine Learning (ICML).
(2024). Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2023). Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Robust Data Valuation with Weighted Banzhaf Values. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Proportional Fairness in Federated Learning. Transactions on Machine Learning Research.
(2023). Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers. Findings of the Association for Computational Linguistics (ACL).
(2023). Multi-Objective Reinforcement Learning: Convexity, Stationarity and Pareto Optimality. International Conference on Learning Representations (ICLR).
(2023). Functional Rényi Differential Privacy for Generative Modeling. Advances in Neural Information Processing Systems (NeurIPS).
(2023). Exploring the Limits of Model-Targeted Indiscriminate Data Poisoning Attacks. International Conference on Machine Learning (ICML).
(2023). Distilling the Knowledge in Diffusion Models. CVPR workshop on Generative Models for Computer Vision.
(2023). CM-GAN: Stabilizing GAN Training with Consistency Models. ICML Workshop on Structured Probabilistic Inference & Generative Modeling.
(2023). Batchnorm Allows Unsupervised Radial Attacks. Advances in Neural Information Processing Systems (NeurIPS).
(2023). A Unifying Framework for Federated Learning. Federated and Transfer Learning.
(2023). $f$-MICL: Understanding and Generalizing InfoNCE-based Contrastive Learning. Transactions on Machine Learning Research.
(2022). Revisiting flow generative models for Out-of-distribution detection. International Conference on Learning Representations (ICLR).
(2022). Optimality and Stability in Non-Convex Smooth Games. Journal of Machine Learning Research.
(2022). Network Comparison with Interpretable Contrastive Network Representation Learning. Journal of Data Science, Statistics, and Visualisation.
(2022). Indiscriminate Data Poisoning Attacks on Neural Networks. Transactions on Machine Learning Research.
(2022). FedMGDA+: Federated Learning meets Multi-objective Optimization. IEEE Transactions on Network Science and Engineering.
(2022). Conditional Generative Quantile Networks via Optimal Transport. ICLR Workshop on Deep Generative Models for Highly Structured Data.
(2021). The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing. The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP).
(2021). S$^3$: Sign-Sparse-Shift Reparametrization for Effective Training of Low-bit Shift Networks. Advances in Neural Information Processing Systems (NeurIPS).
(2021). Quantifying and Improving Transferability in Domain Generalization. Advances in Neural Information Processing Systems (NeurIPS).
(2021). Posterior Differential Regularization with $f$-divergence for Improving Model Robustness. Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL).
(2021). Newton-type Methods for Minimax Optimization. ICML Workshop on Beyond First-Order Methods in ML Systems.
(2021). Demystifying and Generalizing BinaryConnect. Advances in Neural Information Processing Systems (NeurIPS).
(2021). BERxiT: Better-fine-tuned and Wider-applicable Early Exit for *BERT. The 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL).
(2021). Are My Deep Learning Systems Fair? An Empirical Study of Fixed-Seed Training. Advances in Neural Information Processing Systems (NeurIPS).
(2020). Unsupervised Multilingual Alignment using Wasserstein Barycenters. International Joint Conference on Artificial Intelligence (IJCAI).
(2020). Tails of Lipschitz Triangular Flows. International Conference on Machine Learning (ICML).
(2020). Stronger and Faster Wasserstein Adversarial Attacks. International Conference on Machine Learning (ICML).
(2020). Showing Your Work Doesn't Always Work. Proceedings of the Association for Computational Linguistics (ACL).
(2020). Problems and Opportunities in Training Deep-Learning Software Systems: An Analysis of Variance. 35th IEEE/ACM International Conference on Automated Software Engineering (ASE).
(2020). On Minimax Optimality of GANs for Robust Mean Estimation. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2020). Early Exiting BERT for Efficient Document Ranking. Proceedings of the First Workshop on Simple and Efficient Natural Language Processing (SustaiNLP 2020).
(2020). Density Deconvolution with Normalizing Flows. ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models.
(2020). DeepAntigen: a novel method for neoantigen prioritization via 3D genome and deep sparse learning. Bioinformatics.
(2020). DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference. Proceedings of the Association for Computational Linguistics (ACL).
(2020). Convex Representation Learning for Generalized Invariance in Semi-Inner-Product Space. International Conference on Machine Learning (ICML).
(2020). Convergence of Gradient Methods on Bilinear Zero-Sum Games. International Conference on Learning Representations (ICLR).
(2020). A novel neoantigen discovery approach based on chromatin high order conformation. BMC Med Genomics.
(2019). What Part of the Neural Network Does This? Understanding LSTMs by Measuring and Dissecting Neurons. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).
(2019). Understanding Adversarial Robustness: The Trade-off between Minimum and Average Margin. NeurIPS Workshop on Maching Learning with Guarantees.
(2019). Sum-of-squares Polynomial Flow. International Conference on Machine Learning (ICML).
(2019). Multivariate Triangular Quantile Maps for Novelty Detection. Advances in Neural Information Processing Systems (NeurIPS).
(2019). Least-Squares Estimation of Weakly Convex Functions. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2019). A Penalized Regression Model for the Joint Estimation of eQTL Associations and Gene Network Structure. Annals of Applied Statistics.
(2018). Orpheus: Efficient Distributed Machine Learning via System and Algorithm Co-design. ACM Symposium on Cloud Computing (SoCC).
(2018). Inductive Two-Layer Modeling with Parametric Bregman Transfer. International Conference on Machine Learning (ICML).
(2018). Distributed Proximal Gradient Algorithm for Partially Asynchronous Computer Clusters. Journal of Machine Learning Research.
(2018). Deep Homogeneous Mixture Models: Representation, Separation and Approximation. Advances in Neural Information Processing Systems (NeurIPS).
(2017). Semantic Pooling for Complex Event Analysis in Untrimmed Videos. IEEE Transactions on Pattern Analysis and Machine Intelligence.
(2017). Robust Top-$k$ Multiclass SVM for Visual Category Recognition. ACM Conference on Knowledge Discovery and Data Mining (KDD).
(2017). Generalized Conditional Gradient for Sparse Estimation. Journal of Machine Learning Research.
(2017). Efficient Multiple Instance Metric Learning using Weakly Supervised Data. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
(2017). Dropout with Expectation-Linear Regularization. International Conference on Learning Representations (ICLR).
(2017). Convex-constrained Sparse Additive Modeling and Its Extensions. Conference on Uncertainty in Artificial Intelligence (UAI).
(2017). Bregman Divergence for Stochastic Variance Reduction Methods: Adversarial Prediction and Saddle-Point Problems. Advances in Neural Information Processing Systems (NeurIPS).
(2017). Analyzable Diversity-Promoting Latent Space Models. International Conference on Machine Learning (ICML).
(2016). They Are Not Equally Reliable: Semantic Event Search using Differentiated Concept Classifiers. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
(2016). Scalable and Sound Low-Rank Tensor Learning. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2016). On Convergence of Model Parallel Proximal Gradient Algorithm for Stale Synchronous Parallel System. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2016). Lighter-Communication Distributed Machine Learning via Sufficient Factor Broadcasting. Conference on Uncertainty in Artificial Intelligence (UAI).
(2016). Exact Algorithms for Isotonic Regression and Related. Journal of Physics: Conference Series.
(2016). Convex Two-Layer Modeling with Latent Structure. Advances in Neural Information Processing Systems (NeurIPS).
(2016). Closed-Form Training of Mahalanobis Distance for Supervised Clustering. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
(2016). Additive Approximations in High Dimensional Nonparametric Regression via the SALSA. International Conference on Machine Learning (ICML).
(2015). Semantic Concept Discovery for Large-Scale Zero-Shot Event Detection. International Joint Conference on Artificial Intelligence (IJCAI).
(2015). Searching Persuasively: Joint Event Detection and Evidence Recounting with Limited Supervision. ACM Conference on Multimedia (MM).
(2015). Petuum: A New Platform for Distributed Machine Learning on Big Data. IEEE Transactions on Big Data.
(2015). Online Learning and Optimization. Encyclopedia of Algorithms.
(2015). Minimizing Nonconvex Non-Separable Functions. International Conference on Artificial Intelligence and Statistics (AISTATS).
(2015). Linear Time Samplers for Supervised Topic Models using Compositional Proposals. ACM Conference on Knowledge Discovery and Data Mining (KDD).
(2015). Complex Event Detection using Semantic Saliency and Nearly-Isotonic SVM. International Conference on Machine Learning (ICML).
(2014). Efficient Structured Matrix Rank Minimization. Advances in Neural Information Processing Systems (NeurIPS).
(2013). Polar Operators for Structured Sparse Estimation. Advances in Neural Information Processing Systems (NeurIPS).
(2013). On Decomposing the Proximal Map. Advances in Neural Information Processing Systems (NeurIPS).
(2013). Characterizing the Representer Theorem. International Conference on Machine Learning (ICML).
(2013). Better Approximation and Faster Algorithm Using the Proximal Average. Advances in Neural Information Processing Systems (NeurIPS).
(2012). Regularizers versus Losses for Nonlinear Dimensionality Reduction. International Conference on Machine Learning (ICML).
(2012). Convex Multi-view Subspace Learning. Advances in Neural Information Processing Systems (NeurIPS).
(2012). Analysis of Kernel Mean Matching under Covariate Shift. International Conference on Machine Learning (ICML).
(2012). Accelerated Training for Matrix-Norm Regularization: A Boosting Approach. Advances in Neural Information Processing Systems (NeurIPS).
(2012). A Polynomial-time Form of Robust Regression. Advances in Neural Information Processing Systems (NeurIPS).
(2011). Rank/Norm Regularization with Closed-Form Solutions: Application to Subspace Clustering. Conference on Uncertainty in Artificial Intelligence (UAI).
(2011). Distance Metric Learning by Minimal Distance Maximization. Pattern Recognition.
(2011). Convex Sparse Coding, Subspace Learning, and Semi-Supervised Extensions. Association for the Advancement of Artificial Intelligence (AAAI).
(2010). Relaxed Clipping: A Global Training Method for Robust Regression and Classification. Advances in Neural Information Processing Systems (NeurIPS).
(2009). Online TD(1) Meets Offline Monte Carlo. Multidisciplinary Symposium on Reinforcement Learning.
(2009). A General Projection Property for Distribution Families. Advances in Neural Information Processing Systems (NeurIPS).
(2009). A Conditional Value-at-Risk Approach for Uncertain Markov Decision Processes. Multidisciplinary Symposium on Reinforcement Learning.