I got PhD from the Information Retrieval group and Data System group of the David R. Cheriton School of Computer Science at the University of Waterloo. My supervisors are Mark D. Smucker and Gordon V. Cormack. I am also working closely with Jimmy Lin, Maura Grossman, and Charlie Clarke.
My broad research interests include Information Retrieval, NLP, and Machine Learning (especially active learning and deep learning). My core research is building a High-Recall-Information-Retrieval (HiCAL) system to help users find all or nearly all relevant information more efficiently and effectively[4,5,6,11,12,14,15,16,18,19]. Another part of my research is understanding user behavior to improve the quality of ranking. I am also working on deep neural network and trying to apply it on solving Question Answering and Ad-Hoc search problems [3,8,9,10,17].
Before studying at University of Waterloo, I studied at Harbin Institute of Technology, China. I have studied and/or worked in Canada, USA, France, and Italy.
Machine Learning Engineer at Wish, Toronto. May.2019 - Present
Software Engineer Intern at Wish, Toronto. May.2018 - Aug.2018Product Boost - Ads prediction and ranking. Model design and evaluation on recommendation system. Spam products detection.
Research Intern at Oracle Lab, Boston, USA. Jun.2016 - Aug.2016Learning to rank for eCommerce Search.
Software Engineer Intern at Adobe Systems (Beijing). Oct.2011 - July.2012
Successfully pass PhD defence. Thanks to my advisors and PhD committee members. April 10, 2019
Presented our high recall work  at CIKM 2018, Italy. Oct, 2018.
The code for High-Recall Information Retrieval system (HiCAL) is now public: HiCAL. July, 2018.
Join Wish as a software engineer intern. Working on machine learning models for solving eCommerce problems.Toronto, Apirl, 2018.
Pass the PhD Comp2 examination [proposal], now a PhD candidate. April, 2018.
Present user requery behaviour work  at CHIIR 2018, New Jersey. Mar, 2018.
Present High-Recall-Information-Retrieval work with Google Cloud Team [Mountain View]. Dec, 2017.
 "Increasing the Efficiency of High-Recall Information Retrieval", PhD Thesis
 "Dynamic Sampling Meets Pooling", SIGIR 2019
 "Simple Applications of BERT for Ad Hoc Document Retrieval", 2019
 "Effective User Interaction for High-Recall Retrieval: Less is More", CIKM 2018
Haotian Zhang, Mustafa Abualsaud, Nimesh Ghelani, Mark Smucker, Gordon Cormack and Maura Grossman
 "Evaluating Sentence-Level Relevance Feedback for High-Recall Information Retrieval", IRJ
Haotian Zhang, Gordon Cormack, Maura Grossman and Mark Smucker
 "UWaterlooMDS at the TREC 2017 Common Core Track", TREC 2017
Haotian Zhang, Mustafa Abualsaud, Nimesh Ghelani, Angshuman Ghosh, Mark Smucker, Gordon Cormack and Maura Grossman
 "Integrating Lexical and Temporal Signals in Neural Ranking Models for Searching Social Media Streams", SIGIR Neu-IR 2017
Jinfeng Rao, Hua He, Haotian Zhang, Ferhan Ture, Royal Sequiera, Salman Mohammed, and Jimmy Lin
 "Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering", SIGIR Neu-IR 2017
Royal Sequiera, Gaurav Baruah, Zhucheng Tu, Salman Mohammed, Jinfeng Rao, Haotian Zhang, and Jimmy Lin.
Robin Cohen, Alan Tsang, Krishna Vaidyanathan, and Haotian Zhang
Gaurav Baruah, Haotian Zhang, Rakesh Guttikonda, Jimmy Lin, Mark D. Smucker and Olga Vechtomova
Haotian Zhang, Jimmy Lin, Gordon Cormack, Mark Smucker
Haotian Zhang and Shu Liu
A System for Efficient High-Recall Retrieval.Code for the HiCAL
Castorini: Deep Neural Network Frameworks for Question AnsweringCode for the Castorini
Continous Active Learning for TREC Total Recall (BMI) 2015Code for the local version of BMI implementation