Master’s Thesis Presentation • Data Systems — Iterative Edit-based Unsupervised Sentence Simplification

Wednesday, July 22, 2020 1:30 pm - 1:30 pm EDT (GMT -04:00)

Please note: This master’s thesis presentation will be given online.

Dhruv Kumar, Master’s candidate
David R. Cheriton School of Computer Science

We present a new iterative approach towards unsupervised edit-based sentence simplification. Our approach is guided by a scoring function to select simplified sentences generated after iteratively performing word and phrase-level edits on the complex sentence. The scoring function measures different aspects of simplification: fluency, simplicity, and preservation of meaning. As a result, unlike past approaches, our method is controllable and interpretable and does not require a parallel training set since it is unsupervised. At the same time, using the Newsela and WikiLarge datasets, we experimentally show that our solution is nearly as effective as state-of-the-art supervised approaches.