DBRank 2009
Location : VIP meeting room

14:00 - 15:00 Keynote

"Weighted Set Similarity: Queries and Updates"

Dr. Divesh Srivastava
AT&T Labs-Research

Abstract: Consider a universe of items, each of which is associated with a weight, and a database consisting of subsets of these items. Given a query set, a weighted set similarity query identifies either (i) all sets in the database whose cosine similarity to the query set is above a pre-specified threshold, or (ii) the sets in the database with the k highest similarity values to the query set. Weighted set similarity queries are useful in applications like data cleaning and integration for finding approximate matches in the presence of typographical mistakes, multiple formatting conventions, transformation errors, etc. We show that this problem has semantic properties that can be exploited to design index structures that support efficient algorithms for answering queries; these algorithms can achieve arbitrarily stronger pruning than the family of Threshold Algorithms. We describe how these index structures can be efficiently updated using lazy propagation in a way that gives strict guarantees on the quality of subsequent query answers. Finally, we illustrate that our proposed ideas work well in practice for real datasets.

Speaker's Bio:Divesh Srivastava is the head of Database Research at AT&T Labs Research. He received his Ph.D. from the University of Wisconsin, Madison, and his Bachelor of Technology from the Indian Institute of Technology, Bombay, India. His current research interests include data quality, data streams and data privacy.

15:00 - 15:30 Coffee break  
15:30 - 17:30 Research Session

Regular papers (30 min each)

  • Incremental Reverse Nearest Neighbor Ranking
    Hans-Peter Kriegel (LMU Munich), Peer Kröger (LMU Munich), Matthias Renz (LMU Munich), Andreas Züfle (LMU Munich), Alexander Katzdobler (LMU Munich)

  • Continuous Skylining on Volatile Moving Data
    Mu-Woong Lee (Pohang University of Science and Technology, Korea), Seung-won Hwang (Pohang University of Science and Technology, Korea)

Short papers (15 min each)

  • Ranking of Object Summaries
    Georgios Fakas (Manchester Metropolitan University), Zhi Cai (Manchester Metropolitan University)

  • Improving the Effectiveness of XML Retrieval with User Navigation Models
    Sadek Ali (University of Toronto), Mariano Consens (University of Toronto), Bassam Helou (University of Toronto)

  • Visualized Elucidations of Ranking by Exploiting Object Relations
    Xinpeng Zhang (Kyoto University), Yasuhito Asano (Kyoto University), Masatoshi Yoshikawa (Kyoto University)