CS 856 - Web Data Management
M. Tamer Özsu
DC 3121, ext. 6847
tozsu@db.uwaterloo.ca
Course Objectives
The objective of the course is to study the new data management problems (and
solutions) that emerge in the Web environment. The course will quickly cover the
classical distribution issues and then it will focus on Web data management
Prerequisites
Good knowledge of database internals at least at the level of CS 448/648 and of
distributed computing/computer networks at the level of CS454 and 456
is required. Of course,
students are expected to have the maturity to deal with data structures
and algorithms.
Course Outline
The following is not the complete list and will be updated. Note that I will
only lecture on the first two topics, and then it will be the students who will
read and present material. Each of the following can and will be subdivided.
-
Introduction to distributed database management issues
-
Internet and the Web
-
Web caching
-
Web querying
-
Web data integration (and XML, of course)
-
Web mining
-
Peer-to-peer systems
-
Data distribution alternatives (broadcasting, publish/subscribe, etc)
-
Streaming data management
Textbooks and References
Principal reference:
-
There are no required textbooks, but the following can be helpful in
understanding the fundamental distributed database problems:
M.T. Özsu and P. Valduriez, Principles of Distributed Database
Systems, 2nd edition, Prentice-Hall, 1999.
Other useful references:
The following is not a complete list, but a starting point.
Workload and Evaluation
-
(20%) Two paper critiques to be chosen from conferences or journals
-
(20%) Presentation of one lecture or being a discussant on one topic. See the
guidelines.
-
(20%) Position papers. See the guidelines.
-
(40%) A term project that is an in-depth study of one problem. See the
guidelines.
There will be no tests or exams.