Introduction and architectural issues
- Chapter 1 of the textbook.
- Lecture notes 0
- Lecture notes 1
Data distribution/distributed query processing
- Chapters 3, 6, and 7 of the textbook.
- Lecture notes 2
- Lecture notes 3
Distributed query optimization
- Chapter 8 of the textbook.
- Lecture notes 4
Transactions and Distributed Concurrency Control
- Chapters 10 and 11 of the textbook
- Lecture notes 5
Distributed Database Reliability
- Chapter 12 of the textbook
- Lecture notes 6
Database Replication
- Chapter 13 of the textbook
- Lecture notes 7
Database Replication
- Chapter 13 of the textbook
Parallel data management
- Chapter 14 of the textbook
- Lecture notes 8
Database integration
- Chapter 4 of the textbook
- Lecture notes 9a
Presentations
- Alexander Chow:J. C. Corbett, Spanner: Google’s Globally-Distributed Database. Proc. 10th USENIX Symposium on Operating Systems Design and Implementation, 2012. Presentation slides
- Nian Ke: J. Zhou, N. Bruno, and W. Lin, Advanced partitioning techniques for massively distributed computation, Proc. ACM SIGMOD International Conference on Management of Data, pages 13-24, 2012. Presentation slides
Multidatabase Query Processing
- Chapter 9 of the textbook
- Lecture notes 9b
Presentations
- Taras Kinash: H. Elmeleegy, A. Elmagarmid, and J. Lee, Leveraging query logs for schema mapping generation in U-MAP, Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 121-132, 2011. Presentation slides
- Bairong Lei: H. Herodotou, N. Borisov, and S. Babu, Query optimization techniques for partitioned tables, Proc. ACM SIGMOD International Conference on Management of Data, pages 49-60, 2011. Presentation slides
Peer-to-Peer Data Management
- Chapter 16 from the textbook
- Lecture notes 10
Presentations
- Aaron Moss: C. Olston, B. Reed, U. Srivastava, R. Kumar, A. Tomkins, Pig latin: a not-so-foreign language for data processing. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1099-1110, 2008. Presentation slides
- Yiyao Liu: B. F. Cooper, R. Ramakrishnan, U. Srivastava, A. Silberstein, P. Bohannon, H.A. Jacobsen, N. Puz, D. Weaver, R. Yerneni, PNUTS: Yahoo!'s hosted data serving platform. Proc. 34th Int. Conf. on Very Large Data Bases, pages 1277-1288, 2008. Presentation slides
We won't have class this week since I will be out of town.
Stream Data Management
- Chapter 18 (Section 18.1) of the textbook
- Lecture notes 11
Presentations
- Alireza Mortezai: A. Crespo, H. Garcia-Molina, Routing Indices For Peer-to-Peer Systems, Proc. International Conference on Distributed Computing Systems, 2002. Presentation slides
- Zeynep Korkmaz: C. Curino, Y. Zhang, E. P. C. Jones, S. Madden, Schism: a Workload-Driven Approach to Database Replication and Partitioning. Proc. VLDB, 3(1): 48-57, 2010. Presentation slides
- Yogi Yoshi: G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon's Highly Available Key-Value Store. Proc. 21st ACM SIGOPS Symposium on Operating Systems Principles, pages 205-220, 2007. Presentation slides
MapReduce-based Distributed Data Management
- Chapter 18 (Section 18.2) of the textbook
- I will also distribute a survey paper
- Lecture notes 12
Presentations
- Xiang Gao: F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, R. E. Gruber, Bigtable: A Distributed Storage System for Structured Data, ACM Trans. Comput. Syst., 26(2): Article 4, 2008. Presentation slides
- Qiming Hu: K. Shvachko, H. Kuang, S. Radia, R. Chansler, The Hadoop Distributed File System, Proc. IEEE 26th Symposium on Mass Storage Systems and Technologies, 2010. Presentation slides
Final exam