CS 848 (Winter 2010) Schedule

Note: You will need a userid and password to access the locally-cached copies of some of these papers. The userid and password will be given out in class. If you've forgotten them, ask a classmate or send e-mail to the instructor.
Week Date Topic Presenter Summaries
1 04 Jan Organizational Meeting Salem
2 11 Jan Introduction and Background Salem PDF
[lago04] MTCache: Transparent mid-tier database caching in SQL Server Salem PDF Salem
3 18 Jan [pesp97] Flexible Update Propagation for Weakly Consistent Replication Khan PDF
[burr06] The Chubby Lock Service for Loosely-Coupled Distributed Systems Zhan PPT Zangooei
[gama08] Scalable Query Result Caching for Web Applications. Rauf PDF Faghihekhorasani
4 25 Jan [yuva00] Design and Evaluation of a Continuous Consistency Model for Replicated Services. VanSchyndel PDF Zangooei
[ghgo03] The Google File System. Reidmeister PDF Robinson
[degh04] MapReduce: Simplified Data Processing on Large Clusters. Khoshdel Nikkhoo PDF Rauf
5 01 Feb [mamu04] Boxwood: Abstractions as the Foundation for Storage Infrastructure. Robinson PDF Karyakin
[chde06] Bigtable: A Distributed Storage System for Structured Data Avram PDF Khan
[deha07] Dynamo: Amazon's Highly Available Key-Value Store Oshikoji PPT Aluc
6 08 Feb [cora08] PNUTS: Yahoo!'s Hosted Data Serving Platform Farid PPTX Avram
[olre08] Pig Latin: A Not-So-Foreign Language for Data Processing Welch PDF Khoshdel Nikkhoo
[gada03] Application Specific Data Replication for Edge Services Faghihekhorasani PDF Eflov
7 15 Feb Reading Week - no class
8 22 Feb [agme07] Sinfonia: A New Paradigm for Building Scalable Distributed Systems Zangooei PDF Robinson
[pido05] Interpreting the Data: Parallel Analysis with Sawzall Scientific Programming Karyakin PDF Avram
[isbu07] Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks. Huang Khan
9 01 Mar No Class
10 08 Mar [aggo08] A Practical Scalable Distributed B-Tree Eflov PDF
[chje08] SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets Aluc PDF
[abba09] HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads. Chalamalla PDF
11 15 Mar [papa09] A Comparison of Approaches to Large-Scale Data Analysis Isaacs PDF
[brfl08] Building a Database on S3 Ho PDF
[auja09] A Comparison of Flexible Schemas for Software as a Service Sheikh PDF
12 22 Mar Project Presentations
Distributed Extraction of Relations from Unstructured Documents Farid and Chalamalla PDF
Formalizing Repair-Driven Monitoring of Cloud-Based Web Services Reidmeister
Duplicate Document Detection Using Map-Reduce Khoshdel Nikkhoo PDF
MapReduce-based Analysis of Random Number Sequences Avram PDF
Distributed Model Verification Using Map-Reduce Faghihekhorasani PDF
Join and Update Processing in a Distributed RDF Database Aluc PDF
13 29 Mar Project Presentations
Cloud Security and Privacy group presentation PDF
Serving Website Snapshots Rauf and Shiekh
Real-time Analysis of Streaming Data with MapReduce Online Zhan PDF
Survey: QoS Guarantees for Cloud Services Karyakin PDF
Modifications to MapReduce Eflov PDF
Cloud Archiving File System Robinson PDF