CS 854 Advanced Topics in Computer Systems: Cloud Computing & Management

Information

Instructor: Bernard Wong
Email: bernard@uwaterloo.ca
Time: Every Tuesday and Thursday at 2:00-3:20 Fall 2012
Office hours: TBD
Location: DC 3313
URL : http://www.cs.uwaterloo.ca/~bernard/courses/cs854

Description

CS854 is a graduate course on current research topics in cloud computing and management. It is offered as a "paper-chase" course: we read and discuss important papers on different subtopics in cloud computing and management each week. Students must also individually complete a final project that explores a new idea in one of the sub-topics covered in the course. The goal is that by the end of the course, the final project can serve as a starting point for a workshop paper submission.

Grading

The following is the tentative course grades breakdown.

Paper Reviews

Before each class, each student must submit a review for each paper to be discussed that day. A review should include a paragraph summarizing the core contributions of the paper, and a second paragraph that identifies weaknesses in the paper and areas in which the paper can be expanded upon.

Please inform me via email the circumstances for each late or missed submission.

Paper Presentations

Each student is expected to present the papers to be discussed in class for at least one class. The paper presentations should follow the same format as conference talks. The presenter should be prepared with sufficient background knowledge of the related works in the area to answer broad questions and lead the class discussion.

Presenters: Please send me a copy of your slides at least two days before your presentation. I will provide feedback on the slides the next day. Do not just re-use slides provided by the paper authors. You may borrow, with attribution, figures and animations, but your slides should be created independently.

Reading

Date Presenter Topic
9/11 Bernard Introduction: General course description. How to read papers and give talks.
9/13 Bernard Network Architectures (1)

VL2: A Scalable and Flexible Data Center Network

PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric

A Scalable, Commodity Data Center Network Architecture

9/18 Tyler Network Architectures (2)

SPAIN: COTS Data-Center Ethernet for Multipathing over Arbitrary Topologies

Generic and Automatic Address Configuration for Data Auto Addressing

9/20 Elnaz Network Architectures (3)

DCell: A Scalable and Fault-Tolerant Network Structure for Data Centers

BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers

A Cost Comparison of Datacenter Network Architectures (Optional)

9/25 Ben Network Architectures (4)

Small-World Datacenters

Jellyfish: Networking Data Centers Randomly

Symbiotic Routing in Future Data Centers (Optional)

9/27 Xiao Alternative Datacenter Designs

c-Through: Part-time Optics in Data Centers

Augmenting Data Center Networks with Multi-Gigabit Wireless Links

10/2 Duo Flow Scheduling

Hedera: Dynamic Flow Scheduling for Data Center Networks

Improving Datacenter Performance and Robustness with Multipath TCP

10/4 Priyaa Transport Protocols for Datacenters

Data Center TCP (DCTCP)

Safe and Effective Fine-grained TCP Retransmissions for Datacenter Communication

ICTCP: Incast Congestion Control for TCP in Data Center Networks (Optional)

10/9 Thang Performance Isolation in Datacenter Networks

Towards Predictable Datacenter Networks

FairCloud: Sharing the Network in Cloud Computing

Sharing the Datacenter Network (Optional)

10/11 Xianda Distributed and Parallel Processing

MapReduce: Simplified Data Processing on Large Clusters

DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language

10/16 Kareem Resource and Performance Management (1):

Modeling and Synthesizing Task Placement Constraints in Google Compute Clusters

Declarative Automated Cloud Resource Orchestration

10/18 Casey Virtual Infrastructure (1)

Xen and the Art of Virtualization

SnowFlock: Rapid Virtual Machine Cloning for Cloud Computing

10/23 Bernard Pricing

Exertion-based Billing for Cloud Storage Access

Using a Market Economy to Provision Compute Resources Across Planet-wide Clusters

To Move or Not to Move: The Economics of Cloud Computing (Optional)

10/25 Sukhbir Virtual Infrastructure (2)

Black-box and Gray-box Strategies for Virtual Machine Migration

The Turtles Project: Design and Implementation of Nested Virtualization

10/30 Kyle Green Datacenter (1)

Don't Lose Sleep Over Availability: The GreenUp Decentralized Wakeup Service

It's Not Easy Being Green

11/1 Bairong Green Datacenter (2)

Robust And Flexible Power-Proportional Storage

Sierra: Practical Power-proportionality for Data Center Storage

SRCMap: Energy Proportional Storage using Dynamic Consolidation (Optional)

11/6 Chang and Jack Software Support

The Chubby lock service for loosely-coupled distributed systems

ZooKeeper: Wait-free coordination for Internet-scale systems

Paxos Made Simple (Optional)

Vertical Paxos and Primary-Backup Replication (Optional)

11/8 Xin and Xi Cloud Storage (1)

Dynamo: Amazon's Highly Available Key-value Store (Moved to 11/1)

Bigtable: A Distributed Storage System for Structured Data

Cassandra - A Decentralized Structured Storage System (Moved to 11/1)

Object Storage on CRAQ: High-throughput chain replication for read-mostly workloads

11/13 Frank Cloud Storage (2)

Spanner: Google’s Globally-Distributed Database

HyperDex: A Distributed, Searchable Key-Value Store

Megastore: Providing Scalable, Highly Available Storage for Interactive Services (Optional)

11/15 Adriel High Availability and Reliability

Characterizing Cloud Computing Hardware Reliability

A Self-Organized, Fault-Tolerant and Scalable Replication Scheme for Cloud Storage

11/20 James Measurement and Characteristics (1)

Network Traffic Characteristics of Data Centers in the Wild

The Nature of Datacenter Traffic: Measurements & Analysis

11/22 Tzu-Yang Measurement and Characteristics (2)

OpenFlow: Enabling Innovation in Campus Networks

Understanding Network Failures in Data Centers: Measurement, Analysis, and Implications

NOX: Towards an Operating System for Networks (Optional)

11/27 Danish Security

Hey, You, Get Off of My Cloud: Exploring Information Leakage in Third-Party Compute Clouds

Depot: Cloud storage with minimal trust

11/29 Everyone Final Project Presentations

Last modified: September 18, 2012