Data Systems Reading Group
Where: Big Meeting Room in the DSG Lab, When: Mondays, 1-2 pm.

We are a reading group for technical papers that focus on the broad area of data systems.
We will read and discuss system architectures, concurrency control for data processing, data storage techniques, and fault tolerance as it relates to both new research papers and historical work.

We meet every other week for discussion.

Instructions for coordinators.
Instructions for participants.

Winter 2020

Paper Name Coordinator Meeting Date
Umbra: A Disk-Based System with In-Memory Performance Brad Feb 10
Neo: A Learned Query Optimizer Benson Feb 24
Make the most out of your SIMD investments Amine March 9
BlazeIt: Optimizing Declarative Aggregation and Limit Queries for Neural Network-Based Video Analytics Anil March 21
Opportunities for Optimism in Contended Main-Memory Multicore Transactions Michael April 6

Fall 2019

Paper Name Coordinator Meeting Date
Autoscaling Tiered Cloud Storage in Anna Brad Glasbergen October 7
Plan-Structured Deep Neural Network Models for Query Performance Prediction Benson November 4
LegoOS: A Disseminated, Distributed OS for Hardware Resource Disaggregation Omar November 18
Calvin: fast distributed transactions for partitioned database systems Michael December 2
Rya: a scalable RDF triple store for the clouds Brad December 16

Winter 2019

Paper Name Coordinator Meeting Date
The Case for Network-Accelerated Query Processing Brad Glasbergen January 29
Controlled Lock Violation Michael Abebe February 12
SageDB: A Learned Database System Kyle Langendoen February 26
Database Learning: Towards a Database That Becomes Smarter Every Time Brad Glasbergen March 12
SnappyData: A Unified Cluster for Streaming,Transactions, and Interactive Analytics Siddhartha Sahu March 26
Scalable Analytics on Fast Data. Anil Pacaci April 24

Fall 2018

Paper Name Coordinator Meeting Date
The Case for Network-Accelerated Query Processing Brad Glasbergen January 29
TAO: Facebook's Distributed Data Store for the Social Graph Siddhartha Sahu October 2
Flux: an adaptive partitioning operator for continuous query systems Anil Pacaci October 16
The End of an Architectural Era: It's Time for a Complete Rewrite Michael Abebe November 6
Noria: dynamic, partially-stateful data-flow for high-performance web applications Brad Glasbergen November 20
HyPer: A Hybrid OLTP & OLAP Main Memory Database System Siddhartha Sahu December 4