Introduction
Introduction continued
NewSQL
- Presentation 1: Prateek Gulati: C. Diaconu, C. Freedman, E. Ismert, P.A. Larson, P. Mittal, R. Stonecipher, N. Verma, M. Zwilling. Hekaton: SQL Server’s Memory-Optimized OLTP Engine, Proc. ACM SIGMOD International Conference on Management of Data, pages 1243-1254, 2013.
- Presentation 2: Rania Ibrahim: P.A. Larson, C. Clinciu, C. Fraser, E. N. Hanson, M. Mokhtar, M. Nowakiewicz, V. Papadimos, S. L. Price, S. Rangarajan, R. Rusanu, and M. Saubhasik. Enhancements to SQL server column stores. Proc. ACM SIGMOD International Conference on Management of Data, pages 1159-1168, 2013.
NoSQL
- Presentation 1: Jeff Avery: G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall and W. Vogels. Dynamo: Amazon's Highly Available Key-Value Store, Proc. 21st ACM Symposium on Operating Systems Principles, pages 205-,220, 2007.
- Presentation 2: Besat Kassaie: Cassandra: a decentralized structured storage system by A. Lakshman and P. Malik. SIGOPS Oper. Syst. Rev., 44(2): 35-40, 2010.
- Presentation 3: Robina Bhatia: L. Qiao, et al., On Brewing Fresh Espresso: Linkedin's Distributed Data Serving Platform, Proc. ACM SIGMOD International Conference on Management of Data, pages 1135-1146, 2013.
MapReduce-based data management
- Presentation 1: Xiao Meng: F. Li, M. T. Özsu, G. Chen, B.C. Ooi. R-Store: A scalable distributed system for supporting real-time analytics. Proc. IEEE 30th International Conference on Data Engineering, pages 40-51, 2014.
- Presentation 2: Guoyao Feng: L. Chang, Z. Wang, T. Ma, L. Jian, L. Ma, A. Goldshuv, L. Lonergan, J. Cohen, C. Welton, G. Sherry, and M. Bhandarkar. HAWQ: a massively parallel processing SQL engine in Hadoop, Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1223-1234, 2014.
- Presentation 3: Ahmed Elbagoury: I. Elghandour, A. Aboulnaga. ReStore. Reusing Results of MapReduce Jobs. Proc. VLDB Endow., 5(6): 586-597, 2012.
Main memory & column-store systems
- Presentation 1: Cong Guo: V. Sikka, F. Färber, W. Lehner, S. K. Cha, T. Peh, and C. Bornhövd. Efficient transaction processing in SAP HANA database: the end of a column store myth. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 731-742, 2012.
- Presentation 2: Pragnya Addala: D. J. Abadi, S. R. Madden, and N. Hachem. Column-stores vs. row-stores: how different are they really?. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 967-980, 2008.
Stream processing systems
- Presentation 1: Robina Bhatia: L. Abraham, J. Allen, O. Barykin, V. Borkar, B. Chopra, C. Gerea, D. Merl, J. Metzler, D. Reiss, S. Subramanian, J. L. Wiener, and O. Zed. Scuba: diving into data at facebook. Proc. VLDB Endow., 6(11): 1057-1067, 2013.
- Presentation 2: Rania Ibrahim: G. Mishne, J. Dalton, Z. Li, A. Sharma, and J. Lin. 2013. Fast data in the era of big data: Twitter's real-time related query suggestion architecture. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1147-1158, 2013.
Graph data processing - focusing on graph analytics
- Presentation 1: Cong Guo: G. Malewicz, M. H. Austern, A. J. C. Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: A System for Large-Scale Graph Processing, Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 135-145, 2010.
- Presentation 2: Ahmed Elbagoury: V. Satuluri, S. Parthasarathy, and Y. Ruan. Local graph sparsification for scalable clustering. Proc. ACM SIGMOD International Conference on Management of Data, pages 721-732, 2011.
- Presentation 3: Guoyao Feng: N. Satish, N. Sundaram, M.A. Patwary, J. Seo, J. Park, M. A. Hassaan, S. Sengupta, Z. Yin, P. Dubey. Navigating the maze of graph analytics frameworks using massive graph datasets, Proc. ACM SIGMOD International Conference on Management of Data, pages 979-990, 2014.
Graph data processing - focusing on online queries
- Presentation 1: Jeff Avery: B. Shao, H. Wang, Y. Li. Trinity: a distributed graph engine on a memory cloud, Proc. ACM SIGMOD International Conference on Management of Data, pages 505-516, 2013.
- Presentation 2: Xiao Meng: S. Yang, X. Yan, B. Zong, and A. Khan. Towards effective partition management for large graphs. Proc. ACM SIGMOD International Conference on Management of Data, pages 517-528, 2012.
RDF data processing
- Presentation 1: Besat Kassaie: P. Cudré-Mauroux, I. Enchev, S. Fundatureanu, P. Groth, A. Haque, A. Harth, F. L. Keppmann, D. Miranker, J. F. Sequeda, M. Wylot. NoSQL Databases for RDF: An Empirical Evaluation. Proc. International Semantic Web Conference, pages 310-325, 2013.
- Presentation 2: Pragnya Addala: D. J. Abadi, A. Marcus, S. R. Madden, and K. Hollenbach. Scalable semantic web data management using vertical partitioning. Proc. 33rd International Conference on Very Large Data Bases, pages 411-422, 2007.
No class
No class
Project Presentations
- Cong Guo and Jeff Avery
- Robina Bhatia and Pragnya Addala
- Besat Kassale
Project Presentations
- Rania Ibrahim and Ahmed Elbagoury
- Xiao Meng and Guoyao Feng