CS 848 (Winter 2010) Additional Readings
- [agsi09]
P. Agrawal, A. Silberstein, B. F. Cooper, U. Srivastava and R. Ramakrishnan.
Asynchronous view maintenance for VLSD databases. Proc. SIGMOD 2009.
- [gana09]
A. Gates et al.
Building a HighLevel Dataflow System on top of MapReduce: The Pig Experience.
Proc. VLDB 2009. (industrial track paper)
- [frpa09]
John Cieslewicz, Peter Pawlowski, and Eric Friedman.
SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions.
Proc. VLDB 2009. (industrial paper)
- [webo09]
C. D. Weissman and S. Bobrowski.
The Design of the Force.com Multitenant Internet Application
Development Platform.
Proc. ACM SIGMOD Int'l Conference on Management of Data (SIGMOD), 2009,
pp. 889-896.
- [augr08]
S. Aulbach, T. Grust, D. Jacobs, A. Kemper and J. Rittinger.
Multi-tenant databases for software as a service: schema-mapping
techniques.
Proc. ACM SIGMOD Int'l Conference on Management of Data,
2008. pp. 1195-1206
- [isyu09]
Michael Isard and Yuan Yu.
Distributed data-parallel computing using a high-level programming
language.
Proc. ACM SIGMOD Int'l Conference on Management of
Data. 2009. pp. 987-994.
- [grbr00]
S. Gribble, E. Brewer, J. Hellerstein, and D. Culler.
Scalable, distributed data structures for Internet
service construction.
In Proc. OSDI 2000. pp. 319-332. Oct. 2000.
- Brewer's CAP PODC keynote
- [gily02]
S. Gilbert and N. Lynch.
Brewer's conjecture and the feasibility of consistent, available,
partition-tolerant web services.
SIGACT News 33(2), 2002. pp. 51-59.
- T. D. Chandra, R. Griesemer, and J. Redstone.
Paxos made live:
an engineering perspective.
In PODC, pages 398-407, 2007.
- P. Helland.
Life beyond distributed transactions: an apostate's opinion.
In CIDR, pages 132-141, 2007.
- [degr92]
David J. DeWitt and Jim Gray.
Parallel Database Systems: The Future of High Performance Database
Systems.
Communications of the ACM 35(6), 1992.
- [xigo05]
Man Xiong, Brian Goldstein and Chris Auger.
Scaling Out SQL Server with Data-Dependent Routing.
Dell Power Solutions, August 2005.
- [orac07]
Scalability and performance with Oracle 11g database.
- [fitz04]
B. Fitzpatrick.
Distributed caching with memcached.
Linux Journal, August 2004.
- [daag09]
Sudipto Das, Divyakant Agrawal, and Amr El Abbadi.
ElasTraS: An Elastic Transactional Data Store in the Cloud.
In Proc. HotCloud09
- M. Cafarella et al. Data management projects at google. ACM
SIGMOD Record, 37(1):34-38, 2008
- [beda06]
P. Bernstein, N. Dani, B. Khessib, R. Manne, and D. Shutt.
Data management issues in supporting large-scale web services.
IEEE Data Engineering Bulletin, December 2006.
- [sico08]
Adam Silberstein, Brian F. Cooper, Utkarsh Srivastava, Erik
Vee, Ramana Yerneni, and Raghu Ramakrishnan.
Efficient bulk insertion into a distributed ordered table.
Proceedings of the 2008 ACM SIGMOD Int'l Conf. on Management of
Data. June 09-12, 2008.
- Avinash Lakshman, Prashant Malik: Cassandra: structured storage
system on a P2P network. PODC 2009: 5 (keynote presentation)
-
Antony I. T. Rowstron and Peter Druschel.
Pastry: Scalable,
Decentralized Object Location, and Routing for Large-Scale
Peer-to-Peer Systems.
Proceedings of the IFIP/ACM International
Conference on Distributed Systems Platforms Heidelberg, p.329-350,
November 12-16, 2001
- Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek,
and Hari Balakrishnan.
Chord: A scalable peer-to-peer lookup service for
internet applications.
Proceedings of the 2001 conference on
Applications, technologies, architectures, and protocols for
computer communications, p.149-160, August 2001, San Diego,
California, United States.
- Sage A. Weil , Scott A. Brandt , Ethan L. Miller , Carlos
Maltzahn. CRUSH: controlled, scalable, decentralized placement of
replicated data, Proceedings of the 2006 ACM/IEEE conference on
Supercomputing, November 11-17, 2006, Tampa, Florida
- Sage A. Weil , Scott A. Brandt , Ethan L. Miller , Darrell
D. E. Long , Carlos Maltzahn. Ceph: a scalable, high-performance
distributed file system. Proceedings of the 7th USENIX Symposium on
Operating Systems Design and Implementation, p.22-22, November
06-08, 2006, Seattle, WA.
- CouchDB, MongoDB, SimpleDB, Voldemort, Scalaris - distributed
object caches over multiple machines
- [depa08]
David DeWitt, Eric Robinson, Srinath Shankar, Erik Paulson, Jeffrey
Naughton, Andrew Krioukov, and Joshua Royalty.
Clustera: An Integrated Computation and Data Management System.
In Proc. VLDB 2008.
- [thsa09]
Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zheng Shao, Prasad
Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, and Raghotham Murthy.
Hive - A Warehousing Solution Over a Map-Reduce Framework.
In Proc. VLDB 2009 (demo).
- [coco09]
Jeffrey Cohen, Brian Dolan, Mark Dunlap, Joseph Hellerstein,
Caleb Welton.
MAD Skills: New Analysis Practices for Big Data.
Greenplum white paper, 2009.