[brch07] Nicolas Bruno and Surajit Chaudhuri. An online approach to physical design tuning. In Proc. International Conference on Data Engineering (ICDE'07), pages 826-835, April 2007. [ bib | .pdf ]
[agch06] Sanjay Agrawal, Eric Chu, and Vivek Narasayya. Automatic physical design tuning: workload as a sequence. In Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD'06), pages 683-694, 2006. [ bib | .pdf ]
Considers a version of the database physical design problem in which the input is a sequence of queries and updates. The goal is to recommend a target physical design for each query or update in the sequence, taking into account both the effect of the physical design on the cost of executing the query or update and the cost of changing the physical design.
[brch06] Nicolas Bruno and Surajit Chaudhuri. To tune or not to tune? a lightweight physical design alerter. In Proc. International Conference on Very Large Data Bases (VLDB'06), pages 499-510, 2006. [ bib | .pdf | .pdf ]
[brch06b] Nicolas Bruno and Surajit Chaudhuri. Physical design refinement: The merge-reduce approach. In Proc. International Conference on Extending Database Technology (EDBT'06), number 3896 in Lecture Notes in Computer Science, pages 386-404. Springer-Verlag, 2006. [ bib | .pdf ]
[brch05] Nicolas Bruno and Surajit Chaudhuri. Automatic physical database tuning: A relaxation-based approach. In Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data (SIGMOD'05), 2005. [ bib ]
Assumes that the optimizer requests indexes that it thinks might be useful for a particular query. These requested indexes form the initial configuration, which is then modified to meet a space constraint.
[coba05] Mariano P. Consens, Denilson Barbosa, Adrian M. Teisanu, and Laurent Mignet. Goals and benchmarks for autonomic configuration recommenders. In Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD'05), 2005. [ bib | .pdf ]
Uses random queries that are generated from templates. Templates constrain generated queries to ensure that they are reasonable and that they can benefit from indexing.
[abha04] Ashraf Aboulnaga, Peter J. Haas, Mokhtar Kandil, Sam Lightstone, Guy M. Lohman, Volker Markl, Ivan Popivanov, and Vijayshankar Raman. Automated statistics collection in DB2UDB. In International Conference on Very Large Data Bases (VLDB '04), pages 1146-1157, August 2004. [ bib | .pdf | .pdf ]
[agch04] Sanjay Agrawal, Surajit Chaudhuri, Lubor Kollór, Arunprasad P. Marathe, Vivek R. Narasayya, and Manoj Syamala. Database tuning advisor for Microsoft SQL server. In International Conference on Very Large Data Bases (VLDB '04), pages 1110-1121, 2004. [ bib | .pdf ]
[zizu04] Daniel C. Zilio, Calisto Zuzarte, Sam Lightstone, Wenbin Ma, Guy M. Lohman, Roberta Cochrane, Hamid Pirahesh, Latha S. Colby, Jarek Gryz, Eric Alton, Dongming Liang, and Gary Valentin. Recommending materialized views and indexes with IBM DB2 design advisor. In IEEE International Conference on Autonomic Computing (ICAC'04), pages 180-188, 2004. [ bib | .pdf ]
General approach is to generate candidate MVs and indexes based on the workload, and then filter to meet a space constraint. Multiquery optimization is used when generating candidate MVs.
[agch03] Sanjay Agrawal, Surajit Chaudhuri, Abhinandan Das, and Vivek Narasayya. Automating layout of relational databases. In International Conference on Data Engineering (ICDE'03), pages 607-618, 2003. [ bib | .pdf ]
[razh02] Jun Rao, Chun Zhang, Guy M. Lohman, and Nimrod Megiddo. Automating physical database design in a parallel database. In Proc. ACM SIGMOD International Conference on Management of Data, pages 558-569, 2002. [ bib | .pdf ]
Automatic hash-based relation partitioning in shared nothing systems.
[agch01] Sanjay Agrawal, Surajit Chaudhuri, and Vivek R. Narasayya. Materialized view and index selection tool for Microsoft SQL Server 2000. In Proc. ACM SIGMOD International Conference on Management of Data, page 608, 2001. [ bib ]
This is a one-page description of a SIGMOD demo.
[chna01] Surajit Chaudhuri and Vivek Narasayya. Automating statistics management for query optimizers. IEEE Transactions on Knowledge and Data Engineering, 13(1):7-20, 2001. [ bib | .pdf ]
Hardcopy on file. This is the journal version of [chna00]. A variety of heuristic techniques for choosing minimal sets of heuristics in such a way that the quality of plans produced by the optimizer is not reduced.
[agch00] Sanjay Agrawal, Surajit Chaudhuri, and Vivek R. Narasayya. Automated selection of materialized views and indexes in SQL databases. In Proc. International Conference on Very Large Data Bases, pages 496-505, 2000. [ bib | .pdf ]
[chna00] Surajit Chaudhuri and Vivek Narasayya. Automating statistics management for query optimizers. In 16th International Conference on Data Engineering, pages 339-348, 2000. [ bib ]
The journal version of this paper is [chna01].
[leki00] Mong Li Lee, Masaru Kitsuregawa, Beng Chin Ooi, Kian-Lee Tan, and Anirban Mondal. Towards self-tuning data placement in parallel database systems. In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, pages 225-236, 2000. [ bib | .pdf ]
Adaptive declustering in shared-nothing systems using a two-level, tree-structured index.
[vazu00] Gary Valentin, Michael Zuliani, Daniel C. Zilio, Guy M. Lohman, and Alan Skelley. DB2 Advisor: An optimizer smart enough to recommend its own indexes. In 16th International Conference on Data Engineering, pages 101-110, 2000. [ bib | .pdf ]
General recommendation method is to add virtual indexes to the schema, optimize the query, and check whether any virtual statistics are used in the optimal plan. Statistics for virtual indexes are inferred from existing column statistics. To recommend indexes for a workload, recommend for each query in the workload in sequence and then greedily select a subset of the recommended indexes.
[chna98] Surajit Chaudhuri and Vivek R. Narasayya. Autoadmin 'what-if' index analysis utility. In Proc. ACM SIGMOD International Conference on Management of Data, pages 367-378, 1998. [ bib | .pdf ]
How to implement hypothetic database configurations, so that workload costs can be estimated under those configurations. Configuration includes hypothetic indexes and statistics that allow the optimizer to decide whether such an index should be used. Proposes that sampling be used to collect the statistics. Allows specification of scale factors so configurations with larger/smaller databases can be simulated. Presents an analysis interface that supports workload analysis and configuration analysis for current and hypothetical configurations.
[chna97] Surajit Chaudhuri and Vivek R. Narasayya. An efficient cost-driven index selection tool for Microsoft SQL Server. In Proc. International Conference on Very Large Data Bases, pages 146-155, 1997. [ bib | .pdf ]
Assumes that an upper bound is given on the number of indexes. Workload is specified as a set of SQL DML statements, including insert, delete and update. Search space includes both single and multi-attribute indexes. Index configurations are evaluated by the DBMS optimizer, and several techniques are used to reduce the number of configurations for which optimizer evaluation is required. To generate a set of candidate indexes, this method determines an optimal index configuration independently for each query in the workload. The initial candidate set is then taken as the union of the indexes in the single-query optimal configurations. A hybrid exhaustive/greedy approach is used to control search. To find a k-index configuration, first find the optimal m-index configuration (m <= k) using exhaustive search, then add k-m indexes greedily. Multi-column indexes are handled by first finding an good configuration with single-column indexes, then generating and adding a set of candidate two-column indexes, and then rerunning the optimizer on the new candidate set. This is repeated to handle indexes with more than two columns.
[logh94] David Lomet and Shahram Ghandeharizadeh, editors. Bulletin of the IEEE Technical Committee on Data Engineering, volume 17(3), September 1994. [ bib | .pdf | .pdf ]
Special Issue on Data Placement for Parallelism
[fisc88] Sheldon J. Finkelstein, Mario Schkolnick, and Paolo Tiberio. Physical database design for relational databases. ACM Transactions on Database Systems, 13(1):91-128, 1988. [ bib ]
[come78] Douglas Comer. The difficulty of optimum index selection. ACM Transactions on Database Systems, 3(4):440-445, 1978. [ bib ]