CS848 Paper Review Form - Fall 2006

Paper Title: Continuous resource monitoring for self-predicting DBMS

Author(s): Dushyanth Narayanan, Eno Thereska, and Anastassia Ailamaki. 

1) Is the paper technically correct?
 [x] Yes
 [ ] Mostly (minor flaws, but mostly solid)
 [ ] No

2) Originality
 [ ] Very good (very novel, trailblazing work)
 [x] Good 
 [ ] Marginal (very incremental)
 [ ] Poor (little or nothing that is new)

3) Technical Depth
 [ ] Very good (comparable to best conference papers)
 [x] Good (comparable to typical conference papers)
 [ ] Marginal depth
 [ ] Little or no depth

4) Impact/Significance
 [ ] Very significant
 [x] Significant
 [ ] Marginal significance.
 [ ] Little or no significance.

5) Presentation
 [ ] Very well written
 [x] Generally well written
 [ ] Readable
 [ ] Needs considerable work
 [ ] Unacceptably bad

6) Overall Rating
 [ ] Strong accept (very high quality)
 [x] Accept (high quality - would argue for acceptance)
 [ ] Weak Accept (marginal, willing to accept but wouldn't argue for it)
 [ ] Weak Reject (marginal, probably reject)
 [ ] Reject (would argue for rejection)

7) Summary of the paper's main contribution and rationale
   for your recommendation. (1-2 paragraphs)

The paper proposes a framework for quantitatively predicting performance
of a DBMS for a hardware resource upgrade, like memory, CPU, disks, and etc.
The proposed system can estimate new throughput, response time and where the 
bottleneck would be in a "what if" scenario of a particular hardware upgrade.
The main contribution of the paper is for using code traces from the database 
during an actual workload in order to create a model that can be used to predict 
troughput and response time for hypothetical resource configurations.

There is novelty in the paper's approach and its proof-of-concept prototype shows 
promise. The paper does a good job in illustrating its main ideas as well.

8) List 1-3 strengths of the paper.  (1-2 sentences each,
identified as S1, S2, S3.)

S1. Using code tracing during live execution of actual workload and the
    ability to create accurate hardware independent models to predict
    performance.
S2. The proposed architecture allows for analysis and modeling components 
    to be easily replaced.
S3. Trace data can be analyzed offline thus minimizing performance impact on
    the working system.

9) List 1-3 weaknesses of the paper (1-2 sentences each,
identified as W1, W2, W3.)

W1. Only analyzed OLTP workloads, not even a prediction on DSS ones.
W2. Have to modify database engine code to add those traces, which 
    even when not in use could affect performance or cause increase code complexity.

10) Detailed comments for authors.

Buffer pools are not the only memory consumer to benefit from increased
system memory size. Such extra memory can be distributed to sort, locks, 
query parsing/compiling, different utilities, application agents, and etc. 
Such changes can significantly affect performance as well,
allow for a greater number of concurrent connections and even affect 
query plans suggested by the optimizer. Thus changes to more than one
memory consumer may have to be taken into account to predict the future
performance.

Larger disk space can affect database design produced by database advisor
tools like Database Engine Tuning Advisor from Microsoft or DB2 Design Advisor
from IBM. This can significantly alter workload performance as well and
effectively make another resource as the bottleneck.