Meeting 27 January 2016
Attended: drallen (project manager), a2brenna fhgunn ldpaniak.
Database hardware status:
- hardware racked, powered, network connectivity? hostnames? inventory? Yes. Anthony reports:
- ubuntu14.04; 3.16+ kernel
mc-3015-411.cloud.cs.uwaterloo.ca
dc-3558-411.cloud.cs.uwaterloo.ca
m3-3101-411.cloud.cs.uwaterloo.ca
These are in inventory (eg. mc-3015-441.cloud.cs ) but not in Machine Notes.
- networking isn't working quite as expected- but is visible via linux.cscf
- active/passive replication works - via Clayton's instructions. manual switchover from passive to active.
- "Install Cluster Mysql container and with mysql aplication and ppa's" - is done.
- Anthony is working on "Manage Mysql configuration (N-node cluster versus single instance)" - ongoing issue with connecting to the systems via ssh under salt. This was the Feb 1. item... he will have this done for Wed Feb 3.
- Manual failover seems prone to excess downtime. Can we do some sort of "active-active" setup instead of "Active-passive"? Anthony cautions this might be incompatible with three servers- he'll look into "active-active-active."
- Where is the documentation for yubikey? ST#98955
- the digital ocean page links to the instructions Anthony used; and there is also info about active-active.
- Anthony ran into issues with networking on the salted system; he will need followup with Dave. Lori will deliver example ubuntu stanzas to Anthony.
- Anthony's work on "Manage Mysql configuration (N-node cluster versus single instance)" ran into an issue with mysql auth ; Fraser offered mysql tips. Anthony will approach him later when he's tried a few things.
- Benchmark current mysql.cs performance - Fraser and Lori to do this- initial low-level tests before we next meet. (mysql service log has details; Lori will ask Fraser)
- can we track end-user usage stats? - can we instrument inventory, marmoset to make charts? ST already has this per-request... not averages.
- however, speedups are not as critical a part of the project as HA. We could still do "cheap" benchmarks without taking too much staff time.
We don't yet have timing info on the migration and further process. So we can't estimate how far from done we are.
Migrating data:
- will move inventory using the active/passive process described in the cluster documentation- Anthony to do for inventory; we will need to be able to teach someone in TOP- probably Nick. (Daniel to confirm with Omar).
Testing failover:
- Can we do "active active active"? Anthony investigating.
- Can we set a goal of 5 9s of uptime?
Question for management: asynchronous sync has a chance of data-loss if a master goes down before sync. Good enough, or synchronous sync a requirement? Do we want to build out synchronous for the second cluster (August)? Daniel and/or management to ask Ken.
-- DanielAllen - 2016-01-27