- These scripts installs Cuda, Tensorflow, Keras, Pytorch, Pycuda and related packages
--
Mike Gore - 2020-02-14
Location of master repository
This Document was created on Tue Feb 18 13:07:48 EST 2020 by /home/gpu-setup/create_documents
- asimov:/home/gpu-setup - which is symlinked to asimov.uwaterloo.ca:/cscf-adm/src/gpu-setup
- scripts push_servers contains a script that updates all of these machines currently only working from cs-tech1.cs.uwaterloo.ca
Hosts using these scripts
- ai-vector tml tml2 tml3 ming-gpu-1 ming-gpu-2 ming-gpu-3 ming-gpu-4 ming-gpu-p40 ming-gpu-v100 honeydew beaker cabernet
GPU Installation scripts /home/gpu-setup
- We now have cuda 10, cuDNN 7.31, tensorflow, pycuda, pytorch and keras installed
Updating or Creating this document
* Note: the script /home/gpu-setup/create_documents contains the TWIKI source for this document
* Please update /home/gpu-setup/create_documents and rerun it to update the notes
Anaconda environment ml
- I created an anaconda environment called ml
- "ml" stands for math learning
- FYI: tensorflow, pycuda, pytorch and tensorflow use "ml" for their installation.
- These python packages are very dependent on a specific cuda versions
- WARNING to prevent destroying the system python invironment
- You must always use anaconda "ml" environment*
Using the "ml" python environment
- source /home/gpu-setup/install_env
- This sets search paths and library paths
- source activate ml
- This makes sure that you are in the ml workspace!
Linux system group called ml
- I created a new Linux system group "ml"
- This group permits sharing code between users
Adding users to the ml system group
- Run /home/gpu-setup/update_ml_users as root any time
Manually adding the ml group to files or directories
- Example ml group sharing
- chgrp -R ml /home/share
- chmod -R g+w /home/share
directories alrwady added to the ml group
-
- /usr/local/cuda
- /usr/local/anaconda3
- /home/gpu-setup/cudnn_samples_v7
Installation scripts located in /home/gpu-setup
- Installed in the following order
- install_first
- Installs all required Ubuntu packages for support installation
- And these shell libraries used by the scripts
- install_env
- Set environment paths for all users
- create_documents
- Creates CUDA_README.txt file when run and updates motd
- updates the login message by updating /etc/motd.tail
- install_anaconda
- Creates anaconda ml environment
* Aside: these next scripts can also be run anytime to fix broken installation
ml system group update script
- update_ml_users: GPU scripts
- This script adds all users to the ml system group
Tests
Cleanup Scripts
- purge_anaconda
- This file DELETES ALL ANACONDA USER ENVIRONMENTS
- ONLY USE THIS IF YOU HAVE TO START OVER FROM SCRATCH
Comments