7.6

1 Outline

This course is designed for Data Science MMath students who do not have an undergraduate degree in Computer Science. It provides these students with the background needed to take Data Science graduate courses with a strong computational flavour.

1. Processor internals

CPU, registers, memory. The fetch-execute cycle. Instruction sets. Compilers, interpreters, virtual machines. Exceptions. Input/output.

2. Memory hierarchies

Caches. Disks. Remote storage.

3. Operating systems

Virtual memory and paging. Virtual address spaces. Processes and scheduling. Dynamic memory allocation. Garbage collection. Graphical user interfaces. Command-line interfaces.

4. Concurrency and parallelism

Interprocess communication. Exception handling within processes. Threads. Shared memory vs message passing. Synchronization. Critical sections and locking. Hyperthreading. Superscalar processors. SIMD vs MIMD. Multiple cores.

5. Networks

Layered architecture. Packets and routing. Latency and throughput. Client-server model. Sockets.

6. Expressing computation

Programming languages. Paradigms of computing: imperative, object-oriented, functional, declarative, relational, dataflow. Examples and comparisons.

7. Algorithm design and analysis

O-notation. Sorting. Sequences. Sets. Finite maps. Hashing. Trees. Graphs. Collections libraries. Foreign function interfaces, APIs, and bindings.

A. Using a command-line interface

Shell windows. Commands. Navigation. Remote login. Files and directories. Permissions. Redirection. Text editors: Nano, Vim, Emacs. Shell scripts. Version control systems.