layout
default

A new version of this course is being offered in Fall 2019

AI-Sys Spring 2019

When: Mondays and Wednesdays from 9:30 to 11:00
Where: Soda 405
Instructors: Ion Stoica and Joseph E. Gonzalez
Announcements: Piazza
Sign-up to Present: Google Spreadsheet
Project Ideas: Google Spreadsheet
If you have reading suggestions please send a pull request to this course website on Github by modifying the index.md file.

Course Description

The recent success of AI has been in large part due in part to advances in hardware and software systems. These systems have enabled training increasingly complex models on ever larger datasets. In the process, these systems have also simplified model development, enabling the rapid growth in the machine learning community. These new hardware and software systems include a new generation of GPUs and hardware accelerators (e.g., TPU and Nervana), open source frameworks such as Theano, TensorFlow, PyTorch, MXNet, Apache Spark, Clipper, Horovod, and Ray, and a myriad of systems deployed internally at companies just to name a few. At the same time, we are witnessing a flurry of ML/RL applications to improve hardware and system designs, job scheduling, program synthesis, and circuit layouts.

In this course, we will describe the latest trends in systems designs to better support the next generation of AI applications, and applications of AI to optimize the architecture and the performance of systems. The format of this course will be a mix of lectures, seminar-style discussions, and student presentations. Students will be responsible for paper readings, and completing a hands-on project. Readings will be selected from recent conference proceedings and journals. For projects, we will strongly encourage teams that contains both AI and systems students.

Course Syllabus

{% capture dates %} 1/23/19 1/28/19 1/30/19 2/4/19 2/6/19 2/11/19 2/13/19 2/18/19 2/20/19 2/25/19 2/27/19 3/4/19 3/6/19 3/11/19 3/13/19 3/18/19 3/20/19 3/25/19 3/27/19 4/1/19 4/3/19 4/8/19 4/10/19 4/15/19 4/17/19 4/22/19 4/24/19 4/29/19 5/1/19 5/6/19 5/8/19 5/13/19 {% endcapture %} {% assign dates = dates | split: " " %}

This is a tentative schedule. Specific readings are subject to change as new material is published.

Jump to Today

{% include syllabus_entry %}

Introduction and Course Overview

This lecture will be an overview of the class, requirements, and an introduction to what makes great AI-Systems research.

Slide Links

Course Overview [pdf, pptx]

{% include syllabus_entry %}

Convolutional Neural Network Architectures

Minor Update: We have moved the reading on auto-encoders to Wednesday.

Reading notes for the two required readings below must be submitted using this google form by Monday the 28th at 9:30AM. We have asked that for each reading you answer the following questions:

What is the problem that is being solved?
What are the metrics of success?
What are the key innovations over prior work?
What are the key results?
What are some of the limitations and how might this work be improved?
How might this work have long term impact?

If you find some of the reading confusing and want a more gentle introduction, the optional reading contains some useful explanatory blog posts that may help.

Links

Reading Quiz due before class.
Intro Lecture + AlexNet [pdf, pptx]
Classic Neural Architectures and Inception-v4 [pdf, pptx]

The AlexNet paper that both help launch deep learning and also advocate for systems and ML. Take a look at how system constraints affected the model.
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning. In retrospect, the paper Rethinking the Inception Architecture for Computer Vision provides a better overview of the ideas and motivations behind the latest inception models.

Convolutional Networks

For a quick introduction to convolutional networks take a look at CS231 Intro to Convolutional Networks and Chris Olah's illustrated posts.
Much of contemporary computer vision can be traced back to the original LeNet paper and it's corresponding 90's era website.
There is a line of work that builds on residual networks starting with Highway Networks, then Densely Connected Convolutional Networks, and then more recently Deep Layer Aggregation. This blog post provides a nice overview.

{% include syllabus_entry %}

More Neural Network Architectures

Links

Reading Quiz due before class.
Intro [pdf, pptx]
Autoencoders [pdf, pptx]
Graph Neural Networks [pdf, pptx]

We had originally assigned, Autoencoders, Unsupervised Learning, and Deep Architectures. However this paper is a bit theoretical for the goals of this class. Instead, you may alternatively read this overview paper and use it when filling in the reading form.
Graph Neural Networks: A Review of Methods and Applications

Auto-Encoders

An excellent Survey on Autoencoders
A tutorial on variational auto-encoders (and another tutorial)
Original work on auto-encoders Learning Internal Representations by Error Propagation by Rumelhart and McClelland.

Graph Networks

The paper "Relational inductive biases, deep learning, and graph networks" provides some background and motivations behind deep learning on relational objects and introduces a general Graph Network framework.
The paper "Semi-Supervised Classification with Graph Convolutional Networks" introduces graph convolutional networks.

{% include syllabus_entry %}

Deep Learning Frameworks

Links

Reading Quiz due before class.
Intro Lecture [pdf, pptx]
TensorFlow Presentation [pdf, pptx]

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems and/or TensorFlow OSDI Paper
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems

* The following [Comparative Study of Deep Learning Software Frameworks](https://arxiv.org/pdf/1511.06435.pdf) provides a good (but a little dated) comparison of the various frameworks. * [Automatic differentiation in PyTorch](https://openreview.net/pdf?id=BJJsrmfCZ) * A more detailed overview of [Theano](https://arxiv.org/pdf/1605.02688.pdf).

{% include syllabus_entry %}

RL Systems & Algorithms

Links

Reading Quiz due before class.
RLlib [pdf]
A3C [pdf]

* [Asynchronous Methods for Deep Reinforcement Learning](http://proceedings.mlr.press/v48/mniha16.pdf) * [RLlib: Abstractions for Distributed Reinforcement Learning](https://arxiv.org/abs/1712.09381)

Horizon: Facebook's Open Source Applied Reinforcement Learning Platform

{% include syllabus_entry %}

Application: Data Structure and Algorithms

Links

Reading Quiz due before class.
Learned Indexes [pdf, pptx]
Learning to Optimize Join Queries [pdf]

The Case for Learned Index Structures
Learning to Optimize Join Queries With Deep Reinforcement Learning

SageDB: A Learned Database System
RLgraph: Flexible Computation Graphs for Deep Reinforcement Learning

{% include syllabus_entry %}

Distributed Systems for ML

Links

Reading Quiz due before class.
Learned Cardinalities [pdf]

The Case for Learned Index Structures cont'd
Learned Cardinalities: Estimating Correlated Joins with Deep Learning

{% include syllabus_entry %}

Administrative Holiday (Feb 18th)

{% include syllabus_entry %}

Hyperparameter search

Links

Reading Quiz due before class. There was a mix-up in updating the reading and the wrong paper was swapped. You may either read the Hyperband paper (preferred) or the Vizer paper (see optional reading) for the second reading.
A Generalized Framework for Population Based Training [pdf]

A Generalized Framework for Population Based Training
Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Google Vizier: A Service for Black-Box Optimization

{% include syllabus_entry %}

Auto ML & Neural Architecture Search (1/2)

Links

Reading Quiz due before class.
AutoML Overview [pdf, pptx]
Designing Neural Networks with RL [pdf, pptx]

Efficient and Robust Automated Machine Learning
Designing Neural Network Architectures using Reinforcement Learning `

{% include syllabus_entry %}

Auto ML & Neural Architecture Search (2/2)

Links

Reading Quiz due before class.
Semantic Segmentation AutoML slides [pdf]

Efficient Neural Architecture Search via Parameter Sharing
Searching for Efficient Multi-Scale Architectures for Dense Image Prediction

{% include syllabus_entry %}

Autonomous Vehicles

Links

Reading Quiz due before class.
Autonomous Vehicles Overview [pdf, pptx]
Presentation: The Architectural Implications of Autonomous Driving[pdf]

The Architectural Implications of Autonomous Driving
ChauffeurNet: Learning to Drive by Imitating the Best and Synthesizing the Worst

Software Infrastructure for an Autonomous Ground Vehicle
Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net

{% include syllabus_entry %}

Deep Learning Compilers

Links

Reading Quiz due before class.
DL Compiler Overview [pdf, pptx]
Presentation PDF

TVM: An Automated End-to-End Optimizing Compiler for Deep Learning
TensorComprehensions

Learning to Optimize Tensor Programs: The TVM story is two fold. There's a System for ML story (above paper) and this paper is their the ML for System story.

{% include syllabus_entry %}

Project Presentation Checkpoints

{% include syllabus_entry %}

Application: Program synthesis

Links

Reading Quiz due before class.

Learning to Represent Programs with Graphs [pdf], [key]
DeepCoder: Learning to write programs [pdf]

{% include syllabus_entry %}

Distributed Deep Learning (Part 1)

Links

Reading Quiz due before class.
Overview [pdf, pptx]

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Large Scale Distributed Deep Networks

Exascale Deep Learning for Climate Analytics
ImageNet/ResNet-50 Training in 224 Seconds
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

{% include syllabus_entry %}

Distributed Deep Learning (Part 2)

Links

Reading Quiz due before class.

Hogwild!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent[pdf]
Scaling Distributed Machine Learning with the Parameter Server

{% include syllabus_entry %}

Spring Break (March 25th)

{% include syllabus_entry %}

Spring Break (March 27th)

{% include syllabus_entry %}

Application: Networking

Links

Reading Quiz due before class.
Introduction [pdf, pptx]
AI Applications in Network Congestion Control [pdf, pptx]

Neural Adaptive Video Streaming with Pensieve
Internet Congestion Control via Deep Reinforcement Learning

PCC Vivace: Online-Learning Congestion Control

{% include syllabus_entry %}

Dynamic Neural Networks

Links

Reading Quiz due before class.
Introduction [pdf, pptx]

Outrageously large neural networks: The sparsely-gated mixture-of-experts layer
SkipNet: Learning Dynamic Routing in Convolutional Networks

{% include syllabus_entry %}

Model Compression

Links

Reading Quiz due before class.
Introduction [pdf, pptx]

Rethinking the Value of Network Pruning
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

MobileNetV2: Inverted Residuals and Linear Bottlenecks
ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design
Blog Post Comparing MobileNet and ShuffleNet
SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and less than 0.5MB model size
EffNet: An Efficient Structure for Convolutional Neural Networks
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Ternary Weight Networks

{% include syllabus_entry %}

Applications: Security

Links

Reading Quiz due before class.
Helen [pdf, pptx]

Helen
Federated Learning: Strategies for Improving Communication Efficiency

SecureML: A System for Scalable Privacy-Preserving Machine Learning

{% include syllabus_entry %}

Application: Prediction Serving

Links

Reading Quiz due before class.

Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications
Clipper: A Low-Latency Online Prediction Serving System

TFX: A TensorFlow-Based Production-Scale Machine Learning Platform
TensorFlow-Serving: Flexible, High-Performance ML Serving

{% include syllabus_entry %}

Natural Language Processing Systems

Links

Reading Quiz due before class.
Introduction [pdf, pptx]

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

{% include syllabus_entry %}

Explanability & Interpretability

Links

Reading Quiz due before class.
Introduction [pdf, pptx]

"Why Should I Trust You?": Explaining the Predictions of Any Classifier
The Mythos of Model Interpretability

Grad-CAM

{% include syllabus_entry %}

Scheduling for DL Workloads

Links

Reading Quiz due before class.
DL Scheduling slides [pdf]
Dominant Resource Fairness (DRF) slides [pdf]

Optimus: an efficient dynamic resource scheduler for deep learning clusters [pdf]
Gandiva: Introspective Cluster Scheduling for Deep Learning

Grad-CAM

{% include syllabus_entry %}

Cortical Learning and Stoica Course Summary

Links

Reading Quiz due before class.

Cortical Learning via Prediction

A Neuroidal Architecture for Cognitive Computation

{% include syllabus_entry %}

Neural Modular Networks and Gonzalez Course Summary

Links

Reading Quiz due before class.
Neural Modular Networks Slides [pdf, pptx]
Gonzalez Course Summary (Reflections on the Field of AI-Systems) [pdf, pptx]

Neural Modular Networks

{% include syllabus_entry %}

RRR Week (May 6th)

{% include syllabus_entry %}

Poster Session from 9:00 to 11:00

{% include syllabus_entry %}

Final Reports Due

Due at 11:59 PM
Format: 8 pages (Google Doc)
Email link to jegonzal@berkeley.edu and istoica@berkeley.edu

Week	Date (Lec.)	Topic

Projects

Detailed candidate project descriptions will be posted shortly. However, students are encourage to find projects that relate to their ongoing research.

Grading

Grades will be largely based on class participation and projects. In addition, we will require weekly paper summaries submitted before class.

Projects: 60%
Weekly Summaries: 20%
Class Participation: 20%

Files

index.md

Latest commit

History

index.md

File metadata and controls

A new version of this course is being offered in Fall 2019

AI-Sys Spring 2019

Course Description

Course Syllabus

Introduction and Course Overview

Slide Links

Convolutional Neural Network Architectures

Links

Convolutional Networks

More Neural Network Architectures

Links

Auto-Encoders

Graph Networks

Deep Learning Frameworks

Links

RL Systems & Algorithms

Links

Application: Data Structure and Algorithms

Links

Distributed Systems for ML

Links

Administrative Holiday (Feb 18th)

Hyperparameter search

Links

Auto ML & Neural Architecture Search (1/2)

Links

Auto ML & Neural Architecture Search (2/2)

Links

Autonomous Vehicles

Links

Deep Learning Compilers

Links

Project Presentation Checkpoints

Application: Program synthesis

Links

Distributed Deep Learning (Part 1)

Links

Distributed Deep Learning (Part 2)

Links

Spring Break (March 25th)

Spring Break (March 27th)

Application: Networking

Links

Dynamic Neural Networks

Links

Model Compression

Links

Applications: Security

Links

Application: Prediction Serving

Links

Natural Language Processing Systems

Links

Explanability & Interpretability

Links

Scheduling for DL Workloads

Links

Cortical Learning and Stoica Course Summary

Links

Neural Modular Networks and Gonzalez Course Summary

Links

RRR Week (May 6th)

Poster Session from 9:00 to 11:00

Final Reports Due

Projects

Grading