Learn to Learn to Learn

AI Day - Singapore 2018

Martin.Andrews @ reddragon.ai

29 April 2018

About Me

Machine Intelligence / Startups / Finance
- Moved from NYC to Singapore in Sep-2013
2014 = 'fun' :
- Machine Learning, Deep Learning, NLP
- Robots, drones
Since 2015 = 'serious' :: NLP + deep learning
- & Papers...
- & Dev Course...

About Red Dragon AI

Deep Learning Consulting & Prototyping
Education / Training
Products :
- Conversational Computing
- Natural Voice Generation - multiple languages
- Knowledgebase interaction & reasoning

Learning to
Learn to Learn

The basic ideas of Learning
Learning from a lot of data
Learning from some data
Learning from a little data

TensorFlow Playground

http://playground.tensorflow.org/

Things to Understand

Hands-on :
- Goal : learning to predict regions
- Input features
- What a single neuron can learn
- The blame game
- How deep networks 'create' features

Image Competition

ImageNet aka ILSVRC
over 15 million labeled high-resolution images...
... in over 22,000 categories

More Complex Networks

GoogLeNet (2014)

... Even More Complex

Google Inception-v3 (2015)

... and Deeper

Microsoft ResNet (2015)

ImageNet Classification

Trained from Zero

Uses vast numbers of images
Huge computational resources
Does exactly what we told it to do

Transfer Learning

Based on existing model

Use pretrained (ImageNet) model
Leverage it to classify new classes
Much less training data required

Demo : ImageNet

2-CNN/5-TransferLearning/
5-ImageClassifier-keras.ipynb

Next Level Learning

Previous methods learn from large amounts of data
- But humans can learn from very little data
- We need models that do the same
Ideally, we want the models to Learn how to Learn

Meta-Learning

Two main types of meta-learning :
1. Learn how to build the best model
  - Structure meta-learning
2. Build a model that learns quickly
  - One-Shot Learning / Few-Shot Learning

Structure Meta-Learning

Problem : It's difficult for humans to build models
Solution :
- Enable computers to build models
- By searching model architectures efficiently
- By predicting which architectures might work well
Already seen the results...

NASNET Cells

The NASNet structure is created
- ... by searching over architectures
Better accuracy / speed / power tradeoffs than humans

( not the focus of this talk )

One-Shot Learning

Humans can learn from few examples
- eg : "Hammer-Whisk"

One-Shot Learning

Want a model that can also learn tasks quickly :
- Model should be trained on many tasks
- Each task will only have small amounts of data

( backtrack a little )

Regular-Learning

Training set:
- A bunch of different classes
- Each class has sample images to learn
Test set:
- Can the model classify a previously unseen image?

Meta-Learning

Training set:
- A bunch of different tasks
- Each task is a different problem to learn
- - Each of those problems has small amounts of data
Test set:
- Can the model learn a previously unseen task quickly?

Reptile Model Search

OpenAI : Reptile: a Scalable Metalearning Algorithm

Initialize Φ. the initial parameter vector
for iteration 1,2,3,…  do
  Randomly sample a task T
  Perform k>1 steps of SGD on task T, 
    starting with parameters Φ, resulting in parameters W
  Update: Φ ← Φ + ϵ(W−Φ)
end for
Return Φ

Reptile-Sines

Learn to learn tasks quickly
Each task in the meta-training set :
- Creates a 'sine wave' with random amp. and phase
- - Training data is just a few points
  - Test whether new points can be predicted
The meta-test sees whether a single task learns quickly

Demo : Meta-Learning

8-MetaLearning/
2-Reptile-Sines-PyTorch.ipynb

meta-MNIST

Goal : Learn to do character classification
- ... without knowing the character set
So : Want to have a model that can learn...
- ... from just a few examples
- ... or from just one few example (!)

Omniglot Dataset

1623 different handwritten characters
- from 50 different alphabets

Omniglot Dataset

Each of the 1623 characters
- was drawn by 20 different people

Compare : MNIST
- 10 characters, drawn ~5000 times each

One-shot Classification

Each task trains on 1 example each for 3 classes :
- Model pre-meta-trained on Omniglot
- Actual model running in Javascript
The user provides the meta-test task
... works pretty well

Hands-On : One-Shot Learning

http://RedCatLabs.com/meta

TensorFlow Content

Recent TensorFlow Summit (pays the bills)
~~"Eager Mode" is PyTorch for TensorFlow~~
TensorFlow Eager mode gives you :
- Benefits of quick development ~~(PyTorch)~~
- AND Tensorflow == production-ready
- AND Tensorflow == mobile+server+TPU

TF Eager Mode

TF Eager Mode

8-MetaLearning/
2-Reptile-Sines-TFEager.ipynb

You have Learned to
Learn to Learn

You have just seen how to make machines :
- Learn (at all)
- Learn from a lot of data
- Learn from some data
- Learn from a little data
- - ⇒ Learn to Learn

Wrap-up

Field is advancing very rapidly
Still within grasp of individuals
Open source applies to research too

* Please add a star... *

Deep Learning
MeetUp Group

Next Meeting = ~17-May-2018
- Hosted by Google in Singapore
Typical Contents :
- Talk for people starting out
- Something from the bleeding-edge
- Lightning Talks
MeetUp.com / TensorFlow-and-Deep-Learning-Singapore

Deep Learning : Jump-Start Workshop

Dates + Cost : ~S$600 (funding available for SG/PR) :
- Full day (week-end)
- Play with real models
- Get inspired!
- Pick-a-Project to do at home
- Regroup on subsequent week-nights
See SGInnovate Event Page for details

8-week Deep Learning
Developer Course

25 September - 25-November 2017
Twice-Weekly 3-hour sessions included :
- Instruction
- Individual Projects
- Support by WSG
Location : SGInnovate
Status : FINISHED!

Next Deep Learning
Developer Course

Plan : Start in a few months
Sessions will include :
- Instruction
- Individual Projects
- Support by SG govt (planned)
Location : SGInnovate
Status : TBA

Learn to Learn to Learn

AI Day - Singapore 2018

About Me

About Red Dragon AI

Learning to Learn to Learn

TensorFlow Playground

Things to Understand

Image Competition

More Complex Networks

... Even More Complex

... and Deeper

ImageNet Classification

Trained from Zero

Transfer Learning

Based on existing model

Demo : ImageNet

Next Level Learning

Meta-Learning

Structure Meta-Learning

NASNET Cells

One-Shot Learning

One-Shot Learning

Regular-Learning

Meta-Learning

Reptile Model Search

Reptile-Sines

Demo : Meta-Learning

meta-MNIST

Omniglot Dataset

Omniglot Dataset

One-shot Classification

Hands-On : One-Shot Learning

TensorFlow Content

TF Eager Mode

TF Eager Mode

You have Learned to Learn to Learn

Wrap-up

Deep LearningMeetUp Group

Deep Learning : Jump-Start Workshop

8-week Deep LearningDeveloper Course

Next Deep LearningDeveloper Course

- QUESTIONS -

Martin.Andrews @ RedCatLabs.com

Martin.Andrews @ RedDragon.AI

Learning to
Learn to Learn

You have Learned to
Learn to Learn

Deep Learning
MeetUp Group

8-week Deep Learning
Developer Course

Next Deep Learning
Developer Course

Martin.Andrews @
RedCatLabs.com

Martin.Andrews @
RedDragon.AI