CSCI 2952-C: Learning with Limited Labeled Data

Learning with Limited Labeled Data (Fall 2020)

Course Description

As machine learning is deployed more widely, researchers and practitioners keep running into a fundamental problem: how do we get enough labeled data? This seminar course will survey research on learning when only limited labeled data is available. Topics covered include weak supervision, semi-supervised learning, active learning, transfer learning, and few-shot learning. Students will lead discussions on classic and recent research papers, and work in teams on final research projects.

Essential Info

Instructor: Stephen Bach a.k.a. Steve

Class Meetings: Tuesdays and Thursdays, 1-2:20 pm, synchronously on Zoom. Connection information available on Canvas.

Office Hours by appointment. Email anytime to schedule!

Textbook: None

Prerequisites: Previous experience in machine learning is required through CSCI 1420 or equivalent research experience.

Important Links

Canvas for discussions, assignment guidelines, and additional class resources

Past years for previous reading lists (project ideas, etc.)

Contact

For questions, discussion, and other course-related posts, use Canvas.

If you have an atypical question that you are certain does not belong on Canvas, email the instructor.

Course Schedule

Introduction

Sep 10

Introductions, an overview of the research topics we will cover during the semester, how to read a research paper.
Suplemental reading:

Introduction to Semi-Supervised Learning. Olivier Chapelle, Bernhard Schölkopf, and Alexander Zien. In Semi-Supervised Learning, MIT Press, 2006. [PDF] [Online, requires Brown login]
Incidental Supervision: Moving beyond Supervised Learning. Dan Roth. AAAI 2017. [PDF]
How to Read a CS Research Paper? Philip W. L. Wong. [PDF]
How to Read a Technical Paper. Jason Eisner. [Online]
How to Read a Paper. S. Keshav. [PDF]

Semi-Supervised Learning

Sep 15

Billion-scale semi-supervised learning for image classification. I. Zeki Yalniz, Hervé Jégou, Kan Chen, Manohar Paluri, and Dhruv Mahajan. ArXiv 1905.00546 2019.
[PDF]
Suplemental reading:

Big Self-Supervised Models are Strong Semi-Supervised Learners. Ting Chen, Simon Kornblith, Kevin Swersky, Mohammad Norouzi, and Geoffrey Hinton. ArXiv 2006.10029 2020. [PDF] [Code]

Sep 17

S4L: Self-Supervised Semi-Supervised Learning. Xiaohua Zhai, Avital Oliver, Alexander Kolesnikov, and Lucas Beyer. The IEEE/CVF International Conference on Computer Vision (ICCV) 2019.
[PDF] [Supplemental] [Video]
Suplemental reading:

Self-Supervised Learning of Pretext-Invariant Representations. Ishan Misra and Laurens van der Maaten. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020. [PDF]

Sep 22

Temporal Ensembling for Semi-Supervised Learning. Samuli Laine and Timo Aila. International Conference on Learning Representations (ICLR) 2017.
[PDF]
Suplemental reading:

Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. Antti Tarvainen and Harri Valpola. Neural Information Processing Systems (NeurIPS) 2017. [PDF] [Supplemental (Zip)] [Reviews]

Sep 24

Start of course survey due
Self-training with Noisy Student improves ImageNet classification. Qizhe Xie, Minh-Thang Luong, Eduard Hovy, and Quoc V. Le. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020.
[PDF] [Supplemental]
Suplemental reading:

Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning. Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, and Kevin McGuinness. ArXiv 1908.02983 2019. [PDF] [Reviews]

Transfer/Representation Learning

Sep 29

How transferable are features in deep neural networks? Jason Yosinski, Jeff Clune, Yoshua Bengio, and Hod Lipson. Neural Information Processing Systems (NeurIPS) 2014.
[PDF] [Supplemental (Zip)] [Reviews]
Supplemental reading:

Learning Transferable Features with Deep Adaptation Networks. Mingsheng Long, Yue Cao, Jianmin Wang, and Michael Jordan. International Conference on Machine Learning (ICML) 2015. [PDF]

Oct 1

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Meeting of the North American Association for Computational Linguistics (NAACL) 2019.
[PDF]
Supplemental reading:

How to Fine-Tune BERT for Text Classification? Chi Sun, Xipeng Qiu, Yige Xu, and Xuanjing Huang. ArXiv 1905.05583 2019. [PDF]

Oct 6

Unsupervised Feature Learning via Non-Parametric Instance Discrimination. Zhirong Wu, Yuanjun Xiong, Stella X. Yu, and Dahua Lin. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2018.
[PDF] [Video]
Supplemental reading:

Momentum Contrast for Unsupervised Visual Representation Learning. Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020. [PDF] [Supplemental] [Code]

Oct 8

Rethinking Pre-training and Self-training. Barret Zoph, Golnaz Ghiasi, Tsung-Yi Lin, Yin Cui, Hanxiao Liu, Ekin D. Cubuk, and Quoc V. Le. ArXiv 2006.06882 2020.
[PDF]
Supplemental reading:

On Learning Invariant Representations for Domain Adaptation. Han Zhao, Remi Tachet Des Combes, Kun Zhang, and Geoffrey Gordon. International Conference on Machine Learning (ICML) 2019. [PDF] [Supplemental]

Weakly Supervised Learning

Oct 13

Snorkel: Rapid Training Data Creation with Weak Supervision. Alexander Ratner, Stephen H. Bach, Henry Ehrenberg, Jason Fries, Sen Wu, and Christopher Ré. Proceedings of the VLDB Endowment, 11(3):269-282, 2017.
[PDF] [Code]
Supplemental reading:

Maximum Likelihood Estimation of Observer Error-Rates Using the EM Algorithm. A. P. Dawid and A. M. Skene. Journal of the Royal Statistical Society. Series C (Applied Statistics), 28(1):20-28, 1979. [PDF]

Oct 15

Weakly Supervised Sequence Tagging from Noisy Rules. Esteban Safranchik, Shiying Luo, and Stephen H. Bach. AAAI Conference on Artificial Intelligence (AAAI) 2020.
[PDF] [Code]
Supplemental reading:

TriggerNER: Learning with Entity Triggers as Explanation for Named Entity Recognition. Bill Yuchen Lin, Dongho Lee, Ming Shen, Xiao Huang, Ryan Moreno, Prashant Shiralkar, and Xiang Ren. Meeting of the Association for Computational Linguistics (ACL) 2020. [PDF] [Code]

Oct 20

Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model. Wenhan Xiong, Jingfei Du, William Yang Wang, and Veselin Stoyanov. International Conference on Learning Representations (ICLR) 2020.
[PDF]
Supplemental reading:

K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters. Ruize Wang, Duyu Tang, Nan Duan, Zhongyu Wei, Xuanjing Huang, Jianshu ji, Guihong Cao, Daxin Jiang, and Ming Zhou. ArXiv 2002.01808 2020. [PDF]

Oct 22

Project proposal due
Exploring the Limits of Weakly Supervised Pretraining. Dhruv Mahajan, Ross Girshick, Vignesh Ramanathan, Kaiming He, Manohar Paluri, Yixuan Li, Ashwin Bharambe, Laurens van der Maaten. European Conference on Computer Vision (ECCV) 2018.
[PDF]
Supplemental reading:

Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition. Deepti Ghadiyaram, Du Tran, and Dhruv Mahajan. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019. [PDF]

Data Generation and Augmentation

Oct 27

Semi-Supervised Learning with Deep Generative Models. Diederik P. Kingma, Danilo J. Rezende, Shakir Mohamed, and Max Welling. Neural Information Processing Systems (NeurIPS) 2014.
[PDF] [Supplemental (Zip)] [Reviews]
Supplemental reading:

Learning from Simulated and Unsupervised Images through Adversarial Training. Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Josh Susskind, Wenda Wang, and Russ Webb. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2017. [PDF]

Oct 29

Unsupervised Data Augmentation for Consistency Training. Qizhe Xie, Zihang Dai, Eduard Hovy, Minh-Thang Luong, and Quoc V. Le. ArXiv 1904.12848 2019.
[PDF] [Reviews]
Supplemental reading:

MixMatch: A Holistic Approach to Semi-Supervised Learning. David Berthelot, Nicholas Carlini, Ian Goodfellow, Nicolas Papernot, Avital Oliver, and Colin Raffel. Neural Information Processing Systems (NeurIPS) 2019. [PDF] [Supplemental (Zip)] [Reviews] [Code]

Nov 5

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation. Yuheng Li, Krishna Kumar Singh, Utkarsh Ojha, and Yong Jae Lee. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2020.
[PDF] [Talk] [Code] [Video]
Supplemental reading:

Generating Object Stamps. Youssef Alami Mejjati, Zejiang Shen, Michael Snower, Aaron Gokaslan, Oliver Wang, James Tompkin, and Kwang In Kim. ArXiv 2001.02595 2020. [PDF]

Active Learning

Nov 10

Re-active Learning: Active Learning with Relabeling. Christopher H. Lin, Mausam, and Daniel S. Weld. AAAI Conference on Artificial Intelligence (AAAI) 2016.
[PDF]
Supplemental reading:

Variational Adversarial Active Learning. Samarth Sinha, Sayna Ebrahimi, and Trevor Darrell. IEEE/CVF International Conference on Computer Vision (ICCV) 2019. [PDF] [Supplemental] [Code] [Video]

Few-Shot Learning

Nov 12

Prototypical Networks for Few-shot Learning. Jake Snell, Kevin Swersky, and Richard Zemel. In Neural Information Processing Systems (NeurIPS) 2017.
[PDF] [Supplemental (Zip)] [Reviews]
Supplemental reading:

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. Chelsea Finn, Pieter Abbeel, and Sergey Levine. In International Conference on Machine Learning (ICML) 2017. [PDF]

Nov 17

Project status report due
Rethinking Few-Shot Image Classification: A Good Embedding Is All You Need? Yonglong Tian, Yue Wang, Dilip Krishnan, Joshua B. Tenenbaum, and Phillip Isola. ArXiv:2003.11539 2020.
[PDF] [Project] [Code]
Supplemental reading:

A New Meta-Baseline for Few-Shot Learning. Yinbo Chen, Xiaolong Wang, Zhuang Liu, Huijuan Xu, and Trevor Darrell. ArXiv:2003.04390 2020. [PDF] [Code]

Zero-Shot Learning

Nov 19

DeViSE: A Deep Visual-Semantic Embedding Model. Andrea Frome, Greg S. Corrado, Jon Shlens, Samy Bengio, Jeff Dean, Marc'Aurelio Ranzato, and Tomas Mikolov. In Neural Information Processing Systems (NeurIPS) 2015.
[PDF] [Supplemental (Zip)] [Reviews]
Supplemental reading:

Zero-Shot Learning through Cross-Modal Transfer. Richard Socher, Milind Ganjoo, Christopher D. Manning, and Andrew Y. Ng. In Neural Information Processing Systems (NeurIPS) 2013. [PDF] [Reviews]

Nov 24

Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs. Xiaolong Wang, Yufei Ye, and Abhinav Gupta. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018.
[PDF]
Supplemental reading:

Rethinking Knowledge Graph Propagation for Zero-Shot Learning. Michael Kampffmeyer, Yinbo Chen, Xiaodan Liang, Hao Wang, Yujia Zhang, and Eric P. Xing. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019. [PDF]

Dec 11

Final project report due
(No class)

Activity	Hours
Class Meetings	28
Readings	65
Submitting Discussion Questions	10
Preparing to Lead Discussion	2
Project Research	60+
Project Proposal / Status	10
Project Final Report	5
Total	180+