Research: Project Highlights & References

Papers & code for selected research projects. See my complete publication list for more detail.

Memoized Variational Inference & Bayesian Nonparametrics


Hierarchical Dirichlet processes (HDPs) lead to Bayesian nonparametric mixture models, topic models, temporal models, and relational models. We develop a scalable family of variational inference algorithms that allows the number of clusters/topics/states/communities to be adapted online as data is observed.

  • arXiv 2016: Sparse posteriors for scalable learning of high-dimensional models.
  • NIPS 2015: Memoized variational inference for HDP hidden Markov models.
  • AIStats 2015: Memoized variational inference for HDP topic models.
  • NIPS 2013: Memoized variational inference for DP mixture models.
  • NIPS 2013: Stochastic variational inference for HDP mixed membership relational models.
  • NIPS 2012: Stochastic variational inference for HDP topic models.
  • BNPy: Bayesian nonparametric machine learning software in Python.

3D Object Detection & Scene Understanding

We use RGB-D images to learn contextual relationships between object categories and the 3D layout of indoor scenes. Our cloud of oriented gradient (COG) descriptor links the 2D appearance and 3D pose of object categories, accounting for perspective projection to produce state-of-the-art object detectors.

  • CVPR 2016: Contextual cascades for indoor scene understanding from COG descriptors.

Diverse Particle Max-Product

A fusion of max-product belief propagation and particle filters which reliably finds modes of continuous posterior distributions. A submodular optimization algorithm replaces the fragile stochastic resampling of standard particle methods. Applications include human pose estimation and protein structure prediction.

  • ICML 2015: Diverse PMP for loopy graphs, applied to optical flow & protein side-chain prediction.
  • ICML 2014: Diverse PMP for tree-structured graphs, applied to articulated human pose estimation.
  • Software: J. Pacheco's Matlab code for black-box inference via diverse particle max-product.

Vertically Integrated Global Seismic Monitoring

2014 ISBA Mitchell Prize for Bayesian analysis of an important applied problem

The automated processing of multiple seismic signals to detect and localize seismic events is a central tool in both geophysics and nuclear treaty verification. Our Bayesian seismic monitoring system, NET-VISA, is learned from historical data provided by the UN preparatory commission for the comprehensive nuclear-test-ban treaty organization (CTBTO). We reduce the number of missed events by 60%.

Layered Video Segmentation & Motion Estimation

Layered models simultaneously segment scenes into regions of coherent structure and estimate dense motion (optical flow) fields. By explicitly modeling occlusion relationships, and designing sophisticated CRF priors, we achieve state-of-the-art motion estimates and interpretable segmentations.

  • CVPR 2015: Layered estimation of 3D motion (scene flow) from RGB-D videos.
  • CVPR 2013: A densely-connected layered prior for segmenting foreground & background motion.
  • CVPR 2012: Discrete optimization for effective inference of multiple flow layers.
  • NIPS 2010: Original formulation of layered optical flow model for image sequences.
  • CVPR 2012 & NIPS 2008: Layered Pitman-Yor processes for static image segmentation.
  • Software: D. Sun's Matlab code for dense motion estimation and layered segmentation.

Distance-Dependent Hierarchical Clustering

The distance dependent Chinese restaurant process (ddCRP) is a flexible nonparametric prior for data clustering. Using a hierarchical generalization of the ddCRP and spatio-temporal distance measures, we use MCMC learning to segment text, image, video, and 3D mesh data.

  • UAI 2014: A hierarchical ddCRP for grouped data is applied to video & discourse segmentation.
  • NIPS 2012: Deformation-based 3D mesh segmentation applied to human motion analysis.
  • NIPS 2011: Natural image segmentation with spatial ddCRP models.
  • Software: S. Ghosh's Matlab code for deformation-based segmentation of 3D mesh data.

Doubly Correlated Nonparametric Topic & Relational Models

A stick-breaking representation allows Bayesian nonparametric learning of an unbounded number of topics from text data, or communities from network data. We learn correlations in topic/community membership, and better predict relationships by exploiting document/entity metadata.

  • ICML 2012: Nonparametric, metadata-dependent stochastic block model of relational data.
  • NIPS 2011: Nonparametric topic model capturing metadata and correlations in topic usage.
  • Software: D. Kim's Matlab code for MCMC learning of doubly correlated topic models.

Beta Process Hidden Markov Models

Using infinite feature representations derived from the beta process, this Bayesian nonparametric model discovers a set of latent dynamical behaviors, and uses them to segment a library of observed time series. Applications include human activity understanding from video or motion capture data.

Hierarchical Dirichlet Process Hidden Markov Models

The sticky hierarchical Dirichlet process HMM allows an unbounded number of latent states to be learned from unlabeled sequential data. By capturing the "sticky" temporal persistence of real dynamical states we learn improved models of financial indices, human speech, and honeybee dances.

Transformed Dirichlet Process Models of Visual Scenes

Hierarchical probabilistic models for objects, the parts composing them, and the visual scenes surrounding them. We capture geometric context via spatial transformations, and use hierarchical Dirichlet processes to learn new objects and parts from partially labeled images or stereo pairs.

Nonparametric Belief Propagation

Nonparametric belief propagation (NBP) generalizes discrete BP to graphical models with non-Gaussian continuous variables, using sample-based marginal approximations inspired by particle filters. Applications include kinematic tracking of visual motion and distributed localization in sensor networks.

  • Communications of the ACM 2010: A CACM research highlight introduced by Weiss & Pearl.
  • CVPR 2003: Defines the NBP algorithm and applies to a part-based model of facial appearance. CVPR oral presentation.
  • NIPS 2003: Efficient NBP message updates via multiscale, KD-tree representations.
  • GMBV 2004 & NIPS 2004: Occlusion-sensitive tracking of articulated hand motion from videos. NIPS oral presentation.
  • MIT PhD Thesis, May 2006: Additional technical details on 3D orientation estimation for tracking.
  • IROS 2009: Tracking networks of mobile robots using noisy distance measurements.
  • Software: A. Ihler's Matlab and C++ code for multiscale kernel density estimation and sampling.

Embedded Trees & Gaussian Graphical Models

An inference algorithm for Gaussian graphical models that exploits embedded, tree-structured graphs. Generalizing loopy belief propagation, our embedded trees algorithm not only rapidly computes posterior means, but also correctly estimates posterior variances (uncertainties).