Disentangling factors of variation in deep representation using adversarial training

Improved Techniques for Training GANs

An Online Sequence-to-Sequence Model Using Partial Conditioning

Professor Forcing: A New Algorithm for Training Recurrent Networks

Can Active Memory Replace Attention?

On Multiplicative Integration with Recurrent Neural Networks

Architectural Complexity Measures of Recurrent Neural Networks

Reward Augmented Maximum Likelihood for Neural Structured Prediction

Swapout: Learning an ensemble of deep architectures

Deep ADMM-Net for Compressive Sensing MRI

Scan Order in Gibbs Sampling: Models in Which it Matters and Bounds on How Much

Exploring Models and Data for Image Question Answering

Winner-Take-All Autoencoders

End-To-End Memory Networks

StopWasting My Gradients: Practical SVRG

Spatial Transformer Networks

Inverse Reinforcement Learning with Locally Consistent Reward Functions

Teaching Machines to Read and Comprehend

Bandits with Unobserved Confounders: A Causal Approach

Multi-Task Bayesian Optimization

Predicting Parameters in Deep Learning

Memory Limited, Streaming PCA

Analyzing the Harmonic Structure in Graph-Based Learning

Distributed representations of words and phrases and their compositionality

The Fast Convergence of Incremental PCA

Matrix factorization with binary components

Learning to Pass Expectation Propagation Messages

Robust Low Rank Kernel Embeddings of Multivariate Distributions

Fast Algorithms for Gaussian Noise Invariant Independent Component Analysis

Multi-Prediction Deep Boltzmann Machines

Stochastic Ratio Matching of RBMs for Sparse High-Dimensional Inputs

Fast Convergence of Regularized Learning in Games

Competitive Distribution Estimation: Why is Good-Turing Good

A* Sampling

Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product Search (MIPS)

Scalable Influence Estimation in Continuous-Time Diffusion Networks

Submodular Optimization with Submodular Cover and Submodular Knapsack Constraints

A memory frontier for complex synapses

Optimal Neural Population Codes for High-dimensional Stimulus Variables

Correlations strike back (again): the case of associative memory retrieval

Variational Inference for Mahalanobis Distance Metrics in Gaussian Process Regression

One-shot learning and big data with n=2

Summary Statistics for Partitionings and Feature Allocations

Actor-Critic Algorithms for Risk-Sensitive MDPs

What Are the Invariant Occlusive Components of Image Patches? A Probabilistic Generative Approach

Decision Jungles: Compact and Rich Models for Classification

Density estimation from unweighted k-nearest neighbor graphs: a roadmap

Variational Policy Search via Trajectory Optimization

A simple example of Dirichlet process mixture inconsistency for the number of components

Training and Analysing Deep Recurrent Neural Networks

Variance Reduction for Stochastic Gradient Optimization

Sparse Additive Text Models with Low Rank Background

Deep Fisher Networks for Large-Scale Image Classification

Causal Inference on Time Series using Restricted Structural Equation Models

More data speeds up training time in learning halfspaces over sparse vectors

Transportability from Multiple Environments with Limited Experiments: Completeness Results

Robust Multimodal Graph Matching: Sparse Coding Meets Graph Matching

Modeling Clutter Perception using Parametric Proto-object Partitioning

PAC-Bayes-Empirical-Bernstein Inequality

Point Based Value Iteration with Optimal Belief Compression for Dec-POMDPs

On Decomposing the Proximal Map

Polar Operators for Structured Sparse Estimation

Generalized Random Utility Models with Multiple Types

Provable Subspace Clustering: When LRR meets SSC

Bayesian optimization explains human active search

Transfer Learning in a Transductive Setting

Data-driven Distributionally Robust Polynomial Optimization

Latent Maximum Margin Clustering

Reciprocally Coupled Local Estimators Implement Bayesian Information Integration Distributively

Documents as multiple overlapping windows into grids of counts

The Randomized Dependence Coefficient

Bayesian Active Model Selection with an Application to Automated Audiometry

Training Very Deep Networks

Particle Gibbs for Infinite Hidden Markov Models

A Bayesian Framework for Modeling Confidence in Perceptual Decision Making

Path-SGD: Path-Normalized Optimization in Deep Neural Networks

DeViSE: A Deep Visual-Semantic Embedding Model

Generalized Denoising Auto-Encoders as Generative Models

Deep Convolutional Neural Network for Image Deconvolution

Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation

Two-Stream Convolutional Networks for Action Recognition in Videos

Communication Efficient Distributed Machine Learning with the Parameter Server

Semi-Separable Hamiltonian Monte Carlo for Inference in Bayesian Hierarchical Models

Kernel Mean Estimation via Spectral Filtering

Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets

Probabilistic Line Searches for Stochastic Optimization

Fast and Accurate Inference of Plackett-Luce Models

Color Constancy by Learning to Predict Chromaticity from Luminance

Bayesian Manifold Learning: The Locally Linear Latent Variable Model (LL-LVM)

Unlocking neural population non-stationarities using hierarchical dynamics models

On the Pseudo-Dimension of Nearly Optimal Auctions

Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning

Smooth Interactive Submodular Set Cover

A Convergent Gradient Descent Algorithm for Rank Minimization and Semidefinite Programming from Random Linear Measurements

Space-Time Local Embeddings

Parallel Correlation Clustering on Big Graphs

Expressing an Image Stream with a Sequence of Natural Sentences

Planar Ultrametrics for Image Segmentation

Logarithmic Time Online Multiclass prediction

Robust Portfolio Optimization

