Papers - PaperGrep

Language Modeling with Gated Convolutional Networks

Yann N. Dauphin, Angela Fan, Michael Auli, David Grangier

2016

2 references

The pre-dominant approach to language modeling to date is based on recurrent neural networks. Their success on this task is often linked to their ability to capture unbounded context. In this paper we develop a finite context approach through stacked...

View Paper PDF DOI

Wav2Letter: an End-to-End ConvNet-based Speech Recognition System

Ronan Collobert, Christian Puhrsch, Gabriel Synnaeve

2016

2 references

This paper presents a simple end-to-end model for speech recognition, combining a convolutional network based acoustic model and a graph decoding. It is trained to output letters, with transcribed speech, without the need for force alignment of phone...

View Paper PDF DOI

Cyclical Learning Rates for Training Neural Networks

William F. Dean, John Sandford-Smith

2015

2 references

It is known that the learning rate is the most important hyper-parameter to tune for training deep neural networks. This paper describes a new method for setting the learning rate, named cyclical learning rates, which practically eliminates the need ...

View Paper PDF DOI

Design of the 2015 ChaLearn AutoML challenge.

Isabelle Guyon, Kristin P. Bennett, Gavin C. Cawley, Hugo Jair Escalante, Sérgio Escalera, Tin Kam H...

2015

2 references

ChaLearn is organizing the Automatic Machine Learning (AutoML) contest for IJCNN 2015, which challenges participants to solve classification and regression problems without any human intervention. Participants' code is automatically run on the contes...

View Paper DOI

ExSTraCS 2.0: description and evaluation of a scalable learning classifier system.

Ryan J. Urbanowicz, Jason H. Moore

2015

2 references

View Paper PDF DOI

Fast R-CNN

Ross Girshick

2015

2 references

This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection. Fast R-CNN builds on previous work to efficiently classify object proposals using deep convolutional networks. Compared to previous work, Fast R-C...

View Paper PDF DOI

Gorilla: A Fast, Scalable, In-Memory Time Series Database

Tuomas Pelkonen, Scott Franklin, Paul Cavallaro, Qi Huang, Justin Meza, J. Teller, K. Veeraraghavan

2015

2 references

Large-scale internet services aim to remain highly available and responsive in the presence of unexpected failures. Providing this service often requires monitoring and analyzing tens of millions of measurements per second across a large number of sy...

View Paper DOI

Gradient Estimation Using Stochastic Computation Graphs

John Schulman, Nicolas Heess, Theophane Weber, Pieter Abbeel

2015

2 references

In a variety of problems originating in supervised, unsupervised, and reinforcement learning, the loss function is defined by an expectation over a collection of random variables, which might be part of a probabilistic model or the external world. Es...

View Paper PDF DOI

Multi-Scale Context Aggregation by Dilated Convolutions

Fisher Yu, Vladlen Koltun

2015

2 references

State-of-the-art models for semantic segmentation are based on adaptations of convolutional networks that had originally been designed for image classification. However, dense prediction and image classification are structurally different. In this wo...

View Paper PDF DOI

Self-Organizing Feature Maps Identify Proteins Critical to Learning in a Mouse Model of Down Syndrome

C. Higuera, K. Gardiner, K. Cios

2015

2 references

Down syndrome (DS) is a chromosomal abnormality (trisomy of human chromosome 21) associated with intellectual disability and affecting approximately one in 1000 live births worldwide. The overexpression of genes encoded by the extra copy of a normal ...

View Paper PDF DOI

SSD: Single Shot MultiBox Detector

W. Liu, Dragomir Anguelov, D. Erhan, Christian Szegedy, Scott E. Reed, Cheng-Yang Fu, A. Berg

2015

2 references

We present a method for detecting objects in images using a single deep neural network. Our approach, named SSD, discretizes the output space of bounding boxes into a set of default boxes over different aspect ratios and scales per feature map locati...

View Paper PDF DOI

A Fast, Minimal Memory, Consistent Hash Algorithm

John Lamping, Eric Veach

2014

2 references

We present jump consistent hash, a fast, minimal memory, consistent hash algorithm that can be expressed in about 5 lines of code. In comparison to the algorithm of Karger et al., jump consistent hash requires no storage, is faster, and does a better...

View Paper PDF

A modified ziggurat algorithm for generating exponentially- and normally-distributed pseudorandom numbers

Christopher D McFarland

2014

2 references

The Ziggurat Algorithm is a very fast rejection sampling method for generating PseudoRandom Numbers (PRNs) from common statistical distributions. The algorithm divides a distribution into rectangular layers that stack on top of each other (resembling...

View Paper PDF

An implementation of a randomized algorithm for principal component analysis.

Mehdi Soufifar

2014

2 references

This thesis addresses the language recognition problem with a special focus on phonotactic language recognition. A full description of different steps in a language recognition system is provided. We study state-of-the-art speech modeling techniques ...

View Paper PDF DOI

Dropout: a simple way to prevent neural networks from overfitting.

Nitish Srivastava, Geoffrey E. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov

2014

2 references

View Paper DOI

Fast splittable pseudorandom number generators

G. Steele, D. Lea, Christine H. Flood

2014

2 references

We describe a new algorithm SplitMix for an object-oriented and splittable pseudorandom number generator (PRNG) that is quite fast: 9 64-bit arithmetic/logical operations per 64 bits generated. A conventional linear PRNG object provides a generate me...

View Paper DOI

More scalable ordered set for ETS using adaptation

Konstantinos Sagonas, Kjell Winblad

2014

2 references

The Erlang Term Storage (ETS) is a key component of the runtime system and standard library of Erlang/OTP. In particular, on big multicores, the performance of many applications that use ETS as a shared key-value store heavily depends on the scalabil...

View Paper PDF DOI

Random Walk Initialization for Training Very Deep Feedforward Networks

David Sussillo, L. F. Abbott

2014

2 references

Training very deep networks is an important open problem in machine learning. One of many difficulties is that the norm of the back-propagated error gradient can grow or decay exponentially. Here we show that training very deep feed-forward networks ...

View Paper PDF DOI

Recurrent Neural Network Regularization

Wojciech Zaremba, Ilya Sutskever, Oriol Vinyals

2014

2 references

We present a simple regularization technique for Recurrent Neural Networks (RNNs) with Long Short-Term Memory (LSTM) units. Dropout, the most successful technique for regularizing neural networks, does not work well with RNNs and LSTMs. In this paper...

View Paper PDF DOI

SAX-PAC (Scalable And eXpressive PAcket Classification).

Kirill Kogan, Sergey Nikolenko, Ori Rottenstreich, William Culhane, Patrick Eugster

2014

2 references

Efficient packet classification is a core concern for network services. Traditional multi-field classification approaches, in both \nsoftware and ternary content-addressable memory (TCAMs), entail tradeoffs between (memory) space and (lookup) time. T...

View Paper PDF DOI