Papers - PaperGrep

TVM: An Automated End-to-End Optimizing Compiler for Deep Learning

Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Ley...

2018

3 references

There is an increasing need to bring machine learning to a wide diversity of hardware devices. Current frameworks rely on vendor-specific operator libraries and optimize for a narrow range of server-class GPUs. Deploying workloads to new platforms --...

View Paper PDF

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew T...

2017

3 references

Deep learning thrives with large neural networks and large datasets. However, larger networks and larger datasets result in longer training times that impede research and development progress. Distributed synchronous SGD offers a potential solution t...

View Paper PDF DOI

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

Dmitriy Anisimov, Tatiana Khanova

2017

3 references

We present a class of efficient models called MobileNets for mobile and embedded vision applications. MobileNets are based on a streamlined architecture that uses depth-wise separable convolutions to build light weight deep neural networks. We introd...

View Paper PDF DOI

Rainbow: Combining Improvements in Deep Reinforcement Learning

Matteo Hessel, Joseph Modayil, Hado van Hasselt, Tom Schaul, Georg Ostrovski, Will Dabney, Dan Horga...

2017

3 references

The deep reinforcement learning community has made several independent improvements to the DQN algorithm. However, it is unclear which of these extensions are complementary and can be fruitfully combined. This paper examines six extensions to the DQN...

View Paper PDF

RLlib: Abstractions for Distributed Reinforcement Learning

Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox, Ken Goldberg, Joseph E. Gonzale...

2017

3 references

Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing RL components in a composable way by adapt...

View Paper PDF

A Comparative Analysis of Community Detection Algorithms on Artificial Networks

Zhao Yang, René Algesheimer, Claudio J. Tessone

2016

3 references

Many community detection algorithms have been developed to uncover the mesoscopic properties of complex networks. However how good an algorithm is, in terms of accuracy and computing time, remains still open. Testing algorithms on real-world network ...

View Paper PDF DOI

On Multiplicative Integration with Recurrent Neural Networks

Yuhuai Wu, Saizheng Zhang, Ying Zhang, Yoshua Bengio, Ruslan Salakhutdinov

2016

3 references

We introduce a general and simple structural design called Multiplicative Integration (MI) to improve recurrent neural networks (RNNs). MI changes the way in which information from difference sources flows and is integrated in the computational build...

View Paper PDF DOI

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio

2014

3 references

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to maximize t...

View Paper PDF

On Using Very Large Target Vocabulary for Neural Machine Translation

Sébastien Jean, Kyunghyun Cho, Roland Memisevic, Yoshua Bengio

2014

3 references

Neural machine translation, a recently proposed approach to machine translation based purely on neural networks, has shown promising results compared to the existing approaches such as phrase-based statistical machine translation. Despite its recent ...

View Paper PDF DOI

Calibration of Machine Learning Models

Antonio Bella, Cèsar Ferri, José Hernández‐Orallo, Marïa José Ramírez-Quintana

2012

3 references

The evaluation of machine learning models is a crucial step before their application because it is essential to assess how well a model will behave for every single case. In many real applications, not only is it important to know the “total” or the ...

View Paper PDF DOI

Macros that Work Together

Matthew Flatt, Ryan Culpepper, David Darais, Robert Bruce Findler

2012

3 references

Abstract Racket is a large language that is built mostly within itself. Unlike the usual approach taken by non-Lisp languages, the self-hosting of Racket is not a matter of bootstrapping one implementation through a previous implementation, but inste...

View Paper PDF DOI

Algorithms for geodesics

Charles F. F. Karney

2011

3 references

Algorithms for the computation of geodesics on an ellipsoid of revolution are given. These provide accurate, robust, and fast solutions to the direct and inverse geodesic problems and they allow differential and integral properties of geodesics to be...

View Paper PDF DOI

Reordering columns for smaller indexes.

Daniel Lemire, Owen Kaser

2011

3 references

View Paper PDF DOI

Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance.

Nguyen Xuan Vinh, Julien Epps

2010

3 references

View Paper DOI

On classification, ranking, and probability estimation.

H. Andersson

2007

3 references

The aim of this thesis was the study of respiration in ocean margin \nsediments and the assessments of tools needed for this purpose.\n \n \nThe first study was on the biological pump and global respiration \npatterns in the deep ocean using an empir...

View Paper PDF DOI

On the “degrees of freedom” of the lasso

2007

3 references

We study the effective degrees of freedom of the lasso in the framework of Stein’s unbiased risk estimation (SURE). We show that the number of nonzero coefficients is an unbiased estimate for the degrees of freedom of the lasso—a conclusion that requ...

View Paper DOI

Random Features for Large-Scale Kernel Machines

A. Rahimi, B. Recht

2007

3 references

In this paper, we contributed a stereo face recognition formulation which combines appearance and disparity/depth at feature level. We showed that the present-day passive stereovision in combination with 2D appearance images can match up to other met...

View Paper PDF DOI

The relationship between Precision-Recall and ROC curves.

Jesse Davis, Mark Goadrich

2006

3 references

Receiver Operator Characteristic (ROC) curves are commonly used to present results for binary decision problems in machine learning. However, when dealing with highly skewed datasets, Precision-Recall (PR) curves give a more informative picture of an...

View Paper PDF DOI