Papers - PaperGrep

Permutation Tests for Studying Classifier Performance

2010

1 reference

We explore the framework of permutation-based p-values for assessing the performance of classifiers. In this paper we study two simple permutation tests. The first test assess whether the classifier has found a real class structure in the data; the c...

View Paper DOI

Printing floating-point numbers quickly and accurately with integers

Florian Loitsch

2010

1 reference

We present algorithms for accurately converting floating-point numbers to decimal representation. They are fast (up to 4 times faster than commonly used algorithms that use high-precision integers) and correct: any printed number will evaluate to the...

View Paper DOI

Three layer cake for shared-memory programming

A. Robison, Ralph E. Johnson

2010

1 reference

There are many different styles of parallel programming for shared-memory hardware. Each style has strengths, but can conflict with other styles. How can we use a variety of these styles in one program and minimize their conflict and maximize perform...

View Paper DOI

An experimental comparison of performance measures for classification

Cèsar Ferri, José Hernández‐Orallo, R. Modroiu

2009

1 reference

View Paper DOI

Feature hashing for large scale multitask learning.

Kilian Q. Weinberger, Anirban Dasgupta, John Langford, Alex Smola, Josh Attenberg

2009

1 reference

Empirical evidence suggests that hashing is an effective strategy for dimensionality reduction and practical nonparametric estimation. In this paper we provide exponential tail bounds for feature hashing and show that the interaction between random s...

View Paper DOI

Five Balltree Construction Algorithms

S. Omohundro

2009

1 reference

View Paper

Performance Evaluation of RANSAC Family.

Sunglok Choi, Taemin Kim, Wonpil Yu

2009

1 reference

RANSAC (Random Sample Consensus) has been popular in regression problem with samples contaminated with outliers. It has been a milestone of many researches on robust estimators, but there are a few survey and performance analysis on them. This paper ...

View Paper DOI

Rank-Balanced Trees

Bernhard Haeupler, Siddhartha Sen, Robert E. Tarjan

2009

1 reference

Since the invention of AVL trees in 1962, many kinds of binary search trees have been proposed. Notable are red-black trees, in which bottom-up rebalancing after an insertion or deletion takes O(1) amortized time and O(1) rotations worst-case. But th...

View Paper DOI

Shrinkage Algorithms for MMSE Covariance Estimation

Yilun Chen, Ami Wiesel, Yonina C. Eldar, Alfred O. Hero

2009

1 reference

We address covariance estimation in the sense of minimum mean-squared error (MMSE) for Gaussian samples. Specifically, we consider shrinkage methods which are suitable for high dimensional problems with a small number of samples (large p small n). Fi...

View Paper PDF DOI

Work-first and help-first scheduling policies for async-finish task parallelism

Yi Guo, Rajkishore Barik, Raghavan Raman, Vivek Sarkar

2009

1 reference

Multiple programming models are emerging to address an increased need for dynamic task parallelism in applications for multicore processors and shared-address-space parallel computing. Examples include OpenMP 3.0, Java Concurrency Utilities, Microsof...

View Paper DOI

Dynamic programming strikes back

G. Moerkotte, Thomas Neumann

2008

1 reference

Two highly efficient algorithms are known for optimally ordering joins while\navoiding cross products:\nDPccp, which is based on dynamic programming, and Top-Down Partition Search, \nbased on memoization.\nBoth have two severe limitations:\nThey hand...

View Paper PDF DOI

Emulation of a FMA and Correctly Rounded Sums: Proved Algorithms Using Rounding to Odd

S. Boldo, G. Melquiond

2008

1 reference

International audience

View Paper PDF DOI

Incremental Learning for Robust Visual Tracking

David A. Ross, Jongwoo Lim, Ruei-Sung Lin, Ming–Hsuan Yang

2008

1 reference

View Paper DOI

Inter-Coder Agreement for Computational Linguistics

2008

1 reference

This article is a survey of methods for measuring agreement among corpus annotators. It exposes the mathematics and underlying assumptions of agreement coefficients, covering Krippendorff's alpha as well as Scott's pi and Cohen's kappa; discusses the...

View Paper DOI

MapReduce: simplified data processing on large clusters

Jay B. Dean, Sanjay Ghemawat

2008

1 reference

MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the und...

View Paper PDF DOI

Packrat parsers can support left recursion.

Alessandro Warth, J. R. Douglass, Todd Millstein

2008

1 reference

Packrat parsing offers several advantages over other parsing techniques, such as the guarantee of linear parse times while supporting backtracking and unlimited look-ahead. Unfortunately, the limited support for left recursion in packrat parser imple...

View Paper DOI

Sparse inverse covariance estimation with the graphical lasso.

Jerome H. Friedman, Trevor Hastie, Robert Tibshirani

2008

1 reference

Abstract We consider the problem of estimating sparse graphs by a lasso penalty applied to the inverse covariance matrix. Using a coordinate descent procedure for the lasso, we develop a simple algorithm—the graphical lasso—that is remarkably fast: I...

View Paper PDF DOI

The keyed-hash message authentication code (HMAC)

2008

1 reference

This Standard describes a keyed-hash message authentication code (HMAC), a mechanism for message authentication using cryptographic hash functions. HMAC can be used with any iterative Approved cryptographic hash function, in combination with a shared...

View Paper DOI

Visualizing Data using t-SNE

L. Maaten, Geoffrey E. Hinton

2008

1 reference

Tese de mestrado integrado. Engenharia Electrotécnica e de Computadores. Faculdade de Engenharia. Universidade do Porto. 2008

View Paper PDF DOI

Gaussian random number generators

David B. Thomas, W. Luk, P. Leong, J. Villasenor

2007

1 reference

Rapid generation of high quality Gaussian random numbers is a key capability for simulations across a wide range of disciplines. Advances in computing have brought the power to conduct simulations with very large numbers of random numbers and with it...

View Paper PDF DOI