🤖

Machine Learning

Machine learning frameworks, algorithms, and training systems

Repositories

(7)

Papers

(373)

Showing 20 of 373 papers

Accelerated Hierarchical Density Based Clustering

Leland McInnes, John Healy

2017

2 references

We present an accelerated algorithm for hierarchical density based clustering. Our new algorithm improves upon HDBSCAN*, which itself provided a significant qualitative improvement over the popular DBSCAN algorithm. The accelerated HDBSCAN* algorithm...

View Paper PDF DOI

A Comparative Analysis of Community Detection Algorithms on Artificial Networks

Zhao Yang, René Algesheimer, Claudio J. Tessone

2016

3 references

Many community detection algorithms have been developed to uncover the mesoscopic properties of complex networks. However how good an algorithm is, in terms of accuracy and computing time, remains still open. Testing algorithms on real-world network ...

View Paper PDF DOI

A comparison of event models for naive bayes text classification

A. McCallum, K. Nigam

1998

2 references

Article Free Access Share on Distributional clustering of words for text classification Authors: L. Douglas Baker School of Computer Science, Carnegie Mellon University, Pittsburgh, PA and Just Research 4616 Henry Street, Pittsburgh, PA School of Com...

View Paper DOI

A New Vector Partition of the Probability Score

A. H. Murphy

1973

1 reference

A new vector partition of the probability, or Brier, score (PS) is formulated and the nature and properties of this partition are described. The relationships between the terms in this partition and the terms in the original vector partition of the P...

View Paper PDF DOI

Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets

Anna C. Belkina, Chris Ciccolella, R. Anno, Richard L. Halpert, Josef Spidlen, J. Snyder-Cappione

2019

1 reference

Accurate and comprehensive extraction of information from high-dimensional single cell datasets necessitates faithful visualizations to assess biological populations. A state-of-the-art algorithm for non-linear dimension reduction, t-SNE, requires mu...

View Paper PDF DOI

Automatic model construction with Gaussian processes

David Duvenaud

2014

4 references

This thesis develops a method for automatically constructing, visualizing and describing a large class of models, useful for forecasting and finding structure in domains such as time series, geological formations, and physical dynamics. These models,...

View Paper PDF DOI

Cluster Ensembles --- A Knowledge Reuse Framework for Combining Multiple Partitions.

Alexander Vasil’ev

2002

2 references

View Paper DOI

Density-Based Clustering Based on Hierarchical Density Estimates

Ricardo J. G. B. Campello, Davoud Moulavi, Jörg Sander

2013

2 references

View Paper DOI

Finding frequent items in data streams

M. Charikar, Kevin C. Chen, Martín Farach-Colton

2002

2 references

View Paper PDF DOI

Information theoretic measures for clusterings comparison: is a correction for chance necessary?

X. Nguyen, J. Epps, J. Bailey

2009

2 references

Information theoretic based measures form a fundamental class of similarity measures for comparing clusterings, beside the class of pair-counting based and set-matching based measures. In this paper, we discuss the necessity of correction for chance ...

View Paper DOI

Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance.

Nguyễn Xuân Vinh, Julien Epps, James Bailey

2010

3 references

Information theoretic measures form a fundamental class of measures for comparing clusterings, and have recently received increasing interest. Nevertheless, a number of questions concerning their properties and inter-relationships remain unresolved. ...

View Paper DOI

k-means++: the advantages of careful seeding

David Arthur, Sergei Vassilvitskii

2007

1 reference

View Paper PDF DOI

LIBLINEAR: A Library for Large Linear Classification

Jacob Fish, Rong Fan

2008

4 references

AbstractWe present a generalization of the classical mathematical homogenization theory aimed at accounting for finite unit cell distortions, which gives rise to a nonperiodic asymptotic expansion. We introduce an auxiliary macro‐deformed configurati...

View Paper DOI

Modern multidimensional scaling : theory and applications

Ingwer Borg, Patrick J. F. Groenen

1997

5 references

View Paper PDF DOI

More on Multidimensional Scaling and Unfolding in R: smacof Version 2.

Patrick Mair, Patrick J. F. Groenen, Jan de Leeuw

2022

1 reference

The smacof package offers a comprehensive implementation of multidimensional scaling (MDS) techniques in R. Since its first publication (De Leeuw and Mair 2009b) the functionality of the package has been enhanced, and several additional methods, feat...

View Paper PDF DOI

Sparse inverse covariance estimation with the graphical lasso.

Jerome H. Friedman, Trevor Hastie, Robert Tibshirani

2008

1 reference

Abstract We consider the problem of estimating sparse graphs by a lasso penalty applied to the inverse covariance matrix. Using a coordinate descent procedure for the lasso, we develop a simple algorithm—the graphical lasso—that is remarkably fast: I...

View Paper PDF DOI

Statistical Foundations of Actuarial Learning and its Applications

Mario V. Wuthrich, M. Merz

2021

1 reference

The aim of this manuscript is to provide the mathematical and statistical foundations of actuarial learning. This is key to most actuarial tasks like insurance pricing, product development, claims reserving and risk management. The basic approach to ...

View Paper PDF DOI

The Optimality of Naive Bayes.

Harry Zhang

2004

1 reference

View Paper

Visualizing Data using t-SNE

L. Maaten, Geoffrey E. Hinton

2008

1 reference

Tese de mestrado integrado. Engenharia Electrotécnica e de Computadores. Faculdade de Engenharia. Universidade do Porto. 2008

View Paper PDF DOI

V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure

N.H. Bergboer

2007

1 reference

As it is not known a priori which size of the context region around the object yields to most useful information, we pose a second research question.Research question 2 (RQ2): What size of the context region is best suited to lower the false-detectio...

View Paper PDF DOI

Previous Page 15 of 19 Next

Machine Learning

Repositories

huggingface/transformers

microsoft/onnxruntime

mlflow/mlflow

pytorch/pytorch

ray-project/ray

scikit-learn/scikit-learn

tensorflow/tensorflow

Papers