Unsupervised Word Sense Disambiguation Rivaling Supervised Methods

David Yarowsky

1995

1 reference

Abstract

This paper presents an unsupervised learning algorithm for sense disambiguation that, when trained on unannotated English text, rivals the performance of supervised techniques that require time-consuming hand annotations. The algorithm is based on two powerful constraints---that words tend to have one sense per discourse and one sense per collocation---exploited in an iterative bootstrapping procedure. Tested accuracy exceeds 96%.

View Paper PDF DOI

🤖 Machine Learning

1 repository

1 reference

Code References

▶ scikit-learn/scikit-learn

1 file

▶ examples/release_highlights/plot_release_highlights_0_24_0.py

L115

# <https://doi.org/10.3115/981658.981684>`_ can now be used with any

Link copied to clipboard!