Unsupervised Word Sense Disambiguation Rivaling Supervised Methods

David Yarowsky
1995
1 reference

Abstract

This paper presents an unsupervised learning algorithm for sense disambiguation that, when trained on unannotated English text, rivals the performance of supervised techniques that require time-consuming hand annotations. The algorithm is based on two powerful constraints---that words tend to have one sense per discourse and one sense per collocation---exploited in an iterative bootstrapping procedure. Tested accuracy exceeds 96%.

1 repository
1 reference

Code References

â–¶ scikit-learn/scikit-learn
1 file
â–¶ examples/release_highlights/plot_release_highlights_0_24_0.py
1
# <https://doi.org/10.3115/981658.981684>`_ can now be used with any
Link copied to clipboard!