|
Main Page
Literature Survey
Online Thesis
Online
Code-base
WSD Tools
WSD Related
Links
|
- "Introduction to the special issue on word sense
disambiguation: The state of the art.",
Ide, N. and
Véronis, J., 1998. (pdf)
In this paper, history of WSD is presented starting from the 1950's.
It covers the major areas of work and sketches the broad lines of
development in this field.
-
"Five papers onWordNet.", Miller, G.,
Leacock, C., Tengi, R., Bunker, R., and Miller, K.,1990. (ps)
In these 5 papers, information about WordNet structure and details
about the relations between synsets are presented.
-
"Using Syntactic Dependency as Local Context
to Resolve Word Sense Ambiguity", Dekang Lin, 1997. (pdf)
The intuition "Two different words are likely to have similar
meanings if they occur in identical local contexts" is adopted in
this paper. Disambiguation is done based on syntactic dependency and
sense similarity.
-
"Automatic Retrieval and Clustering of Similar Words", Dekang
Lin, 1998. (pdf)
A new methodology for constructing a thesaurus using parsed corpus
is presented in this paper.
-
"Discovering word senses from text",
Patrick Pantel and Dekang Lin, 2002. (pdf)
A clustering algorithm called CBC (Clustering By Committee) that
automatically discovers word senses from text is presented in this
paper. Each cluster that a word belongs to represents one of its
senses. An evaluation methodology for automatically measuring the
precision and recall of discovered senses is also presented.
- "A simple approach to building ensembles of naive bayesian
classifiers for word sense disambiguation." , Ted Pedersen,
2000. (pdf)
This paper presents a corpus-based approach to word sense
disambiguation that builds an ensemble of Naive Bayesian
classifiers. Each Bayesian classifier is based on lexical features
that represent co-occurring words in varying sized windows of
context.
-
"Using measures of semantic relatedness for
word sense disambiguation", S. Patwardhan, S. Banerjee and T.
Pedersen, 2003. (pdf)
This paper generalizes an adapted Lesk algorithm to a method of WSD
based on semantic relatedness.
The work presented in this paper depends on the use of a thesaurus
acquired from raw textual corpora and the WordNet similarity package
to find predominant noun senses automatically.
- "A method for disambiguating word senses in a large corpus.",
B. Gale, K. Church, and D. Yarowsky, 1992, Computers and the
Humanities, 26:415-439.
In this paper
a
variant of Bayes ratio on six ambiguous nouns is presented and the
paper reports 92% accuracy.
- "One sense per discourse." B. Gale, K. Church, and
D. Yarowsky, 1992. (pdf)
Based on the observation that it is extremely unusual to find two or
more senses of a polysemous word in the same discourse, "one sense
per discourse" heuristic is proposed in this paper.
This paper presents an unsupervised learning algorithm for sense
disambiguation. The algorithm is based on two powerful constraints
-one sense per discourse and one sense per collocation- exploited in
an iterative bootstrapping procedure. Tested accuracy exceeds 96%.
This paper describes a supervised algorithm for word
sense disambiguation based on hierarchies of decision lists. This
algorithm supports a useful degree of conditional branching while
minimizing the training data fragmentation typical of decision
trees.
This article presents a comprehensive empirical
exploration and evaluation of a diverse range of data
characteristics which infuence word sense disambiguation
performance. It focuses on a set of 6 core supervised algorithms,
including 3 variants of Bayesian classifiers, a cosine model,
non-hierarchical decision lists, and an extension of the
transformation-based learning model.
- "Comparative experiments on disambiguating word senses: An
illustration of the role of bias in machine learning"
R. Mooney, 1996. (pdf)
This paper describes an experimental comparison of seven different
learning algorithms on the problem of learning to disambiguate the
meaning of a word from context. The algorithms tested include
statistical, neural-network, decision-tree, rule-based, and
case-based classification techniques.
- "Combining classifiers for word sense disambiguation", R.
Florian,S. Cucerzan, C. Schafer, and D. Yarowsky, 2002. (pdf)
R. Florian combined four classifiers namely feature-enhanced Naïve
Bayes, Cosine, bag-of-words Naïve Bayes and non-hierarchical
decision lists, and obtained good results which were shown in this
paper.
- "Exploring automatic word sense
disambiguation with decision lists and the web", E. Agirre, and
D.Martinez, 2000. (pdf)
Supervised learning seems to be stuck because of the knowledge
acquisition bottleneck. In this paper, an in-depth study of the
performance of decision lists on two publicly available corpora and
an additional corpus automatically acquired from the Web is
presented.
-
"Integrating multiple knowledge sources to
disambiguate word sense: An exemplar-based approach", Ng, H. T.
and Lee, H. B., 1996. (pdf)
In this paper, a new approach for word sense disambiguation using an
exemplar-based learning algorithm is presented. This approach
integrates a diverse set of knowledge sources to disambiguate word
sense, including part of speech of neighboring words, morphological
form, the unordered set of surrounding words, local collocations,
and verb-object syntactic relation.
-
"Word
sense disambiguation using statistical methods", Brown, Peter
F., Stephen, D.P., Vincent, J.D.P., and Robert, L.M., 1991. (pdf)
A statistical technique for assigning senses to words is presented
in this paper. An instance of a word is assigned a sense by asking a
question about the context in which the word appears.
The algorithm presented in this paper checks the context surrounding
the target noun against that of previously observed instances and
chooses the sense for which the most evidence is found, where
evidence consists of a set of orthographic, syntactic, and lexical
features.
This paper presents a novel approach to automatically generate
arbitrarily large corpora for word senses. The method is based on
the information provided in WordNet and the information gathered
from the Internet using existing search engines.
In this paper an iterative algorithm which combines WordNet and a
semantic tagged corpus is presented.
-
"A highly accurate
bootstrapping algorithm for word sense disambiguation",
Mihalcea, R. and Moldovan, D., 2001. (pdf)
A bootstrapping algorithm for Word Sense Disambiguation which uses
WordNet and a semantic tagged corpus is presented in this paper.
This paper proposes a generation algorithm that may be used to
automatically create large sense tagged corpora.
The method presented in this paper is based on word similarity and
context similarity measures. Words are considered similar if they
appear in similar contexts; contexts are similar if they contain
similar words.
In this paper the likelihood estimation used
in the thesis is explained.
|