dc.rights.license | In Copyright | en_US |
dc.creator | Davis, Elizabeth Elwyn | |
dc.date.accessioned | 2023-10-20T17:40:23Z | |
dc.date.available | 2023-10-20T17:40:23Z | |
dc.date.created | 2006 | |
dc.identifier | WLURG038_Davis_thesis_2006 | |
dc.identifier.uri | https://dspace.wlu.edu/handle/11021/36365 | |
dc.description.abstract | This paper examines a possible solution to the problem of disambiguating polysemous nouns
in machine translation. Latent Semantic Analysis (LSA) , a statistical method of finding and
representing word sense, is used to differentiate between the different meanings of ambiguous
words according to the given context. A collection of training texts are sorted according
to polysemous word and meaning. A word-by-text matrix is created from this data and
transformed by the LSA method, creating vectors for each text defining it in terms of the
(non-polysemous) words that appear in it. These representations of textual meanings are
compared to the context of an ambiguous word to determine the most similar meaning.
The viability of this LSA model is compared with a simple Bayesian probability model. | en_US |
dc.format.extent | 39 pages | en_US |
dc.language.iso | en_US | en_US |
dc.rights | This material is made available for use in research, teaching, and private study, pursuant to U.S. Copyright law. The user assumes full responsibility for any use of the materials, including but not limited to, infringement of copyright and publication rights of reproduced materials. Any materials used should be fully credited with the source. | en_US |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en_US |
dc.subject.other | Washington and Lee University -- Honors in Computer Science | en_US |
dc.title | Lexical Disambiguation in Machine Translation with Latent Semantic Analysis | en_US |
dc.type | Text | en_US |
dcterms.isPartOf | WLURG038 - Student Papers | en_US |
dc.rights.holder | Davis, Elizabeth Elwyn | en_US |
dc.subject.fast | Latent semantic indexing | en_US |
dc.subject.fast | Semantics -- Data processing | en_US |
dc.subject.fast | Discourse analysis -- Data processing | en_US |
dc.subject.fast | Machine translating | en_US |
local.department | Computer Science | en_US |