32,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in über 4 Wochen
  • Broschiertes Buch

Most common search engines have serious problems returning all the documents which are important to a given user query because they can not disambiguate ambiguous terms or find documents which only include synonyms of the query terms. A promising approach to overcome these shortcomings gives Latent Semantic Indexing (LSI). This indexing scheme uses Singular Value Decomposition (SVD) to reveal the underlying latent semantic structure of documents. The implementation described in this book is a local search engine called Bosse for Wikipedia articles. Four different search types were implemented…mehr

Produktbeschreibung
Most common search engines have serious problems returning all the documents which are important to a given user query because they can not disambiguate ambiguous terms or find documents which only include synonyms of the query terms. A promising approach to overcome these shortcomings gives Latent Semantic Indexing (LSI). This indexing scheme uses Singular Value Decomposition (SVD) to reveal the underlying latent semantic structure of documents. The implementation described in this book is a local search engine called Bosse for Wikipedia articles. Four different search types were implemented which allow to search for documents or terms similar to a given term, query or document. These search types are evaluated and the importance of term weighting, exclusion of non content words and the optimal number of remaining dimension (k) during SVD are discussed. Furthermore, an introduction to Latent Semantic Indexing (LSI) and an explanation of the Singular Value Decomposition (SVD) is given.
Autorenporträt
Geiß, Johanna§studied Computational Linguistics, Near Eastern Archaeology and Jewish Studies at the Ruprecht-Karls-Universität Heidelberg and at the College of Jewish Studies Heidelberg. She worked as Software Developer at SAP AG, Walldorf and is now a PhD student at the University of Cambridge, UK.