32,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in 6-10 Tagen
payback
16 °P sammeln
  • Broschiertes Buch

With the enormous growth of the digital information, it is necessary to find advanced ways to process it. The goal is to enhance information retrieval, information extraction and natural language processing. One of the most complicated processes is text mining which deals with finding high quality information from text. This book presents a statistical part-of-speech tagging model for Albanian. The training, testing and evaluation processes are done with Apache OpenNLP tool. The tagging process is performed based on a basic and a large tagset. The experiments are performed on a tagger model…mehr

Produktbeschreibung
With the enormous growth of the digital information, it is necessary to find advanced ways to process it. The goal is to enhance information retrieval, information extraction and natural language processing. One of the most complicated processes is text mining which deals with finding high quality information from text. This book presents a statistical part-of-speech tagging model for Albanian. The training, testing and evaluation processes are done with Apache OpenNLP tool. The tagging process is performed based on a basic and a large tagset. The experiments are performed on a tagger model trained with corpus composed of a standard Albanian text written by Albanian authors. The tagger model is tested using a cross-validation and a sample text. Results showed that the accuracy of the trained tagger model in real testing environments was about 70%, and up to 98% when the environment settings were optimized for the best accuracy. It was also noticed that the overall accuracy for this model depends on the number of training tokens, level of grammatical and morphological complexity in text and special cases in language expressions.
Autorenporträt
In 2009 graduated from the Polytechnic University in Tirana for Electronic Engineering. He completed Master of Science in Computer Science from the University of New York Tirana & University of Greenwich in 2011. He workes at Performance & Quality Directorate in Albtelecom & EagleMobile, the biggest telecommunication company in Albania.