Statistical Properties of Turkish Words

Contemporary Printed Turkish Word Characteristics and Smoothing Techniques

Fotogalerie

Gökhan Dalkilic

Statistical Properties of Turkish Words

Contemporary Printed Turkish Word Characteristics and Smoothing Techniques

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Autorenporträt

Andere Kunden interessierten sich auch für

Fer T. Örücü
Characteristics of Contemporary Printed Turkish

33,99 €
Antara Chowdhury
Classification of Isolated Assamese Words by Male and Female Speakers

27,99 €
Prashant K. Gupta
COMPUTING WITH WORDS

65,99 €
Anuj Sharma
A Report on Neural Network working in Words Images Understanding

27,99 €
Ahmet Arslan
Evaluation of Turkish Text Information Retrieval

25,99 €
Zeliha Görmez
A CONCATENATIVE TURKISH TEXT-TO-SPEECH SYSTEM

33,99 €
Mehdi Rohaninezhad
Ontological Development For Computing With Words Based Systems

33,99 €

Produktbeschreibung

For speech recognition, OCR, etc. determination of the structural properties of a natural language is essential. These properties can be analyzed under two different categories; morphological and statistical analysis. For statistical analysis, a corpus which is a representative sample of the natural language is needed. Word n-gram frequencies of that corpus can be determined by using suitable algorithms and missing n-grams can be estimated by using smoothing techniques. In this study, in order to compare and apply smoothing techniques to Turkish, a corpus named TurCo was created. In order to calculate word n-grams, different algorithms were tested. After finding n-gram word lists, their characteristics were analyzed. For generalization, Zipf s Law was applied, and to increase the accuracy in Zipf s Law, Mandelbrot Law was applied by finding the appropriate constants of Mandelbrot. As the corpus could not be big enough to represent all of the language, smoothing techniques were used to estimate the unseen word n-grams. This study can help professionals working on speech recognition, cryptanalysis, and author recognition in Turkish.

Produktdetails

Produktdetails
Verlag: LAP Lambert Academic Publishing
Seitenzahl: 140
Erscheinungstermin: 17. März 2010
Englisch
Abmessung: 220mm x 150mm x 9mm
Gewicht: 204g
ISBN-13: 9783838351582
ISBN-10: 3838351584
Artikelnr.: 29430487

Herstellerkennzeichnung
Books on Demand GmbH
In de Tarpen 42
22848 Norderstedt
info@bod.de
040 53433511

Produktdetails

Verlag: LAP Lambert Academic Publishing
Seitenzahl: 140
Erscheinungstermin: 17. März 2010
Englisch
Abmessung: 220mm x 150mm x 9mm
Gewicht: 204g
ISBN-13: 9783838351582
ISBN-10: 3838351584
Artikelnr.: 29430487

Herstellerkennzeichnung
Books on Demand GmbH
In de Tarpen 42
22848 Norderstedt
info@bod.de
040 53433511

Autorenporträt

Feri¿tah Örücü: Ze had de B.S. en M.S. graden in Comp Eng van DEU, Turkije. Ze heeft een Ph.D. student en een Res Asst of Dept of Comp Eng van DEU. Gökhan Dalk¿l¿ç: Hij had M.S. graden in Comp Sci van USC, en van Ege Univ CI, Ph.D. graad in Comp Eng van DEU. Hij was een assistent-professor van de afdeling Comp Eng van DEU.