Language Corpora Annotation and Processing

Fotogalerie

Zur Bildergalerie

Niladri Sekhar Dash

Language Corpora Annotation and Processing

Gebundenes Buch

Jetzt bewerten Jetzt bewerten

This book addresses the research, analysis, and description of the methods and processes that are used in the annotation and processing of language corpora in advanced, semi-advanced, and non-advanced languages. It provides the background information and empirical data needed to understand the nature and depth of problems related to corpus annotation and text processing and shows readers how the linguistic elements found in texts are analyzed and applied to develop language technology systems and devices. As such, it offers valuable insights for researchers, educators, and students of linguistics and language technology.…mehr

Andere Kunden interessierten sich auch für

Niladri Sekhar Dash
Language Corpora Annotation and Processing

119,99 €
Kristina Barabashova
Detection of Sigmatism with the aid of Machine Learning

24,99 €
Natural Language Processing Using Very Large Corpora

112,99 €
S. Armstrong / Kenneth W. Church / Pierre Isabelle / Sandra Manzi / Evelyne Tzoukermann / David Yarowsky (Hgg.)
Natural Language Processing Using Very Large Corpora

115,99 €
Language in Cognition and Affect

75,99 €
Cross-linguistic Influences in Multilingual Language Acquisition

112,99 €
Stefano Rastelli
Plain Language

174,99 €

Produktbeschreibung

Produktdetails

Produktdetails
Verlag: Springer / Springer Nature Singapore / Springer, Berlin
Artikelnr. des Verlages: 978-981-16-2959-4
1st edition 2021
Seitenzahl: 304
Erscheinungstermin: 8. Juli 2021
Englisch
Abmessung: 241mm x 160mm x 22mm
Gewicht: 623g
ISBN-13: 9789811629594
ISBN-10: 9811629595
Artikelnr.: 61767745

Herstellerkennzeichnung
Springer-Verlag GmbH
Tiergartenstr. 17
69121 Heidelberg
ProductSafety@springernature.com

Produktdetails

Verlag: Springer / Springer Nature Singapore / Springer, Berlin
Artikelnr. des Verlages: 978-981-16-2959-4
1st edition 2021
Seitenzahl: 304
Erscheinungstermin: 8. Juli 2021
Englisch
Abmessung: 241mm x 160mm x 22mm
Gewicht: 623g
ISBN-13: 9789811629594
ISBN-10: 9811629595
Artikelnr.: 61767745

Herstellerkennzeichnung
Springer-Verlag GmbH
Tiergartenstr. 17
69121 Heidelberg
ProductSafety@springernature.com

Autorenporträt

Dr. Niladri Sekhar Dash is Professor and Head, Linguistic Research Unit, Indian Statistical Institute, Kolkata (The Institute of National Importance, Govt. of India). For the last 28 years, he is working in corpus linguistics, language technology, computational lexicography, computer-assisted language teaching, language documentation, translation, clinical linguistics, and digital ethnography. To his credit, he has published 18 research monographs and more than 285 research papers in indexed and peer-reviewed research journals, anthologies, and conference proceedings. As an invited speaker, he has delivered lectures at more than 50 universities and institutes in India and abroad. He acts as a Research Advisor for several multinational organizations that work on language technology, artificial intelligence, lexicography, digital humanities, and language resource development. He acts as Principal Investigator for several LangTech projects funded by the Govt. of India and corporate houses. He is the Chief Editor of the Journal of Advanced Linguistic Studies-a reviewed international journal of linguistics. He is an Editorial Board Member for several international journals. He is also a member of several linguistic associations across the world. He is a British Academy International Visiting Fellow (2018), Visiting Research Fellow of School of Psychology & Clinical Language Sciences, University of Reading, UK (2018-2021), and Visiting Scholar of Language and Brain Laboratory, University of Oxford, UK (2019). At present, he is heading 5 projects: (a) 'Upgradation of Bengali WordNet' funded by the Ministry of Statistics and Programme Implementation (MoSPI), Govt. of India; (b) 'Sound Imitative Words in Bengali" in collaboration with the Dept. of British and American Studies, Faculty of Arts, P.J. afárik University, Slovakia; (c) 'Bilingual Dementia of Patients with Broca's Aphasia' in collaboration with the School of Psychology and Clinical Language Sciences, University of Reading, UK; (d) 'Public Announcement System at Airports and Railway Stations in Indian Sign Language with Animation' in a consortium-mode project headed by the Dept. of Computer Science, Punjabi University, Patiala, India, and (e) 'Dictionary for Sabar Speech Community' - an endangered tribe of West Bengal, India.

Inhaltsangabe

Introduction.- Chapter 1. Corpora Annotation: Definition and Types.- Chapter 2. Maxims, Principles, & Rules of Text Annotation.- Chapter 3. Extratextual Documentative Annotation.- Chapter 4. Etymological Annotation.- Chapter 5. Concordance, KWIC, LWG and Collocation.- Chapter 6. Morphological Processing of Words.- Chapter 7. Part-of-Speech Tagging.- Chapter 8. Lemmatization of Inflected Nouns.- Chapter 9. Decomposition of Inflected Verbs.- Chapter 10. Parsing Sentences in a Text.

Inhaltsangabe

Rezensionen

"The present book is written to make issues of text annotation and processing useful to those people who can reap a good harvest from the areas related to text annotation and processing. ... Readers will get better insight into the linguistic challenges involved in text annotation and processing. ... People working in different areas of linguistics, information technology, machine learning, data processing, grammar content development, dictionary compilation, translation, corpus linguistics, and others will be directly benefited by this book." (Selvaraj Arulmozi, Sociolinguistic Studies, Vol. 17 (1-3), 2023)