Text, Speech, and Dialogue

27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part II
Herausgegeben:Nöth, Elmar; Horák, Ales; Sojka, Petr

Fotogalerie

Text, Speech, and Dialogue

27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part II
Herausgegeben:Nöth, Elmar; Horák, Ales; Sojka, Petr

Broschiertes Buch

Jetzt bewerten Jetzt bewerten

Weitere Ausgabe:
eBook, PDF

The two-volume set LNAI 15048 and 15049 constitutes the refereed proceedings of the 27th International Conference on Text, Speech, and Dialogue, TSD 2024, held in Brno, Czech Republic, during September 9-13, 2024. The 50 revised full papers presented in these deadline proceedings were carefully reviewed and selected from 103 submissions.
The papers are organized in the following topical sections:
Part I: Text
Part II: Speech, Dialogue

Inhaltsangabe

Andere Kunden interessierten sich auch für

Text, Speech, and Dialogue

44,99 €
Text, Speech, and Dialogue

38,99 €
Worldwide Language Service Infrastructure

38,99 €
Intelligent Information and Database Systems

50,99 €
Intelligent Information and Database Systems

50,99 €
Artificial Intelligence Applications and Innovations

75,99 €
Artificial Intelligence Applications and Innovations

75,99 €

Produktbeschreibung

The two-volume set LNAI 15048 and 15049 constitutes the refereed proceedings of the 27th International Conference on Text, Speech, and Dialogue, TSD 2024, held in Brno, Czech Republic, during September 9-13, 2024.
The 50 revised full papers presented in these deadline proceedings were carefully reviewed and selected from 103 submissions.

The papers are organized in the following topical sections:

Part I: Text

Part II: Speech, Dialogue

Produktdetails

Produktdetails
Lecture Notes in Computer Science 15049
Verlag: Springer / Springer Nature Switzerland / Springer, Berlin
Artikelnr. des Verlages: 978-3-031-70565-6
2024
Seitenzahl: 344
Erscheinungstermin: 1. September 2024
Englisch
Abmessung: 235mm x 155mm x 19mm
Gewicht: 523g
ISBN-13: 9783031705656
ISBN-10: 3031705653
Artikelnr.: 71269944

Herstellerkennzeichnung

Produktdetails

Lecture Notes in Computer Science 15049
Verlag: Springer / Springer Nature Switzerland / Springer, Berlin
Artikelnr. des Verlages: 978-3-031-70565-6
2024
Seitenzahl: 344
Erscheinungstermin: 1. September 2024
Englisch
Abmessung: 235mm x 155mm x 19mm
Gewicht: 523g
ISBN-13: 9783031705656
ISBN-10: 3031705653
Artikelnr.: 71269944

Herstellerkennzeichnung

Inhaltsangabe

.- Speech.
.- Retrieval Augmented Spoken Language Generation for Transport Domain.
.- Adapting Audiovisual Speech Synthesis to Estonian.
.- Dysphonia Diagnosis Using Self-Supervised Speech Models in Mono- and Cross-Lingual Settings.
.- Sentences vs Phrases in Neural Speech Synthesis.
.- Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model.
.- Deep Speaker Embeddings for Speaker Verification of Children.
.- Improved Alignment for Score Combination of RNN-T and CTC Decoder for Online Decoding.
.- Attention to Phonetics: A Visually Informed Explanation of Speech Transformers.
.- Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis.
.- Stream-Based Active Learning for Speech Emotion Recognition via Hybrid Data Selection and Continuous Learning.
.- Data Alignment and Duration Modelling in VITS.
.- Multiword Expressions Resources for Italian: Presenting a Manually Annotated Spoken Corpus.
.- Generating High-Quality F0 Embeddings Using the Vector-Quantized Variational Autoencoder.
.- Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation.
.- X-vector-based Speaker Diarization Using Bi-LSTM and Interim Voting-driven Post-processing.
.- A Paradigm for Interpreting Metrics and Measuring Error Severity in Automatic Speech Recognition.
.- Enhancing Speech Emotion Recognition Using Transfer Learning From Speaker Embeddings.
.- Dialogue.
.- Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets.
.- PiCo-VITS: Leveraging Pitch Contours for Fine-grained Emotional Speech Synthesis.
.- Improving and Understanding Clarifying Question Generation in Conversational Search.
.- Explainable Multimodal Fusion for Dementia Detection From Text and Speech.
.- Robust Classification of Parkinson's Speech: an Approximation to a Scenario With Non-controlled Acoustic Conditions.
.- Leveraging Conceptual Similarities to Enhance Modeling of Factors Affecting Adolescents' Well-Being.
.- Joint-Average Mean and Variance Feature Matching (JAMVFM) Semi-supervised GAN with Additional-Objective Training Function for Intent Detection.
.- Capturing Task-Related Information for Text-Based Grasp Classification Using Fine-Tuned Embeddings.
.- StepDP: A Step Towards Expressive and Pervasive Dialogue Platforms .
.- Automatic Classification of Parkinson's Disease Using Wav2vec Embeddings at Phoneme, Syllable, and Word Levels.

Inhaltsangabe

Text, Speech, and Dialogue

27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part IIHerausgegeben:Nöth, Elmar; Horák, Ales; Sojka, Petr

Text, Speech, and Dialogue

27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part IIHerausgegeben:Nöth, Elmar; Horák, Ales; Sojka, Petr

27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part II
Herausgegeben:Nöth, Elmar; Horák, Ales; Sojka, Petr

27th International Conference, TSD 2024, Brno, Czech Republic, September 9-13, 2024, Proceedings, Part II
Herausgegeben:Nöth, Elmar; Horák, Ales; Sojka, Petr