automatic speech processing - latin american spanish varieties - consonant lenition - multi-dialectal pronunciation modeling - dialect-specific extended dataset https://ertim.inalco.fr/ fr From large-scale phonetic studies to speech recognition of Spanish varieties https://ertim.inalco.fr/node/604 <span class="field field--name-title field--type-string field--label-hidden">From large-scale phonetic studies to speech recognition of Spanish varieties</span> <span class="field field--name-uid field--type-entity-reference field--label-hidden"><span>Anonyme (non vérifié)</span></span> <span class="field field--name-created field--type-created field--label-hidden">ven 06/11/2020 - 00:00</span> <div class="field field--name-field-annee field--type-integer field--label-above"> <div class="field__label">Année</div> <div class="field__item">2017</div> </div> <div class="field field--name-field-abstract field--type-string-long field--label-above"> <div class="field__label">Résumé</div> <div class="field__item">Dialectal variation represents a major challenge for automatic speech procesing. The purpose of this research is to improve the performance of a broadcast news transcription system for Latin American Spanish. Automatic speech processing tools were employed to estimate the impact of intervocalic /b/ /d/ /g/ and coda /s/ lenition across Spanish dialects. These findings have been applied to the acoustic model training together with modifications of both the phonemic inventory and lexicon. The effect of dialect-specific extended train data was also studied. Two acoustic model training configurations were developed: an initial set with Peninsular data exclusively and an extended dataset adding Latin American data. The best performing model for Latin American speech includes expert corrections, consonant merge and lenition with the extended dataset. This model obtains 7% relative gain in WER for Latin American data and remains robust to other Spanish dialects.</div> </div> <div class="field field--name-field-tags field--type-entity-reference field--label-above"> <div class="field__label">Mots-clés</div> <div class="field__items"> <div class="field__item"><a href="/taxonomy/term/2440" hreflang="fr">automatic speech processing - latin american spanish varieties - consonant lenition - multi-dialectal pronunciation modeling - dialect-specific extended dataset</a></div> </div> </div> <div class="field field--name-field-document field--type-file field--label-above"> <div class="field__label">Fichier</div> <div class="field__item"> <span class="file file--mime-application-pdf file--application-pdf"> <a href="/sites/default/files/hernandez-nidia-memoire-resume.v3.pdf" type="application/pdf">hernandez-nidia-memoire-resume.v3.pdf</a></span> </div> </div> Thu, 05 Nov 2020 23:00:00 +0000 Anonyme 604 at https://ertim.inalco.fr