Session JEP poster - P6

Jeudi 12 Juin - 10h30 12h30

papier 1557 L'évolution du phonétisme arabe et la résistance coarticulatoire

Mohamed Embarki  ( Praxiling UMR5267 CNRS-Montpellier III)

Résumé : This study relies on the articulatory commentary of the early Arab grammarians on the Classical Arabic (CA) consonants (VIII-XII century). 4 consonants were picked up from the Arab inventory: 3 consonants appeared in Al-Khalil's (died in 786) commentary as post-palatal, and the last one as pre-palatal. The stability of the four graphemes allowed the comparison between the articulation in CA and Modern Arabic (MA). The results show that the phonetic values of these consonants have evolved. The phonetic values are unanimous in exhibiting the fronting of the four articulations. They all split from a back to a front articulation. The reasons of this fronting are undoubtedly linked to a better control of the main articulator, the tongue, and to a good optimisation of the coarticulatory process.

article

papier 1627 Bilan et perspectives de quinze ans d'évaluation vocale par méthodes instrumentales et perceptives

Alain Ghio  ( Laboratoire Parole et Langage, Aix-Marseille Université)

Antoine Giovanni  ( Laboratoire Parole et Langage, Aix-Marseille Université)

Bernard Teston  ( Laboratoire Parole et Langage, Aix-Marseille Université)

Joana Révis  ( Laboratoire Parole et Langage, Aix-Marseille Université)

Ping Yu  ( Laboratoire Parole et Langage, Aix-Marseille Université)

Résumé : For fifteen years, we have developed and studied different techniques and methodologies to assess voice quality in a clinical context. This paper exposes recent results obtained by complementary approaches. 449 speakers (including 391 dysphonic patients) participated in the experiment where voice quality was evaluated by (1) perceptual voice assessment performed by a jury and (2) instrumental voice assessment using acoustic and aerodynamic data. Results showed that a combination of 7 instrumental measures allowed the classification of 82% voice samples in the same grade as the jury. We evaluate the methodological situation and we also discuss some theoretical aspects which are often forgotten in the performance race.

article

papier 1565 Unités Prosodiques et Grammaire de l'Intonation : vers une nouvelle approche

Elisabeth Delais-Roussarie  ( CNRS, UMR 7110 / Laboratoire de Linguistique Formelle, Paris 7)

Brechtje Post  ( RCEAL, University of Cambridge)

Résumé : Dans la majorité des travaux consacrés au français, deux niveaux de structuration prosodique sont généralement utilisés : le groupe accentuel GA et le groupe intonatif GI. Alors qu'il existe un accord assez important sur la définition et la réalisation du GA, ce n'est pas le cas pour le GI. Dans cette contribution, nous allons nous intéresser à la définition et à la réalisation des GI. Notre proposition diffère des travaux antérieurs pour deux raisons. Premièrement, nous opérons une distinction entre deux types de GI sur la base i) de l'inventaire des contours mélodiques qui sont réalisés sur leur frontière droite, et ii) des relations qu'ils entretiennent avec la syntaxe et la sémantique. Deuxièmement, La construction de ces constituants se fait à un niveau phonologique sous-jacent, que les frontières soient ou non réalisées par des pauses ou des mouvements mélodiques importants dépend de choix faits à d'autres niveaux (comme la performance, etc.).

article

papier 1598 Transformation de la prosodie par adaptation MLLR de GMM

Damien Lolive  ( IRISA / Université de Rennes 1)

Nelly Barbot  ( IRISA / Université de Rennes 1)

Olivier Boëffard  ( IRISA / Université de Rennes 1)

Résumé : In a voice transformation context, prosody transformation using parallel corpora is quite unrealistic as such corpora are difficult and also expensive to build. Based on this observation, we propose an approach for transforming prosody using non-parallel corpora thanks to the MLLR adaptation strategy. This methodology is applied to the joint transformation of duration and F0 at the syllable level. The source data are modelled by a GMM which is adapted to the target by applying a linear transformation to the mean vectors of the gaussian mixture. This methodology is applied to the conversion of duration and F0 between two french speakers and is evaluated by cross validation between the models and the test datasets.

article

papier 1602 Identification de l'origine des locuteurs non natifs en utilisant des paramètres prosodiques

Marina Piat  ( LORIA)

Dominique Fohr  ( LORIA)

Irina Illina  ( LORIA)

Résumé : In this paper we propose an automatic approach to foreign accent identification. Knowing the speaker origin could allow to adapt the acoustic models for non-native speech recognition. In this study, we use a statistical approach based on prosodic parameters. This approach relies on the fact that prosody is different between languages, and so between accents. This work is done in the framework of the HIWIRE (Human Input that Works In Real Environment) European project. The corpus is composed of English sentences pronounced by French, Italian, Greek and Spanish speakers. Results obtained with duration and energy are promising for foreign accent identification : 68.6% with energy and 67.1% with duration. These two parameters combined with MFCC achieve a 87.1% correct foreign accent identification rate.

article

papier 1609 L'association avec le focus en question : seulement et son associé

Cristel Portes  ( Laboratoire Parole & Langge (CNRS & Université de Provence))

Jean-Marie Marandin  ( Laboratoire de linguistique formelle (CNRS & Université de Paris 7))

Claire Beyssade  ( Institut Jean Nicod (CNRS-EHESS-ENS))

Résumé : La communication présente les contraintes prosodiques sur l'associé de l'adverbe restrictif >seulement>. L'associé porte deux types de marques: (i) un accent nucléaire terminal ou non-terminal sur sa frontière droite (ce qui ne spécifie pas s'il est un associé large ou étroit) et (ii) un soulignement prosodique lorsque l'associé est étroit. Le marquage de l'association et le marquage du focus informationnel sont des phénomènes distincts en français.

article

papier 1645 Découpage prosodique sur différents types de segmentations phonémiques

Natalia Segal  ( France Télécom R&D)

Katarina Bartkova  ( France Télécom R&D)

Résumé : This paper presents the assessment of prosodic boundary detection algorithms using the prosodic structure representation in the form of trees, with different types of phonemic segmentations. Two types of prosodic boundary detection algorithms were studied, first using linguistic and prosodic information and second using only prosodic information [5, 6]. The algorithms were applied to different kinds of phonemic segmentations in order to find out the limits of their applicability to various automatic speech processing tasks. We analyzed the degradation of performance according to phonetic segmentation quality, including manually verified segmentation, automatic alignment and phonemic decoding using a phoneme trigram. We also evaluated and compared the two algorithms on a larger spontaneous speech data base with automatic alignment.

article

papier 1665 Un modèle de duréedes syllabes fondé les propriétés syllabiques intrinsèques et les variations locales de débit

Nicolas Obin  ( IRCAM)

Xavier Rodet  ( IRCAM)

Anne Lacheret-Dujour  ( Université Paris X et IUF)

Résumé : Local speech rate is an emergent field in research on prosody. This paper introduces a syllable duration model based on intrinsic syllable duration properties and local speech rate variations. The proposed model is compared to the observed syllable durations and to a standard model in which durations are normalized according to local speech rate only. This comparison shows that our model is i) robust : significant reduction of the observed syllable dispersion, ii) consistant : this reduction comes with reduction of duration dispersion due to the syllable intrisic properties as well as prominence phenomena.

article

papier 1684 Utilisation des grammaires probabilistes dans les tâches de segmentation et d'annotation prosodique

Irina Nesterenko  ( Université Blaise Pascal, Clermont-Ferrand II)

Stéphane Rauzy  ( Laboratoire Parole et Langage, Université de Provence)

Résumé : L'objectif de notre étude est de modéliser comment les informations probabilistes dans l'espace tonal peuvent être explorées dans une tâche de segmentation du continuum sonore à la fois par les humains et par les algorithmes de l'annotation semi-automatique des corpus. Nous testons également si implémenter une structure hiérarchique minimale améliore la performance de l'algorithme. Nous nous appuyons sur l'appareil mathématique des grammaires probabilistes et nous décrivons et évaluons les étapes de la construction des modèles probabilistes et de leur testes en prédiction.

article