Session JEP poster - P3

Mardi 10 Juin - 14h00 16h00

papier 1564 Pratiques langagières bilingues et multimodales de jeunes adultes sourds

Agnès Millet  ( Université Stendhal- Grenoble3)

Isabelle Esteve  ( Université Stendhal- Grenoble3)

Résumé : This paper describes how, in everyday communication, deaf 20-year olds use two languages (LSF and French) and two modalities (voice and gestures). Individual and situational variations show that speakers are constantly adaptating. Language combinations are qualitatively analyzed.

article

papier 1573 Analyse du mot sur le bout de la langue de 19 à 79 ans

Christelle Gillioz  ( Université de Lausanne)

Brigitte Zellner-Keller  ( Université de Lausanne)

Résumé : It is commonly reported that lexical access becomes less efficient with aging and that most of the TOTs are proper names. In this paper, we analyse data of questionnaires obtained from 55 French-speaking subjects aged between 19 and 79 who reported TOTs from their daily lives for two weeks. The data suggest that age is not linearly related to the increase of TOT, that common names are good candidates for TOTs, and that men and women do not experiment TOTs in the same contexts and do not behave entirely in the same way.

article

papier 1617 La prosodie chez le bébé au stade pré-linguistique : premières formes stables

Christelle Dodane  ( Laboratoire Dipralang, Université Paul Valéry Montpellier 3)

Karine Martel  ( Laboratoire PALM, Université de Caen Basse-Normandie)

Résumé : Cette recherche porte sur le rôle de la prosodie au cours du développement pré-linguistique des enfants. Elle s'appuie sur l'étude de cas de deux bébés monolingues français suivis de façon longitudinale et observés en interaction spontanée avec leurs parents. Les enfants ont été enregistrés sur bande vidéo et audio entre les âges de 0. 10,00 et 1. 00,00. Chaque énoncé a été codé en fonction de ses contours mélodiques et son type d'interaction (>mono> vs >dialo>). Les données montrent des similitudes dans les deux sujets en condition >mono> état, avec un inventaire de contours intonatifs commun. Dans le deuxième contexte (>dialo>), les enfants ont des profils différents. La sélection et l'utilisation des caractéristiques prosodiques semblent nécessiter plus de temps pour servir l'interaction.

article

papier 1589 L'équation du locus comme indice de coarticulation dans l'articulation des occlusives sourdes chez les enfants atteints de fente palatine

Marion Bechet  ( Institut de Phonétique de Strasbourg)

Fabrice Hirsch  ( Institut de Phonétique de Strasbourg)

Véronique Ferbach-Hecker  ( Institut de Phonétique de Strasbourg)

Béatrice Vaxelaire  ( Institut de Phonétique de Strasbourg)

Rudolph Sock  ( Institut de Phonétique de Strasbourg)

Résumé : The aim of the present work is to verify coarticulatory strategies in children with a cleft palate, in comparison with those used by unimpaired children, as control subjects. Locus equations were measured in order to quantify the degree of coarticulation for 4 children: 2 pathological children and 2 non pathological ones. Results show that regression slope values are lower for children with a cleft palate. This finding suggests that degree of coarticulation is less pronounced in children with a cleft palate, even though they may use the same strategies as non pathological children. Furthermore, it is shown that variability (measured using R²) is higher in all productions of the children, even though it is higher in pathological subjects.

article

papier 1683 Traitement temporel des traits de voisement et de lieu d'articulation : étude comparative dyslexiques / normo-lecteurs adultes

Caroline Jacquier  ( UMR 5596 CNRS - Université Lumière Lyon 2)

Fanny Meunier  ( UMR 5596 CNRS - Université Lumière Lyon 2)

Résumé : The general auditory deficit in temporal processing is one of the main views still debated to explain nature and origin of dyslexia. In our study, we investigated the auditory temporal processing in expert readers and in dyslexics adults. By time-compression of rapid acoustic features, we explored their abilities of extraction and analyse of this cues (Experiment 1: voice onset-time and Experiment 2: second formant transition). Compared with controls, dyslexics exhibit deficit in temporal processing and the impairment is stronger for voicing than for formant transition. A distinct temporal processing for both acoustic features and mechanisms of compensation, have been observed for dyslexics.

article

papier 1674 Analyse de la Production d'un Codeur LPC Sourd

Pablo Sacher  ( Grenoble Images Parole Signal Automatique - Dpt. Parole et Cognition)

Denis Beautemps  ( Grenoble Images Parole Signal Automatique - Dpt. Parole et Cognition)

Marie-Agnès Cathiard  ( Centre de Recherche sur l'Imaginaire - Université Stendhal Grenoble III)

Noureddine Aboutabit  ( Grenoble Images Parole Signal Automatique - Dpt. Parole et Cognition)

Résumé : Cet article se concentre sur l'analyse de la production de code LPC par une personne sourde. En effet, ce genre d'analyse, pour des codeurs normo-entendants est rare, et quasiment inexistante en ce qui concerne la production de code par une personne sourde. Précisons qua par analyse nous entendons coordination temporelle des gestes labiaux et manuels du code LPC. Nous verrons que l'anticipation manuelle sur le geste labial (découvert par Attina et al. (2004)) est conservée pour ce codeur.

article

papier 1603 Combinaison de différents jeux de paramètres acoustiques pour la reconnaissance de la parole

Loïc Barrault  ( Laboratoire Informatique d'Avignon)

Driss Matrouf  ( Laboratoire Informatique d'Avignon)

Georges Linarès  ( Laboratoire Informatique d'Avignon)

Renato De-Mori  ( Laboratoire Informatique d'Avignon)

Résumé : With the purpose of improving Automatic Speech Recognition (ASR) systems performance, many different approaches on combining them have been largely studied. In this paper, a combination of state a posteriori probabilities given by different feature sets is proposed. In order to perform a coherent combination of state posterior probabilities, the acoustic models trained on different feature sets must have the same topology (i.e. same set of states). For this purpose, a fast and efficient twin model training protocol is proposed. Two different strategies for combining probabilities are presented : the linear and the log linear interpolation. By using log linear interpolation, a relative Word Error Rate (WER) reduction of about 15% and 14% have been observed respectively on MEDIA and ESTER corpora.

article

papier 1622 Combinaison de systèmes par décodage guidé

Benjamin Lecouteux  ( LIA, Avignon)

Georges Linarès  ( LIA, Avignon)

Yannick Estève  ( LIUM, Le Mans)

Guillaume Gravier  ( IRISA, Rennes)

Résumé : In this paper, we propose an integrated approach for system combination named Driven Decoding Algorithm (DDA). It consists in guiding the search algorithm of a primary ASR system by the outputs of an auxiliary system. We first evaluate this method in simple configuration in which the primary search is driven by the one-best hypothesis of a single auxiliary system. Then, we generalize DDA to confusion-network driven decoding and we propose a general combination schemes for multiple system combination. The proposed extended DDA is evaluated using 3 ASR systems from different labs. Results show that generalized-DDA outperforms significantly ROVER method: we obtain a 15.7% relative word error rate improvement with respect to the best single system, as opposed to 8.5% with the ROVER combination.

article

papier 1623 Vers une adaptation thématique non supervisée de modèles de langage : utilisation d'Internet comme un corpus ouvert

Gwénolé Lecorvé  ( Irisa, INSA de Rennes)

Guillaume Gravier  ( Irisa, CNRS)

Pascale Sébillot  ( Irisa, INSA de Rennes)

Résumé : Since language models (LM) of automatic speech recognition systems are usually trained on multi-topic corpora, topic adaptation has been shown to be an effective way to improve the recognition accuracy, especially for broadcast news. This paper presents a new complete and unsupervised technique using information retrieval methods and based on the use of the Internet to retrieve thematically coherent corpora from which adapted LMs are trained. Experimental results demonstrate the validity of the proposed adaptation method with significant perplexity and word error rate reductions, and also show that topic adaptation should be included early in the recognition process.

article

papier 1625 Combinaison de systèmes pour la phonétisation automatique de noms propres

Antoine Laurent  ( LIUM (Laboratoire d'Informatique de l'Université du Maine), Spécinov)

Sylvain Meignier  ( LIUM (Laboratoire d'Informatique de l'Université du Maine))

Yannick Estève  ( LIUM (Laboratoire d'Informatique de l'Université du Maine))

Paul Deléglise  ( LIUM (Laboratoire d'Informatique de l'Université du Maine))

Résumé : Les systèmes de reconnaissance vocale à grand vocabulaire ont des performances correctes dans des contextes d'utilisation connus et contrôlés. Cependant, la reconnaissance de noms propres est généralement considérée comme une tâche difficile. La phonétisation automatique des noms propres est délicate à obtenir, bien qu'il s'agisse d'une des plus importantes ressources nécessaire au système de décodage. Dans cet article, nous proposons une méthode de phonétisation automatique appliquée aux noms propres. Cette méthode est fondée sur la combinaison du système de phonétisation automatique à base de règles LIA_PHON avec un système de décodage acoustico-phonétique. Sur le corpus ESTER, nous avons observé que le système de combinaison obtient de meilleurs résultats que notre système de référence (LIA_PHON).

article

papier 1628 Utilisation de la structure de mots de passe personnalisés pour la reconnaissance de locuteurs embarquée

Anthony Larcher  ( Université d'Avignon, LIA)

Jean-François Bonastre  ( Université d'Avignon, LIA)

John-S.d. Mason  ( Speech and Image group, Swansea University)

Résumé : Embedded speaker recognition in mobile devices involves a limited amount of computing resource. Both the enrolment and the test have to be done using short audio sequences. Even if they proved their efficiency in more classical situations, GMM/UBM based systems show their limits in this context. This paper deals with this problem and proposes to take into account the linguistic nature of the speech material inside the GMM/UBM framework. The proposed solution mixes the text-independent aspects of the GMM/UBM with a semi-continuous like approach in order to deal with the text-dependent information. This system respects both the resource and the ergonomic constraints of the considered application field. The preliminary experiments are done on the MyIdea database and show the potential of the proposed approach.

article

papier 1657 Adaptation rapide de modèles acoustiques compacts

Christophe Lévy  ( Université d'Avignon et des Pays de Vaucluse)

Georges Linarès  ( Université d'Avignon et des Pays de Vaucluse)

Jean-François Bonastre  ( Université d'Avignon et des Pays de Vaucluse)

Résumé : In a previous work we presented a new architecture dedicated to embedded speech recognition. It relies on a general GMM, which represents the whole acoustic space, associated with a set of HMM state-dependent probability functions modeled as transformations of this GMM. This work takes advantage of this architecture to propose a fast and efficient way to adapt the acoustic models. The adaptation is performed only on the general GMM model and does not require state-dependent adaptation data. It is also very efficient in terms of computational cost. We evaluate our approach in the voice-command task. This adaptation method achieved a relative error-rate decrease of about 10% even if few adaptation data are available.

article