Session JEP poster - P2

Lundi 9 Juin - 16h00 18h00

papier 1640 Annotation dynamique dans le corpus italien de dialogues spontanés LUNA

Christian Raymond  ( Université d'Avignon)

Kepa-Joseba Rodriguez  ( Piedmont Consortium for Information Systems)

Résumé : The aim of the LUNA project is to to investigate the problem of spontaneous speech understanding in the context of conversational systems engaged in complex tasks such as the problem-solving paradigm.Three steps are considered for the Spoken Language Understanding (SLU) process: generation of semantic concept tags, semantic composition into conceptual structures and context sensitive validation. The SLU modules will be trained and evaluated on the LUNA corpus and applied to different conversational systems in Italian, French and Polish. In this paper, we present the semantic annotation procedure we are following on an italian corpus. This corpus consists in human-human spontaneous dialogues recorded in the call center of the help desk facility of the Consortium for Information Systems of the Piedmont. The aim of our semantic annotation procedure is to speed up and make more reliable the manual annotation of the corpus following an active learning paradigm. The active learning procedure is coupled with an annotation error detection to assure more reliable annotation.

article

papier 1642 Evaluation de méthodes de réduction de corpus linguistiques

Nelly Barbot  ( IRISA / Université de Rennes 1 - ENSSAT, Lannion)

Pierre Alain  ( IRISA / Université de Rennes 1 - ENSSAT, Lannion)

Olivier Boeffard  ( IRISA / Université de Rennes 1 - ENSSAT, Lannion)

Jonathan Chevelu  ( IRISA / Université de Rennes 1 - ENSSAT, Lannion)

Arnaud Delhay  ( IRISA / Université de Rennes 1 - ENSSAT)

Résumé : This article deals with covering methodologies in the context of automatic speech processing technologies. More precisely, we are interested in covering phonological attributes of a linguistic corpus under the constraint of a minimal duration. This goal is classically achieved by greedy algorithms which however do not guarantee the optimality of the solutions. We propose to compare the results of a new algorithm, the LamSCP, that calls upon the principles of lagrangian relaxation, and an agglomeration-spitting greedy algorithm to achieve an optimal covering. We conducted experiments on the Gutenberg corpus considering, phone, diphone and triphone optimal covering. The LamSCP provides better solutions than the greedy algorithm and enables to locate their quality by offering a lower bound to the optimization problem.

article

papier 1644 Surveillance vocale de réseaux de communication professionnels par la reconnaissance du locuteur

Alexandre Preti  ( Thales Commmunications / LIA)

Bertrand Ravera  ( Thales Commmunications)

François Capman  ( Thales Commmunications)

Jean-François Bonastre  ( Université d'Avignon / Laboratoire d'Informatique d'Avignon)

Résumé : Even if the speaker recognition field is very dynamic, few studies concern the constraints linked to the use of a speaker recognition system inside a professional telecommunication network. This paper deals with this problem and proposes some adaptation of such system in the focus of a real world network monitoring application. Both real-time constraints and distributed architectures are investigated. We propose a frame-by-frame on-line processing for feature extraction, frame selection and normalization. The links between the network speech coder and the speaker recognition system are also investigated, for both the ETSI TETRA speech codec (at 4600 bit/sec) and the NATO STANAG 4591 (at 2400 bit/sec). The proposed solutions are compared with a classical unconstrained front-end (off-line processing).

article

papier 1654 Mistral : Plate-forme open source d'authentification biométrique

Eric Charton  ( LIA - Université d'Avignon)

Teva Merlin  ( LIUM - Université du Maine)

Christophe Lévy  ( LIA - Université d'Avignon)

Anthony Larcher  ( LIA - Université d'Avignon)

Sylvain Meignier  ( LIUM - Université du Maine)

Résumé : Mistral est une plate-forme de reconnaissance biométrique open-source. Elle permet d'assurer les tâches de vérification du locuteur, de reconnaissance de la langue, de segmentation et classification en locuteurs, ainsi que la reconnaissance de visages ou d'empreintes digitales. Dans cet article, nous présentons l'utilisation de Mistral en tant qu'instrument scientifique libre et ouvert, ainsi que comme support d'enseignement.

article

papier 1664 Comparaison de trois outils de détection automatique de proéminence en français parlé

Nicolas Obin  ( IRCAM)

Matthieu Avanzi  ( Université de Neuchâtel)

Jean-Phillipe Goldman  ( Université de Genève)

Anne Lacheret-Dujour  ( Université de Paris X Nanterre et IUF)

Résumé : This paper presents the inner details of three different algorithms for prominence detection. On the basis of a 50-minute corpus made of 5 speaking styles and manually annotated for prominence, a quantitative evaluation compares the three approches.

article

papier 1679 Modélisation Articulatoire de la Main en Langue Française Parlée Complétée : le cas de la clé digitale

Pablo Sacher  ( Grenoble Image Parole Signal Automatique, département Parole & Cognition)

Denis Beautemps  ( Grenoble Image Parole Signal Automatique, département Parole & Cognition)

Vilain Coriandre  ( Grenoble Image Parole Signal Automatique, département Parole & Cognition)

Résumé : Dans le cadre de la synthèse audiovisuelle de code LPC, modéliser la forme de la main est un enjeu majeur. La présente contribution présente plusieurs modèles de prédiction de la forme de la main. Une discussion présente les résultats croisés des deux modèles et alimente une interrogation sur le dispositif expérimental.

article

papier 1582 Evolution du débit de parole chez l'enfant francophone dans des tâches narrative et conversationnelle

Jean-Marc Colletta  ( Laboratoire Lidilem)

Catherine Pellenq  ( Laboratoire des Sciences de l'Education)

Isabelle Rousset  ( Laboratoire Lidilem)

Résumé : This study examines age-related changes in oral narrative and expository discourse of 67 french school-children aged 3-10 years who elaborate a story from a short animated film and answer to why-questions. Length of utterances, number of clauses, words, speech segments, syllables, and speech rate were analysed using ELAN as an annotation tool. Our results show that aging increases quantity and density of informational content of both narratives and explanations. Older children score significantly higher than younger children on all measures of duration and informational content. Our results also show a gradual – but not statistically significant – increase of speech rate. We discuss these results in the light of cognitive and linguistic developpement.

article

papier 1592 Voyelles brèves en parole conversationnelle

Christine Meunier  ( CNRS-Université de Provence)

Yohann Meynadier  ( Université de Provence)

Robert Espesser  ( CNRS-Université de Provence)

Résumé : This work deals with the phenomenon of speech reduction in conversational speech. An automatic and a manual analysis have been conducted. The automatic analysis show a strong reduction in the vocalic system and very short durations for a great proportion of vowels in the corpus. The manual analysis highlights the specific realisations of extra-short vowels (30ms) according to voicing, formants, indentification and lexicon.

article

papier 1620 Analyse des erreurs d'une stratégie de sondage automatique d'opinions

Nathalie Camelin  ( LIA - Université d'Avignon)

Frédéric Béchet  ( LIA - Université d'Avignon)

Géraldine Damnati  ( France Télécom R&D)

Renato De-Mori  ( LIA - Université d'Avignon)

Résumé : La stratégie de sondage automatique d'opinions présentée extrait les distributions des opinions exprimées par les utilisateurs d'un service de téléphonie. Elle permet de sélectionner à partir d'un corpus de grande taille, les messages susceptibles d'être traités correctement par le module de Reconnaissance Automatique de la Parole (RAP) et le module de classification. Pour cette raison, il est très important de vérifier la représentativité du sous-corpus de messages sélectionnés par la stratégie de rejet. Plusieurs mesures, basées sur la divergence de Kullback-Leibler, sont proposées afin d'évaluer la validité de notre stratégie d'extraction d'opinions en analysant les différents types d'erreurs qu'elle implique.

article

papier 1641 Transcription manuelle vs assistée de la parole préparée et spontanée

Thierry Bazillon  ( Université du Maine)

Yannick Estève  ( Université du Maine)

Daniel Luzzati  ( Université du Maine)

Résumé : Notre étude porte sur le gain de temps qui peut être obtenu lors de la transcription de parole préparée et spontanée en utilisant un système de reconnaisance automatique de la parole, par rapport à une transcription entièrement manuelle. Plusieurs tâches ont ainsi été minutées (transcription du texte, assignation des locuteurs, correction orthographique...), et nous présentons ici une analyse des principaux résultats.

article