Lexical-phonetic automata for spoken utterance indexing and retrievalReport as inadecuate

Lexical-phonetic automata for spoken utterance indexing and retrieval - Download this document for free, or read online. Document in PDF available to download.

* Corresponding author 1 TEXMEX - Multimedia content-based indexing IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique 2 BUSIM Speech Group Department of Electrical and Electronic Engineering Istanbul

Abstract : This paper presents a method for indexing spoken utterances which combines lexical and phonetic hypotheses in a hybrid index built from automata. The retrieval is realised by a lexical-phonetic and semi-imperfect matching whose aim is to improve the recall. A feature vector, containing edit distance scores and a confidence measure, weights each transition to help the filtering of the candidate utterance list for a more precise search. Experiment results show that the lexical and phonetic representations are complementary and we compare the hybrid search with the state-of-the-art cascaded search to retrieve named entity queries.

Keywords : information retrieval speech indexing lexical-phonetic automata confidence measures edit distances supervised learning

Author: Julien Fayolle - Murat Saraclar - Fabienne Moreau - Christian Raymond - Guillaume Gravier -

Source: https://hal.archives-ouvertes.fr/


Related documents