An efficient mechanism for searching arabic audio libraries

Document Type

Conference Proceeding

Source of Publication

Proceedings - IEEE Sixth International Symposium on Multimedia Software Engineering, MSE 2004

Publication Date

12-1-2004

Abstract

In this paper we propose an approach that allows the user to query an Arabic audio library using voice. We use a combination of class-based language models and robust interpretation to recognize and identify the spoken keywords. The mechanism uses a Large Vocabulary Recognition System (LVCSR) to implement the functionality of an Arabic authority control system. A series of experiments were performed to assess the accuracy and the robustness of the proposed approach: restricted grammar recognition with semantic interpretation, class-based statistical language models (CB_SLM) with robust interpretation, and generalized CB-SLM. The results have shown that the combination of CB-SLM and robust interpretation provides better accuracy and robustness than the traditional grammar-based parsing. © 2004 IEEE.

ISBN

0769522173

Publisher

IEEE

First Page

579

Last Page

582

Disciplines

Computer Sciences

Keywords

Bibliographic information, Class-based language models, n-gram language models (LM), Sematic classes, Control systems, Database systems, Indexing (of information), Libraries, Program interpreters, Query languages, Semantics, Vocabulary control, Radio studios

Scopus ID

20844445131

Indexed in Scopus

yes

Open Access

no

Share

COinS