An efficient mechanism for searching arabic audio libraries
Source of Publication
Proceedings - IEEE Sixth International Symposium on Multimedia Software Engineering, MSE 2004
In this paper we propose an approach that allows the user to query an Arabic audio library using voice. We use a combination of class-based language models and robust interpretation to recognize and identify the spoken keywords. The mechanism uses a Large Vocabulary Recognition System (LVCSR) to implement the functionality of an Arabic authority control system. A series of experiments were performed to assess the accuracy and the robustness of the proposed approach: restricted grammar recognition with semantic interpretation, class-based statistical language models (CB_SLM) with robust interpretation, and generalized CB-SLM. The results have shown that the combination of CB-SLM and robust interpretation provides better accuracy and robustness than the traditional grammar-based parsing. © 2004 IEEE.
Bibliographic information; Class-based language models; n-gram language models (LM); Sematic classes; Control systems; Database systems; Indexing (of information); Libraries; Program interpreters; Query languages; Semantics; Vocabulary control; Radio studios
Kamel, Ibrahim and Talhami, Habib, "An efficient mechanism for searching arabic audio libraries" (2004). All Works. 434.
Indexed in Scopus