Search Engine for Multilingual Audiovisual Contents
Authors: José David Pérez, Antonio Bonafonte, Antonio Cardenal, Marta Ruiz, José A. R. Fonollosa, Asunción Moreno, Eva Navas, Eduardo R. Banga
Abstract: This paper describes the BUCEADOR search engine, a web server that allows retrieving. multimedia documents (text, audio, video) in diferent languages. All the documents are translated into the user language and are presented either as text (for instance, subtitles in video documents) or dubbed audio. The user query consist in a sequence of keywords and can be typed or spoken. Multiple Spoken Language Technologies (SLT) servers have been implemented, such as speech recognition, speech machine translation and text-to-speech conversion. The platform can be used in the four Spanish oficial languages (Spanish, Basque, Catalan and Galician) and in English.
Keywords: multimedia search, multilingual search, speech recognition, machine translation, speech synthesis, speech to speech translation.