Khan, Wasiq ORCID: 0000-0002-7511-3873 and Kuru, Kaya ORCID: 0000-0002-4279-4166 (2017) Intelligent system for spoken term detection using the belief combination. IEEE Intelligent Systems, 32 (1). pp. 70-79. ISSN 1541-1672
Preview |
PDF (Author Accepted Manuscript)
- Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives. 1MB |
Official URL: http://doi.org/10.1109/MIS.2017.13
Abstract
Spoken Term Detection (STD) can be considered as a sub-part of the automatic speech recognition which aims to extract the partial information from speech signals in the form of query utterances. A variety of STD techniques available in the literature employ a single source of evidence for the query utterance match/mismatch determination. In this manuscript, we develop an acoustic signal processing based approach for STD that incorporates a number of techniques for silence removal, dynamic noise filtration, and evidence combination using Dempster-Shafer Theory (DST). A ‘spectral-temporal features based voiced segment detection’ and ‘energy and zero cross rate based unvoiced segment detection’ are built to remove the silence segments in the speech signal. Comprehensive experiments have been performed on large speech datasets and consequently satisfactory results have been achieved with the proposed approach. Our approach improves the existing speaker dependent STD approaches, specifically the reliability of query utterance spotting by combining the evidences from multiple belief sources.
Repository Staff Only: item control page