A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems (Q2570389): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(2 intermediate revisions by 2 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1155/asp.2005.487 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2117468249 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:38, 30 July 2024

scientific article
Language Label Description Also known as
English
A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems
scientific article

    Statements

    A computationally efficient mel-filter bank VAD algorithm for distributed speech recognition systems (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    28 October 2005
    0 references
    Summary: This paper presents a novel computationally efficient voice activity detection (VAD) algorithm and emphasizes the importance of such algorithms in distributed speech recognition (DSR) systems. When using VAD algorithms in telecommunication systems, the required capacity of the speech transmission channel can be reduced if only the speech parts of the signal are transmitted. A similar objective can be adopted in DSR systems, where the nonspeech parameters are not sent over the transmission channel. A novel approach is proposed for VAD decisions based on mel-filter bank (MFB) outputs with the so-called Hangover criterion. Comparative tests are presented between the presented MFB VAD algorithm and three VAD algorithms used in the G.729, G.723.1, and DSR (advanced front-end) Standards. These tests were made on the Aurora 2 database, with different signal-to-noise (SNRs) ratios. In the speech recognition tests, the proposed MFB VAD outperformed all the three VAD algorithms used in the standards by \(14.19\%\) relative (G.723.1 VAD), by \(12.84\%\) relative (G.729 VAD), and by \(4.17\%\) relative (DSR VAD) in all SNRs.
    0 references
    voice activity detection
    0 references
    distributed speech recognition
    0 references
    telecommunication systems
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references