
Speech file compression by eliminating unvoiced/silence components
Author(s) -
Arda Sahin,
Mehmet Z. Unlu
Publication year - 2021
Publication title -
sustainable engineering and innovation
Language(s) - English
Resource type - Journals
ISSN - 2712-0562
DOI - 10.37868/sei.v3i1.119
Subject(s) - computer science , silence , speech recognition , energy (signal processing) , component (thermodynamics) , compression (physics) , signal (programming language) , upload , matlab , noise (video) , voice activity detection , computer hardware , speech processing , acoustics , artificial intelligence , statistics , physics , materials science , mathematics , image (mathematics) , composite material , thermodynamics , programming language , operating system
The main objective of this study is to have the noise component of a speech signal eliminated and compressed by storing the locations and durations of silence regions. The separation between voiced, unvoiced, and silence regions is done by using the Short-Time Energy (STE) and Zero Crossing Rate (ZCR) methodologies. All operations in this study have been performed by using the User Interface (UI) developed on MATLAB®. These operations include voice recording, playing the recording, eliminating the unwanted regions, playing the modified recording, saving original and compressed files, and loading the recording compressed.