Speech file compression by eliminating unvoiced/silence components
Author(s) -
Arda Sahin,
Mehmet Zübeyir Ünlü
Publication year - 2021
Publication title -
sustainable engineering and innovation issn 2712-0562
Language(s) - English
Resource type - Journals
ISSN - 2712-0562
DOI - 10.37868/sei.v3i1.119
Subject(s) - computer science , silence , speech recognition , energy (signal processing) , component (thermodynamics) , compression (physics) , signal (programming language) , upload , matlab , noise (video) , voice activity detection , computer hardware , speech processing , acoustics , artificial intelligence , statistics , physics , materials science , mathematics , image (mathematics) , composite material , thermodynamics , programming language , operating system
The main objective of this study is to have the noise component of a speech signal eliminated and compressed by storing the locations and durations of silence regions. The separation between voiced, unvoiced, and silence regions is done by using the Short-Time Energy (STE) and Zero Crossing Rate (ZCR) methodologies. All operations in this study have been performed by using the User Interface (UI) developed on MATLAB®. These operations include voice recording, playing the recording, eliminating the unwanted regions, playing the modified recording, saving original and compressed files, and loading the recording compressed.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom