z-logo
open-access-imgOpen Access
Adaptive Speech Streaming Based on Packet Loss Prediction Using Support Vector Machine for Software‐Based Multipoint Control Unit over IP Networks
Author(s) -
Kang Jin Ah,
Han Mikyong,
Jang JongHyun,
Kim Hong Kook
Publication year - 2016
Publication title -
etri journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.295
H-Index - 46
eISSN - 2233-7326
pISSN - 1225-6463
DOI - 10.4218/etrij.16.2716.0013
Subject(s) - computer science , network packet , speech coding , packet loss , speech recognition , voice activity detection , linear predictive coding , psqm , processing delay , real time computing , transmission delay , speech processing , computer network
An adaptive speech streaming method to improve the perceived speech quality of a software‐based multipoint control unit (SW‐based MCU) over IP networks is proposed. First, the proposed method predicts whether the speech packet to be transmitted is lost. To this end, the proposed method learns the pattern of packet losses in the IP network, and then predicts the loss of the packet to be transmitted over that IP network. The proposed method classifies the speech signal into different classes of silence, unvoiced, speech onset, or voiced frame. Based on the results of packet loss prediction and speech classification, the proposed method determines the proper amount and bitrate of redundant speech data (RSD) that are sent with primary speech data (PSD) in order to assist the speech decoder to restore the speech signals of lost packets. Specifically, when a packet is predicted to be lost, the amount and bitrate of the RSD must be increased through a reduction in the bitrate of the PSD. The effectiveness of the proposed method for learning the packet loss pattern and assigning a different speech coding rate is then demonstrated using a support vector machine and adaptive multirate‐narrowband, respectively. The results show that as compared with conventional methods that restore lost speech signals, the proposed method remarkably improves the perceived speech quality of an SW‐based MCU under various packet loss conditions in an IP network.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here