
QLAW: An Improved Quantization-Based Local Audio Watermarking Scheme Using Inter-Frame Correlation
Author(s) -
Qiutong Li,
Zheng Xing,
Ju Wang,
Guoheng Huang,
Xiaochen Yuan
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3573838
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
With the rapid development of the Internet, audio distribution has become more convenient with increasing copyright infringement. To address this problem, this paper proposes a quantization-based local audio watermarking scheme using inter-frame correlation, integrating machine learning techniques and traditional methods. To obtain the time-frequency spectrogram of the audio signal, a Short-Time Fourier Transform (STFT) is first applied to the audio signal. Then, Main Energy Region Extractor (MERE) is proposed to extract the main energy region of the spectrogram. Based on the main energy region, the Stable Frequency and Energy Region Extractor is conducted to find the local feature region for embedding. After segmenting the local feature embedding region into several frames, Adjacent Frame Extraction Process (AFEP) is conducted to select the adjacent frame. Then, Discrete Cosine Transform (DCT) is applied to each embedding frame and its adjacent frame to extract their corresponding frequency domain coefficients. To improve robustness, mid-frequency DCT coefficients are alternately selected to embed the watermark. By adjusting the difference between the embedding frame and its corresponding adjacent frame in a predefined range, the local watermark is embedded. Experimental results show that the proposed scheme outperforms existing schemes in inaudibility and robustness, achieving an average Signal-to-Noise Ratio (SNR) above 25 dB and a lower Bit Error Rate (BER) under various attacks.
Empowering knowledge with every search
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom