Dual-domain audio watermarking algorithm based on flexible segmentation and adaptive embedding

Luo, Yifan, Peng, Dezhong, Sang, Yongsheng and Xiang, Yong 2019, Dual-domain audio watermarking algorithm based on flexible segmentation and adaptive embedding, IEEE access, vol. 7, pp. 10533-10545, doi: 10.1109/ACCESS.2019.2890972.

Attached Files
Name Description MIMEType Size Downloads

Title Dual-domain audio watermarking algorithm based on flexible segmentation and adaptive embedding
Author(s) Luo, Yifan
Peng, Dezhong
Sang, Yongsheng
Xiang, YongORCID iD for Xiang, Yong orcid.org/0000-0003-3545-7863
Journal name IEEE access
Volume number 7
Start page 10533
End page 10545
Total pages 13
Publisher Institute of Electrical and Electronics Engineers
Place of publication Piscataway, N.J.
Publication date 2019
ISSN 2169-3536
Keyword(s) Audio beats
psychoacoustic model
Science & Technology
Computer Science, Information Systems
Engineering, Electrical & Electronic
Computer Science
Summary This paper proposes a novel dual-domain audio watermarking approach based on flexible segmentation and adaptive embedding aimed to improve robustness and imperceptibility. Compared with conventional watermarking strategies, the proposed approach has two advantages. First, a novel audio beat detection approach is designed to flexibly segment the audio, which provides stronger robustness to synchronization attacks. The audio is decomposed by the discrete wavelet packet transform. Then, the covariance relationships of the decomposition coefficients at different time instants are calculated to determine the locations of the beats and to establish a flexible segmentation model. Second, a dual-domain embedding approach is proposed to realize better robustness to compression attacks while maintaining imperceptibility. In each segment, the psychoacoustic model is used to calculate the audio masking threshold, which divides the signals into the masking signal domain and masked signal domain. The signals in the masking signal domain are robust to compression attacks, and the signals in the masked signal domain have better imperceptibility. To combine these advantages, we embed the watermark into the two domains simultaneously by using the distortion-compensated dither modulation quantization approach. To reduce the impact of the watermark on the original audio, the frequency band with the lowest mask-to-noise ratio is selected as the embedding position for each domain. Moreover, the adaptive quantization steps are calculated to control the embedding strength according to the masking effect. The adaptive embedding will improve the robustness to compression attacks without significantly affecting the original audio quality. The effectiveness of our approach is verified through simulation experiments.
Language eng
DOI 10.1109/ACCESS.2019.2890972
HERDC Research category C1 Refereed article in a scholarly journal
Copyright notice ©2019, IEEE
Persistent URL http://hdl.handle.net/10536/DRO/DU:30119136

Connect to link resolver
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in TR Web of Science
Scopus Citation Count Cited 0 times in Scopus
Google Scholar Search Google Scholar
Access Statistics: 65 Abstract Views, 1 File Downloads  -  Detailed Statistics
Created: Thu, 21 Mar 2019, 09:25:28 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au.