Robust Audio Watermarking Based on Empirical Mode Decomposition and Group Differential Relations
×
Cite This
Citation & Abstract
W. Lai, T. Chou, M. Chou, and BJ. W.. Schuller, "Robust Audio Watermarking Based on Empirical Mode Decomposition and Group Differential Relations," J. Audio Eng. Soc., vol. 71, no. 3, pp. 100-117, (2023 March.). doi: https://doi.org/10.17743/jaes.2022.0067
W. Lai, T. Chou, M. Chou, and BJ. W.. Schuller, "Robust Audio Watermarking Based on Empirical Mode Decomposition and Group Differential Relations," J. Audio Eng. Soc., vol. 71 Issue 3 pp. 100-117, (2023 March.). doi: https://doi.org/10.17743/jaes.2022.0067
Abstract: An audio watermarking technique using Complementary Ensemble Empirical Mode Decomposition and group differential relations of average absolute amplitudes of the last Intrinsic Mode Function (IMF) is proposed. By using group differential relations, the relationship with neighboring samples in the last IMF is well preserved, and near-imperceptibility can be achieved. Placing a watermark on low-frequency components, the last IMF, which is perceptually significant, therefore makes the watermark difficult to be removed. The embedding watermark, which is a logo image in our experiment, is processed by Arnold transformation, secret key encryption, and Bose--Chaudhuri--Hocquenghem coding to enhance robustness and security. Experimental results of the signal-to-noise ratio fit the recommendations of imperceptibility of the International Federation of the Phonographic Industry. The average Objective Difference Grade (an objective measure that correlates very well with subjective assessment) and subjective quality assessment were performed to evaluate the imperceptibility. Furthermore, our method accomplishes robustness under 13 different categories of attacks, including noise corruption, amplitude scaling, echo addition, resampling, re-quantization, low-pass filtering, MPEG-1 Audio Layer III compression, Digital-to-Analog/Analog-to-Digital conversion, cropping, time shift, zero thresholding, jittering, and combined attacks.
@article{lai2023robust,
author={lai, wen-hsing and chou, tsung-yuan and chou, meng-chen and schuller, björn w.},
journal={journal of the audio engineering society},
title={robust audio watermarking based on empirical mode decomposition and group differential relations},
year={2023},
volume={71},
number={3},
pages={100-117},
doi={https://doi.org/10.17743/jaes.2022.0067},
month={march},}
@article{lai2023robust,
author={lai, wen-hsing and chou, tsung-yuan and chou, meng-chen and schuller, björn w.},
journal={journal of the audio engineering society},
title={robust audio watermarking based on empirical mode decomposition and group differential relations},
year={2023},
volume={71},
number={3},
pages={100-117},
doi={https://doi.org/10.17743/jaes.2022.0067},
month={march},
abstract={an audio watermarking technique using complementary ensemble empirical mode decomposition and group differential relations of average absolute amplitudes of the last intrinsic mode function (imf) is proposed. by using group differential relations, the relationship with neighboring samples in the last imf is well preserved, and near-imperceptibility can be achieved. placing a watermark on low-frequency components, the last imf, which is perceptually significant, therefore makes the watermark difficult to be removed. the embedding watermark, which is a logo image in our experiment, is processed by arnold transformation, secret key encryption, and bose--chaudhuri--hocquenghem coding to enhance robustness and security. experimental results of the signal-to-noise ratio fit the recommendations of imperceptibility of the international federation of the phonographic industry. the average objective difference grade (an objective measure that correlates very well with subjective assessment) and subjective quality assessment were performed to evaluate the imperceptibility. furthermore, our method accomplishes robustness under 13 different categories of attacks, including noise corruption, amplitude scaling, echo addition, resampling, re-quantization, low-pass filtering, mpeg-1 audio layer iii compression, digital-to-analog/analog-to-digital conversion, cropping, time shift, zero thresholding, jittering, and combined attacks.},}
TY - paper
TI - Robust Audio Watermarking Based on Empirical Mode Decomposition and Group Differential Relations
SP - 100
EP - 117
AU - Lai, Wen-Hsing
AU - Chou, Tsung-Yuan
AU - Chou, Meng-Chen
AU - Schuller, Björn W.
PY - 2023
JO - Journal of the Audio Engineering Society
IS - 3
VO - 71
VL - 71
Y1 - March 2023
TY - paper
TI - Robust Audio Watermarking Based on Empirical Mode Decomposition and Group Differential Relations
SP - 100
EP - 117
AU - Lai, Wen-Hsing
AU - Chou, Tsung-Yuan
AU - Chou, Meng-Chen
AU - Schuller, Björn W.
PY - 2023
JO - Journal of the Audio Engineering Society
IS - 3
VO - 71
VL - 71
Y1 - March 2023
AB - An audio watermarking technique using Complementary Ensemble Empirical Mode Decomposition and group differential relations of average absolute amplitudes of the last Intrinsic Mode Function (IMF) is proposed. By using group differential relations, the relationship with neighboring samples in the last IMF is well preserved, and near-imperceptibility can be achieved. Placing a watermark on low-frequency components, the last IMF, which is perceptually significant, therefore makes the watermark difficult to be removed. The embedding watermark, which is a logo image in our experiment, is processed by Arnold transformation, secret key encryption, and Bose--Chaudhuri--Hocquenghem coding to enhance robustness and security. Experimental results of the signal-to-noise ratio fit the recommendations of imperceptibility of the International Federation of the Phonographic Industry. The average Objective Difference Grade (an objective measure that correlates very well with subjective assessment) and subjective quality assessment were performed to evaluate the imperceptibility. Furthermore, our method accomplishes robustness under 13 different categories of attacks, including noise corruption, amplitude scaling, echo addition, resampling, re-quantization, low-pass filtering, MPEG-1 Audio Layer III compression, Digital-to-Analog/Analog-to-Digital conversion, cropping, time shift, zero thresholding, jittering, and combined attacks.
An audio watermarking technique using Complementary Ensemble Empirical Mode Decomposition and group differential relations of average absolute amplitudes of the last Intrinsic Mode Function (IMF) is proposed. By using group differential relations, the relationship with neighboring samples in the last IMF is well preserved, and near-imperceptibility can be achieved. Placing a watermark on low-frequency components, the last IMF, which is perceptually significant, therefore makes the watermark difficult to be removed. The embedding watermark, which is a logo image in our experiment, is processed by Arnold transformation, secret key encryption, and Bose--Chaudhuri--Hocquenghem coding to enhance robustness and security. Experimental results of the signal-to-noise ratio fit the recommendations of imperceptibility of the International Federation of the Phonographic Industry. The average Objective Difference Grade (an objective measure that correlates very well with subjective assessment) and subjective quality assessment were performed to evaluate the imperceptibility. Furthermore, our method accomplishes robustness under 13 different categories of attacks, including noise corruption, amplitude scaling, echo addition, resampling, re-quantization, low-pass filtering, MPEG-1 Audio Layer III compression, Digital-to-Analog/Analog-to-Digital conversion, cropping, time shift, zero thresholding, jittering, and combined attacks.
Authors:
Lai, Wen-Hsing; Chou, Tsung-Yuan; Chou, Meng-Chen; Schuller, Björn W.
Affiliations:
Department of Computer and Communication Engineering, National Kaohsiung University of Science and Technology, No. 1, University Rd., Yanchao Dist., Kaohsiung City 82445, Taiwan; ZD.B Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany; Ph.D. Program in Engineering Science and Technology, College of Engineering, National Kaohsiung University of Science and Technology, Taiwan; Department of Computer and Communication Engineering, National Kaohsiung University of Science and Technology, No. 1, University Rd., Yanchao Dist., Kaohsiung City 82445, Taiwan; ZD.B Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany; GLAM – Group on Language, Audio & Music, Imperial College London, U.K.(See document for exact affiliation information.) JAES Volume 71 Issue 3 pp. 100-117; March 2023
Publication Date:
March 7, 2023Import into BibTeX
Permalink:
http://www.aes.org/e-lib/browse.cfm?elib=22034