Audio Watermarking through Deterministic plus Stochastic Signal Decomposition

Liu, Yi-Wen; Smith, Julius O.

doi:10.1155/2007/75961

Research Article
Open access
Published: 15 November 2007

Audio Watermarking through Deterministic plus Stochastic Signal Decomposition

Yi-Wen Liu^1,2 &
Julius O. Smith¹

EURASIP Journal on Information Security volume 2007, Article number: 075961 (2007) Cite this article

2486 Accesses
4 Citations
3 Altmetric
Metrics details

Abstract

This paper describes an audio watermarking scheme based on sinusoidal signal modeling. To embed a watermark in an original signal (referred to as a cover signal hereafter), the following steps are taken. (a) A short-time Fourier transform is applied to the cover signal. (b) Prominent spectral peaks are identified and removed. (c) Their frequencies are subjected to quantization index modulation. (d) Quantized spectral peaks are added back to the spectrum. (e) Inverse Fourier transform and overlap-adding produce a watermarked signal. To decode the watermark, frequencies of prominent spectral peaks are estimated by quadratic interpolation on the magnitude spectrum. Afterwards, a maximum-likelihood procedure determines the binary value embedded in each frame. Results of testing against lossy compression, low- and highpass filtering, reverberation, and stereo-to-mono reduction are reported. A Hamming code is adopted to reduce the bit error rate (BER), and ways to improve sound quality are suggested as future research directions.

[1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41]

References

Kirovski D, Malvar HS: Spread-spectrum watermarking of audio signals. IEEE Transactions on Signal Processing 2003, 51(4):1020-1033. 10.1109/TSP.2003.809384
Article MathSciNet Google Scholar
Swanson MD, Zhu B, Tewfik AH, Boney L: Robust audio watermarking using perceptual masking. Signal Processing 1998, 66(3):337-355. 10.1016/S0165-1684(98)00014-0
Article MATH Google Scholar
Chou J, Ramchandran K, Ortega A: Next generation techniques for robust and imperceptible audio data hiding. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '01), May 2001, Salt Lake City, Utah, USA 3: 1349-1352.
Google Scholar
Vercoe BL, Gardner WG, Scheirer ED: Structured audio: creation, transmission, and rendering of parametric sound representations. Proceedings of the IEEE 1998, 86(5):922-939. 10.1109/5.664280
Article Google Scholar
Liu Y-W, Smith JO: Watermarking parametric representations for synthetic audio. Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '03), April 2003, Hong Kong 5: 660-663.
Google Scholar
Markel JD, Gray AH: Linear Prediction of Speech. Springer, New York, NY, USA; 1976.
Book MATH Google Scholar
Schroeder MR, Atal BS: Code-excited linear prediction (CELP): high-quality speech at very low bit rates. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '85), April 1985, Tampa, Fla, USA 10: 937-940.
Article Google Scholar
McAulay RJ, Quatieri TF: Speech analysis/synthesis based on a sinusoidal representation. IEEE Transaction Acoustics, Speech, Signal Processing 1986, 34(4):744-754. 10.1109/TASSP.1986.1164910
Article Google Scholar
Smith JO, Serra X: PARSHL: an analysis/synthesis program for non-harmonic sounds based on a sinusoidal representation. Proceedings of the International Computer Music Conference (ICMC '87), 1987, Tokyo, Japan 290-297.
Google Scholar
Serra X, Smith JO: Spectral modeling synthesis: a sound analysis/synthesis system based on a deterministic plus stochastic decomposition. Computer Music Journal 1990, 14(4):12-24. 10.2307/3680788
Article Google Scholar
S. N. Levine, “Audio representations for data compression and compressed domain processing,” Ph.D. dissertation, Stanford University, Stanford, Calif, USA, 1998.
Google Scholar
Purnhagen H, Meine N: HILN-the MPEG-4 parametric audio coding tools. Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS '00), May 2000, Geneva, Switzerland 3: 201-204.
Google Scholar
Wu C-P, Su P-C, Kuo C-CJ: Robust and efficient digital audio watermarking using audio content analysis. Proceedings of Security and Watermarking of Multimedia Contents II: Audio Watermarking, January 2000, San Jose, Calif, USA, Proceedings of SPIE 3971: 382-392.
Article Google Scholar
M. Ali, “Adaptive signal representation with application in audio coding,” Ph.D. dissertation, University ofMinnesota,Minneapolis, Minn, USA, 1996.
Google Scholar
Mansour MF, Tewfik AH: Time-scale invariant audio data embedding. EURASIP Journal on Applied Signal Processing 2003, 2003(10):993-1000. 10.1155/S1110865703304135
Article Google Scholar
Bender W, Gruhl D, Morimoto N, Lu A: Techniques for data hiding. IBM Systems Journal 1996, 35(3-4):313-336.
Article Google Scholar
Dong X, Bocko MF, Ignjatovic Z: Data hiding via phase manipulation of audio signals. Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '04), May 2004, Montreal, QC, Canada 5: 377-380.
Google Scholar
Chen B, Wornell GW: Quantization index modulation: a class of provably good methods for digital watermarking and information embedding. IEEE Transactions on Information Theory 2001, 47(4):1423-1443. 10.1109/18.923725
Article MATH MathSciNet Google Scholar
Petrovic R: Audio signal watermarking based on replica modulation. Proceedings of the 5th International Conference on Telecommunications in Modern Satellite, Cable and Broadcasting Service (TELSIKS '01), September 2001, Nis, Yugoslavia 1: 227-234.
Google Scholar
Shin S, Kim O, Kim J, Choil J: A robust audio watermarking algorithm using pitch scaling. Proceedings of the 14th International Conference on Digital Signal Processing (DSP '02), October 2002, Pine Mountain, GA, USA 701-704.
Google Scholar
Girin L, Marchand S: Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '04), May 2004, Montreal, QC, Canada 1: 633-636.
Google Scholar
Liu Y-W, Smith JO: Watermarking sinusoidal audio representations by quantization index modulation in multiple frequencies. Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '04), May 2004, Montreal, QC, Canada 5: 373-376.
Google Scholar
Harris FJ: On the use of windows for harmonic analysis with the discrete Fourier transform. Proceedings of the IEEE 1978, 66(1):51-83.
Article Google Scholar
Bosi M: Perceptual audio coding. IEEE Signal Processing Magazine 1997, 14(5):43-49.
Google Scholar
Zwicker E, Fastl H: Psychoacoustics, Facts and Models. Springer, Berlin, Germany; 1990.
Google Scholar
Jayant N, Johnston J, Safranek R: Signal compression based on models of human perception. Proceedings of the IEEE 1993, 81(10):1385-1422. 10.1109/5.241504
Article Google Scholar
Bosi M, Goldberg RE: Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, Boston, Mass, USA; 2003.
Book Google Scholar
Cox IJ, Miller ML, Bloom JA: Digital Watermarking. Morgan Kaufmann, San Francisco, Calif, USA; 2002.
Google Scholar
Terhardt E: Calculating virtual pitch. Hearing Research 1979, 1(2):155-182. 10.1016/0378-5955(79)90025-X
Article Google Scholar
Abe M, Smith JO: Design criteria for simple sinusoidal parameter estimation based on quadratic interpolation of FFT magnitude peaks. Proceedings of the 117th Audio Engineering Society Conventions and Conferences (AES '04), October 2004, San Francisco, Calif, USA 6256.
Google Scholar
Shower EG, Biddulph R: Differential pitch sensitivity of the ear. Journal of the Acoustical Society of America 1931, 3(1A):275-287.
Article Google Scholar
Wier CC, Jesteadt W, Green DM: Frequency discrimination as a function of frequency and sensation level. Journal of the Acoustical Society of America 1977, 61(1):178-184. 10.1121/1.381251
Article Google Scholar
Zeng F-G, Kong Y-Y, Michalewski HJ, Starr A: Perceptual consequences of disrupted auditory nerve activity. Journal of Neurophysiology 2005, 93(6):3050-3063. 10.1152/jn.00985.2004
Article Google Scholar
Liu Y-W: Audio watermarking through parametric synthesis models. In Digital Audio Watermarking Techniques and Technologies: Applications and Benchmarking. Edited by: Cvejic N. Idea Group, Hershey, Pa, USA; 2007.
Google Scholar
Scharf LL, McWhorter LT: Geometry of the Cramer-Rao bound. Proceedings of the 6th IEEE SP Workshop on Statistical Signal and Array Processing, October 1992, Victoria, BC, Canada 31(3):301-311.
Google Scholar
Wolters M, Kjörling K, Homm D, Purnhagen H: A closer look into MPEG-4 high efficiency AAC. Proceedings of the 115th Audio Engineering Society Conventions and Conferences (AES '03), October 2003, New York, NY, USA
Google Scholar
Allen JB, Berkley DA: Image method for efficiently simulating small-room acoustics. Journal of the Acoustical Society of America 1979, 65(4):943-950. 10.1121/1.382599
Article Google Scholar
Kabal P: An examination and interpretation of ITU-R BS.1387: perceptual evaluation of audio quality. Department of Electrical & Computer Engineering, McGill University, Montreal, Canada; 2003.http://www-mmsp.ece.mcgill.ca/Documents/Software/http://www-mmsp.ece.mcgill.ca/Documents/Software/
Google Scholar
Pless V: Introduction to the Theory of Error-Correcting Codes. 3rd edition. Wiley-Interscience, New York, NY, USA; 1998.
Book MATH Google Scholar
Eggers JJ, Bäuml R, Tzschoppe R, Girod B: Scalar Costa scheme for information embedding. IEEE Transactions on Signal Processing 2003, 51(4):1003-1019. 10.1109/TSP.2003.809366
Article MathSciNet Google Scholar
Moulin P, Koetter R: Data-hiding codes. Proceedings of the IEEE 2005, 93(12):2083-2126.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Computer Research in Music and Acoustics (CCRMA), Stanford University, Palo Alto, CA, 94305, USA
Yi-Wen Liu & Julius O. Smith
Boys Town National Research Hospital, 555 North 30th Street, Omaha, NE, 68131, USA
Yi-Wen Liu

Authors

Yi-Wen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Julius O. Smith
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi-Wen Liu.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Liu, YW., Smith, J.O. Audio Watermarking through Deterministic plus Stochastic Signal Decomposition. EURASIP J. on Info. Security 2007, 075961 (2007). https://doi.org/10.1155/2007/75961

Download citation

Received: 01 May 2007
Revised: 10 August 2007
Accepted: 01 October 2007
Published: 15 November 2007
DOI: https://doi.org/10.1155/2007/75961

Audio Watermarking through Deterministic plus Stochastic Signal Decomposition

Abstract

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords