Extraction and matching of characteristic fingerprints from audio signals
Filing Information
- Patent Number: US7516074
- Application Number: US11219385
- Filing date: 09/01/2005
- Issue date: 04/07/2009
- Prior Publication Data:
- US20070055500 - 03/08/2007
- Predicted expiration date: 10/16/2027
- Patent term adjustment: 775
- U.S. Classifications: 704/270 ·
- International Classifications: G10L2100 ·
Abstract
An audio fingerprint is extracted from an audio sample, where the fingerprint contains information that is characteristic of the content in the sample. The fingerprint may be generated by computing an energy spectrum for the audio sample, resampling the energy spectrum logarithmically in the time dimension, transforming the resampled energy spectrum to produce a series of feature vectors, and computing the fingerprint using differential coding of the feature vectors. The generated fingerprint can be compared to a set of reference fingerprints in a database to identify the original audio content.References Cited
U.S. Patent Documents
| Document Number | Assignees | Inventors | Issue/Pub Date |
|---|---|---|---|
| US5918223 | Muscle Fish | Blum et al. | Jun 1999 |
| US6266003 | Sigma Audio Research Limited | Hoek | Jul 2001 |
| US6453252 | Creative Technology Ltd. | Laroche | Sep 2002 |
| US6990453* | Landmark Digital Services LLC | Wang et al. | Jan 2006 |
| US7328153* | Gracenote, Inc. | Wells et al. | Feb 2008 |
| US7346512* | Landmark Digital Services, LLC | Wang et al. | Mar 2008 |
| US20020072982 | Shazam Entertainment Ltd. | Barton et al. | Jun 2002 |
| US20020083060 | Wang et al. | Jun 2002 | |
| US20020161741 | Shazam Entertainment Ltd. | Wang et al. | Oct 2002 |
| US20030086341 | GRACENOTE, INC. | Wells et al. | May 2003 |
| US20060072786 | Kaleidescape, Inc. | Watson et al. | Apr 2006 |
| US20060075237 | Koninklijke Philips Electronics N.V. | Seo et al. | Apr 2006 |
| US20060080356 | Microsoft Corporation | Burges et al. | Apr 2006 |
Foreign Patent Documents
| Document Number | Assignees | Inventors | Issue/Pub Date |
|---|---|---|---|
| WO200120483 | CANTAMETRIX, INC. | Mar 2001 | |
| WO200120609 | CANTAMETRIX, INC. | Mar 2001 | |
| WO200162004 | AUDIBLE MAGIC CORPORATION | Aug 2001 | |
| WO200176290 | MOBIQUID | Oct 2001 | |
| WO200227600 | SHAZAM ENTERTAINMENT LTD. | Apr 2002 | |
| WO200237316 | AUDIBLE MAGIC CORPORATION | May 2002 | |
| WO200211123 | SHAZAM ENTERTAINMENT LIMITED | Jul 2002 |
Other Publications
| Foote, J. “Content-based Retrieval of Music and Audio”, in Multimedia Storage and Archiving Systems II, Proc. Of SPIE, C.C. J. Kuo et al., Eds., 1997, vol. 3229, pp. 138-147. |
| Lin, K. et al.: The TV-tree: An Index Structure for High Dimensional Data. VLDB Journal, pp. 517-541 (1992). |
| Zhang, T. et al.: Content-based Classification and Retrieval of Audio. In: SPIE's 43rd Annual Meeting—Conf. On Advanced Signal Processing Algorithms, Architectures, and Implementations VIII, SPIE vol. 3461, San Diego, Jul. (1998) 432-443. |
| TRM Advanced Audio Fingerprinting from Relatable. Retrieved from internet at http://relatable.com/tech/trm.html (2002). |
| Content-Based Classification, Search, and Retrieval of Audio: Free article by Muscle Fish in IEEE MultiMedia, retrieved from internet at: http://audiblemagic.com/solutions/ieee-1.htm (2002). |
| Sukittanon, Somask et al., Modulation Scale Analysis for Content Identification, UWEE Technical Report No. UWEETR-2003-0025, Department of EE, University of Washington, Nov. 1, 2003, 11 pages. |
| Haitsma, Jaap et al., A Highly Robust Audio Fingerprinting System, Proceedings of ISMIR 2002. Paris, France, Oct. 2002, 9 pages. |
| Sukittanon, Somsak, Modulation-Scale Analysis for Content Identification, IEEE Transactions on Signal Processing. vol. 52, No. 10, Oct. 2004, pp. 3023-3035. |
| Seo, Jin Soo, Linear Speed-Change Resilient Audio Fingerprinting, Proc. 1st, IEEE Benelux Workshop on Model Based Processing and Coding of audio (MPCA-2002), Leuven, Belgium, Nov. 15, 2002, pp. 45-48. |
| Haitsma, J.A., A new Technology to Identify Music, Nat. Lab. Unclassified Report 2002/824, Aug. 13, 2002, 30 pages. |
Referenced By
The current document is not referenced by other documents.Patent Family
The current document is not in a family.Independent Claims | See all claims (48)
- 1. A method for extracting an audio fingerprint from an audio frame, the method comprising: filtering the audio frame into a plurality of frequency bands to produce a corresponding plurality of filtered audio signals; resampling the filtered audio signals at a nonlinear timescale; transforming the resampled audio signals for each frequency band to produce a feature vector for the frequency band; and computing the audio fingerprint based on the set of feature vectors.
- 26. A method for extracting an audio fingerprint from an audio frame, the method comprising: a step for computing a spectrogram for the audio frame; sampling the spectrogram at a nonlinear time scale for a plurality of frequency bands in the spectrogram; a step for extracting a long-term feature vector using the samples from each of the sampled frequency bands; and a step for generating the audio fingerprint based on the feature vectors.
- 28. A computer program product for extracting an audio fingerprint from an audio frame, the computer program product comprising a computer-readable medium containing computer program code for performing the method comprising: filtering the audio frame into a plurality of frequency bands to produce a corresponding plurality of filtered audio signals; resampling the filtered audio signals at a nonlinear timescale; transforming the resampled audio signals for each frequency band to produce a feature vector for the frequency band; and computing the audio fingerprint based on the set of feature vectors.





View assignee updates
analyzing 100 million+ documents to uncover your network...