ROBUST AND INVARIANT AUDIO PATTERN MATCHING
Filing Information
- Publication Number: WO2003091990
- Application Number: US312126
- Filing date: 04/18/2003
- Publication date: 11/06/2003
- U.S. Classifications: --
- International Classifications: 7G 10L 21/00 A ·
- Foreign Priority: US37605502 - 04/25/2002 ·
Abstract
The present invention provides an innovative technique for rapidly and accurately determining whether two audio samples match, as well as being immune to various kinds of transformations, such as playback speed variation. The relationship between the two audio samples is characterized by first matching certain fingerprint objects derived from the respective samples. A set (230) of fingerprint objects (231, 232), each occurring at a particular location (242), is generated for each audio sample (210). Each location is determined in dependence upon the content of respective audio sample (210) and each fingerprint object (232) characterizes one or more local features (222) at or near the respective particular location (242). A relative value is next determined for each pair of matched fingerprint objects. A histogram of the relative values is then generated. If a statistically significant peak is found, the two audio samples can be characterized as substantially matching.References Cited
The current document has no citations.Referenced By
| Document Number | Assignee | Inventors | Issue/Pub Date |
|---|---|---|---|
| US7565327 | Audible Magic Corporation | Richard A. Schmelzer | Jul 2009 |
| US7562012 | Audible Magic Corporation | Erling H. Wold et al. | Jul 2009 |
| EP1704695 | Advestigo | Hassane ESSAFI et al. | Feb 2008 |
| EP1864279 | Fraunhofer-Gesellschaft zur Frderung der angewandten Forschung e.V. | Wolfgang FIESEL et al. | Jun 2009 |
| US7500007 | Audible Magic Corporation | Vance E. Ikezoye et al. | Mar 2009 |
| US7623823 | Integrated Media Measurement, Inc. | Tom Zito et al. | Nov 2009 |
| US7363278 | Audible Magic Corporation | Richard A. Schmelzer et al. | Apr 2008 |
| US7529659 | Audible Magic Corporation | Erling H. Wold | May 2009 |
Patent Family
| Document Number | Assignee | Inventors | Issue/Pub Date |
|---|---|---|---|
| WO2003091990 | SHAZAM ENTERTAINMENT, LTD. | Avery Li-Chun WANG et al. | Nov 2003 |
| CA2483104 | SHAZAM ENTERTAINMENT LTD | WANG AVERY LI-CHUN et al. | Nov 2003 |
| AU2003230993 | SHAZAM ENTERTAINMENT LTD | WANG AVERY LI-CHUN et al. | Nov 2003 |
| BR0309598 | SHAZAM ENTERTAINMENT LTD | WANG AVERY LI-CHUN et al. | Feb 2005 |
| CN1647160 | SHAZAM ENTERTAINMENT LTD | LI-CHUN WANG AVERY et al. | Jul 2005 |
| JP2005524108 | Aug 2005 | ||
| EP1504445 | Landmark Digital Services LLC | Avery Li-Chun WANG et al. | Aug 2008 |
| AT405924 | LANDMARK DIGITAL SERVICES LLC | WANG AVERY LI-CHUN et al. | Sep 2008 |
| DE60323086 | LANDMARK DIGITAL SERVICES LLC | WANG AVERY LI-CHUN et al. | Oct 2008 |
| DK1504445 | LANDMARK DIGITAL SERVICES LLC | WANG AVERY LI-CHUN et al. | Dec 2008 |
Independent Claims | See all claims (27)
- What is claimed is: 1. A method of characterizing a relationship between a first and a second audio samples, comprising the steps of: generating a first set of fingerprint objects for the first audio sample, each fingerprint object occurring at a respective location within the first audio sample, the respective location being determined in dependence upon the content of the first audio sample, and each fingerprint object characterising one or more features of the first audio sample at or near each respective location;
- generating a second set of fingerprint objects for the second audio sample, each fingerprint object occurring at a respective location within the second audio sample, the respective location being determined in dependence upon the content of the second audio sample, and each fingerprint object characterising one or more features of the second audio sample at or near each respective location;
- pairing fingerprint objects by matching a first fingerprint object from the first audio sample with a second fingerprint object from the second audio sample that is substantially similar to the first fingerprint object;
- generating, based on the pairing step, a list of pairs of matched fingerprint objects;
- determining a relative value for each pair of matched fingerprint objects;
- generating a histogram of the relative values; and
- searching for a statistically significant peak in the histogram, the peak characterizing the relationship between the first and second audio samples.
- detecting if the relative pitch and a reciprocal of the relative playback speed are substantially different, in which case the relationship between the first and second audio samples is characterized as nonlinear.
- for each pair of matched fingerprint objects in the list, determining a compensated relative time offset value, t-R*t, where t and t are locations in time with respect to the first and second fingerprint objects;
- generating a second histogram of the compensated relative time offset values; and
- searching for a statistically significant peak in the second histogram of the compensated relative time offset values, the peak further characterizing the relationship between the first and second audio samples.
- 17. A computer program product for performing a method according to any preceding • claim.





View assignee updates
analyzing 100 million+ documents to uncover your network...