Scalable, Content-Based Audio Identification by Multiple Independent Psychoacoustic Matching

Geoff R Schmidt
Matthew K Belmonte


Journal of the Audio Engineering Society 52(4):366-377 (April 2004).


Abstract: Tuneprint is a software system for content-based identification of audio recordings. Tuneprint transforms its input using a perceptual model of the human auditory system, making its output robust to lossy compression and to other distortions. In order to make use both of the instantaneous pattern of a recording's perceptual features and the information contained in the evolution of these features over time, Tuneprint first matches fragments of the input against a database of fragments of known recordings. In a subsequent step, these matches at the fragment level are assembled in order to identify a single recording that matches consistently over time. In a small-scale test, Tuneprint has successfully matched all queries against a database of 100,000 commercially released recordings.

Profound thanks are due to our colleagues Daren Gill, Martin Stiaszny, Amittai Axelrod, Josh Pollack, Jennifer Chung, and Lex Nemzer, without whose many day and night hours the Tuneprint system could never have been implemented. In addition, we wish to acknowledge Sage Hill Partners, who funded the development of Tuneprint.


Colleagues who have been given a password may download a reprint. Other interested persons may obtain a password by entering their email address here:

CITED IN: