A spectrogram-based audio fingerprinting system for content-based copy detection

Research output: Contribution to journalJournal Articlepeer-review

8 Citations (Scopus)

Abstract

This paper presents a novel audio fingerprinting method that is highly robust to a variety of audio distortions. It is based on an unconventional audio fingerprint generation scheme. The robustness is achieved by generating different versions of the spectrogram matrix of the audio signal by using a threshold based on the average of the spectral values to prune this matrix. We transform each version of this pruned spectrogram matrix into a 2-D binary image. Multiple versions of these 2-D images suppress noise to a varying degree. This varying degree of noise suppression improves likelihood of one of the images matching a reference image. To speed up matching, we convert each image into an n-dimensional vector, and perform a nearest neighbor search based on this n-dimensional vector. We give results with two different feature parameters and their combination. We test this method on TRECVID 2010 content-based copy detection evaluation dataset, and we validate the performance on TRECVID 2009 dataset also. Experimental results show the effectiveness of these features even when the audio is distorted. We compare the proposed method to two state-of-the-art audio copy detection systems, namely NN-based and Shazam systems. Our method by far outperforms Shazam system for all audio transformations (or distortions) in terms of detection performance, number of missed queries and localization accuracy. Compared to NN-based system, our approach reduces minimal Normalized Detection Cost Rate (min NDCR) by 23 % and improves localization accuracy by 24 %.

Original languageEnglish
Pages (from-to)9145-9165
Number of pages21
JournalMultimedia Tools and Applications
Volume75
Issue number15
DOIs
Publication statusPublished - 1 Aug 2016

!!!Keywords

  • Audio fingerprints
  • Content-based copy detection
  • Feature parameters
  • Spectrogram
  • TRECVID

Fingerprint

Dive into the research topics of 'A spectrogram-based audio fingerprinting system for content-based copy detection'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this