Ircam-Centre Pompidou

Recherche

Recherche simple

Recherche avancée

Panier électronique

Votre panier ne contient aucune notice

Connexion à la base

(Identifiez-vous pour accéder aux fonctions de mise à jour. Utilisez votre login-password de courrier électronique)

Entrepôt OAI-PMH

Soumettre une requête

	Consulter la notice détaillée
	Version complète en ligne
	Version complète en ligne accessible uniquement depuis l'Ircam
	Ajouter la notice au panier
	Retirer la notice du panier

English version

(full translation not yet available)

Liste complète des articles

Consultation des notices

Vue détaillée

Catégorie de document	Contribution à un colloque ou à un congrès
Titre	Audio Identification based on spectral modeling of bark-bands energy and synchronization through onset detection
Auteur principal	Mathieu Ramona
Co-auteur	Geoffroy Peeters
Colloque / congrès	ICASSP. Prague : Mai 2011
Comité de lecture	Oui
Année	2011
Statut éditorial	Accepté - publication en cours
Résumé	In this paper, we present for the first time the fingerprint IRCAM system for audio identification in streams. The baseline system relies on a double-nested Short Time Fourier Transform. The first STFT computes the energies of a filter-bank, that are then modelled over 2 s, using a second STFT. We then present recent improvements of our system: first the inclusion of perceptual scales for amplitude and frequency (Bark bands), then the synchronization of stream and database frames using an onset detection system. The performance of these improvements is tested on a large set of real audio streams. We compare our results with the results of re-implementations of the two state-of-the-art systems of Philips and Shazam.
Equipe	Analyse et synthèse sonores
Cote	Ramona11c
Adresse de la version en ligne	http://articles.ircam.fr/textes/Ramona11c/index.pdf

© Ircam - Centre Pompidou 2005.