Deep Scattering Spectrum

From MaRDI portal
Publication:4579374

DOI10.1109/TSP.2014.2326991zbMATH Open1394.94040arXiv1304.6763MaRDI QIDQ4579374FDOQ4579374


Authors: Joakim Andén, Stéphane Mallat Edit this on Wikidata


Publication date: 22 August 2018

Published in: IEEE Transactions on Signal Processing (Search for Journal in Brave)

Abstract: A scattering transform defines a locally translation invariant representation which is stable to time-warping deformations. It extends MFCC representations by computing modulation spectrum coefficients of multiple orders, through cascades of wavelet convolutions and modulus operators. Second-order scattering coefficients characterize transient phenomena such as attacks and amplitude modulation. A frequency transposition invariant representation is obtained by applying a scattering transform along log-frequency. State-the-of-art classification results are obtained for musical genre and phone classification on GTZAN and TIMIT databases, respectively.


Full work available at URL: https://arxiv.org/abs/1304.6763







Cited In (19)





This page was built for publication: Deep Scattering Spectrum

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4579374)