Ricard Marxer

I'm a Research Fellow in Speech Technology at the University of Sheffield working on unsupervised and self-supervised learning approaches for speech and audio. I previously did research at Universitat Pompeu Fabra and University of Toulon working with various groups on music information retrieval, bioacoustics and speech processing. My work focuses on developing novel machine learning methods for processing speech and audio signals. I'm particularly interested in self-supervised representation learning and how we can extract meaningful features from raw audio without relying on labeled data. Some of my recent projects involve studying the scaling properties of speech language models, improving speaker diarization through joint optimization with speech separation, and developing models for predicting speech intelligibility.

I collaborate extensively with researchers in speech, music, and marine bioacoustics. Recent work includes developing systems for underwater audio processing and marine mammal monitoring, as well as applications of deep learning to hearing aid technology and speech enhancement. I'm also interested in the intersections between speech technology and cognitive science, studying how computational models can help us understand human speech perception.

Publications

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024

Joonas Kalda, Tanel Alumäe, Martin Lebourdais, Hervé Bredin, Séverin Baroudi, R. Marxer

Interspeech 2024

Transfer Learning from Whisper for Microscopic Intelligibility Prediction

Transfer Learning from Whisper for Microscopic Intelligibility Prediction

Paul Best, Santiago Cuervo, R. Marxer

Interspeech 2024

Scaling Properties of Speech Language Models

Scaling Properties of Speech Language Models

Santiago Cuervo, R. Marxer

Conference on Empirical Methods in Natural Language Processing 2024

PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings

PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings

Joonas Kalda, Clément Pagés, R. Marxer, Tanel Alumäe, Hervé Bredin

The Speaker and Language Recognition Workshop 2024

Speech Foundation Models on Intelligibility Prediction for Hearing-Impaired Listeners

Speech Foundation Models on Intelligibility Prediction for Hearing-Impaired Listeners

Santiago Cuervo, R. Marxer

IEEE International Conference on Acoustics, Speech, and Signal Processing 2024

Vocal interactivity in-and-between humans, animals and robots

M. Chetouani, E. Briefer, Angela Dassow, R. Marxer, Roger K. Moore, Nicolas Obin, D. Stowell

Interaction Studies 2023

Progress and Prospects for Spoken Language Technology: Results from Five Sexennial Surveys

Progress and Prospects for Spoken Language Technology: Results from Five Sexennial Surveys

Roger K. Moore, R. Marxer

Interspeech 2023

On the Benefits of Self-supervised Learned Speech Representations for Predicting Human Phonetic Misperceptions

On the Benefits of Self-supervised Learned Speech Representations for Predicting Human Phonetic Misperceptions

Santiago Cuervo, R. Marxer

Interspeech 2023

1st Year of running MIR at UJI

1st Year of running MIR at UJI

P. J. Sanz, R. Marín, Salvador López-Barajas, A. Solis, R. Marxer, Vincent Hugel

Oceans 2023

Eiffel Tower: A deep-sea underwater dataset for long-term visual localization

Eiffel Tower: A deep-sea underwater dataset for long-term visual localization

Clémentin Boittiaux, C. Dune, Maxime Ferrera, A. Arnaubec, R. Marxer, M. Matabos, Loïc Van Audenhaege, Vincent Hugel

Int. J. Robotics Res. 2023

Deep audio embeddings for vocalisation clustering

Paul Best, R. Marxer, Sébastien Paris, H. Glotin

bioRxiv 2023

SUCRe: Leveraging Scene Structure for Underwater Color Restoration

SUCRe: Leveraging Scene Structure for Underwater Color Restoration

Clémentin Boittiaux, R. Marxer, C. Dune, A. Arnaubec, Maxime Ferrera, Vincent Hugel

International Conference on 3D Vision 2022

Author Correction: Temporal evolution of the Mediterranean fin whale song

Paul Best, R. Marxer, Sébastien Paris, H. Glotin

Scientific Reports 2022

Blind Speech Separation Through Direction of Arrival Estimation Using Deep Neural Networks with a Flexibility on the Number of Speakers

Blind Speech Separation Through Direction of Arrival Estimation Using Deep Neural Networks with a Flexibility on the Number of Speakers

Mohammed Hafsati, Kamil Bentounes, R. Marxer

IEEE International Workshop on Multimedia Signal Processing 2022

Temporal evolution of the Mediterranean fin whale song

Paul Best, R. Marxer, Sébastien Paris, H. Glotin

Scientific Reports 2022

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

Santiago Cuervo, Adrian La'ncucki, R. Marxer, Paweł Rychlikowski, J. Chorowski

Neural Information Processing Systems 2022

Homography-Based Loss Function for Camera Pose Regression

Homography-Based Loss Function for Camera Pose Regression

Clémentin Boittiaux, R. Marxer, C. Dune, A. Arnaubec, V. Hugel

IEEE Robotics and Automation Letters 2022

Contrastive Prediction Strategies for Unsupervised Segmentation and Categorization of Phonemes and Words

Contrastive Prediction Strategies for Unsupervised Segmentation and Categorization of Phonemes and Words

Santiago Cuervo, Maciej Grabias, J. Chorowski, Grzegorz Ciesielski, Adrian La'ncucki, Paweł Rychlikowski, R. Marxer

IEEE International Conference on Acoustics, Speech, and Signal Processing 2021

Marine and Maritime Intelligent Robotics (MIR)

Marine and Maritime Intelligent Robotics (MIR)

R. Marxer, V. Hugel, Kalliopi Pediaditi Prud’Homme, P. Batista, José Vicente Martí Avilés, A. Pascoal, P. Sanz, I. Schjølberg

Oceans 2021

Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

Information Retrieval for ZeroSpeech 2021: The Submission by University of Wroclaw

J. Chorowski, Grzegorz Ciesielski, Jaroslaw Dzikowski, Adrian La'ncucki, R. Marxer, Mateusz Opala, P. Pusz, Paweł Rychlikowski, Michal Stypulkowski

Interspeech 2021

Aligned Contrastive Predictive Coding

Aligned Contrastive Predictive Coding

J. Chorowski, Grzegorz Ciesielski, Jaroslaw Dzikowski, A. Lancucki, R. Marxer, Mateusz Opala, P. Pusz, Paweł Rychlikowski, Michal Stypulkowski

Interspeech 2021

Voice Restoration with Silent Speech Interfaces (ReSSInt)

Voice Restoration with Silent Speech Interfaces (ReSSInt)

I. Hernáez, José Andrés González López, E. Navas, J. L. Pérez-Córdoba, I. Saratxaga, Gonzalo Olivares, Jon Sánchez de la Fuente, A. Galdón, Víctor García Romillo, Míriam González-Atienza, T. Schultz, P. Green, Michael Wand, R. Marxer, Lorenz Diener

IberSPEECH Conference 2021

Stereo to five-channels bombyx sonobuoys: from four years cetacean monitoring to real-time whale-ship anti-collision system

Paul Best, Sebastián Marzetti, Marion Poupard, Maxence Ferrari, Sébastien Paris, R. Marxer, O. Philippe, V. Gies, V. Barchasz, H. Glotin

The “ScribbleLens” Dutch Historical Handwriting Corpus

The “ScribbleLens” Dutch Historical Handwriting Corpus

Hans J. G. A. Dolfing, J. Bellegarda, J. Chorowski, R. Marxer, Antoine Laurent

International Conference on Frontiers in Handwriting Recognition 2020

DOCC10: Open access dataset of marine mammal transient studies and end-to-end CNN classification

DOCC10: Open access dataset of marine mammal transient studies and end-to-end CNN classification

Maxence Ferrari, H. Glotin, R. Marxer, M. Asch

IEEE International Joint Conference on Neural Network 2020

Deep Learning and Domain Transfer for Orca Vocalization Detection

Deep Learning and Domain Transfer for Orca Vocalization Detection

Paul Best, Maxence Ferrari, Marion Poupard, Sébastien Paris, R. Marxer, H. Symonds, P. Spong, H. Glotin

IEEE International Joint Conference on Neural Network 2020

A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning

A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning

Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, J. Chorowski, A. Lancucki, R. Marxer, James R. Glass

Interspeech 2020

Robust Training of Vector Quantized Bottleneck Models

Robust Training of Vector Quantized Bottleneck Models

A. Lancucki, J. Chorowski, Guillaume Sanchez, R. Marxer, Nanxin Chen, Hans J. G. A. Dolfing, Sameer Khurana, Tanel Alumäe, Antoine Laurent

IEEE International Joint Conference on Neural Network 2020

Deep Learning Classification with Noisy Labels

Deep Learning Classification with Noisy Labels

Guillaume Sanchez, V. Guis, R. Marxer, F. Bouchara

2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) 2020

Unsupervised Neural Segmentation and Clustering for Unit Discovery in Sequential Data

Unsupervised Neural Segmentation and Clustering for Unit Discovery in Sequential Data

J. Chorowski, Nanxin Chen, R. Marxer, Hans J. G. A. Dolfing, Adrian Łańcucki, Guillaume Sanchez, Tanel Alumäe, Antoine Laurent

Wave propagation in the biosonar organ of sperm whales using a finite difference time domain method

Wave propagation in the biosonar organ of sperm whales using a finite difference time domain method

Maxence Ferrari, R. Marxer, M. Asch, H. Glotin

High-frequency Near-field Physeter macrocephalus Monitoring by Stereo-Autoencoder and 3D Model of Sonar Organ

High-frequency Near-field Physeter macrocephalus Monitoring by Stereo-Autoencoder and 3D Model of Sonar Organ

Maxence Ferrari, H. Glotin, R. Marxer, V. Barchasz, Véronique Sarano, V. Gies, M. Asch, F. Sarano

Oceans 2019

Efficient artifacts filter by density-based clustering in long term 3D whale passive acoustic monitoring with five hydrophones fixed under an Autonomous Surface Vehicle

Efficient artifacts filter by density-based clustering in long term 3D whale passive acoustic monitoring with five hydrophones fixed under an Autonomous Surface Vehicle

Maxence Ferrari, Marion Poupard, Pascale Giraudet, R. Marxer, Jean-Marc Prevot, Thierry Soriano, H. Glotin

Oceans 2019

Real-time Passive Acoustic 3D Tracking of Deep Diving Cetacean by Small Non-uniform Mobile Surface Antenna

Real-time Passive Acoustic 3D Tracking of Deep Diving Cetacean by Small Non-uniform Mobile Surface Antenna

Marion Poupard, Maxence Ferrari, Jan Schlüter, R. Marxer, Pascale Giraudet, V. Barchasz, V. Gies, G. Pavan, H. Glotin

IEEE International Conference on Acoustics, Speech, and Signal Processing 2019

Lexical frequency effects in English and Spanish word misperceptions.

M. Cooke, M. L. García Lecumberri, J. Barker, R. Marxer

Journal of the Acoustical Society of America 2019

Deep learning for ethoacoustical mapping: Application to a single Cachalot long term recording on joint observatories in Vancouver Island

H. Glotin, P. Spong, H. Symonds, Vincent Roger, Randall Balestriero, Maxence Ferrari, Marion Poupard, J. Towers, Scott Veirs, R. Marxer, Pascale Giraudet, James C Pilkinton, V. Veirs, J. Wood, J. K. Ford, Tom Dakin

Journal of the Acoustical Society of America 2018

Towards the topology of autoencoder of calls versus clicks of marine mammal

Vincent Roger, Maxence Ferrari, R. Marxer, Faicel Chamroukhi, H. Glotin

Journal of the Acoustical Society of America 2018

DNN driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation

DNN driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation

M. Gogate, A. Adeel, R. Marxer, J. Barker, A. Hussain

Interspeech 2018

Sperm whales ultra high frequency near field multichannel analysis

Maxence Ferrari, R. Marxer, Vincent Roger, V. Gies, F. Sarano, M. Asch, Hugues Vitry, Axel Preud' Homme, René Heuzey, Véronique Sarano, H. Glotin

A corpus of audio-visual Lombard speech with frontal and profile views.

A corpus of audio-visual Lombard speech with frontal and profile views.

Najwa Alghamdi, Steve C. Maddock, R. Marxer, J. Barker, Guy J. Brown

Journal of the Acoustical Society of America 2018

The impact of the Lombard effect on audio and visual speech recognition systems

R. Marxer, J. Barker, Najwa Alghamdi, S. Maddock

Speech Communication 2018

The CHiME Challenges: Robust Speech Recognition in Everyday Environments

J. Barker, R. Marxer, E. Vincent, Shinji Watanabe

New Era for Robust Speech Recognition, Exploiting Deep Learning 2017

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

E. Vincent, Shinji Watanabe, Aditya Arie Nugraha, J. Barker, R. Marxer

Computer Speech and Language 2017

The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes

The third 'CHiME' speech separation and recognition challenge: Analysis and outcomes

J. Barker, R. Marxer, E. Vincent, Shinji Watanabe

Computer Speech and Language 2017

Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement

Binary Mask Estimation Strategies for Constrained Imputation-Based Speech Enhancement

R. Marxer, J. Barker

Interspeech 2017

Multi-microphone speech recognition in everyday environments

J. Barker, R. Marxer, E. Vincent, Shinji Watanabe

Computer Speech and Language 2017

Guest Editorial for the special issue on Multi-Microphone Speech Recognition in Everyday Environments

J. Barker, R. Marxer, E. Vincent, Shinji Watanabe

A Data Driven Approach to Audiovisual Speech Mapping

Andrew Abel, R. Marxer, J. Barker, R. Watt, Bill Whitmer, Peter Derleth, A. Hussain

International Conference on Advances in Brain Inspired Cognitive Systems 2016

Vocal Interactivity in-and-between Humans, Animals, and Robots

Vocal Interactivity in-and-between Humans, Animals, and Robots

Roger K. Moore, R. Marxer, Serge Thill

Frontiers in Robotics and AI 2016

CloudCAST - Remote Speech Technology for Speech Professionals

P. Green, R. Marxer, S. Cunningham, H. Christensen, Frank Rudzicz, Maria Yancheva, André Coy, Massimiliano Malavasi, L. Desideri, F. Tamburini

Interspeech 2016

Progress and Prospects for Spoken Language Technology: Results from Four Sexennial Surveys

Progress and Prospects for Spoken Language Technology: Results from Four Sexennial Surveys

Roger K. Moore, R. Marxer

Interspeech 2016

Language Effects in Noise-Induced Word Misperceptions

Language Effects in Noise-Induced Word Misperceptions

M. L. G. Lecumberri, J. Barker, R. Marxer, M. Cooke

Interspeech 2016

An Innovative Speech-Based Interface to Control AAL and IoT Solutions to Help People with Speech and Motor Disability

Massimiliano Malavasi, E. Turri, Maria Rosaria Motolese, R. Marxer, Jochen Farwer, H. Christensen, L. Desideri, F. Tamburini, P. Green

Italian Forum on Active and Assisted Living 2016

Evaluation and combination of pitch estimation methods for melody extraction in symphonic classical music

Juan J. Bosch, R. Marxer, E. Gómez

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines

Jon Barker, R. Marxer, Emmanuel Vincent, Shinji Watanabe

Automatic Speech Recognition & Understanding 2015

Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition

Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition

Ning Ma, R. Marxer, J. Barker, Guy J. Brown

Automatic Speech Recognition & Understanding 2015

Knowledge transfer between speakers for personalised dialogue management

Knowledge transfer between speakers for personalised dialogue management

I. Casanueva, Thomas Hain, H. Christensen, R. Marxer, P. Green

SIGDIAL Conference 2015

Remote Speech Technology for Speech Professionals - the CloudCAST initiative

Remote Speech Technology for Speech Professionals - the CloudCAST initiative

P. Green, R. Marxer, S. Cunningham, H. Christensen, Frank Rudzicz, Maria Yancheva, André Coy, Massimiliano Malavasi, L. Desideri

SLPAT@Interspeech 2015

Automatic dysfluency detection in dysarthric speech using deep belief networks

Automatic dysfluency detection in dysarthric speech using deep belief networks

Stacey Oue, R. Marxer, Frank Rudzicz

SLPAT@Interspeech 2015

Unsupervised Incremental Online Learning and Prediction of Musical Audio Signals

Unsupervised Incremental Online Learning and Prediction of Musical Audio Signals

R. Marxer, Hendrik Purwins

IEEE/ACM Transactions on Audio Speech and Language Processing 2015

Unsupervised Incremental Learning and Prediction of Music Signals

R. Marxer, Hendrik Purwins

Score-informed and timbre independent lead instrument separation in real-world scenarios

Score-informed and timbre independent lead instrument separation in real-world scenarios

Juan J. Bosch, Kazunobu Kondo, R. Marxer, J. Janer

European Signal Processing Conference 2012

Combining a harmonic-based NMF decomposition with transient analysis for instantaneous percussion separation

Combining a harmonic-based NMF decomposition with transient analysis for instantaneous percussion separation

J. Janer, R. Marxer, Keita Arimoto

IEEE International Conference on Acoustics, Speech, and Signal Processing 2012

A Tikhonov regularization method for spectrum decomposition in low latency audio source separation

A Tikhonov regularization method for spectrum decomposition in low latency audio source separation

R. Marxer, J. Janer

IEEE International Conference on Acoustics, Speech, and Signal Processing 2012

Low-Latency Instrument Separation in Polyphonic Audio Using Timbre Models

R. Marxer, J. Janer, J. Bonada

Latent Variable Analysis and Signal Separation 2012

What/when causal expectation modelling applied to audio signals

Amaury Hazan, R. Marxer, Paul Brossier, Hendrik Purwins, P. Herrera, Xavier Serra

Connection science 2009

Computational models of music perception and cognition I: the perceptual and cognitive processing chain

Computational models of music perception and cognition I: the perceptual and cognitive processing chain

Hendrik Purwins, P. Herrera, M. Grachten, Amaury Hazan, R. Marxer, Xavier Serra

Computational models of music perception and cognition II: Domain-specific music processing

Computational models of music perception and cognition II: Domain-specific music processing

Hendrik Purwins, M. Grachten, P. Herrera, Amaury Hazan, R. Marxer, Xavier Serra

Dynamical hierarchical self‐organization of harmonic and motivic musical categories

R. Marxer, Piotr Holonowicz, Amaury Hazan, Hendrik Purwins

What/when causal expectation modelling applied to percussive audio

Amaury Hazan, Paul Brossier, R. Marxer, Hendrik Purwins

Model-based language-instructed reinforcement learning

Model-based language-instructed reinforcement learning

Vedant Misra, Kevin Robinson, L. Fedus, Denny, Daphne Zhou, David Ippolito, Hyeontaek Luan, Lim, David Dohan, Shivani Agrawal, Mark Omernick, M. Dai, Thanumalayan Sankaranarayana, Pil-693 lai, Marie Pellat, Aitor Lewkowycz, Erica Moreira, Zongwei Zhou, Xuezhi Wang, Brennan Saeta, Orhan Diaz, Michele Firat, Jason Catasta, Kathy Wei, J. Devlin, Ming-Wei Chang, Kenton Lee, Danijar Hafner, T. Lillicrap, Jimmy Ba, Ian Fischer, Mohammad Norouzi, Austin W. Hanjie, Victor Zhong, Heinrich Küttler, Nantas Nardelli, Alexander Miller, Roberta Raileanu, Marco Selvatici, Edward Grefen-723, A. Lancucki, J. Chorowski, Guillaume Sanchez, R. Marxer, Nanxin Chen, Hans J. G. A. Dolf-728, Sameer Khurana, Tanel Alumäe, Karthik Narasimhan, R. Barzilay, Sherjil Ozair, Yazhe Li, Ali Razavi, Ioannis Antonoglou

Proceedings of the 3rd International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots VIHAR 2021

R. Marxer, Mohamed Chetouani, Elodie Mandel-Briefer, Angela Dassow, Roger K. Moore, Nicolas Obin, Yann Caradec, Sabrina Engesser, Julie Oswald, Paul Best, Sébastien Paris, Hervé Glotin, Kevin El Haddad, Gabriel Meunier, Chiara Mazzocconi, Abdellah Fourtassi, V. Lostanlen, Pierrick Arnaud, Marc du Gardin, Laurent Godet, Mathieu Lagrange, Vlad Demartsev, Mara Thomas, Baptiste Averly, M. Manser, Ariana Strandburg-Peshkin, Franck Malige, J. Patris, M. Hauray, Pascale Giraudet, Rébecca Kleinberger, Janelle Sands, Sareen Harpreet, Janet M. Baker, Jennifer M. Cunha, Silvia Pagliarini, Ian T. Coldren, Beáta Korcsok, T. Faragó, Bence Ferdinandy, Á. Miklósi, Péter Korondi, M. Gácsi, Philip Scales, Véronique Aubergé, O. Aycard, Kate A. Hardy, Denise Hart, M. J. Rosen, Pierre Klintefors, Simon Grendeus, Edvin Boyner, Alexander Pettersson, Eric Bolo, Muhammad Samoul, N. Seichepine, Ana Mamede, Navjeevan Dadwal, Dinesh Bhatt, Vinay Kumar

An Innovative Speech-Based User Interface for Smarthomes and IoT Solutions to Help People with Speech and Motor Disabilities

An Innovative Speech-Based User Interface for Smarthomes and IoT Solutions to Help People with Speech and Motor Disabilities

Massimiliano Malavasi, E. Turri, J. J. Atria, H. Christensen, R. Marxer, L. Desideri, André Coy, F. Tamburini, P. Green

AAATE Conf. 2017

Towards Multi-modal Hearing Aid Design and Evaluation in Realistic Audio-Visual Settings : Challenges and Opportunities

A. Hussain, J. Barker, R. Marxer, A. Adeel, W. Whitmer, R. Watt, Peter Derleth

A corpus of noise-induced word misperceptions for English.

A corpus of noise-induced word misperceptions for English.

R. Marxer, J. Barker, M. Cooke, M. L. García Lecumberri

Journal of the Acoustical Society of America 2016

Aalborg Universitet Unsupervised Learning of Structural Representation of Percussive Audio Using a Hierarchical Dirichlet Process Hidden Markov Model Antich,

Aalborg Universitet Unsupervised Learning of Structural Representation of Percussive Audio Using a Hierarchical Dirichlet Process Hidden Markov Model Antich,

J. Antich, M. Paterna, R. Marxer, Hendrik Purwins

“ Are we playing like Music-Stars ? ” Placing Emerging Artists on the Italian Music Scene

“ Are we playing like Music-Stars ? ” Placing Emerging Artists on the Italian Music Scene

M. Paterna, R. Marxer, Hendrik Purwins, J. Stevens

Vocal Interactivity in-and-between Humans, Animals and Robots (VIHAR) (Dagstuhl Seminar 16442)

R. Moore, Serge Thill, R. Marxer

Dagstuhl Reports 2016

A framework for the evaluation of microscopic intelligibility models

A framework for the evaluation of microscopic intelligibility models

R. Marxer, M. Cooke, J. Barker

Interspeech 2015

Study of regularizations and constraints in NMF-based drums monaural separation

Study of regularizations and constraints in NMF-based drums monaural separation

R. Marxer, J. Janer

MODELLING AND SEPARATION OF SINGING VOICE BREATHINESS IN POLYPHONIC MIXTURES

MODELLING AND SEPARATION OF SINGING VOICE BREATHINESS IN POLYPHONIC MIXTURES

R. Marxer, J. Janer

Separation of Unvoiced Fricatives in Singing Voice Mixtures with Semi-Supervised NMF

Separation of Unvoiced Fricatives in Singing Voice Mixtures with Semi-Supervised NMF

J. Janer, R. Marxer

Low-latency Bass Separation using Harmonic-Percussion Decomposition

Low-latency Bass Separation using Harmonic-Percussion Decomposition

R. Marxer, J. Janer

Music classification using high-level models

N. Wack, C. Laurier, O. Meyers, R. Marxer, D. Bogdanov, J. Serrà, E. Gómez, P. Herrera

CLASSIFICATION USING HIGH-LEVEL MODELS

N. Wack, C. Laurier, O. Meyers, R. Marxer, D. Bogdanov, J. Serrà, E. Gómez, P. Herrera

MUSIC TYPE GROUPERS (MTG): GENERIC MUSIC CLASSIFICATION ALGORITHMS

MUSIC TYPE GROUPERS (MTG): GENERIC MUSIC CLASSIFICATION ALGORITHMS

N. Wack, E. Guaus, C. Laurier, O. Meyers, R. Marxer, D. Bogdanov, J. Serrà, P. Herrera

RESEARCH ARTICLE What/when causal expectation modeling applied to audio signals

Amaury Hazan, R. Marxer, Paul Brossier, Hendrik Purwins, P. Herrera, Xavier Serra

An F-Measure for Evaluation of Unsupervised Clustering with Non-Determined Number of Clusters

R. Marxer, Hendrik Purwins

Attention as musical interplay of bottom-up accents and expectation

Attention as musical interplay of bottom-up accents and expectation

M. Grachten, R. Marxer, Amaury Hazan, P. Purwins

What/when causal expectation modelling in monophonic pitched and percussive audio

What/when causal expectation modelling in monophonic pitched and percussive audio

Amaury Hazan, Paul Brossier, R. Marxer, Hendrik Purwins

Neural Information Processing Systems 2007

Dynamical Hierarchical Self-Organization of Harmonic, Motivic, and Pitch Categories

Dynamical Hierarchical Self-Organization of Harmonic, Motivic, and Pitch Categories

R. Marxer, Piotr Holonowicz, Hendrik Purwins, Amaury Hazan

Neural Information Processing Systems 2007

Computational Modeling of Statistical Learning of Tone Sequences (Poster)

Amaury Hazan, P. Herrera, R. Marxer, M. Grachten, P. Purwins

A Comparative Study of Dimensionality Reduction Methods: The Case of Music Similarity

A Comparative Study of Dimensionality Reduction Methods: The Case of Music Similarity

N. Wack, P. Cano, B. D. Jong, R. Marxer