top of page

RESEARCH [T - Z]

Scroll down to find direct links to publicly posted research papers, presentations, etc. in the field of sound source separation and related topics.  I have listed them in alphabetical order by title for ease of browsing and have provided these links, which are available on the web, for your convenience.  I have provided permalinks for those papers available for a fee.  Titles without a link can be found through a web search.  Please use the CONTACT page to notify me of any corrections, to supply suggestions for adding any additional pertinent links, or to notify me if you encounter any dead links in this list.  Thanks!

 

TACKLING THE COCKTAIL FORK PROBLEM FOR SEPARATION AND TRANSCRIPTION OF REAL-WORLD SOUNDTRACKS [PDF]

Darius Petermann, Department of Intelligent Systems Engineering, Indiana University, Bloomington, Indiana, Gordon Wichern, Mitsubishi Electric Research Laboratories (MERL), Cambridge, Massachusetts, Aswin Shanmugam Subramanian, Mitsubishi Electric Research Laboratories (MERL), Cambridge, Massachusetts, Zhong-Qiu Wang, Language Technologies Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania and Jonathan Le Roux, Mitsubishi Electric Research Laboratories (MERL), Cambridge, Massachusetts (2022)

TANDEM ALGORITHM WITH SUPERVISED CLASSIFIER FOR PITCH ESTIMATION AND VOICE SEPARATION FROM MUSIC ACCOMPANIMENTS: SURVEY [PDF]

Vikas Nichal1, Mane V.A2, Gadhave D.D3, 1Shivaji University, Kolhapur. ADCET Ashta, 2Anasaheb Dange college of engg.. Shivaji University, Kolhapur, 3Shivaji University, Kolhapur. ADCET Ashta (2015)

 

TECHNIQUES STATISTIQUES POUR LA SÉPARATION DE SOURCES AUDIO DANS UN MÉLANGE MONOCAPTEUR (in French) [PDF direct download]

Emmanuel Vincent, École Normale Supérieure, Université Paris XI, IRCAM (2002)

 

TELPC BASED RE-SYNTHESIS METHOD FOR ISOLATED NOTES OF POLYPHONIC INSTRUMENTAL MUSIC RECORDINGS [PDF]

Ya-Han Kuo, Wei-Chen Chang, Tien-Ming Wang, Alvin W.Y. Su, SCREAM Lab., Department of CSIE, National Cheng-Kung University Tainan, Taiwan (2013)

 

TEMPORAL ANNOTATION-BASED AUDIO SOURCE SEPARATION USING WEIGHTED NONNEGATIVE MATRIX FACTORIZATION [permalink]

Ngoc Duong, Alexey Ozerov, Louis Chevallier, Technicolor, 4th IEEE International Conference on Consumer Electronics - Berlin (ICCE-Berlin 2014), Berlin, Germany (2014)

 

TENSOR FUSION: LEARNING IN HETEROGENEOUS AND DISTRIBUTED DATA [PDF direct download]

Umut Șimșekli, Graduate Program in Computer Engineering, Boğaziçi University, Istanbul, Turkey (2015)

TENSORIZATION AND APPLICATIONS IN BLIND SOURCE SEPARATION [PDF direct download]

Otto Debals, KU Leuven Arenberg Doctoral School, Leuven, Belgium (2017)

TF-ATTENTION-NET: AN END TO END NEURAL NETWORK FOR SINGING VOICE SEPARATION [PDF]

Tingle Li* 1,2, Jiawei Chen2, Haowen Hou3, Ming Li1, 1Data Science Research Center, Duke Kunshan University, Kunshan, China, 2School of Computer Science and Technology, Tianjin Polytechnic University, Tianjin, China, 3AI Platform Department, Tencent Inc., Shenzhen, China (2019)

THE COCKTAIL PARTY PROBLEM [PDF]

Josh H. McDermott (2010)

 

THE DESAM TOOLBOX: SPECTRAL ANALYSIS OF MUSICAL AUDIO [PDF]

M. Lagrange1, R. Badeau2, B. David2, N. Bertin3, J. Echeveste2, O. Derrien4, S. Marchand5, L. Daudet6, Ircam1 CNRS STMS Paris, France, Institut Telecom2, Telecom ParisTech, CNRS LTCI Paris, France, METISS Project3 IRISA-INRIA Rennes, France, Laboratoire de Mécanique4 et d’Acoustique (LMA), CNRS - UPR 7051, Marseille, France, LaBRI CNRS5, University of Bordeaux, 1 Talence, France, Université Paris Diderot6, Institut Langevin, CNRS UMR 7587, Paris, France <hal-00523319> (2010)

 

THE DIMENSIONS OF PERCEPTUAL QUALITY OF SOUND SOURCE SEPARATION [permalink]

Estefanía Cano, Judith Liebetrau, Fraunhofer IDMT, Germany, Derry Fitzgerald, Cork Institute of Technology, Ireland, Karlheinz Brandenburg, Fraunhofer IDMT, Germany (2018)

THE DYNAMIC REDISTRIBUTION OF SPECTRAL ENERGIES FOR UPMIXING AND RE-ANIMATION OF RECORDED AUDIO [permalink]

Christopher J. Keyes, Hong Kong Baptist University, Kowloon, Hong Kong (2012)

 

THE EXPECTED AMPLITUDE OF OVERLAPPING PARTIALS OF HARMONIC SOUNDS [permalink]

Chunghsin Yeh ; STMS Anal./Synthesis team, CNRS, Paris ; Roebel, A. (2009)

 

THE FLEXIBLE AUDIO SOURCE SEPARATION TOOLBOX VERSION 2.0 [PDF]

Yann Salaün, Emmanuel Vincent, Nancy Bertin, Nathan Souviraà-Labastie, Xabier Jaureguiberry, Dung T. Tran, and Frédéric Bimbot (2014)

 

THE GOOD VIBRATIONS PROBLEM [permalink]

Derry FitzGerald, Audio Research Group, Dublin Institute of Technology, Dublin, Ireland (2013)

 

THE LISTENING MACHINE PROJECT

Dan Ellis, LabROSA, Columbia University, New York, New York (2009)

THE MULTI-SCALE SHORT-TIME FOURIER TRANSFORM (examples)

Nicolas Juillerat, Pervasive and AI Research Group, University of Fribourg, Switzerland, Stefan Müller Arisona, Media Arts and Technology, University of California, Santa Barbara, Simon Schubiger-Banz Computer Systems Institute, ETH Zürich, Switzerland (2008)

 

THE NORTHWESTERN UNIVERSITY SOURCE SEPARATION LIBRARY [PDF]

Ethan Manilow, Prem Seetharaman, Bryan Pardo, Northwestern University (2018)

THE NUTS AND BOLTS OF MUSIC SOURCE SEPARATION​ (slide show)

Stipe Kabić (2022)

THE PHASEBOOK: BUILDING COMPLEX MASKS VIA DISCRETE REPRESENTATIONS FOR SOURCE SEPARATION [PDF]

Jonathan Le Roux, Gordon Wichern, Shinji Watanabe, Andy Sarroff, John R. Hershey, Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA, USA (2019)

THE ROBUSTNESS AND APPLICABILITY OF AUDIO SOURCE SEPARATION FROM SINGLE MIXTURES [PDF]

Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, The University of Tokyo, Bunkyo-ku, Tokyo, Japan (2008)

 

THE SEPARATION SYSTEM OF THE MONOPHONIC MIXTURE BASED ON THE BLIND DECOMPOSITION METHOD (in Japanese) [PDF]

Takuya Murayama, Shuji Hashimoto (2006) 

THE SOUND OF PIXELS

Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick Josh McDermott, and Antonio Torralba, Massachusetts Institute of Technology (2018)

THE SOUND OF PIXELS [PDF]

Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick Josh McDermott, and Antonio Torralba, Massachusetts Institute of Technology (2018)

THE SPARSE DECOMPOSITION OF SOUND IN THE TIME DOMAIN USING NON-NEGATIVE QUADRATIC PROGRAMMING [PDF]

Conor Houghton, School of Mathematics, Trinity College Dublin, Dublin, Ireland (2009)

 

THE WHOLE IS GREATER THAN THE SUM OF ITS PARTS: IMPROVING DNN-BASED MUSIC SOURCE SEPARATION [PDF]

Ryosuke Sawata, Member, IEEE, Naoya Takahashi, Member, IEEE, Stefan Uhlich, Member, IEEE, Shusuke Takahashi, Member, IEEE, and Yuki Mitsufuji, Member, IEEE (2023)

THE 2018 SIGNAL SEPARATION EVALUATION CAMPAIGN [PDF]

Fabian-Robert Stöter1, Antoine Liutkus1, and Nobutaka Ito2, 1 Inria and LIRMM, University of Montpellier, France, 2 NTT Communication Science Laboratories, NTT Corporation, Japan (2018)

TIMBRE-CONSTRAINED RECURSIVE TIME-VARYING ANALYSIS FOR MUSICAL NOTE SEPARATION [PDF]

Yiju Lin, Wei-Chen Chang, Tien-Ming Wang, Alvin W.Y. Su, SCREAM Lab., Department of CSIE, National Cheng-Kung University, Tainan, Taiwan, 

Wei-Hsiang Liao, Analysis/Synthesis Group, IRCAM, Paris, France (2013)

 

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION [PDF]

Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom, Telecom ParisTech, Paris, France (2010)

 

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION (slides) [PDF]

Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom, Telecom ParisTech, Paris, France (2010)

 

TIME-DEPENDENT RECURSIVE REGULARIZATION FOR SOUND SOURCE SEPARATION [permalink]

Tien-Ming Wang ; Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng-Kung Univ., Tainan, Taiwan ; Ta-Chun Chen ; Yin-Lin Chen ; Su, A.W.Y. (2012)

 

TIME-DOMAIN AUDIO SOURCE SEPARATION BASED ON WAVE-U-NET COMBINED WITH DISCRETE WAVELET TRANSFORM [PDF]

Tomohiko Nakamura, Hiroshi Saruwatari, Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan (2020)

TIME-DOMAIN AUDIO SOURCE SEPARATION WITH NEURAL NETWORKS BASED ON MULTIRESOLUTION ANALYSIS [PDF]

Tomohiko Nakamura , Member, IEEE, Shihori Kozuka, and Hiroshi Saruwatari , Member, IEEE (2021)

TIME DOMAIN EXTRACTION OF VIBRATO FROM MONOPHONIC INSTRUMENTS [PDF]

Daniel Bendor, Undergraduate School of Electrical Engineering, University of Maryland at College Park, Mark Sandler, Department of Electronic Engineering, King’s College London (2000)

 

TIME-DOMAIN MUSIC SOURCE SEPARATION FOR CHOIRS AND ENSEMBLES [PDF]

Saurjya Sarkar, Queen Mary University of London, School of Electronic Engineering and Computer Science (2024)

TIME-FREQUENCY FILTER BANK: A SIMPLE APPROACH FOR AUDIO AND MUSIC SEPARATION [permalink]

Ning Yang, College of Automation, Northwestern Polytechnical University, China, Muhammad Usman and Xiangjian He, School of Electrical and Data Engineering, University of Technology Sydney, Australia (2017)

TIME-FREQUENCY REPRESENTATIONS FOR SINGLE-CHANNEL MUSIC SOURCE SEPARATION [permalink]

Vanessa H. Tan and Franz de Leon, Electrical and Electronics Engineering Institute, University of the Philippines, Diliman (2019)

TIME-FREQUENCY TRADE-OFFS FOR AUDIO SOURCE SEPARATION WITH BINARY MASKS [PDF]

Andrew J.R. Simpson, Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, UK (2015)

 

TOWARD DEEP DRUM SOURCE SEPARATION [PDF]

Alessandro Ilic Mezza, Riccardo Giampiccolo, Alberto Bernardini, Augusto Sarti, Dipartimento di Elettronica, Informazione e Bioingegneria Politecnico di Milano, Milan, Italy (2023)

TOWARD FINDING OPTIMAL SOURCE DICTIONARIES FOR SINGLE CHANNEL MUSIC SOURCE SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION [permalink]

Bhathiya Rathnayake, K.M.K. Weerakoon, G.M.R.I. Godaliyadda, M.P.B. Ekanayake, Office of Research and Innovation Services (ORIS), Sri Lanka Technological Campus, CO, Sri Lanka (2018)

TOWARDS AUTOMATED SINGLE CHANNEL SOURCE SEPARATION USING NEURAL NETWORKS [PDF]

Arpita Gang 1, Pravesh Biyani 1, Akshay Soni 2, 1 IIIT Delhi, India, 2 Oath Research, USA (2018)

TOWARDS COMPLEX MATRIX DECOMPOSITION OF SPECTROGRAMS BASED ON THE RELATIVE PHASE OFFSETS OF HARMONIC SOUNDS [PDF]

Holger Kirchhoff ∗1, Roland Badeau†2, Simon Dixon1, 1 Centre for Digital Music, Queen Mary University of London, UK, 2 Institut Mines-Télécom, Télécom ParisTech, Paris, France (2014)

TOWARDS EFFICIENT AUDIO-VISUAL SOURCE SEPARATION AND SYNTHESIS [PDF]

Juan Felipe Montesinos, Universitat Pompeu Fabra, Departament de Tecnologies de la Informació i les Comunicacions (2023)

TOWARDS LIVE DRUM SEPARATION USING PROBABILISTIC SPECTRAL CLUSTERING BASED ON THE ITAKURA-SAITO DIVERGENCE [permalink]

Eric Battenberg1,2, Victor Huang1, David Wessel1,2, 1University of California, Berkeley, California, USA, 2Center for New Music and Audio Technologies, Berkeley, California, USA (2012)

 

TOWARDS MUSICAL INSTRUMENT SEPARATION USING MULTIPLE-CAUSE NEURAL NETWORKS [PDF]

J Klinseisen, M D Plumbley, Department of Electronic Engineering, King's College London, Strand, London, UK (2000)

 

TOWARDS REAL-TIME SINGLE-CHANNEL SINGING-VOICE SEPARATION WITH PRUNED MULTI-SCALED DENSENETS [permalink]

Markus Huber1, Günther Schindler2, Christian Schörkhuber3, Wolfgang Roth1, Franz Pernkopf1, Holger Fröning2, 1Graz University of Technology, Signal Processing and Speech Communication Lab, 2Heidelberg University, Institute of Computer Engineering, 3sonible GmbH (2020) 

TOWARDS ROBUST MUSIC SOURCE SEPARATION ON LOUD COMMERCIAL MUSIC [PDF]

Chang-Bin Jeon, and Kyogu Lee, Department of Intelligence and Information Music and Audio Research Group (MARG), Seoul National University (2022)

TOWARDS SHIFTED NMF FOR IMPROVED MONAURAL SEPARATION [permalink]

R. Jaiswal ; D. Fitzgerald ; E. Coyle ; S. Rickard (2013)

 

TOWARDS SINGLE-CHANNEL UNSUPERVISED SOURCE SEPARATION OF SPEECH MIXTURES: THE LAYERED HARMONICS/FORMANTS SEPARATION-TRACKING MODEL [PDF]

Manuel Reyes-Gomez1, Nebojsa Jojic2, Daniel P.W. Ellis1, 1 LabROSA, Department of Electrical Engineering, Columbia University 2 Microsoft Research (2004)

 

TOWARDS SOLVING THE BOTTLENECK OF PITCH-BASED SINGING VOICE SEPARATION [permalink]

Bilei Zhu, Wei Li, Linwei Li, Fudan University, Shanghai, China (2015)

 

TOWARDS TRANSIENT RESTORATION IN SCORE-INFORMED AUDIO DECOMPOSITION [PDF]

Christian Dittmar, Meinard Müller, International Audio Laboratories Erlangen, Erlangen, Germany (2015)

 

TRACING SOUND OBJECTS IN AUDIO TEXTURES [PDF]

Monika Dörfler, Ewa Matusiak, University of Vienna, Faculty of Mathematics, NuHAG (2013)

TRANSCRIPTION AND SEPARATION OF DRUM SIGNALS FROM POLYPHONIC MUSIC [permalink]

Olivier Gillet, Associate Member, IEEE, and Gaël Richard, Senior Member, IEEE (2008)

 

TRANSCRIPTION IS ALL YOU NEED: LEARNING TO SEPARATE MUSICAL MIXTURES WITH SCORE AS SUPERVISION [PDF]

Yun-Ning Hung1,2, Gordon Wichern1, Jonathan Le Roux,1, 1Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA, USA 2Center for Music Technology, Georgia Institute of Technology, Atlanta, GA, USA (2020)

TRANSCRIPTION IS ALL YOU NEED: LEARNING TO SEPARATE MUSICAL MIXTURES WITH SCORE AS SUPERVISION (poster) [PDF]

Yun-Ning Hung1,2, Gordon Wichern1, Jonathan Le Roux,1, 1Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA, USA 2Center for Music Technology, Georgia Institute of Technology, Atlanta, GA, USA (2021)

TRANSFER LEARNING WITH JUKEBOX FOR MUSIC SOURCE SEPARATION [PDF]

Wadhah Zai El Amri, Oliver Tautz, Helge Ritter, Andrew Melnik, University of Bielefeld, Germany (2021)

TRANSIENT AND STEADY-STATE COMPONENT EXTRACTION USING NONLINEAR FILTERING [PDF]

Ignacio Irigaray, Universidad de la República IIE/FING, Montevideo, Uruguay, Luiz W. P. Biscainho, Universidade Federal do Rio de Janeiro DEL/Poli & PEE/COPPE, Rio de Janeiro, Brazil (2013)

 

TWO MULTIMODAL APPROACHES FOR SINGLE MICROPHONE SOURCE SEPARATION [permalink]

Farnaz Sedighin and Massoud Babaie-Zadeh, School of Electrical Engineering, Sharif University of Technology, Tehran, Iran, Bertrand Rivet and Christian Jutten, GIPSA Lab, Univ. Grenoble Alpes, Grenoble, France (2016)

 

TWO NONEXCLUSIVE NEURO-FUZZY CLASSIFIERS FOR RECOGNITION OF MUSICAL INSTRUMENTS [PDF]

G. Costantini1, F.M. Frattale Mascioli2, P.Antici1, 1Department of Electronics Engineering, University of Rome “Tor Vergata”, Roma, Italy, 2INFO-COM Department, University of Rome “La Sapienza”, Roma, Italy (1999)

 

TWO STAGE SINGLE CHANNEL AUDIO SOURCE SEPARATION USING DEEP NEURAL NETWORKS [PDF]

Emad M. Grais, Gerard Roma, Andrew J. R. Simpson, and Mark D. Plumbley, Fellow, IEEE, Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, U.K. (2017)

TWO-STEP SOUND SOURCE SEPARATION: TRAINING ON LEARNED LATENT TARGETS [PDF]

Efthymios Tzinis♮, Shrikant Venkataramani♮, Zhepei Wang♮, Cem Subakan♭, Paris Smaragdis♮,♯,♮ University of Illinois at Urbana-Champaign, ♭ Mila–Quebec Artificial Intelligence Institute , ♯ Adobe Research (2019)

TWO-STEP SOUND SOURCE SEPARATION: TRAINING ON LEARNED LATENT TARGETS (ICASSP 2020 presentation)

Efthymios Tzinis♮, Shrikant Venkataramani♮, Zhepei Wang♮, Cem Subakan♭, Paris Smaragdis♮,♯,♮ University of Illinois at Urbana-Champaign, ♭ Mila–Quebec Artificial Intelligence Institute , ♯ Adobe Research (2020)

UGOSA: USER-GUIDED ONE-SHOT DEEP MODEL ADAPTATION FOR MUSIC SOURCE SEPARATION (video)

Giorgia Cantisani1, Alexey Ozerov2, Slim Essid1, Gaël Richard1, 1 LTCI Telecom Paris, Institut Polytechnique de Paris, Palaiseau, France, 2 InterDigital R&D France, Cesson Sevigne, France (2021)

UNDERDETERMINED BLIND AUDIO SOURCE SEPARATION USING MODAL DECOMPOSITION [PDF]

Abdeldjalil Aïssa-El-Bey, Karim Abed-Meraim, and Yves Grenier, Départment TSI, École Nationale Supérieure des Télécommunications (ENST), Paris, France (2007)

 

UNDERDETERMINED BLIND SOURCE SEPARATION BASED ON FUZZY C-MEANS AND SEMI-NONNEGATIVE MATRIX FACTORIZATION [permalink]

Ossama. S. Alshabrawy, M. E. Ghoneim, W. A. Awad, Aboul Ella Hassanien, Math. Dept., Mansoura University, Damietta, Egypt (2012)

UNDERDETERMINED SOURCE SEPARATION USING SPEAKER SUBSPACE MODELS [PDF]

Ron J. Weiss, Columbia University (2009)

 

UNDERDETERMINED SOURCE SEPARATION USING SPEAKER SUBSPACE MODELS (slides) [PDF]
Ron J. Weiss, Columbia University (2009)

 

UNDERSTANDING BY UNMIXING

Sebastian Ewert, Nicola Montecchio, Rachel Bittner, Spotify R&D (2020)

U-NET: A SUPERVISED APPROACH FOR MONAURAL SOURCE SEPARATION [permalink]

Samiul Basir, Md. Nahid Hossain, Md. Shakhawat Hosen, Department of Computer Science and Engineering, Islamic University, Kushtia, Bangladesh, Md. Sadek Ali, Department of Computer Science and Engineering, Islamic University, Kushtia, Bangladesh, Hong Kong Centre for Cerebro-Cardiovascular Health Engineering (COCHE), Hong Kong, China, Zainab Riaz, Hong Kong Centre for Cerebro-Cardiovascular Health Engineering (COCHE), Hong Kong, China, Md. Shohidul Islam, Department of Computer Science and Engineering, Islamic University, Kushtia, Bangladesh, Hong Kong Centre for Cerebro-Cardiovascular Health Engineering (COCHE) (2024)

UNIFIED GRADIENT REWEIGHTING FOR MODEL BIASING WITH APPLICATIONS TO SOURCE SEPARATION [PDF]

Efthymios Tzinis1, Dimitrios Bralios1, 2, Paris Smaragdis1, 3, 1University of Illinois at Urbana-Champaign, 2National Technical University of Athens, 3Adobe Research (2020)

UNIFYING LOCAL AND GLOBAL METHODS FOR HARMONIC-PERCUSSIVE SOURCE SEPARATION

International Audio Laboratories Erlangen (2018)

UNISON SOURCE SEPARATION [PDF]

Fabian-Robert Stöter, Stefan Bayer, Bernd Edler, International Audio Laboratories Erlangen, Erlangen, Germany (2014)

 

UNIVERSAL SOUND SEPARATION [PDF]

Ilya Kavalerov1,2∗, Scott Wisdom1, Hakan Erdogan1, Brian Patton1, Kevin Wilson1, Jonathan Le Roux3, John R. Hershey1, 1Google Research, Cambridge Massachusetts, 2Department of Electrical and Computer Engineering, UMD, 3Mitsubishi Electric Research Laboratories (MERL), Cambridge, Massachusetts (2019) 

UNIVERSAL SOURCE SEPARATION WITH WEAKLY LABELLED DATA [PDF]

Qiuqiang Kong1, Ke Chen1, Haohe Liu2, Xingjian Du1, Taylor Berg-Kirkpatrick3, Shlomo Dubnov3, Mark D. Plumbley2, 1 ByteDance, Shanghai, China, 2 University of Surrey, Guildford, UK, 3 University of California San Diego, San Diego, USA (2023)

UNIVERSAL SPEECH MODELS FOR SPEAKER INDEPENDENT SINGLE CHANNEL SOURCE SEPARATION [PDF]

Dennis L. Sun, Department of Statistics, Stanford University, Gautham J. Mysore, Adobe Research (2013)

 

UNSUPERVISED APPROACH TO MUSIC SOURCE SEPARATION USING GENERALIZED DIRICHLET PRIOR [PDF]

Park Jung Soo, Seoul National University, Seoul, South Korea (2018)

UNSUPERVISED AUDIO SOURCE SEPARATION USING DIFFERENTIABLE PARAMETRIC SOURCE MODELS [PDF]

Kilian Schulze-Forster1, Clement S. J. Doire2, Gaël Richard1, and Roland Badeau1, 1 Laboratoire de Traitement et Communication de l’Information (LTCI), Télécom Paris, Institut Polytechnique de Paris, Palaiseau, France, 2 Sonos Inc., Paris, France (2022)

UNSUPERVISED AUDIO SOURCE SEPARATION USING GENERATIVE PRIORS [PDF]

Vivek Narayanaswamy1, Jayaraman J. Thiagarajan2, Rushil Anirudh2 and Andreas Spanias1, 1SenSIP Center, School of ECEE, Arizona State University, Tempe, Arizona, USA, 2Lawrence Livermore National Labs, Livermore, California, USA (2020)

UNSUPERVISED AUDIO SOURCE SEPARATION VIA SPECTRUM ENERGY PRESERVED WASSERSTEIN LEARNING [PDF]

Ning Zhang, Junchi Yan, IBM, China (2017)

UNSUPERVISED DEEP CLUSTERING FOR SOURCE SEPARATION: DIRECT LEARNING FROM MIXTURES USING SPATIAL INFORMATION [PDF]

Efthymios Tzinis♯, Shrikant Venkataramani♯, Paris Smaragdis♯♭, ♯University of Illinois at Urbana-Champaign, Department of Computer Science, ♭Adobe Research (2018)

UNSUPERVISED INTERPRETABLE REPRESENTATION LEARNING FOR SINGING VOICE SEPARATION [PDF]

Stylianos I. Mimilakis, Semantic Music Techn. Group, Fraunhofer-IDMT, Ilmenau, Germany, Konstantinos Drossos, Audio Research Group, Tampere University, Tampere, Finland, Gerald Schuller, Applied Media Systems Group, Technical University of Ilmenau, Ilmenau, Germany (2020)

UNSUPERVISED LEARNING FOR MONAURAL SOURCE SEPARATION USING MAXIMIZATION–MINIMIZATION ALGORITHM WITH TIME–FREQUENCY DECONVOLUTION [PDF direct download]

Wai Lok Woo 1, Bin Gao 2, Ahmed Bouridane 3, Bingo Wing-Kuen Ling 4 and Cheng Siong Chin 5, 

1 School of Electrical and Electronic Engineering, Newcastle University, Newcastle upon Tyne, UK, 2 School of Automation Engineering, University of Electronic Science and Technology of China, Chengdu, China, 3 Department of Computer and Information Sciences, Northumbria University, Newcastle upon Tyne, UK, 4 Faculty of Information Engineering, Guangdong University of Technology, Guangzhou, China, 5 Faculty of Science Agriculture and Engineering, Newcastle University, Singapore, Singapore (2018)

UNSUPERVISED LEARNING METHODS FOR SOURCE SEPARATION IN MONAURAL MUSIC SIGNALS [PDF]

Tuomas Virtanen, Institute of Signal Processing, Tampere University of Technology, Tampere, Finland (2006)

 

UNSUPERVISED MONAURAL MUSIC SOURCE SEPARATION BY AVERAGE HARMONIC STRUCTURE MODELING (separation results)

Zhiyao Duan, Yungang Zhang, Changshui Zhang, Member, IEEE, Zhenwei Shi (2008)

UNSUPERVISED MUSIC SOURCE SEPARATION USING DIFFERENTIABLE PARAMETRIC SOURCE MODELS [PDF]

Kilian Schulze-Forster, Gae ̈l Richard Fellow, IEEE, Liam Kelley, Clement S. J. Doire, and Roland Badeau Senior Member, IEEE (2023)

UNSUPERVISED SINGING VOICE SEPARATION BASED ON ROBUST PRINCIPAL COMPONENT ANALYSIS EXPLOITING RANK-1 CONSTRAINT [PDF]

Feng Li and Masato Akagi, Japan Advanced Institute of Science and Technology, Nomi, Ishikawa, Japan (2018)

UNSUPERVISED SINGING VOICE SEPARATION USING GAMMATONE AUDITORY FILTERBANK AND CONSTRAINT ROBUST PRINCIPAL COMPONENT ANALYSIS [PDF]

Feng Li and Masato Akagi, Japan Advanced Institute of Science and Technology, Ishikawa, Japan (2018)

UNSUPERVISED SINGLE-CHANNEL MUSIC SOURCE SEPARATION BY AVERAGE HARMONIC STRUCTURE MODELING [permalink]

Zhiyao Duan, Yungang Zhang, Changshui Zhang, Member, IEEE, Zhenwei Shi (2008)

 

UNSUPERVISED SINGLE-CHANNEL SINGING VOICE SEPARATION WITH WEIGHTED ROBUST PRINCIPAL COMPONENT ANALYSIS BASED ON GAMMATONE AUDITORY FILTERBANK AND VOCAL ACTIVITY DETECTION (PDF direct download)

Feng Li 1,2, Yujun Hu 1, and Lingling Wang 1, 1 Department of Computer Science and Technology, Anhui University of Finance and Economics, Bengbu, China, 2 School of Information Science and Technology, University of Science and Technology of China, Hefei, China (2023)

UNSUPERVISED SINGLE-CHANNEL SOURCE SEPARATION USING BAYESIAN NMF [permalink]

Onur Dikmen, A. Taylan Cemgil, Boğazic ̧ University, Computer Engineering Department, Istanbul, Turkey (2009)

 

UNSUPERVISED SINGLE CHANNEL SOURCE SEPARATION WITH NONNEGATIVE MATRIX FACTORIZATION [PDF]

A.M. Darsono, Shakir Saat, N.M. Z. Hashim, A.A.M ISA, Faculty of Electronics & Computer Engineering, Universiti Teknikal Malaysia Melaka, Melaka, Malaysia (2015)

 

UNSUPERVISED SOUND SEPARATION USING MIXTURE INVARIANT TRAINING [PDF]

Scott Wisdom1, Efthymios Tzinis2, Hakan Erdogan1, Ron J. Weiss1, Kevin Wilson1, John R. Hershey1, 1 Google Research, 2 UIUC (2020)

UNSUPERVISED SOUND SEPARATION USING MIXTURES OF MIXTURES [PDF]

Scott Wisdom, Hakan Erdogan, Ron J. Weiss, Kevin Wilson, John R. Hershey, Google Research, Efthymios Tzinis, University of Illinois at Urbana-Champaign (2020)

UNSUPERVISED SOURCE SEPARATION BY STEERING PRETRAINED MUSIC MODELS [PDF]

Ethan Manilow1, Patrick O’Reilly1, Prem Seetharaman2, Bryan Pardo1, 1Northwestern University, 2Descript, Inc. (2021)

UNSUPERVISED SOURCE SEPARATION VIA BAYESIAN INFERENCE IN THE LATENT DOMAIN [PDF]

Michele Mancusi∗1, Emilian Postolache∗1, Giorgio Mariani1, Marco Fumero1, Andrea Santilli1, Luca Cosmo2,3, Emanuele Rodolà1, 1Sapienza University of Rome, 2Ca’ Foscari University of Venice, 3University of Lugano (2022)

UNSUPERVISED SOURCE SEPARATION VIA SELF-SUPERVISED TRAINING [PDF]

Ertuğ Karamatlı, Department of Computer Engineering, Boğaziçi University, İstanbul, Turkey, Serap Kırbız, Department of Electrical and Electronics Engineering, MEF University, İstanbul, Turkey (2022)

UNTANGLING PHASE AND TIME IN MONOPHONIC SOUNDS [PDF direct download]

Henning Thielemann, Institut für Informatik, Martin-Luther-Universität Halle-Wittenberg, Halle, Germany (2010)

 

UNTWIST: A NEW TOOLBOX FOR AUDIO SOURCE SEPARATION [PDF]

Gerard Roma, Emad M. Grais, Andrew J.R. Simpson, Iwona Sobieraj, Mark D. Plumbley, Centre for Vision, Speech and Signal Processing, University of Surrey , UK (2016)

UPMIXING FROM MONO - A SOURCE SEPARATION APPROACH [PDF]

Derry FitzGerald, Audio Research Group, Dublin Institute of Technology, 17th International Conference on Digital Signal Processing, Corfu, Greece (2011)

UPSAMPLING LAYERS FOR MUSIC SOURCE SEPARATION [PDF]

Jordi Pons, Joan Serrà, Santiago Pascual, Giulio Cengarle, Daniel Arteaga, Davide Scaini, Dolby Laboratories (2021)

 

USER ASSISTED SEPARATION OF REPEATING PATTERNS IN TIME AND FREQUENCY USING MAGNITUDE PROJECTIONS [PDF]

Derry FitzGerald1, Zafar Rafii2, Antoine Liutkus3, 1 School of Music, Cork Institute of Technology, Cork, Ireland, 2 Gracenote, Applied Research, Emeryville, California, USA 3 Inria, Villiers-Le`s-Nancy, France (2017)

USER ASSISTED SEPARATION OF REPEATING PATTERNS IN TIME AND FREQUENCY USING MAGNITUDE PROJECTIONS (poster) [PDF]

Derry FitzGerald1, Zafar Rafii2, Antoine Liutkus3, 1 School of Music, Cork Institute of Technology, Cork, Ireland, 2 Gracenote, Applied Research, Emeryville, California, USA 3 Inria, Villiers-Le`s-Nancy, France (2017)

USER ASSISTED SEPARATION USING TENSOR FACTORISATIONS [PDF]

Derry FitzGerald, Audio Research Group, School of Electrical Engineering Systems, Dublin Institute of Technology, Ireland (2012)

 

USER ASSISTED SOURCE SEPARATION USING NON-NEGATIVE MATRIX FACTORISATION [PDF]

Derry FitzGerald, Dublin Institute of Technology (2011)

USER GUIDED AUDIO SELECTION FROM COMPLEX SOUND MIXTURES [PDF]

Paris Smaragdis, Adobe Systems Inc. (2009)

USER GUIDED AUDIO SELECTION FROM COMPLEX SOUND MIXTURES (demo video)

Paris Smaragdis, Creative Technologies Lab, Adobe Systems Inc. (2012)

 

USER-GUIDED ONE-SHOT DEEP MODEL ADAPTATION FOR MUSIC SOURCE SEPARATION [PDF]

Giorgia Cantisani,1,2∗ Alexey Ozerov,2 Slim Essid,1 Gael Richard,1, 1LTCI, Telécom Paris, Institut Polytechnique de Paris, Palaiseau, France (2021)

USER-GUIDED SOURCE SEPARATION, LVA/ICA 2012

J.-L. Durrieu, J.-Ph. Thiran (2012)

 

USING AI FOR MUSIC SOURCE SEPARATION [permalink]

Jasline Jie Yu Lee, Nanyang Technological University, Singapore (2021)

USING BRIEF GLIMPSES TO DECOMPOSE MIXTURES [permalink]

A. S. Bregman 

 

USING DEEP LEARNING FOR SOURCE SEPARATION (MUSIC REMOVAL)

Mohammad Nauman (2021)

USING SCORE-INFORMED CONSTRAINTS FOR NMF-BASED SOURCE SEPARATION [permalink]

Ewert, S. ; Univ. of Bonn, Bonn, Germany ; Muller, M. (2012)

 

USING SCORE-INFORMED CONSTRAINTS FOR NMF-BASED SOURCE SEPARATION (results)

Ewert, S. ; Univ. of Bonn, Bonn, Germany ; Muller, M. (2012)

 

USING TENSOR FACTORISATION MODELS TO SEPARATE DRUMS FROM POLYPHONIC MUSIC [PDF]

Derry FitzGerald, Dublin Institute of Technology, Matt Cranitch, Cork Institute of Technology, Eugene Coyle, Dublin Institute of Technology (2009)

 

VARIATIONAL BAYESIAN INFERENCE FOR SOURCE SEPARATION AND ROBUST FEATURE EXTRACTION [PDF]

Kamil Adiloğlu, HörTech gGmbH, Oldenburg, Germany, Emmanuel Vincent, Inria, Villers-lès-Nancy, France <hal-00726146v2> (2016)

VARIATIONAL BAYESIAN MODEL AVERAGING FOR AUDIO SOURCE SEPARATION [PDF]

Xabier Jaureguiberry 1∗, Emmanuel Vincent 2, Gaël Richard 1, 1 Institut Mines-Télécom, Télécom ParisTech, Paris, France, 2 Inria, Villers-lès-Nancy, France (2014)

 

VARIATIONAL INFERENCE IN NON-NEGATIVE FACTORIAL HIDDEN MARKOV MODELS FOR EFFICIENT AUDIO SOURCE SEPARATION [PDF]

Gautham J. Mysore, Advanced Technology Labs, Adobe Systems Inc., San Francisco, California, USA, Maneesh Sahani, Gatsby Computational Neuroscience Unit, University College, London, UK (2012)

 

VAT-SNET: A CONVOLUTIONAL MUSIC-SEPARATION NETWORK BASED ON VOCAL AND ACCOMPANIMENT TIME-DOMAIN FEATURES [PDF direct download]

Xiaoman Qiao 1ORCID, Min Luo 2, Fengjing Shao 1, Yi Sui 1, Xiaowei Yin 1 and Rencheng Sun 1, 1School of Computer Science and Technology, Qingdao University, China, 2Conservatory of Music, Qingdao University, China (2022)

VISION-GUIDED MUSIC SOURCE SEPARATION VIA A FINE-GRAINED CYCLE-SEPARATION NETWORK [permalink]
Ma Shuo, Yanli Ji, Xing Xu, Xiaofeng Zhu, University of Electronic Science and Technology of China, Chengdu, China (2021)

VISUAL AUDIO: AN INTERACTIVE TOOL FOR ANALYZING AND EDITING OF AUDIO IN THE SPECTROGRAM [PDF direct download]

C. G. v. d. Boogaart, R. Lienhart, Institut für Informatik, Universität Augsburg, Augsburg, Germany (2005)

 

VISUALLY GUIDED AUDIO SOURCE SEPARATION WITH META CONSISTENCY LEARNING [PDF]

Md Amirul Islam1, Seyed Shahabeddin Nabavi1, Irina Kezele1, Yang Wang2, Yuanhao Yu1, Jin Tang1, 1Huawei Noah’s Ark Lab, 2Concordia University (2024)

 

VISUALLY GUIDED AUDIO SOURCE SEPARATION WITH META CONSISTENCY LEARNING (presentation)

Md Amirul Islam1, Seyed Shahabeddin Nabavi1, Irina Kezele1, Yang Wang2, Yuanhao Yu1, Jin Tang1, 1Huawei Noah’s Ark Lab, 2Concordia University (2024)

VISUALLY GUIDED SOUND SOURCE SEPARATION USING CASCADED OPPONENT FILTER NETWORK [PDF]

Lingyu Zhu and Esa Rahtu, Tampere University, Tampere, Finland (2020)

VISUAL SCENE GRAPHS FOR AUDIO SOURCE SEPARATION [PDF]

Moitreya Chatterjee1, Jonathan Le Roux2, Narendra Ahuja1, Anoop Cherian2*, 1University of Illinois at Urbana-Champaign, Champaign, Illinois,
2Mitsubishi Electric Research Laboratories, Cambridge, Massachusetts (2021)

VOCAL ACTIVITY INFORMED SINGING VOICE SEPARATION WITH THE IKALA DATASET [PDF]

Tak-Shing Chan1, Tzu-Chun Yeh2, Zhe-Cheng Fan2, Hung-Wei Chen3, Li Su1, Yi-Hsuan Yang1, Roger Jang2, 1Research Center for Information Technology Innovation, Academia Sinica, Taiwan, 2Department of Computer Science and Information Engineering, National Taiwan University, Taiwan, 3 iKala Interactive Media Inc., Taiwan (2015)

VOCAL DETECTION IN MONAURAL MIXTURES [PDF]

Anders Elowsson, Ragnar Schön, Matts Höglund, Elias Zea, Anders Friberg, KTH, Royal Institute of Technology, Stockholm, Sweden (2014)

VOCAL EXTRACTION FROM MUSIC USING RPCA DECOMPOSITION [PDF]

Samuel Frank, Shahriar Mokhtari-Sharghi, Joaquín Ruales, Nicholas Ursa, Columbia University (2014)

VOCAL HARMONY SEPARATION USING TIME-DOMAIN NEURAL NETWORKS [PDF]

Saurjya Sarkar, Emmanouil Benetos, Mark Sandler, Centre for Digital Music, Queen Mary University of London, United Kingdom (2021)

VOCAL-INSTRUMENT SEPARATION PROGRAM [PDF]

Dahyun Chung and Thomas Downey, University of Rochester, Audio and Music Engineering, Rochester, New York (2017)

VOCAL-INSTRUMENT SEPARATION PROGRAM (poster) [PDF]

Dahyun Chung and Thomas Downey, University of Rochester, Audio and Music Engineering, Rochester, New York (2017)

VOCAL ISOLATION

Benjamin Hurd, aaronjennis Jennis (2020)

VOCAL MELODY EXTRACTION IN THE PRESENCE OF PITCHED ACCOMPANIMENT IN POLYPHONIC MUSIC [permalink]

Vishweshwara Rao, Preeti Rao, Department of Electrical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, India (2010)

 

VOCAL SEPARATION

librosa-gallery (2016-2017)

VOCAL SEPARATION BY CONSTRAINED NON-NEGATIVE MATRIX FACTORIZATION [PDF]

Eri Ochiai, Takanori Fujisawa, Masaaki Ikehara, EEE Dept., Keio Univ., Yokohama, Kanagawa, Japan (2015)

 

VOCAL SEPARATION FROM MONAURAL MUSIC USING ADAPTIVE AUDITORY FILTERING BASED ON KERNEL BACK-FITTING [PDF]

Jun-Yong Lee, Hye-Seung Cho, Hyoung-Gook Kim, Kwangwoon University, Seoul, Rep. of Korea (2015)

VOCAL SEPARATION FROM MONAURAL MUSIC USING TEMPORAL/SPECTRAL CONTINUITY AND SPARSITY CONSTRAINTS [permalink]

Il-Young Jeong, Music & Audio Res. Group, Seoul Nat. Univ., Seoul, South Korea, Kyogu Lee (2014)

 

VOCAL SEPARATION FROM MONAURAL MUSIC USING TEMPORAL/SPECTRAL CONTINUITY AND SPARSITY CONSTRAINTS (demos)

Il-Young Jeong, Music & Audio Res. Group, Seoul Nat. Univ., Seoul, South Korea, Kyogu Lee (2014)

 

VOCAL SEPARATION FROM MONAURAL MUSIC USING TEMPORAL/SPECTRAL CONTINUITY AND SPARSITY CONSTRAINTS (poster) [PDF]

Il-Young Jeong, Music & Audio Res. Group, Seoul Nat. Univ., Seoul, South Korea, Kyogu Lee (2015)

VOCAL SEPARATION METHOD USING WEIGHTED β-ORDER MINIMUM MEAN SQUARE ERROR ESTIMATION BASED ON KERNEL BACK-FITTING (in Korean) [PDF direct download]

Hye-Seung Cho and Hyoung-Gook Kim†, †Department of Electronics Convergence Engineering, Kwang- woon University, Seoul, Republic of Korea (2015)

VOCAL SEPARATION USING EXTENDED ROBUST PRINCIPAL COMPONENT ANALYSIS WITH SCHATTEN P/LP-NORM AND SCALE COMPRESSION [permalink]

Il-Young Jeong1 and Kyogu Lee1,2, 1Music and Audio Research Group, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, Korea, 2Advanced Institutes of Convergence Technology, Suwon, Korea (2014)

 

VOCAL SEPARATION USING EXTENDED ROBUST PRINCIPAL COMPONENT ANALYSIS WITH SCHATTEN P/LP-NORM AND SCALE COMPRESSION (poster) [PDF]
Il-Young Jeong1 and Kyogu Lee1,2, 1Music and Audio Research Group, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, Korea, 2Advanced Institutes of Convergence Technology, Suwon, Korea (2014)

 

VOCAL SEPARATION USING KARAOKE U-NET [permalink]

Vipul Dube, Rutwik Patel, Vrushali Sule, Ninad Mehendale, K. J. Somaiya College of Engineering (2021)

VOCAL SEPARATION USING NEAREST NEIGHBOURS AND MEDIAN FILTERING [PDF]

Derry FitzGerald, Dublin Institute of Technology, 23rd IET Irish Signals and Systems Conference, Maynooth (2012)

 

VOCAL SEPARATION USING SINGER-VOWEL PRIORS OBTAINED FROM POLYPHONIC AUDIO [PDF]

Shrikant Venkataramani1, Nagesh Nayak2, Preeti Rao1, and Rajbabu Velmurugan1, 1Department of Electrical Engineering , IIT Bombay , Mumbai, 2Sensibol Audio Technologies Pvt. Ltd. (2014)

 

VOCAL SINGING AND MUSIC SEPARATION OF MIZO FOLK SONGS [permalink]

Nikhil Das, Esther Ramdinmawii, Ajit Kumar, Sanghamitra Nath, Department of Computer Science & Engineering, Tezpur University Napaam, Sonitpur, Assam, India (2023)

VOICE AND ACCOMPANIMENT SEPARATION IN MUSIC USING SELF-ATTENTION CONVOLUTIONAL NEURAL NETWORKS [PDF]

Yuzhou Liu1, Balaji Thoshkahna2, Ali Milani3, Trausti Kristjansson3, 1Ohio State University; 2Amazon Music, Bangalore; 3Amazon Lab126, CA (2020)

VOICE & MUSIC PATTERN EXTRACTION: A REVIEW [PDF]

Pooja Gautam1 and B S Kaushik2, 1Electronics & Telecommunication Department, RCET, Bhilai,Bhilai (C.G.) India, 2Electrical & Instrumentation Department, Bhilai (C.G.) India (2016)

VOICE AND STREAM: PERCEPTUAL AND COMPUTATIONAL MODELING OF VOICE SEPARATION [permalink]

Emilios Cambouropoulos, Aristotle University of Thessaloniki, Greece (2008)

VOICE/MUSIC SEPARATION RESEARCH (sound samples)

H. Deif, L. Gan, W. Wang and and S. Alhashmi

VOICE SEPARATION IN POLYPHONIC MUSIC: A DATA-DRIVEN APPROACH [PDF]

Anna Jordanous, Music Informatics, University of Sussex (2008)

 

‘VOICE’ SEPARATION: THEORETICAL, PERCEPTUAL AND COMPUTATIONAL PERSPECTIVES [PDF]

Emilios Cambouropoulos, Department of Music Studies, Aristotle University of Thessaloniki, Thessaloniki, Greece (2006)

 

VQ-BASED SINGLE–CHANNEL AUDIO SEPARATION FOR MUSIC AND SPEECH MIXTURES [PDF]

Pejman Mowlaee, Abolghasem Sayadiyan, Hamid Shekhzadeh Nadjar, Electrical Engineering Department, Amirkabir University of Technology, Tehran, Iran (2010)

 

WAVE-U-NET: A MULTI-SCALE NEURAL NETWORK FOR END-TO-END AUDIO SOURCE SEPARATION [PDF]

Daniel Stoller, Queen Mary University of London, Sebastian Ewert, Spotify, Simon Dixon, Queen Mary University of London (2018)

WEAK LABEL SUPERVISION FOR MONAURAL SOURCE SEPARATION USING NON-NEGATIVE DENOISING VARIATIONAL AUTOENCODERS [PDF]

Ertuğ Karamatlı1,2, Ali Taylan Cemgil1, Serap Kırbız3, 1 Department of Computer Engineering, Boğaziçi University, İstanbul, Turkey, 2 sahibinden.com, İstanbul, Turkey, 3 Department of Electrical and Electronics Engineering, MEF University, İstanbul, Turkey (2018)

WEAKLY INFORMED AUDIO SOURCE SEPARATION [PDF]

Kilian Schulze-Forster,1∗ Clément Doire,2 Gaël Richard,1 Roland Badeau1, 1 Télécom Paris, Institut Polytechnique de Paris, France, 2 Audionamix, Paris, France (2019)

WEAKLY SUPERVISED AUDIO SOURCE SEPARATION VIA SPECTRUM ENERGY PRESERVED WASSERSTEIN LEARNING [PDF]

Ning Zhang1, Junchi Yan2, Yuchen Zhou1, 1 IBM Research – China, Beijing, P.R. China, 2 Shanghai Jiao Tong University, Shanghai, P.R. China (2018)

WEAKLY-SUPERVISED AUDIO-VISUAL SOUND SOURCE DETECTION AND SEPARATION [PDF]

Tanzila Rahman1,2 and Leonid Sigal1,2,3, 1University of British Columbia, 2Vector Institute for AI, 3Canada CIFAR AI Chair (2021)

WILDMIX DATASET AND SPECTRO-TEMPORAL TRANSFORMER MODEL FOR MONOAURAL AUDIO SOURCE SEPARATION [PDF]

Amir Zadeh†, Tianjun Ma†, Soujanya Poria*, Louis-Philippe Morency†, † Language Technologies Institute, School of Computer Science, Carnegie Mellon University, ∗ Singapore University of Technology and Design (2019)

WHAT IS NMF (slide show)

Sayan Patra (2014)

 

WHY DOES MUSIC SOURCE SEPARATION BENEFIT FROM CACOPHONY? [PDF]

Chang-Bin Jeon1,2, Gordon Wichern1, Franc ̧ois G. Germain1, Jonathan Le Roux1, 1Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA, USA,  2Department of Intelligence and Information, Seoul National University, Seoul, South Korea (2024)

WIENER BASED SOURCE SEPARATION WITH HMM/GMM USING A SINGLE SENSOR [PDF]

Laurent Benaroya, Frédéric Bimbot, IRISA (CNRS & INRIA), METISS, Rennes Cedex, France (2003)

WILDMIX DATASET AND SPECTRO-TEMPORAL TRANSFORMER MODEL FOR MONOAURAL AUDIO SOURCE SEPARATION [PDF]

Amir Zadeh†, Tianjun Ma†, Soujanya Poria , Louis-Philippe Morency†, † Language Technologies Institute, School of Computer Science, Carnegie Mellon University, ∗ Singapore University of Technology and Design (2019)

WIMP2: MUSIC REMIXING AND UPMIXING USING SOURCE SEPARATION (video)

Gerard Roma, 2nd AES Workshop on Intelligent Music Production at the Centre for Digital Music at Queen Mary University of London (2016)

WINDOW SELECTION FOR ACCURATE MUSIC SOURCE SEPARATION USING REPET

Shivam Sharma and V. K. Mittal, Indian Institute of Information Technology Chittoor, Sri City, A.P., India (2016)

WORKSHOP "TEACHING AI TO HEAR LIKE WE DO: PSYCHOACOUSTICS IN MACHINE LEARNING" [PDF]

Gerald Schuller, Ilmenau University of Technology, Ilmenau, Germany (2022)

ZERO-SHOT AUDIO SOURCE SEPARATION THROUGH QUERY-BASED LEARNING FROM WEAKLY-LABELED DATA [PDF]

Ke Chen1*, Xingjian Du2*, Bilei Zhu2, Zejun Ma2, Taylor Berg-Kirkpatrick1, Shlomo Dubnov1, 1 University of California San Diego, CA, USA, 2 Bytedance AI Lab, Shanghai, China (2021)

bottom of page
Mastodon