Audio ML

Researchers in Machine Learning and Signal processing for Audio, Speech, Music, and Language Processing

Created by

Jonathan Le Roux

@jonathanleroux.bsky.social

View in Bluesky

ethan manilow

@ethanmanilow.bsky.social

universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io

Shoichi Koyama

@sh01k.bsky.social

Researcher in Audio Signal Processing and Machine Learning ))))

@ehabets.bsky.social

@inakodrasi.bsky.social

Arijit Biswas

@pa9501460.bsky.social

R&D expert in Speech & Audio Processing and Coding (Compression) | Researcher @Dolby utilizing Deep Learning http://linkedin.com/in/arijitbiswas

@ritheshkumar.bsky.social

Researcher in audio and speech generative models (SampleRNN, MelGAN, DAC, …) Research Scientist @AdobeResearch. Ex @DescriptApp, @Mila_Quebec https://ritheshkumar.com

Michele Mancusi

@michelemancusi.bsky.social

PhD, Senior Research Scientist @Sony Former @Microsoft, @Musixmatch Working on #DeepLearning, #SignalProcessing, #GenerativeModels.

Emilian Postolache

@emilianpos.bsky.social

Senior AI Research Scientist @irisaudiotech | PhD in CS @SapienzaRoma | Former @CaFoscari, @SonyCSL, @Dolby and @c4dm

@gerkmann.bsky.social

@psmaragdis.bsky.social

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) WASPAA 2025 will be held Oct. 12-15, 2025 at Granlibakken Tahoe, Tahoe City, CA, USA. Abstract deadline: April 23, 2025 (23:59 AOE) Paper deadline: April 30, 2025 (23:59

Marco Comunità

@mcomunita.bsky.social

PhD researcher in AI & Music at C4DM | QMUL. Previously: Sony CSL Paris, Sony Tokyo, AXD Imperial College, Blackstar Amps

Salah Zaiem

@salahzaiem.bsky.social

Research Scientist at Google Deepmind working on audio/speech generation.

Hideki Kawahara

@hidekikawahara.bsky.social

Auditory signal processing researcher.

Tuomas Virtanen

@tuomasvirtanen.bsky.social

@yusufziyaisik.bsky.social

@drjohnhershey.bsky.social

Kyutai

@kyutai-labs.bsky.social

https://kyutai.org/ Open-Science AI Research Lab based in Paris

Soham Deshmukh

@soham97.bsky.social

PhD candidate at Carnegie Mellon University Senior Applied Scientist at Microsoft 🌐 https://soham97.github.io 🐙 https://github.com/soham97 🎓 https://scholar.google.com/citations?user=MasiEogAAAAJ&hl=en

Julien Hauret

@jhauret.bsky.social

PhD Student - Deep Learning & Speech Processing @LeCnam GitHub: github.com/jhauret

András Barják

@forevian.bsky.social

Multi-instrumentalist musician, sound engineer, audio software dev, ML engineer

Anugunj Naman

@tanked-bozo.bsky.social

I watch football and I code sometimes! https://anugunjnaman.github.io

John Martinsson

@johnmartinsson.org

I study machine listening methods for bioacoustics and automated sensing of natural environments. And I enjoy natural environments. https://johnmartinsson.org/ Core member of Climate AI Nordics | ML researcher at RISE

Titouan "SpeechBrain" Parcollet

@tparcollet.bsky.social

Research Scientist at the Samsung AI Center in Cambridge. Ex. Assoc. Prof. — Powering the best open-source speech toolkit.

Aditya Arie Nugraha

@aanugraha.bsky.social

Research Scientist @ Sound Scene Understanding Team, RIKEN-AIP, Japan

Changhong Wang

@changhongwang.bsky.social

Postdoc researcher @telecomparis. Previously @CNRS/LS2N @c4dm. Machine learning for audio. https://changhongw.github.io/

Junghyun (Tony) Koo

@tonykoo.bsky.social

Research Scientist @SonyAI PhD from Seoul National University Previous intern @MERL, @Sony, and @Supertone

julia wilkins 👩🏽‍💻🎵

@juliawilkins.bsky.social

Now: Audio & Multimodal ML PhD in the Music and Audio Research Lab @ NYU Prev: Data Developer at Sonos and Northwestern, Research Intern at Adobe + Bosch Research

Johannes Imort

@joimort.bsky.social

Machine Learning for Music/Speech | Senior Research Engineer at Native Instruments/iZotope | Previously Intern @Microsoft, @Sony, @AudioshakeAI

UCSD-MUSAIC

@ucsd-musaic.bsky.social

We're the Music Understanding Synthesis and AI Creativity Group at UCSD! (PIs: Julian McAuley, Taylor Berg-Kirkpatrick, Shlomo Dubnov) https://ucsd-musaic.github.io

@topel.bsky.social

Samuele Cornell

@popcornell.bsky.social

William Chen

@wanchichen.bsky.social

PhD Student @ltiatcmu.bsky.social I work in speech processing. wanchichen.github.io

Malcolm Slaney

@malcolm.slaney.org

California based auditory researcher and sports photographer.

Karn Watcharasupat

@kwatcharasupat.bsky.social

unsound lab cat @ georgia tech

@danstowell.bsky.social

Xiaoyu Bie

@xiaoyubie.bsky.social

Postdoc Researcher @Télécom Paris, Institut Polytechnique de Paris Prev PhD @INRIA Prev intern @Meta @Baidu I work on generative models and audio applications https://xiaoyubie1994.github.io/

Ashvala Vinay

@ashva.la

Current: Co-founder, NoneType (https://nonety.pe). Prev: PhD and Masters (music tech) @ Georgia Tech, Bachelors (EPD) @ Berklee, Keyo and LiveAds. Knower of guitars.

Hugo Malard

@hugomlrd.bsky.social

PhD student in multimodal learning for audio understanding at telecom-paris

@geoffroypeeters.bsky.social

Gautham Mysore

@gauthamjmysore.bsky.social

Head of Audio and Video AI Research at Adobe Research

Josh McDermott

@joshhmcdermott.bsky.social

Working to understand how humans and machines hear. Prof at MIT; director of Lab for Computational Audition. https://mcdermottlab.mit.edu/

Mathieu Fontaine

@mathfontaine.bsky.social

Associate professor at Télécom Paris in machine listening and audio applied to extended reality

Constantinos Dimitriou

@cnstntns.bsky.social

audio & ml research at antarestech.com. previously audioshake.ai, gracenote.com. also interested in photography, bicycles, and beer.

Daniele Giacobello

@dgiacobello.bsky.social

Milanese-Californian Digital Speech and Audio Processing Technologist @ Apple

Speech and Audio in the Northeast (SANE)

@saneworkshop.org

Official account for the SANE series of workshops. The one-day events annually gather researchers and students in speech and audio from the Northeast of the American continent, alternately in Boston and NYC. 🌐 saneworkshop.org

Zhongweiyang Xu

@zhongweiyangxu.bsky.social

I’m a PhD student in University of Illinois Urbana-Champaign working on audio inverse problems. My website: https://xzwy.github.io/alanweiyang.github.io/

Vivek Kumar

@v1vekkumar.bsky.social

Senior Manager, Foundational Research , @GoogleDeepMind Googler, Ex @Dolby & @Broadcom Talks and Investments 👉🏽 http://portfolio.v1vek.com

Shinji Watanabe

@shinjiw.bsky.social

I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.

DJ 🐳

@djain.bsky.social

HCI Assistant Professor at UMich researching accessibility, audio AI, sound interaction, XR, and health. Director, Soundability Lab. Previously, Google, Apple, Microsoft, UW, and MIT Media Lab. https://dhruv-jain.com

Minje Kim

@minjekim.bsky.social

Audio and AI researcher. Faculty in Siebel School at UIUC and Visiting Academic at Amazon Lab126. A working dad. Some obsolete hobbies: music, photography, drawing, and writing. Still active interests: cooking. 🏠 https://minjekim.com

Saurjya Sarkar

@dinosaurjya.bsky.social

Ph.D. in Artificial Intelligence and Music, C4DM https://saurjya.github.io/

Haven Kim

@havenkim.bsky.social

1st-year CS PhD student at UCSD I work on music and ML. havenpersona.github.io

Carl Thomé

@carlthome.bsky.social

Music machine learning, MIR, ML, DSP

Faro Stöter

@faroit.bsky.social

AudioML research scientist at https://audioshake.ai, before: post-doc @inria@social.numerique.gouv.fr, Editor at https://bsky.app/profile/joss-openjournals.bsky.social All in 17.68% of grey, located in Frankfurt (Germany)

Matthias Mauch

@matthiasmauch.bsky.social

I lead music ML research for Music. Flexitalian.

Yoshiaki Bando

@yoshipon0520.bsky.social

Hao-Wen (Herman) Dong 董皓文

@hermandong.bsky.social

Assistant Professor at University of Michigan | PhD from UC San Diego | Human-Centered Generative AI for Content Creation

Jordi Pons

@jordiponsdotme.bsky.social

Music, audio, and deep learning research at Stability AI ~ Building bridges between audio signal processing wisdom and deep learning. artintech.substack.com www.jordipons.me

Zhaoheng Ni

@nateanl.bsky.social

Researcher@Meta Reality Labs, working on generative models, speech enhancement, speech recognition, TTS, etc. https://nateanl.github.io/

robinsch

@fakufaku.bsky.social

Farming chili peppers for fun and hot sauce 🌶️

Albert Zeyer

@albertzeyer.bsky.social

Deep Learning, speech recognition, language modeling, https://scholar.google.com/citations?user=qrh5CBEAAAAJ&hl=en Open source, https://github.com/albertz/

hugofloresgarcía

@hugofloresgarcia.bsky.social

human computer musical instruments https://hugofloresgarcia.art/ phd candidate @northwestern research intern @adobe prev @spotify, @descript chicago // honduras

Bernardo Torres

@bernardo-torres.bsky.social

PhD Student @ Telecom Paris, ADASP team. Previously intern at Sony CSL (Music Team). AI/ML for audio and music signal processing and synthesis.

DCASE Challenge

@dcase-challenge.bsky.social

Challenge on Detection and Classification of Acoustic Scenes and Events. https://dcase.community/

INTERSPEECH 2025

@interspeech.bsky.social

Welcome to the 26th Interspeech Conference, the premier global event on spoken language processing technology, held in August 17-21, 2025, in Rotterdam, NL.

Martijn Bartelds

@mbartelds.bsky.social

Postdoctoral Scholar Stanford NLP

Kyle Kastner

@kastnerkyle.bsky.social

computers and music are (still) fun

@kzmolikova.bsky.social

Simon Leglaive

@sleglaive.bsky.social

Tenured Assistant Professor at CentraleSupélec. Signal processing and machine learning for speech and audio. sleglaive.github.io

Luca Comanducci

@lucacomanducci.bsky.social

Fixed-term researcher (RTDA) @polimi working on audio signal processing, music informatics, spatial audio and generative models (https://lucacoma.github.io/)

Francesco Paissan

@fpaissan.bsky.social

research in ML at MERL and Mila francescopaissan.it

@gtzan.bsky.social

@yoshiki-masuyama.bsky.social

Lancelot

@lancelotblanchard.bsky.social

Musician, Engineer, AI Researcher - @mitofficial.bsky.social @medialab.bsky.social

Zachary Novack

@zacknovack.bsky.social

Efficient+Controllable Audio Generation @ UCSD | Interning Stability AI, Adobe | Teaching drums @ POW Percussion

SeungHeon Doh

@seungheon-doh.bsky.social

research on llm + music (https://seungheondoh.github.io/). PhD Candidate @ Music and Audio Computing Lab, KAIST. Previously an intern @Adobe, @BytedanceTalk, @Naver, @Chartmetric.

Fernando Espinosa Iñiguez

@neuralvocoder.bsky.social

Audio ML Research @ Auto-Tune 🎤🎵 Prev: Audio+NLP for Startups, Auditory Neuro, Applied Math Love to talk Cognitive Science, Linguistics, Bio-inspired Learning, Topological Signal Processing & TDA

Justin Salamon

@justinsalamon.bsky.social

Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him. www.justinsalamon.com

Hao Tang

@larryniven4.bsky.social

Lecturer at the University of Edinburgh. Member of Centre of Speech Technology Research (CSTR).

Oriol (Uri) Nieto

@urinieto.bsky.social

Researcher at Adobe Research. Machine learning on audio. General Chair of ISMIR24. Screamer. Oaklander born in Barcelona. Titan. He/they 🌈 www.urinieto.com

Julius Richter

@julius-richter.bsky.social

Postdoctoral researcher at Meta

Joan Serrà

@serrjoa.bsky.social

Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine. https://serrjoa.github.io/

Julian Lenz

@jlenzyy.bsky.social

Audio AI research engineer w/ Lemonaide. prev. Neutone, Okio. MSc in Audio Computation at UPF. I also fly planes and play the cello sometimes!

Antoine Deleforge

@adeleforge.bsky.social

Research scientist at #Inria. Audio signal processing, Acoustics, Machine Learning, Bicycle Riding, Lindy Hop Dancing.

@naoyukikandaslp.bsky.social

@keunwoochoi.bsky.social

AI researcher in music, audio, LLMs.

Andrew Owens

@andrewowens.bsky.social

Assistant professor @ UMich EECS

Romain Serizel

@rserizel.bsky.social

Associate professor at Université de Lorraine. Doing research is speech and audio processing.