Audio ML
Researchers in Machine Learning and Signal processing for Audio, Speech, Music, and Language Processing
Created by
@jonathanleroux.bsky.social
@dcase-workshop.bsky.social
@ethanmanilow.bsky.social
universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io
@pa9501460.bsky.social
R&D expert in Speech & Audio Processing and Coding (Compression) | Researcher @Dolby utilizing Deep Learning http://linkedin.com/in/arijitbiswas
@michelemancusi.bsky.social
PhD, Senior Research Scientist @Sony Former @Microsoft, @Musixmatch Working on #DeepLearning, #SignalProcessing, #GenerativeModels.
@emilianpos.bsky.social
Senior AI Research Scientist @irisaudiotech | PhD in CS @SapienzaRoma | Former @CaFoscari, @SonyCSL, @Dolby and @c4dm
@waspaa.com
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) WASPAA 2025 will be held Oct. 12-15, 2025 at Granlibakken Tahoe, Tahoe City, CA, USA. Abstract deadline: April 23, 2025 (23:59 AOE) Paper deadline: April 30, 2025 (23:59
@mcomunita.bsky.social
PhD researcher in AI & Music at C4DM | QMUL. Previously: Sony CSL Paris, Sony Tokyo, AXD Imperial College, Blackstar Amps
@salahzaiem.bsky.social
Research Scientist at Google Deepmind working on audio/speech generation.
@tuomasvirtanen.bsky.social
@soham97.bsky.social
PhD candidate at Carnegie Mellon University Senior Applied Scientist at Microsoft 🌐 https://soham97.github.io 🐙 https://github.com/soham97 🎓 https://scholar.google.com/citations?user=MasiEogAAAAJ&hl=en
@jhauret.bsky.social
PhD Student - Deep Learning & Speech Processing @LeCnam GitHub: github.com/jhauret
@forevian.bsky.social
Multi-instrumentalist musician, sound engineer, audio software dev, ML engineer
@tanked-bozo.bsky.social
I watch football and I code sometimes! https://anugunjnaman.github.io
@johnmartinsson.org
I study machine listening methods for bioacoustics and automated sensing of natural environments. And I enjoy natural environments. https://johnmartinsson.org/ Core member of Climate AI Nordics | ML researcher at RISE
@tparcollet.bsky.social
Research Scientist at the Samsung AI Center in Cambridge. Ex. Assoc. Prof. — Powering the best open-source speech toolkit.
@aanugraha.bsky.social
Research Scientist @ Sound Scene Understanding Team, RIKEN-AIP, Japan
@changhongwang.bsky.social
Postdoc researcher @telecomparis. Previously @CNRS/LS2N @c4dm. Machine learning for audio. https://changhongw.github.io/
@tonykoo.bsky.social
Research Scientist @SonyAI PhD from Seoul National University Previous intern @MERL, @Sony, and @Supertone
@juliawilkins.bsky.social
Now: Audio & Multimodal ML PhD in the Music and Audio Research Lab @ NYU Prev: Data Developer at Sonos and Northwestern, Research Intern at Adobe + Bosch Research
@joimort.bsky.social
Machine Learning for Music/Speech | Senior Research Engineer at Native Instruments/iZotope | Previously Intern @Microsoft, @Sony, @AudioshakeAI
@ucsd-musaic.bsky.social
We're the Music Understanding Synthesis and AI Creativity Group at UCSD! (PIs: Julian McAuley, Taylor Berg-Kirkpatrick, Shlomo Dubnov) https://ucsd-musaic.github.io
@popcornell.bsky.social
@wanchichen.bsky.social
PhD Student @ltiatcmu.bsky.social I work in speech processing. wanchichen.github.io
@xiaoyubie.bsky.social
Postdoc Researcher @Télécom Paris, Institut Polytechnique de Paris Prev PhD @INRIA Prev intern @Meta @Baidu I work on generative models and audio applications https://xiaoyubie1994.github.io/
@ashva.la
Current: Co-founder, NoneType (https://nonety.pe). Prev: PhD and Masters (music tech) @ Georgia Tech, Bachelors (EPD) @ Berklee, Keyo and LiveAds. Knower of guitars.
@hugomlrd.bsky.social
PhD student in multimodal learning for audio understanding at telecom-paris
@joshhmcdermott.bsky.social
Working to understand how humans and machines hear. Prof at MIT; director of Lab for Computational Audition. https://mcdermottlab.mit.edu/
@mathfontaine.bsky.social
Associate professor at Télécom Paris in machine listening and audio applied to extended reality
@cnstntns.bsky.social
audio & ml research at antarestech.com. previously audioshake.ai, gracenote.com. also interested in photography, bicycles, and beer.
@dgiacobello.bsky.social
Milanese-Californian Digital Speech and Audio Processing Technologist @ Apple
@saneworkshop.org
Official account for the SANE series of workshops. The one-day events annually gather researchers and students in speech and audio from the Northeast of the American continent, alternately in Boston and NYC. 🌐 saneworkshop.org
@zhongweiyangxu.bsky.social
I’m a PhD student in University of Illinois Urbana-Champaign working on audio inverse problems. My website: https://xzwy.github.io/alanweiyang.github.io/
@v1vekkumar.bsky.social
Senior Manager, Foundational Research , @GoogleDeepMind Googler, Ex @Dolby & @Broadcom Talks and Investments 👉🏽 http://portfolio.v1vek.com
@shinjiw.bsky.social
I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
@djain.bsky.social
HCI Assistant Professor at UMich researching accessibility, audio AI, sound interaction, XR, and health. Director, Soundability Lab. Previously, Google, Apple, Microsoft, UW, and MIT Media Lab. https://dhruv-jain.com
@minjekim.bsky.social
Audio and AI researcher. Faculty in Siebel School at UIUC and Visiting Academic at Amazon Lab126. A working dad. Some obsolete hobbies: music, photography, drawing, and writing. Still active interests: cooking. 🏠 https://minjekim.com
@dinosaurjya.bsky.social
Ph.D. in Artificial Intelligence and Music, C4DM https://saurjya.github.io/
@havenkim.bsky.social
1st-year CS PhD student at UCSD I work on music and ML. havenpersona.github.io
@faroit.bsky.social
AudioML research scientist at https://audioshake.ai, before: post-doc @inria@social.numerique.gouv.fr, Editor at https://bsky.app/profile/joss-openjournals.bsky.social All in 17.68% of grey, located in Frankfurt (Germany)
@yoshipon0520.bsky.social
@hermandong.bsky.social
Assistant Professor at University of Michigan | PhD from UC San Diego | Human-Centered Generative AI for Content Creation
@jordiponsdotme.bsky.social
Music, audio, and deep learning research at Stability AI ~ Building bridges between audio signal processing wisdom and deep learning. artintech.substack.com www.jordipons.me
@nateanl.bsky.social
Researcher@Meta Reality Labs, working on generative models, speech enhancement, speech recognition, TTS, etc. https://nateanl.github.io/
@albertzeyer.bsky.social
Deep Learning, speech recognition, language modeling, https://scholar.google.com/citations?user=qrh5CBEAAAAJ&hl=en Open source, https://github.com/albertz/
@hugofloresgarcia.bsky.social
human computer musical instruments https://hugofloresgarcia.art/ phd candidate @northwestern research intern @adobe prev @spotify, @descript chicago // honduras
@bernardo-torres.bsky.social
PhD Student @ Telecom Paris, ADASP team. Previously intern at Sony CSL (Music Team). AI/ML for audio and music signal processing and synthesis.
@dcase-challenge.bsky.social
Challenge on Detection and Classification of Acoustic Scenes and Events. https://dcase.community/
@interspeech.bsky.social
Welcome to the 26th Interspeech Conference, the premier global event on spoken language processing technology, held in August 17-21, 2025, in Rotterdam, NL.
@sleglaive.bsky.social
Tenured Assistant Professor at CentraleSupélec. Signal processing and machine learning for speech and audio. sleglaive.github.io
@lucacomanducci.bsky.social
Fixed-term researcher (RTDA) @polimi working on audio signal processing, music informatics, spatial audio and generative models (https://lucacoma.github.io/)
@lancelotblanchard.bsky.social
Musician, Engineer, AI Researcher - @mitofficial.bsky.social @medialab.bsky.social
@zacknovack.bsky.social
Efficient+Controllable Audio Generation @ UCSD | Interning Stability AI, Adobe | Teaching drums @ POW Percussion
@seungheon-doh.bsky.social
research on llm + music (https://seungheondoh.github.io/). PhD Candidate @ Music and Audio Computing Lab, KAIST. Previously an intern @Adobe, @BytedanceTalk, @Naver, @Chartmetric.
@neuralvocoder.bsky.social
Audio ML Research @ Auto-Tune 🎤🎵 Prev: Audio+NLP for Startups, Auditory Neuro, Applied Math Love to talk Cognitive Science, Linguistics, Bio-inspired Learning, Topological Signal Processing & TDA
@justinsalamon.bsky.social
Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him. www.justinsalamon.com
@larryniven4.bsky.social
Lecturer at the University of Edinburgh. Member of Centre of Speech Technology Research (CSTR).
@urinieto.bsky.social
Researcher at Adobe Research. Machine learning on audio. General Chair of ISMIR24. Screamer. Oaklander born in Barcelona. Titan. He/they 🌈 www.urinieto.com
@serrjoa.bsky.social
Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine. https://serrjoa.github.io/
@jlenzyy.bsky.social
Audio AI research engineer w/ Lemonaide. prev. Neutone, Okio. MSc in Audio Computation at UPF. I also fly planes and play the cello sometimes!
@adeleforge.bsky.social
Research scientist at #Inria. Audio signal processing, Acoustics, Machine Learning, Bicycle Riding, Lindy Hop Dancing.
@francoisgrondin.bsky.social
Assistant professor at USherbrooke. Creator of the ODAS framework. Research in speech, multichannel audio processing, robot audition, embedded AI. francoisgrondin.com
@keshet.bsky.social
Speech, language, and deep learning at the Technion. But also psychology, philosophy, and history. And Jazz improv.
@maureendeseyssel.bsky.social
machine learning researcher @Apple | PhD from @CoML_ENS | speech, ml and cognition.
@catlai.bsky.social
Lecturer in speech and language technology, CSTR, University of Edinburgh. https://homepages.inf.ed.ac.uk/clai/
@daanvanesch.nl
I work on speech and language technologies at Google. I like languages, history, maps, traveling, cycling, and buying way too many books.
@begus.bsky.social
Assoc. Professor at UC Berkeley Artificial and biological intelligence and language Linguistics Lead at Project CETI 🐳 PI Berkeley SC Lab 🗣️ College Principal of Bowles Hall 🏰 https://www.gasperbegus.com
@grzegorz.chrupala.me
Speech • Language • Learning https://grzegorz.chrupala.me @ Tilburg University
@emmanouilb.bsky.social
Reader in Machine Listening, @qmuleecs.bsky.social Queen Mary University of London - research on machine listening / audio analysis. Website: https://www.eecs.qmul.ac.uk/~emmanouilb/
@rdesh26.bsky.social
Research Scientist @ Meta GenAI in NYC. Working on audio/speech for LLaMA. Previously: PhD @ JHU CLSP desh2608.github.io
@rserizel.bsky.social
Associate professor at Université de Lorraine. Doing research is speech and audio processing.