Interpretability & Mechanistic Interpretability
People who work on interpretability for LLMs / VLMs! Message @alessiodevoto.bsky.social to be added :)
Created by
@alessiodevoto.bsky.social
@neelrajani.bsky.social
PhD student in Responsible NLP at the University of Edinburgh, passionate about MechInterp
@sscardapane.bsky.social
I fall in love with a new #machinelearning topic every month ๐ Ass. Prof. Sapienza (Rome) | Author: Alice in a differentiable wonderland (https://www.sscardapane.it/alice-book/)
@busycalibrating.bsky.social
PhD in ML @Mila/UdeM LLM robustness, safety, interpretability
@variint.bsky.social
Lost in translation | Interpretability of modular convnets applied to ๐๏ธ and ๐ฐ๏ธ๐ | she/her ๐ฆ๐ variint.github.io
@swetakar.bsky.social
Machine learning PhD student @ Blei Lab in Columbia University Working in mechanistic interpretability, nlp, causal inference, and probabilistic modeling! Previously at Meta for ~3 years on the Bayesian Modeling & Generative AI teams. ๐ www.sweta.dev
@wordscompute.bsky.social
nlp/ml phding @ usc, interpretability & reasoning & pretraining & emergence ํamerican, she, iglee.me, likes ??= bookmarks
@francescortu.bsky.social
NLP & Interpretability | PhD Student @ University of Trieste & Laboratory of Data Engineering of Area Science Park | Prev MPI-IS
@amuuueller.bsky.social
Postdoc at Northeastern and incoming Asst. Prof. at Boston U. Working on NLP, interpretability, causality. Previously: JHU, Meta, AWS
@zacharylipton.bsky.social
CTO & Chief Scientific Officer @ Abridge, CMU ML prof, occasional writer, relapsing ๐ท, creator of d2l.ai & approximatelycorrect.com
@sarahooker.bsky.social
I lead Cohere For AI. Formerly Research Google Brain. ML Efficiency, LLMs, @trustworthy_ml.
@yoavgo.bsky.social
@yungsung.bsky.social
PhD student #MIT_CSAIL | Intern #MetaAI #Microsoft #MITIBMLab | BS #NTU in #Taiwan
@yoavartzi.com
LM/NLP/ML researcher ยฏ\_(ใ)_/ยฏ yoavartzi.com / associate professor @ Cornell CS + Cornell Tech campus @ NYC / nlp.cornell.edu / associate faculty director @ arXiv.org / researcher @ ASAPP / starting @colmweb.org / building RecNet.io
@sarah-nlp.bsky.social
Research in LM explainability & interpretability since 2017. sarahwie.github.io Postdoc @ai2.bsky.social & @uwnlp.bsky.social PhD from Georgia Tech Views my own, not my employer's.
@nsaphra.bsky.social
Waiting on a robot body. All opinions are universal and held by both employers and family. Current fellow at Harvard Kempner, incoming faculty at Boston University, recruiting students! ML/NLP/they/she.
@gsarti.com
PhD Student at @gronlp.bsky.social ๐ฎ, core dev @inseq.org. Interpretability โฉ HCI โฉ #NLProc. gsarti.com
@colah.bsky.social
Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.
@soniajoseph.bsky.social
AI researcher at Mila, visiting researcher at Meta Also on X: @soniajoseph_
@yuzhaouoe.bsky.social
https://yuzhaouoe.github.io/ | PhD Student @ University of Edinburgh | Opening the Black Box for Efficient Training/Inference
@neuralnoise.com
Researcher in ML/NLP at the University of Edinburgh (faculty at Informatics and EdinburghNLP), Co-Founder/CTO at www.miniml.ai, ELLIS (@ELLIS.eu) Scholar, Generative AI Lab (GAIL, https://gail.ed.ac.uk/) Fellow -- www.neuralnoise.com, he/they
@alessiodevoto.bsky.social
PhD in ML/AI | Researching Efficient ML/AI (vision & language) ๐ & Interpretability | @SapienzaRoma @EdinburghNLP | https://alessiodevoto.github.io/