Data-centric AI
Created by
@mmhamdy.bsky.social
@agcrnz.bsky.social
@loubnabnl.hf.co
SmolLMs & Data @huggingface Training SmolLMs and curating high quality web and synthetic datasets ✨ https://loubnabnl.github.io/
@mziizm.bsky.social
seeks to understand language. Staff Research Scientist @Cohere_Labs @Cohere PhD from @UvA_Amsterdam https://marziehf.github.io/
@stellaathena.bsky.social
I make sure that OpenAI et al. aren't the only people who are able to study large scale AI systems.
@nataliaelv.hf.co
Building Argilla @ Hugging Face 🤗. Linguist at heart. En ocasiones escribo en castellano.
@sarahooker.bsky.social
I lead Cohere For AI. Formerly Research Google Brain. ML Efficiency, LLMs, @trustworthy_ml.
@soldaini.net
I like tokens! Lead for OLMo data at @ai2.bsky.social (Dolma 🍇) w @kylelo.bsky.social. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot more at https://soldaini.net
@simonwillison.net
Independent AI researcher, creator of datasette.io and llm.datasette.io, building open source tools for data journalism, writing about a lot of stuff at https://simonwillison.net/
@ameeelie.bsky.social
Working at HF I love simple things and making them even simpler. I create both digital and physical products. I co-created Argilla, an Open-Source app for all who care about doing AI projects responsibly by caring about their data.
@datologyai.com
AI models are what they eat. Optimize training efficiency, maximize performance, and reduce compute costs with our expert curation.
@arielnlee.bsky.social
Data quality & post training magic. Data Provenance Initiative. Platypus. Prev: Founding Research Scientist (Multimodal) @ Raive. MS @ BU ECE, BS @ UCLA.
@pratyushmaini.bsky.social
Data Quality x Privacy PhD student @ CMU with Zico Kolter and Zack Lipton | Founding Member @datologyai.com | Prev. Comp Sc @iitdelhi http://pratyushmaini.github.io/
@llm360.bsky.social
Working on fully open-source LLMs and training data. We believe in community-owned AI. https://www.llm360.ai
@gabrielmb.com
ML Engineer @hf.co 🤗 Building tools for you to take care of your datasets like Argilla or distilabel!
@benburtenshaw.bsky.social
Building tools for AI datasets. 😽 Looking in AI datasets. 🙀 Sharing clean open AI datasets. 😻 at https://bsky.app/profile/hf.co
@leavittron.bsky.social
Chief Science Officer, Co-Founder @datologyai Former: Head of Data Research @MosaicML; FAIR. views are from nowhere
@arimorcos.bsky.social
CEO and Co-founder @ DatologyAI working to make it easy for anyone to make the most of their data. Former: RS FAIR, RS DeepMind, Harvard Neuroscience PhD. www.datologyai.com
@dvilasuero.hf.co
Everything datasets and human feedback for AI at Hugging Face. Prev: co-founder and CEO of Argilla (acquired by Hugging Face)
@shaynelongpre.bsky.social
PhD @ MIT. Prev: Google Deepmind, Apple, Stanford. 🇨🇦 Interests: AI/ML/NLP, Data-centric AI, transparency & societal impact