Journal of Data-centric Machine Learning Research @DMLRJournal
A member of the @JmlrOrg family, the top archival venue for articles focused on the data aspect of machine learning research
Follow @DMLRWorkshop for workshopsdata.mlr.pressJoined September 2023
Synthetic Datasets for Machine Learning on Spatio-Temporal Graphs using PDEs
by Jost Arndt, Utku Isil, Michael Detzel, Wojciech Samek, Jackie Ma
Action Editor: Yi Liu
data.mlr.press/assets/pdf/v02…
Chronicling Germany: An Annotated Historical Newspaper Dataset
by Christian Schultze, Niklas Kerkfeld, Kara Kuebart, Princilia Weber, Moritz Wolter, Felix Selgert
Action Editor: Hugo Jair Escalante
data.mlr.press/assets/pdf/v02…
MONSTER: Monash Scalable Time Series Evaluation Repository
by Angus Dempster, Navid Mohammadi Foumani, Chang Wei Tan, Lynn Miller, Amish Mishra, Mahsa Salehi, Charlotte Pelletier, Daniel F. Schmidt, Geoffrey I. Webb
Action Editor: Hugo Jair Escalante
data.mlr.press/assets/pdf/v02…
FlowBench: A Large Scale Benchmark for Flow Simulation over Complex Geometries
by Ronak Tali, et al
Action Editor: Sergio Escalera
data.mlr.press/assets/pdf/v02…
Text Quality-Based Pruning for Efficient Training of Language Models
by Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Daniel Li Chen, Armen Aghajanyan, Gargi Ghosh, Luke Zettlemoyer
AE: Yang Liu
data.mlr.press/assets/pdf/v02…
Deep Learning for Accurate Diagnosis of Viral Infections through scRNA-seq Analysis: A Comprehensive Benchmark Study
by Ziwei Yang, Xuxi Chen, Biqing Zhu, Tianlong Chen, Zhangyang Wang
Action Editor: Sergio Escalera
data.mlr.press/assets/pdf/v02…
Data Acquisition: A New Frontier in Data-centric AI
by Lingjiao Chen, Bilge Acun, Newsha Ardalani, Yifan Sun, Feiyang Kang, Hanrui Lyu, Yongchan Kwon, Ruoxi Jia, Carole-Jean Wu, Matei Zaharia, James Zou
Action Editor: Remi Denton
data.mlr.press/assets/pdf/v02…
Challenge design roadmap
by Hugo Jair Escalante, Isabelle Guyon, Addison Howard, Walter Reade, Sébastien Treguer
Action Editor: Sebastian Schelter
data.mlr.press/assets/pdf/v02…
V-LoL😂: A Diagnostic Dataset for Visual Logical Learning
by Lukas Helff, Wolfgang Stammer, Hikaru Shindo, Devendra Singh Dhami, Kristian Kersting
Action Editor: Christopher De Sa
data.mlr.press/assets/pdf/v02…
SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine Learning
by Pu Ren, N. Benjamin Erichson, Junyi Guo, Shashank Subramanian, Omer San, Zarija Lukić, Michael W. Mahoney
Action Editor: Holger Caesar
data.mlr.press/assets/pdf/v02…
Towards impactful challenges: post-challenge paper, benchmarks and other dissemination actions
by Antoine Marot, David Rousseau, Zhen (Zach) Xu
Action Editor: Sebastian Schelter
data.mlr.press/assets/pdf/v02…
Constructing Confidence Intervals for "the" Generalization Error – a Comprehensive Benchmark Study
by Hannah Schulz-Kümpel, Sebastian Fischer, Roman Hornung, Anne-Laure Boulesteix, Thomas Nagler, Bernd Bischl
Action Editor: Yue Zhao
data.mlr.press/assets/pdf/v02…
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
by Juan Zuluaga-Gomez, et al.
Action Editor: Peter Mattson
data.mlr.press/assets/pdf/v02…
Evaluating Durability: Benchmark Insights into Image and Text Watermarking
by Jielin Qiu, William Han, Xuandong Zhao, Shangbang Long, Christos Faloutsos, Lei Li
Action Editor: Hongyang Zhang
data.mlr.press/assets/pdf/v02…
1 Followers 123 FollowingDistributed systems engineer. Curiosity about data infrastructure, scaling laws and data-centric ML. @kurtiscwright on Substack. greenjollygiant on BlueSky
7 Followers 263 Following💻AI Expert | 💰 Tech & Finance Buff | Exploring the intersection of AI, business, and investments | On a mission to innovate and grow wealth.
6 Followers 398 FollowingA collection of dreams, memories, and fleeting moments ☯️ Shitposts in search of the most interesting questions in Science, Philosophy, and Culture
6 Followers 278 Following2nd International Conference on Artificial Intelligence and Big Data Analytics | October 15-16, 2026 | Singapore City, Singapore
4 Followers 226 FollowingPhD in AI | Freelance ML/Data Science engineer | Building and sharing hands-on AI tools, insights, and entrepreneurial experiments.
13 Followers 242 FollowingI work at the intersection of AI and data science, transforming challenges into intelligent solutions | Always learning, always innovating.
9 Followers 250 FollowingBuild orchestration logic that merges e-bike bots, delivery pods, and aerial units into one chain. Race folding bikes for fun.
163 Followers 1 FollowingWorkshops Series on Data-centric Machine Learning Research Next workshop will take place at ICML 2024. Check out @DMLRJournal for the journal's account.
157K Followers 41 FollowingSydney Dec 6-12, 26, Paris and Atlanta. Tweets to this account are not monitored. Please send feedback to [email protected].
1K Followers 393 FollowingFaculty @TUDelft, Prev. @ETH_en @Stanford @EPFL_en
Research in ML robustness, reliability and reasoning
Exec. @DMLRJournal, Some chair @iclr_conf & @NeurIPSConf
38K Followers 13 FollowingProfessor of machine learning at the University of Cambridge. Opinions are my own. Author of "The Atomic Human"
Mainly found on @lawrennd.bsky.social
3K Followers 1K FollowingAI Researcher, @open_ml founder, research lead @TUeindhoven. Building AI systems that learn how to learn, grow and adapt continuously & push humanity forward.