Abitha @abitha___

PhD student at @SCSatCMU abitha-thankaraj.github.io Joined May 2019

Tweets

40
Followers

228
Following

1K
Likes

5K

Lawrence Feng @lawrencefeng17

3 weeks ago

1/ To retain post-training capabilities after further fine-tuning, mix that data into pretraining. The effect can be invisible until fine-tuning begins; early exposure may not help post-training performance, but it changes what persists. How a model learns a task matters.

6 24 86 27K 56

View Details

Amanda Bertsch @abertsch72

a month ago

New paper! allenai.org/papers/olmpool This tackles a puzzle we found during the training of Olmo 3: how could two models with nearly identical short-context performance (and trained on the same data!) behave completely differently after long context extension?

Ai2 @allen_ai

a month ago

Recipes for teaching language models to handle long inputs don't work equally well across model families. We wanted to know why—is it the architecture, the training data, or both? 🧵

5 15 84 25K 62

3 28 111 15K 50

View Details

Abitha @abitha___

2 months ago

@pratyushmaini time to parallelise. multiple subagents should write multiple SKILL.md s.

0 0 3 167 0

View Details

Yash Jangir @off_jangir

3 months ago

🤖 What would LMArena for robotics look like? Introducing RobotArena ∞ We turn real videos into simulated environments and evaluate robot policies at scale using VLM scoring + human preferences A scalable benchmark for robot generalists 🔗 robotarenainf.github.io Details 🧵👇

5 28 126 22K 85

View Details

Christina Baek @_christinabaek

3 months ago

Models are typically specialized to new domains by finetuning on small, high-quality datasets. We find that repeating the same dataset 10–50× starting from pretraining leads to substantially better downstream performance, in some cases outperforming larger models. 🧵

19 80 618 95K 521

View Details

Aldo Gael Carranza @agcrnz

4 months ago

1/ We’ve released a report on our work on multilingual data curation @datologyai. tl;dr: We shift the performance–compute Pareto frontier for multilingual models. Entirely by improving data quality and composition. arxiv: arxiv.org/abs/2602.15210 blog: datologyai.com/blog/berweb-in…

Ricardo Monti @RicardoMonti9

4 months ago

1/ People often think better multilingual models must come at the cost of English performance. Not true. The constraint isn’t capacity, it’s data quality, and we can fix it. Today @datologyai shares ÜberWeb: a year of multilingual curation lessons, scaled to 20T+ tokens.

7 30 153 39K 67

2 9 35 3K 11

View Details

Kaleigh Mentzer @KaleighMentzer

4 months ago

🌎Making your model multilingual doesn't have to sacrifice English performance—you just need better data. @agcrnz, @RicardoMonti9, and I have been working on curating the best possible multilingual data with the team @datologyai, and it works! Check out the results 👇

Ricardo Monti @RicardoMonti9

4 months ago

7 30 153 39K 67

0 13 31 3K 2

View Details

Ricardo Monti @RicardoMonti9

4 months ago

7 30 153 39K 67

View Details

Amanda Bertsch @abertsch72

7 months ago

Can LLMs accurately aggregate information over long, information-dense texts? Not yet… We introduce Oolong, a dataset of simple-to-verify information aggregation questions over long inputs. No model achieves >50% accuracy at 128K on Oolong!

13 68 358 81K 218

View Details

Abitha @abitha___

7 months ago

@universeinanegg @yoavgo chatgpt.com/share/6900f033… ^ seems to fix some of this behavior

0 0 1 35 0

View Details

Abitha @abitha___

7 months ago

@universeinanegg @yoavgo Training objective mismatch in post training : Language models being unable to output ‘I don’t know’- arxiv.org/abs/2506.09038; Very vaguely - the model just picks the closest embedding. This explains the repetition and retrying until the token budget runs out.

1 0 0 57 0

View Details

Abitha @abitha___

8 months ago

Homanga is an incredible researcher and mentor. If you value thoughtful insights and exciting research problems, apply to work with him at JHU!

Homanga Bharadhwaj @mangahomanga

8 months ago

I'll be joining the faculty @JohnsHopkins late next year as a tenure-track assistant professor in @JHUCompSci Looking for PhD students to join me tackling fun problems in robot manipulation, learning from human data, understanding+predicting physical interactions, and beyond!

87 115 852 132K 161

11 months ago

@abitha___ will be presenting our work on training language models to predict further into the future beyond the next token and the benefits this objective brings. x.com/gm8xx8/status/…

𝚐𝔪𝟾𝚡𝚡𝟾 @gm8xx8

a year ago

Looking beyond the next token TRELAWNEY inserts future tokens <T>...</T> during training to teach models to plan ahead—boosting reasoning, coherence, and control. Highlights: - NO ARCHITECTURE CHANGES. JUST SMARTER DATA. - works with standard decoding - enables controllable

10 54 287 24K 199

0 5 18 2K 3

View Details

Yiding Jiang @yidingjiang

11 months ago

I will talk about how to train agents with decision making capabilities that generalize to completely new environments: x.com/FahimTajwar10/…

Fahim Tajwar @FahimTajwar10

a year ago

Interacting with the external world and reacting based on outcomes are crucial capabilities of agentic systems, but existing LLMs’ ability to do so is limited. Introducing Paprika 🌶️, our work on making LLMs general decision makers than can solve new tasks zero-shot. 🧵 1/n

5 94 458 57K 336

2 4 19 4K 8

View Details

Gokul Swamy @g_k_swamy

a year ago

Say ahoy to 𝚂𝙰𝙸𝙻𝙾𝚁⛵: a new paradigm of *learning to search* from demonstrations, enabling test-time reasoning about how to recover from mistakes w/o any additional human feedback! 𝚂𝙰𝙸𝙻𝙾𝚁 ⛵ out-performs Diffusion Policies trained via behavioral cloning on 5-10x data!

10 79 266 90K 133

View Details

Yutong (Kelly) He @electronickale

a year ago

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵