Dan Deutsch @_danieldeutsch

Research Scientist at Google Translate working on text generation evaluation danieldeutsch.github.io San Francisco Joined September 2012

Tweets

92
Followers

601
Following

92
Likes

132

Vilém Zouhar @zouharvi

3 months ago

Machine translation is tough to evaluate, partly because most of what you throw at is too easy. That doesn't at all mean that translation is solved; we're just not doing a good job finding interesting inputs.

1 1 16 828 2

View Details

John Hewitt @johnhewtt

7 months ago

Come do a PhD with me at Columbia! My lab tackles basic problems in alignment, interpretability, safety, and capabilities of language systems. If you love adventuring in model internals and behaviors---to understand and improve---let's do it together! pic: a run in central park

13 128 949 79K 322

View Details

Eleftheria Briakou @ebriakou

7 months ago

🗺️ Are we making our #LLMs multilingual, or anglocentric? Much work brings languages closer to English, but that comes at the cost of crucial #cultural nuance. @h__j___han tackles this trade-off with surgical steering, adapting LLMs to cultural contexts at inference time.

HyoJung Han @hj_han

7 months ago

Lots of work on cross-lingual alignment encourages multilingual LLMs to generalize knowledge across languages. But this push for uniformity creates a tension: what happens to knowledge that should remain local? We look into this trade-off of transfer and cultural erasure:🧵

3 19 61 18K 28

0 11 50 9K 18

View Details

Markus Freitag @markuseful

10 months ago

Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!

1 5 53 3K 2

View Details

Markus Freitag @markuseful

a year ago

Two new datasets from Google Translate targeting high and low resource languages! WMT24++: 46 new en->xx languages to WMT24, bringing the total to 55 SMOL: 6M tokens for 115 very low-resource languages WMT24++: huggingface.co/datasets/googl… SMOL: huggingface.co/datasets/googl…

2 24 84 16K 51

View Details

iseeaswell꩜bʂky @iseeaswell

a year ago

😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: arxiv.org/pdf/2502.12301 Huggingface: huggingface.co/datasets/googl…

3 12 35 4K 11

View Details

Dan Deutsch @_danieldeutsch

a year ago

@shrutirij @prk_riley @esalesk @FirasTr88060642 Stephanie Winkler @BZhangGo @markuseful #nlproc #nlp #ai

1 0 1 243 0

View Details

Dan Deutsch @_danieldeutsch

a year ago

This project was a highly collaborative effort with many people contributing translations, evaluations, analyses, etc., so I want to thank all of my co-authors! @ebriakou @iseeaswell @marafinkels Rebecca Galor @JurikJuraska @gezakovacs Alison Lui @RicardoRei7 @jasonriesa

1 0 2 221 0

View Details

Dan Deutsch @_danieldeutsch

a year ago

🚨New machine translation dataset alert! 🚨We expanded the language coverage of WMT24 from 9 to 55 en->xx language pairs by collecting new reference translations for 46 languages in a dataset called WMT24++ Paper: arxiv.org/abs/2502.12404… Data: huggingface.co/datasets/googl…

3 24 88 7K 26

View Details

Yusuf Kocyigit @mykocyigit

a year ago

Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination. arxiv.org/abs/2501.18771

3 19 85 12K 32

View Details

Dan Deutsch @_danieldeutsch

a year ago

@srush_nlp Sent you an email about tennis!

0 0 1 678 0

View Details

Jurik Juraska @JurikJuraska

a year ago

🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible! 🔗 GitHub: github.com/google-researc…

Jurik Juraska @JurikJuraska

2 years ago

🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…

1 5 17 2K 7

0 2 2 360 0

View Details

Jurik Juraska @JurikJuraska

2 years ago

1 5 17 2K 7

View Details

Dan Deutsch @_danieldeutsch

2 years ago

Super simple and effective way of significantly increasing the performance of your evaluation metric!

Mara Finkelstein @marafinkels

2 years ago

LLMs are typically evaluated w/ automatic metrics on standard test sets, but metrics + test sets are developed independently. This raises a crucial question: Can we design automatic metrics specifically to excel on the test sets we prioritize? Answer: Yes! arxiv.org/abs/2411.15387

4 11 49 12K 40

0 0 8 896 2

View Details

Dan Deutsch @_danieldeutsch

2 years ago

@psingh522 Unfortunately this role requires that you are enrolled in a PhD program. But there are plenty of roles at Google for Master's students that you can find on the Google Careers page buildyourfuture.withgoogle.com/internships

0 0 0 234 0

View Details

Dan Deutsch @_danieldeutsch

2 years ago

New application link! google.com/about/careers/… I am at EMNLP/WMT this week. Please come find me if you want to learn more about this role!

Dan Deutsch @_danieldeutsch

2 years ago

Interested in doing research on Google Translate and Gemini? Good news! I’m hiring for full-time roles on the Google Translate Research Team! Apply here: google.com/about/careers/…