Rohan Pandey (e/acc) @khoomeik, Twitter Profile

Rohan Pandey (e/acc) @khoomeik

a month ago

Language model scaling laws appear to be sensitive to data complexity (modulated by syntactic properties of a PCFG), and gzip effectively predicts the impact of these dataset-specific scaling properties.

7 35 257 39K 144

Download Image

Rohan Pandey (e/acc) @khoomeik

a month ago

everyone wants the visuals and no one wants the data huh? classic

Rohan Pandey (e/acc) @khoomeik

a month ago

everyone wants the visuals and no one wants the data huh? classic

0 1 17 3K 6

1 0 12 2K 1

xlr8harder @xlr8harder

a month ago

@khoomeik paper link for this?

1 0 7 1K 0

Adarsh @adarshxs

a month ago

@khoomeik Wait, please correct me if I'm wrong - you are implying the model's quality is directly proportional to the gzip compressibility of the dataset used to train the model relative to the models and datasets of the same weight and size class?

2 0 1 320 0

Dan Ofer @danofer

a month ago

@khoomeik @Yampeleg @BrandesNadav Yet another point for our using Uniref50 in #Proteinbert (and others following the trick) :D

0 0 0 273 0

yikes @yikesawjeez

a month ago

@khoomeik what do u know, language models r compression

1 1 3 440 0