Richard Socher @RichardSocher, Twitter Profile

Richard Socher @RichardSocher

3 weeks ago

Large nnet is all you need - no matter which architecture. I've pitched something similar to @jekbradbury: Just scale up our QRNNs and you'd likely get similar performance as transformers.

Samuel L Smith @SamuelMLSmith

3 weeks ago

Large nnet is all you need - no matter which architecture. I've pitched something similar to @jekbradbury: Just scale up our QRNNs and you'd likely get similar performance as transformers.

9 69 280 173K 140

Download Image

4 11 84 22K 30

Arvind Nagaraj @nagaraj_arvind

3 weeks ago

@RichardSocher @jekbradbury 😉

0 0 0 131 0

Ali Hatamizadeh @ahatamiz1

3 weeks ago

@RichardSocher @jekbradbury only if it was that easy

0 0 1 142 0

Rishi @notnotrishi

3 weeks ago

@RichardSocher @jekbradbury throwback

Richard Socher @RichardSocher

5 months ago

@RichardSocher @jekbradbury throwback

1 3 23 2K 14

0 0 1 269 0

@RichardSocher @jekbradbury This is an interesting opinion and I share the same views on size. But I cannot understand the unreasonable effectiveness of decoder only vs not as performant encoder-decoder system with regards to generation capabilities. Anyone investigated that? Tips/pointers 😉

1 0 0 175 0

Richard Socher @RichardSocher

Samuel L Smith @SamuelMLSmith

Arvind Nagaraj @nagaraj_arvind

Ali Hatamizadeh @ahatamiz1

Rishi @notnotrishi

Richard Socher @RichardSocher

Liling Tan @alvations