Daisuke Okanohara / 岡野原大輔 @hillbig

Co-founder and CER of Preferred Networks (PFN). CEO of PFCC. CEO of PFE. Interested in AI, science, and business. hillbig.github.io Japan Tokyo Joined January 2008

Tweets

5K
Followers

30K
Following

619
Likes

539

Daisuke Okanohara / 岡野原大輔 @hillbig

5 days ago

They propose to train LLM (M2) on the compressed text by another NN (M1). They found that this works only when the coding state and M1 state are initialized at the window's boundary. Thus, M2 can decode (understand) the compressed text during the training. arxiv.org/abs/2404.03626…

0 1 7 5K 4

Daisuke Okanohara / 岡野原大輔 @hillbig

5 days ago

NN(M1)で予測した確率分布に従い算術圧縮したテキスト上でLLM(M2)を学習。圧縮後サイズが一定となるように分割、M1でと圧縮器の状態を初期化した場合に、M2はM1による圧縮/復元過程を推定でき学習可能となる。SentencePieceに勝てていないが将来有望arxiv.org/abs/2404.03626…

0 18 103 15K 41

Daisuke Okanohara / 岡野原大輔 @hillbig

6 days ago

FlowMap takes video and uses optical flow and a point cloud tracker to estimate camera poses/intrinsic and dense depth by gradient descent, achieving COLMAP quality. A depth model is fitted for each scene, while the others are the differentiable results. arxiv.org/abs/2404.15259

0 1 5 5K 0

Daisuke Okanohara / 岡野原大輔 @hillbig

6 days ago

FlowMapはビデオと既存のOptical Flow、点群トラッカーを入力としカメラ姿勢、内部パラメータ、フレーム毎の密な深度推定を求める。シーン毎に深度モデルを学習、他は入力と深度の微分可能な計算結果で表され全体が微分可能。長年デファクトのCOLMAPに精度、速度で匹敵する。arxiv.org/abs/2404.15259

0 17 98 14K 59

Daisuke Okanohara / 岡野原大輔 @hillbig

a week ago

Dual Propagation is non-BP learning based on the contrastive Hebbian rule, where each neuron has a state pair, positively/negatively nudged from which the activation and the errors are obtained (other formulations are proposed in arxiv.org/pdf/2402.08573) arxiv.org/abs/2302.01228

1 0 6 5K 1

Daisuke Okanohara / 岡野原大輔 @hillbig

a week ago

Dual Propagationは対比ヘブ則を使った学習で、ニューロン毎に目標に近づけた場合と遠ざけた場合の二つの状態ペアを持ち、この平均で順方向、差で逆方向の誤差を表す。層ごとに解析的に解けるのでBP並に高速かつ局所的に更新できる（様々な定式化もarxiv.org/pdf/2402.08573）arxiv.org/abs/2302.01228

0 24 101 14K 47

Daisuke Okanohara / 岡野原大輔 @hillbig

a week ago

Many non-BP methods, such as equilibrium propagation, require symmetric weights. They showed that asymmetric weights can be used for learning as long as the norm of the asymmetric part of the Jacobian is suppressed by using Hutchinson estimation openreview.net/forum?id=kUveo…

0 3 5 4K 5

Daisuke Okanohara / 岡野原大輔 @hillbig

a week ago

均衡伝搬法など多くの非BP手法は重みが対称であることを求め、専用HWによる実現性や脳内実現可能性で問題だった。非対称重みを使った場合のバイアスはJacobianの非対称成分のノルムで表せると証明し、非対称重みでもHutchinson推定で求めたノルムさえ抑えれば学習可能と示す openreview.net/forum?id=kUveo…

0 17 57 10K 25

Daisuke Okanohara / 岡野原大輔 @hillbig

a week ago

5/8（水）の生成AIカンファレンスでLLMの最前線の話と今後の展望についてお話させていただきます。興味のある方はぜひご参加ください

【5/8現地参加残りわずか！】生成AI Conf実行委員会 @gen_ai_conf

a week ago

5/8（水）の生成AIカンファレンスでLLMの最前線の話と今後の展望についてお話させていただきます。興味のある方はぜひご参加ください

1 8 26 32K 8

Download Image

1 16 100 27K 33

Matlantis™ @matlantis_pfcc

2 weeks ago

AGC株式会社様の#Matlantis活用事例を公開しました。材料研究開発を担う研究者お三方にどのように#Matlantisを活用されているのか?また取り組み等についてお話しいただきました。詳細はこちら　matlantis.com/ja/case-study/… #computationalchemistry #materialsscience #materialsinformatics #MI

0 6 11 5K 1

Daisuke Okanohara / 岡野原大輔 @hillbig

2 weeks ago

Multi-head MoE splits each token into subtokens and applies MoE to each subtokens (linear projections are inserted before and after). This increases the utility of experts and achieves better performance than the original MoE. arxiv.org/abs/2404.15045

0 2 9 5K 1

Daisuke Okanohara / 岡野原大輔 @hillbig

2 weeks ago

マルチヘッドMoEはトークン毎に線形写像した後、h（2~6）個のヘッドに分割し、それぞれでMoEを適用した後、結合し線形写像で元に戻す。トークンあたりのエキスパート使用数をh倍に増やし、元のMoEと比べて多面的にエキスパートを使え、同じパラメータ数でMoEより性能が良い。arxiv.org/abs/2404.15045

0 16 88 13K 40

Matlantis™ @matlantis_pfcc

2 weeks ago

#Matlantis ウェビナー（無料）を開催いたします。ご参加お待ちしてます。日時：2024年5月30日（木）16:00-17:30 タイトル：Matlantisの進化と新機能LightPFPのご紹介詳しくはこちら↓ matlantis.com/ja/news/webina… #computationalchemistry #materialsscience #materialsinformatics #MI

0 4 4 4K 2

Daisuke Okanohara / 岡野原大輔 @hillbig

2 weeks ago

Phi-3 3B, 7B is trained on textbook-like high-quality data generated by LLM. (trained on general data to acquire reasoning, then on high-quality to acquire knowledge). It can be run on a smartphone while achieving comparable MMLU scores as GPT-3.5. arxiv.org/abs/2404.14219 I've…

0 3 16 6K 6

Daisuke Okanohara / 岡野原大輔 @hillbig

2 weeks ago

Phi-3 3B, 7BはLLMを使って学習データをフィルタリングしたり教科書のようなデータを生成し作られた高品質な学習データで学習（一般のデータで学習し推論能力獲得した後高品質なデータを知識を導入）。スマホで動かせ、10倍近く大きいGPT-3.5などのMMLUスコアなどに匹敵する。arxiv.org/abs/2404.14219…

0 57 281 35K 114

Daisuke Okanohara / 岡野原大輔 @hillbig

2 weeks ago

DPO actually solves token-level inverse Q learning, estimates the optimal advantage function, and solves the token-level credit assignment problem. It can identify the token that led to the outcome of a dialogue, and justify the likelihood-based search. arxiv.org/abs/2404.12358

0 1 13 5K 5

Daisuke Okanohara / 岡野原大輔 @hillbig

2 weeks ago

LLMのアライメントであるDPOは実はトークン単位の逆Q学習を実現し、最適なアドバンテージ関数を推定し、トークン単位の信用割当問題を解いている。例えばある対話の結果につながった原因のトークンを特定できたり、尤度最大化のビーム探索はそのまま収益最大化とみなせる arxiv.org/abs/2404.12358

1 31 165 18K 92

Daisuke Okanohara / 岡野原大輔 @hillbig

3 weeks ago

Llama3 8/70B trains on 15T tokens w/ 16k GPUs, achieving the highest performance on this scale. The architecture is the same as before. Multimodal and multi-language will be supported in the future. 400B+ model in training is on par with the top LLMs. ai.meta.com/blog/meta-llam…

0 4 9 6K 3

Daisuke Okanohara / 岡野原大輔 @hillbig

3 weeks ago

Llama3 8B 70Bは15兆トークン, 16k GPUを使い学習。同サイズのモデルでは最高性能。苦手だったプログラムはかなり改善。アーキテクチャは従来とほぼ同様、安全機構を備える。long-context、マルチモーダル、複数言語は将来リリース。学習中の400B+モデルは現トップLLMに並ぶ ai.meta.com/blog/meta-llam…

1 48 175 24K 58

Daisuke Okanohara / 岡野原大輔 @hillbig

3 weeks ago

They re-analyzed the graph of the "Chinchilla scaling law" and found that the estimated rules do not fit the data, and their derived scaling law can fit well. Not sure if this problem arises from the figure-creation process or an analysis method. arxiv.org/abs/2404.10102

1 1 5 5K 1

毎日、数学をやっています。抽象度の高い数学が好きで、公理的集合論や数理論理学、圏論に興味があるけど、もっと具体的で実用的な数学も好きです。AI技術と、それがもたらす社会的影響についてよく考えていますが、基本的にテクノロジー全般の最新動向に興味があります。良さげな講義動画を見つけたら、ツイートするようにしてます。

小猫遊りょう（.. @jaguring1

29K Followers 248 Following 毎日、数学をやっています。抽象度の高い数学が好きで、公理的集合論や数理論理学、圏論に興味があるけど、もっと具体的で実用的な数学も好きです。AI技術と、それがもたらす社会的影響についてよく考えていますが、基本的にテクノロジー全般の最新動向に興味があります。良さげな講義動画を見つけたら、ツイートするようにしてます。

AI研究者 / 博士（工学，東京大学）, Ph. D. / 元・東京大学松尾研究室 / 新たな挑戦に向けて準備中！/ 強化学習，マルチエージェント，生成AI，LLM，ゲームAI / 著書：『生成AIで世界はこう変わる』『G検定公式テキスト』『AI白書』 / 翻訳書：『強化学習』/ 石川県金沢市出身

今井翔太 / Shota .. @ImAI_Eruel

47K Followers 830 Following AI研究者 / 博士（工学，東京大学）, Ph. D. / 元・東京大学松尾研究室 / 新たな挑戦に向けて準備中！/ 強化学習，マルチエージェント，生成AI，LLM，ゲームAI / 著書：『生成AIで世界はこう変わる』『G検定公式テキスト』『AI白書』 / 翻訳書：『強化学習』/ 石川県金沢市出身

goto @goto_yuta_

13K Followers 2K Following LLMをよく触る。最新AI(生成AI)関連のツイート多め。模索中。大喜利/隠れYoutuber/京大情報卒

yu4u @yu4u

8K Followers 1K Following General Manager at GO Inc. / Ph.D. in Eng. / Kaggle Competitions Grandmaster https://t.co/UEPcVAxE1B / https://t.co/iTjqtfhbAa…

ばんくし王 @vaaaaanquish

31K Followers 26K Following エムスリー株式会社 VPoE、Google Cloud Champion Innovator (AI/ML)

人工知能(AI)・機械学習(ML)の最新動向・トレンドを論文等から読み解き分かりやすく解説します。中の人は @mhagiwara です。日米の多くの企業・研究機関において、研究者・エンジニアとして自然言語処理・機械学習の研究開発に携わった経験から情報発信します。現シニアAIリサーチャー @earthspecies

ステート・オブ.. @stateofai_ja

14K Followers 155 Following 人工知能(AI)・機械学習(ML)の最新動向・トレンドを論文等から読み解き分かりやすく解説します。中の人は @mhagiwara です。日米の多くの企業・研究機関において、研究者・エンジニアとして自然言語処理・機械学習の研究開発に携わった経験から情報発信します。現シニアAIリサーチャー @earthspecies

Seitaro Shinagawa @sei_shinagawa

8K Followers 3K Following ニューラルネットと対話したい人です / SB Intuitions←NAIST 知能コミュニケーション研究室 / cvpaper.challenge

Kazunori Sato @kazunori_279

19K Followers 2K Following Developer Advocate, Google Cloud (The opinions expressed here by myself are my own, not those of my employer)

統計・機械学習・AIのブログ書いてます。主にベイズ学習をいじって遊んでいます。
「ベイズ推論による機械学習入門」「ベイズ深層学習」著者
確率モデル、深層学習、強化学習、データサイエンス、人工知能、汁なし担々麺、ねこ
※講演・執筆依頼等ありましたらDMください

須山敦志 Suyama A.. @sammy_suyama

25K Followers 255 Following 統計・機械学習・AIのブログ書いてます。主にベイズ学習をいじって遊んでいます。「ベイズ推論による機械学習入門」「ベイズ深層学習」著者確率モデル、深層学習、強化学習、データサイエンス、人工知能、汁なし担々麺、ねこ ※講演・執筆依頼等ありましたらDMください

ソフトウェアエンジニア@PreferredNet(PFN)←@Google←京大知能情報＆@AtCoder創業。現職は金融時系列の研究(AAAI採択等)。修論はいもす法、卒論はないんたんの天気予報。ICPC世界大会、国際情報オリンピック日本代表他。いもすちゃんアイコンは岸田メル先生が描いてくれました。子供は6歳と2歳。

いもす @imos

24K Followers 2K Following ソフトウェアエンジニア@PreferredNet(PFN)←@Google←京大知能情報＆@AtCoder創業。現職は金融時系列の研究(AAAI採択等)。修論はいもす法、卒論はないんたんの天気予報。ICPC世界大会、国際情報オリンピック日本代表他。いもすちゃんアイコンは岸田メル先生が描いてくれました。子供は6歳と2歳。

『面倒なことはChatGPTにやらせよう』が1/29に発売になりました。KaggleGrandMaster。要望や質問などなんでも：https://t.co/bFBeCtAKiZ まで。 KaggleスタートブックとKaggleのチュートリアル第6版を執筆しました。

カレーちゃん�.. @currypurin

13K Followers 887 Following 『面倒なことはChatGPTにやらせよう』が1/29に発売になりました。KaggleGrandMaster。要望や質問などなんでも：https://t.co/bFBeCtAKiZ まで。 KaggleスタートブックとKaggleのチュートリアル第6版を執筆しました。

Professor (Graduate School of Social Data Science) at Hitotsubashi University.
鍵アカウントで相互フォローを承認いただけない方、投稿数が0件の方はブロックすることがありますので、ご了承ください。

Mamoru B Komachi @mamoruk

13K Followers 6K Following Professor (Graduate School of Social Data Science) at Hitotsubashi University. 鍵アカウントで相互フォローを承認いただけない方、投稿数が0件の方はブロックすることがありますので、ご了承ください。

Ph.D candidate, the university if Tokyo ← Tech Lead for the Computer Vision team at ExaWizards Inc. ← Manufacturing Engineer

akira @AkiraTOSEI

7K Followers 5K Following Ph.D candidate, the university if Tokyo ← Tech Lead for the Computer Vision team at ExaWizards Inc. ← Manufacturing Engineer

Shion Honda @shion_honda

7K Followers 272 Following 本田志温 / Software Engineer at @avec_alan / MSc in Computer Science / Tweets are my own

HELLO CYBERNETICS @ML_deep

7K Followers 623 Following 機械学習や信号処理と制御についてつぶやきます。個人的見解です。C++/Python/MATLABがちょっと書けます。 https://t.co/sT2SRYoTow

しゅんけー @shunk031

4K Followers 699 Following Ph.D. in Engineering. ex-彌冨研, 学振DC2 | 拡散モデルによる画像生成 AI 講座を公開中: https://t.co/NwVMnmIUxY | ENTP 🐉

kota small matsui @matsui_kota

6K Followers 675 Following N大学I学系研究科S物TK学分野。医学や材料科学領域の統計・機械学習の問題に興味があります。

Ryota Kanai / 金井�.. @kanair_jp

13K Followers 572 Following

データ活用支援・AI研究開発をやっています(独立研究者, 企業研究者, 日本銀行出身), 物理学が好き, Japan Digital Design Senior Researcher, Independent Scholar, EX-Bank of Japan Economist, Physics enthusiast

Yusuke Hayashi 林祐.. @hayashiyus

10K Followers 671 Following データ活用支援・AI研究開発をやっています(独立研究者, 企業研究者, 日本銀行出身), 物理学が好き, Japan Digital Design Senior Researcher, Independent Scholar, EX-Bank of Japan Economist, Physics enthusiast

ML Security Researcher / Kaggle Grandmaster / CS Ph.D. candidate at UC Irvine / I will be in the job market this Fall. Please feel free to contact me via DM

Takami Sato @tkm2261

10K Followers 828 Following ML Security Researcher / Kaggle Grandmaster / CS Ph.D. candidate at UC Irvine / I will be in the job market this Fall. Please feel free to contact me via DM

アフィリエイト.. @gfc4VBz9sQ91907

319 Followers 702 Following アフィリエイトチャンネルですアニメ、動画を中心に商品やサービスを幅広く紹介するチャンネルですアダルトアフィリエイトはこちらから⏬

くぁwせdrftgyふ�.. @h1maj1ndao

0 Followers 111 Following

朝妬け @bTkNlawcXM64225

0 Followers 19 Following

斉藤潤也 @vk9vk1

5 Followers 193 Following 東京/社会人/情報収集

Hoa Mai @MaiHoa210

0 Followers 16 Following

20歳以上の経験者、未経験者、学生、主婦歓迎！給与日給30,000円以上全額日払い（歩合率60%～70%）勤務地広島市中区・呉（同時募集）勤務時間 11時～翌5時あなたのスケジュールに合わせて働ける曜日、時間帯のみでＯＫです！勤務日完全自由出勤

メンズエステENE.. @ENEL230763

1K Followers 3K Following 20歳以上の経験者、未経験者、学生、主婦歓迎！給与日給30,000円以上全額日払い（歩合率60%～70%）勤務地広島市中区・呉（同時募集）勤務時間 11時～翌5時あなたのスケジュールに合わせて働ける曜日、時間帯のみでＯＫです！勤務日完全自由出勤

自分らしく周りとの方々と摩擦を起こさないで地道に生きていきたい。来月24日はおたんたん(誕生日✨👍️🎵）ADOさん、木村カエラさん、及川光博さん、キンタロー。さん此方の人々はみんな同じおたんたんでーす！攻撃的な書き込み、ケンカ口調な書き込みは即ブロックします！バックの写真は東秀の中華丼でちゅ❗

マイペースで生.. @sdg1336925dc

1K Followers 5K Following 自分らしく周りとの方々と摩擦を起こさないで地道に生きていきたい。来月24日はおたんたん(誕生日✨👍️🎵）ADOさん、木村カエラさん、及川光博さん、キンタロー。さん此方の人々はみんな同じおたんたんでーす！攻撃的な書き込み、ケンカ口調な書き込みは即ブロックします！バックの写真は東秀の中華丼でちゅ❗

Takafumi Hayashi @TakafumiHayash4

2 Followers 16 Following

まさを @Nn5YA0aXxRKHb0N

4 Followers 23 Following

つぼうつぼTsuboU.. @SAKANAfishfry

14 Followers 64 Following Processingお勉強中

Ru / MLOps @rabbit_x86

65 Followers 180 Following 大学院M2

yusuke @sochitarou

20 Followers 404 Following

まこと @micro4559

297 Followers 271 Following

アダルト動画・.. @fu64762

417 Followers 641 Following アダルト動画とライブチャットを中心にアフィリエイトを利用して紹介しています 18歳未満の閲覧は禁止です❌ ＃斉藤帆夏　＃ライブチャット　＃AV

TH @tetsuya_hayashi

28 Followers 503 Following AIを活用したWEBアプリを開発しています。

Breakthrough AI @AiBreakthrough

68 Followers 1K Following

愛 @ArtifiIntelli10

1 Followers 31 Following 2002

ibu @ibuki__0224

50 Followers 197 Following

福間　拓郎／Tak.. @takuro_fukuma

22 Followers 105 Following ▶︎1990/福岡在住 ▶︎映像クリエイター ▶︎モーションデザイナーオフライン編集／モーショングラフィックス／カラーグレーディングを主に行なっています。ご連絡はDMまでお願いします。

MIRENA @happylifemie

139 Followers 2K Following

はじめまして　こんにちは
Hollow
あくまでも仮検討ですが。
2023/02/05 追記
趣味と生活の一環として
色々記載しています。
2023/07/10追記
主に日本にいます。
2024/01/17追記
コミュニティノート参加してます。
2024/01/26追記
フォロワー、フォローに詐欺犯のいる可能性あり。

オクさん @OTVAXKdEFz4CIFp

523 Followers 643 Following はじめまして　こんにちは Hollow あくまでも仮検討ですが。 2023/02/05 追記趣味と生活の一環として色々記載しています。 2023/07/10追記主に日本にいます。 2024/01/17追記コミュニティノート参加してます。 2024/01/26追記フォロワー、フォローに詐欺犯のいる可能性あり。

D2/JSTっ子/博士の受精卵/脳ミソの研究/薬剤師免許使ってバイトさせていただいております/質問箱･DM･リプ大歓迎( ^)o(^ )b/趣味はオカルトと薬学と谷郷元昭と推し活🌸/ファンアート募集/なるべくポジティブなことをツイートしたいと考えております/ ＃ヒヨ勉強

ヒヨッ子🧬 @Apocrine200

2K Followers 1K Following D2/JSTっ子/博士の受精卵/脳ミソの研究/薬剤師免許使ってバイトさせていただいております/質問箱･DM･リプ大歓迎( ^)o(^ )b/趣味はオカルトと薬学と谷郷元昭と推し活🌸/ファンアート募集/なるべくポジティブなことをツイートしたいと考えております/ ＃ヒヨ勉強

（株）オークンでファンド向け株式トレードシステムの開発をしています。 Pythonの高速化、GPU演算が得意です。生成AI勉強中。阪大→阪大院→Panasonic→エンジニア転職し、現職。日本株 | 米国株 | クオンツ| 機械学習 | pandas | cuDF | 一緒に働く仲間を募集しています👇👇

Nobu@金融データ�.. @nobu_tk

かつどん @bonbonbon6458

11 Followers 88 Following

ボンデモル石井、着木田ボグ (チャッキー)、モシャブ、オトロヨ・ズマンドウ、グボイハラ・ムタ。針田家長女, 19-t。著書「スムーザーみたい」「1300円で買えるレーザープリンタ大全」「気が狂わない限り正気」「埼玉は鳥取はシェフ」「ゲホザ海峡」「スカイデーデルヤ」「モーショナルサイコラテン」「メガモンゴル」絶賛発売中

ツァン中ヨブト @ideron_harita

127 Followers 168 Following ボンデモル石井、着木田ボグ (チャッキー)、モシャブ、オトロヨ・ズマンドウ、グボイハラ・ムタ。針田家長女, 19-t。著書「スムーザーみたい」「1300円で買えるレーザープリンタ大全」「気が狂わない限り正気」「埼玉は鳥取はシェフ」「ゲホザ海峡」「スカイデーデルヤ」「モーショナルサイコラテン」「メガモンゴル」絶賛発売中

ゲーム開発初心.. @ue51068748

19 Followers 409 Following

あぼしれ @avsl3000

30 Followers 49 Following

K @K3393207570705

57 Followers 88 Following 個人投資家　会社経営

2023年3/1からネットワークエンジニア。2024年の3/1からはAWSクラウドエンジニアになります。開発や制作も趣味程度に触れたりしているのでもっと勉強していきたい。◾️取得済み資格:ITパスポート AWS CLF AWS SAA ◾️取得予定学習中資格: LPIC 1 CCNA◾️

そら@AWSクラウ�.. @mediciEngineers

3K Followers 5K Following 2023年3/1からネットワークエンジニア。2024年の3/1からはAWSクラウドエンジニアになります。開発や制作も趣味程度に触れたりしているのでもっと勉強していきたい。◾️取得済み資格:ITパスポート AWS CLF AWS SAA ◾️取得予定学習中資格: LPIC 1 CCNA◾️

オモト @qTfmqmEPFe13294

0 Followers 39 Following

amu3 @a2ym_u2

0 Followers 2K Following

Hirotaka Tamura @htktamura

0 Followers 32 Following

Shoei @N_S10230401

32 Followers 350 Following 19

修士(数理科学)→企業研究職(深層学習)
数学/微分幾何/複素幾何/表現論/
深層学習/幾何学的深層学習/深層生成モデル/PIML/情報幾何学/ベイズ統計
I'm interested in mathematics and deep learning.

ふぇに @Fusshi2441

8 Followers 18 Following 修士(数理科学)→企業研究職(深層学習) 数学/微分幾何/複素幾何/表現論/ 深層学習/幾何学的深層学習/深層生成モデル/PIML/情報幾何学/ベイズ統計 I'm interested in mathematics and deep learning.

Chidoriashi1990 @Chidori0991

5 Followers 161 Following Please call me, ちどりあし fullstack web developer, database engineer, data engineer etc...

天音鈴 @KjXwYegLQAHvoXh

2K Followers 4K Following 小説書き　人生は敗戦処理

すずき @mthijiri

5 Followers 31 Following

Muhammad Daniel @MuhammadDa96

14 Followers 127 Following

obayahi_shinya @obssi

102 Followers 149 Following 休暇中

京大生 / KYO LAB. / 鉄血会正会員 / Web & iOSエンジニア / JavaScript / TypeScript / Python / Swift / Firebase / GCP / Youtube, TikTok運用 / 動画編集, ディレクター / 1動画300万再生超え

南奎佑 @ iOSエ�.. @ti_wts

335 Followers 328 Following 京大生 / KYO LAB. / 鉄血会正会員 / Web & iOSエンジニア / JavaScript / TypeScript / Python / Swift / Firebase / GCP / Youtube, TikTok運用 / 動画編集, ディレクター / 1動画300万再生超え

大吉丼 @Daikich16263373

148 Followers 58 Following S株高配当株投資2022年冬⛄🔰から 🤗📝👓️📚️💻️旅好き ○おばさんが独りでつぶやいてます～宜しくお願いしまーす🤗🤗🤗

pin_wanted @wanted_pink

8 Followers 73 Following JTCメーカーでデータサイエンティスト・データアナリストやってます。あとはDXとかマーケティングとか。工学(博士)、情報科学、E資格、統計検定準一級。アラサーです。

toshi✟hide🛸 @492816357

36 Followers 122 Following 前の表示名は：ﾃﾄ　みなさんよろしく今日も素敵にね

Elizabeth @Elizabe00174030

2 Followers 16 Following Throwback to my favorite travel destination. Can't wait to go back someday!

Shogo Kikuchi @Shogo_kkk

3 Followers 36 Following

FallingCat @dCezwT1yiLiErNU

11 Followers 94 Following

ENEL（080 8244 9297）はじめまして！至らぬ点もあるかと思いますが、わたしに会いに来てくださったこと後悔させません♡ 一緒に楽しい時間を過ごしましょう♡ 得意技はしっかりマッサージときわきわ鼠蹊部です❤︎

ENEL（エネル）�.. @ENEL23367450

563 Followers 478 Following ENEL（080 8244 9297）はじめまして！至らぬ点もあるかと思いますが、わたしに会いに来てくださったこと後悔させません♡ 一緒に楽しい時間を過ごしましょう♡ 得意技はしっかりマッサージときわきわ鼠蹊部です❤︎

あたっく@魂 @atackpoker

3 Followers 41 Following 02/東大 /ポーカー初心者/

春大 @hitto8979

0 Followers 2 Following

Zihua Liu @magicboom2

7 Followers 116 Following Tokyo Institute of Technology. / Ph.D. Candidate of Okutomi&Tanaka lab. ACM/IEEE/CVF Student Member. Japan.

小猫遊りょう（.. @jaguring1

いもす @imos

Mamoru B Komachi @mamoruk

akira @AkiraTOSEI

7K Followers 5K Following Ph.D candidate, the university if Tokyo ← Tech Lead for the Computer Vision team at ExaWizards Inc. ← Manufacturing Engineer

smly @smly

8K Followers 3K Following Fellow at Rist | Kaggle Grandmaster https://t.co/lBVa8oGfCW | Mahjong AI Competition https://t.co/8b1Ytq5G54

パナソニックホールディングス株式会社リードリサーチャー｜素材×DX（#マテリアルズインフォマティクス）によるクリーンエネルギー材料の研究開発｜社会人博士課程修了｜計算材料科学、第一原理計算、機械学習｜その他の情報は個人HPへ｜会社の研修がきっかけで始めました｜発信内容は個人見解ですのでご了承ください

横山トモヤス｜.. @yoko_materialDX

5K Followers 693 Following パナソニックホールディングス株式会社リードリサーチャー｜素材×DX（#マテリアルズインフォマティクス）によるクリーンエネルギー材料の研究開発｜社会人博士課程修了｜計算材料科学、第一原理計算、機械学習｜その他の情報は個人HPへ｜会社の研修がきっかけで始めました｜発信内容は個人見解ですのでご了承ください

Preferred Networks @PreferredNetJP

19K Followers 137 Following Preferred Networks（#PFN）の日本語公式アカウントです。 - English Account: @PreferredNet - Tech Account: @preferred_jp

Takuya Akiba @iwiwi

20K Followers 1K Following Research Scientist @SakanaAILabs

PFN Tech @preferred_jp

14K Followers 979 Following Preferred Networks, Inc. の研究開発情報を投稿します。 #PFN

Taku Kudo @taku910

5K Followers 359 Following 形態素解析などなど

東京大学松尾�.. @Matsuo_Lab

35K Followers 3 Following 東京大学松尾研究室のアカウントです。授業や研究に関する情報、研究室の雰囲気等を発信していきます。

Preferred Networks, VP of retail。流通、小売、物流。自然言語処理、機械学習、言語とロボット。『深層学習による自然言語処理』。物流ロボット、お片付けロボット、対話ロボット、Chainer, CuPy, Jubatus。Pentax K1 M-II、卓球、料理、自転車

Yuya Unno @unnonouno

14K Followers 2K Following Preferred Networks, VP of retail。流通、小売、物流。自然言語処理、機械学習、言語とロボット。『深層学習による自然言語処理』。物流ロボット、お片付けロボット、対話ロボット、Chainer, CuPy, Jubatus。Pentax K1 M-II、卓球、料理、自転車

しましま @shima__shima

10K Followers 144 Following データマイニング・推薦システムの研究してます．データマイニング，機械学習，および推薦システムの解説・講義資料を https://t.co/znf3970s1q で配布しています．【ここでのツイートは個人の見解です】

ollama @ollama

32K Followers 7 Following https://t.co/rx433zDvXt

Chief Llama Officer @huggingface 🦙

Founder @AI_Learners.
Xoogler (SWE @Google Assistant, 20% PM TF Graphics).
100% Hacker Llama🇵🇪🇲🇽

Omar Sanseviero @osanseviero

32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽

cohere @cohere

99K Followers 4 Following Give your technology language.

Aaron Defazio @aaron_defazio

6K Followers 373 Following Research Scientist at Meta working on optimization. Fundamental AI Research (FAIR) team

Databricks @databricks

70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.

Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAI

Jonathan Frankle @jefrankle

16K Followers 684 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAI

Nathan Lambert @natolambert

25K Followers 693 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentials

Building @UnslothAI. Finetune LLMs 30x faster https://t.co/aRyAAgKOR7. Prev ML at NVIDIA. Hyperlearn used by NASA. I like maths, making code go fast

Daniel Han @danielhanchen

7K Followers 945 Following Building @UnslothAI. Finetune LLMs 30x faster https://t.co/aRyAAgKOR7. Prev ML at NVIDIA. Hyperlearn used by NASA. I like maths, making code go fast

工藤郁子 Fumiko K.. @inflorescencia

3K Followers 640 Following 港区OLですと自己紹介したら横から「概念の内破」と言われました共著に『ロボット・AIと法』『AIと憲法』『在野研究ビギナーズ』など花と果物が好きです

リスク問題を考えるための公開仕事ネタ帳　by 岸本充生（risk assessment, emerging technologies, economic analysis, governance, impact assessment）。　https://t.co/okxdvvwaW1

Atsuo Kishimoto @ooousta

651 Followers 73 Following リスク問題を考えるための公開仕事ネタ帳　by 岸本充生（risk assessment, emerging technologies, economic analysis, governance, impact assessment）。　https://t.co/okxdvvwaW1

Andrew Campbell @AndrewC_ML

433 Followers 91 Following Machine Learning PhD student - Dept. Statistics University of Oxford

researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

Saining Xie @sainingxie

14K Followers 1K Following researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

Bill Peebles @billpeeb

32K Followers 287 Following sora and agi @openai

Tim Brooks @_tim_brooks

29K Followers 76 Following Sora research lead @OpenAI

Associate Professor, the University of Tokyo, Neuro-Symbolic AI for Language Understanding #NLProc 東大博士(工学)→理研→東大IS/CS 准教授|卓越研究員・谷中研究室|JSTさきがけ|人工知能学会理事ダンモOB

Hitomi Yanaka (谷中.. @verypluming

5K Followers 528 Following Associate Professor, the University of Tokyo, Neuro-Symbolic AI for Language Understanding #NLProc 東大博士(工学)→理研→東大IS/CS 准教授|卓越研究員・谷中研究室|JSTさきがけ|人工知能学会理事ダンモOB

Tom Jobbins @TheBlokeAI

16K Followers 237 Following My Hugging Face repos: https://t.co/yh7J4DFGTc Discord server: https://t.co/5h6rGsGfBx Patreon: https://t.co/yfQwFggGtx

Snorkel AI @SnorkelAI

16K Followers 156 Following Programmatic data development for production AI

Autonomous Vision Group of Andreas Geiger at the University of Tübingen. We are excited about Computer Vision, Machine Learning and Robotics.

Autonomous Vision Gro.. @AutoVisionGroup

12K Followers 371 Following Autonomous Vision Group of Andreas Geiger at the University of Tübingen. We are excited about Computer Vision, Machine Learning and Robotics.

Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

Delip Rao e/σ @deliprao

46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

1913年創業の総合出版社です。岩波文庫、岩波新書、人文書、自然科学書、児童書、広辞苑を含む辞典など。問い、考えることを世の人々と共にするため、学術研究、思想、文学、芸術等の人間の創造活動の成果を広く伝えていきます。ご質問はメールにてお願いいたします。☞ twitter_adあっとまーくhttps://t.co/yuT8uV3KlW

岩波書店 @Iwanamishoten

128K Followers 1K Following 1913年創業の総合出版社です。岩波文庫、岩波新書、人文書、自然科学書、児童書、広辞苑を含む辞典など。問い、考えることを世の人々と共にするため、学術研究、思想、文学、芸術等の人間の創造活動の成果を広く伝えていきます。ご質問はメールにてお願いいたします。☞ twitter_adあっとまーくhttps://t.co/yuT8uV3KlW

baibai @ibaibabaibai

5K Followers 964 Following 岩波データサイエンス刊行委員会メンバー　本アカウントで表明される学問や社会についての意見はいかなる意味でも所属組織を代表・代弁するものではありません。

LINEヤフー/SB Intuitions/理研AIP ←LINE←フューチャー← 理研AIP←東北大乾研(特任助教)←NAIST松本研博士取得。自然言語処理、特に自動採点、文法誤り訂正、自動校正、対話、評価に関する研究とか。 https://t.co/7UfUp5qtvQ

みずもともや @tomo_wb

2K Followers 727 Following LINEヤフー/SB Intuitions/理研AIP ←LINE←フューチャー← 理研AIP←東北大乾研(特任助教)←NAIST松本研博士取得。自然言語処理、特に自動採点、文法誤り訂正、自動校正、対話、評価に関する研究とか。 https://t.co/7UfUp5qtvQ

Agrim Gupta @agrimgupta92

2K Followers 307 Following Simulating reality @stanford

Karsten Kreis @karsten_kreis

2K Followers 444 Following Senior Research Scientist at @NVIDIA | Former Physicist | Deep Generative Learning. Opinions are my own.

Ben Mildenhall @BenMildenhall

5K Followers 1K Following making stuff 3D. formerly research scientist at Google, phd at Berkeley.

Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.

Sebastian Raschka @rasbt

268K Followers 885 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.

Tianyu Gao @gaotianyu1350

3K Followers 687 Following CS PhD student @Princeton @Princeton_nlp working on NLP. Previously: @Tsinghua_Uni @TsinghuaNLP

Albert Gu @_albertgu

9K Followers 90 Following assistant prof @mldcmu. chief scientist @cartesia_ai. leading the ssm revolution.

朝日新聞経済部記者@asahi、AIとデジタルプラットフォームなどを取材しています/2021-22ブリュッセル自由大学(VUB)コミュニケーション学(Digital Media in Europe)修士号/アイコンは @akiko_ma さん

Naoko Murai 村井七.. @murainaoko

934 Followers 2K Following 朝日新聞経済部記者@asahi、AIとデジタルプラットフォームなどを取材しています/2021-22ブリュッセル自由大学(VUB)コミュニケーション学(Digital Media in Europe)修士号/アイコンは @akiko_ma さん

gavin leech @g_leech_

4K Followers 420 Following the subject of criticism @ArbResearch, @Bristol_AI_CDT, ESPR

Simon Batzner @simonbatzner

4K Followers 749 Following RS at Google DeepMind. Prev: PhD at Harvard, MIT, NASA, Google Brain.

Mira Murati @miramurati

274K Followers 527 Following CTO @OpenAI

@NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

Jim Fan @DrJimFan

231K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.

エムスリー株式会社取締役 CTO / VPoP / ピープルサクセス担当役員 / 初代VPoE / 初代CDO / メビックス株式会社執行役員 CTO / エムスリーソリューションズ取締役 / エムスリーデジカル株式会社共同創業者 / その他諸々 / 新規事業絶賛開発中 / やっていき

山崎聡@エムス�.. @yamamuteking

5K Followers 5K Following エムスリー株式会社取締役 CTO / VPoP / ピープルサクセス担当役員 / 初代VPoE / 初代CDO / メビックス株式会社執行役員 CTO / エムスリーソリューションズ取締役 / エムスリーデジカル株式会社共同創業者 / その他諸々 / 新規事業絶賛開発中 / やっていき

Floor Eijkelboom @FEijkelboom

224 Followers 142 Following PhD candidate @UvA_Amsterdam | deep learning for (quantum) physics 🦭

Aravind Srinivas @AravSrinivas

87K Followers 952 Following CEO @perplexity_ai

Student Researcher @GoogleDeepMind ; PhD Candidate Machine Learning @AmlabUva, @ai4science_lab @UvA_Amsterdam. Previously @MSFTResearch, @FlatironInst

David Ruhe @djjruhe

1K Followers 397 Following Student Researcher @GoogleDeepMind ; PhD Candidate Machine Learning @AmlabUva, @ai4science_lab @UvA_Amsterdam. Previously @MSFTResearch, @FlatironInst

Nat Friedman @natfriedman

183K Followers 288 Following https://t.co/Lhh178sIjq

博士(統計科学). Principal AI Engineer @BCG . 徳島大学デザイン型AI教育センター客員准教授・情報処理学会ビッグデータ研究グループ幹事・株式会社ホクソエムの妖精を兼任.

著訳/監修書: 評価指標入門, データ分析失敗事例集, 効果検証入門等

Shinichi Takaŷanagi�.. @_stakaya

5K Followers 505 Following 博士(統計科学). Principal AI Engineer @BCG . 徳島大学デザイン型AI教育センター客員准教授・情報処理学会ビッグデータ研究グループ幹事・株式会社ホクソエムの妖精を兼任. 著訳/監修書: 評価指標入門, データ分析失敗事例集, 効果検証入門等

Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)

elvis @omarsar0

190K Followers 486 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)

Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.

Yuandong Tian ✈️ .. @tydsh

17K Followers 808 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.

CEO of @abacusai, using Gen AI to build Applied AI and LLM agents and systems at scale, ex-AWS / Google, passionate about human behavior and open-source AGI

Bindu Reddy @bindureddy

125K Followers 339 Following CEO of @abacusai, using Gen AI to build Applied AI and LLM agents and systems at scale, ex-AWS / Google, passionate about human behavior and open-source AGI

New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to townhall@neurips.cc.

NeurIPS Conference @NeurIPSConf

112K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].

Oᴛᴀɢᴀᴋɪ Sat.. @otagaki

375 Followers 250 Following 換喩・パン・小さな書店

Jakub Pachocki @merettm

21K Followers 0 Following OpenAI

T2 @t2_auto

115 Followers 6 Following 大型トラックの自動運転による幹線輸送サービスの実現を目指す企業ー株式会社T2 アカウント【公式】

Research @MetaAI+NYU. Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position encode+LLM: MemNet (2015). Recent (2024): Self-Rewarding LLMs & more!

Jason Weston @jaseweston

9K Followers 569 Following Research @MetaAI+NYU. Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position encode+LLM: MemNet (2015). Recent (2024): Self-Rewarding LLMs & more!

Databricks Mosaic Res.. @DbrxMosaicAI

30K Followers 115 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.

Trending GitHub Repos.. @trending_repos

18K Followers 0 Following Tweeting the most starred GitHub repository of the: 📈 day - every day 🏅 week - every Monday 🏆 month - every 1st of the month

小磯まさひろ @koiso_masahiro

5 days ago

『大規模言語モデルは新たな知能か』岩波書店 #読了高２の時、隣の席に著者の岡野原大輔がいた。寝てるのに成績が良かった。チャットＧＰＴなどの解説。プロンプトを使用してタスクをこなす場合、新しい訓練データが不要（ゼロショット学習）など、ここが凄い！ということが分かった。

0 0 5 133 0

礒部達/プリファードロボティクス CEO @toru_isobe

5 days ago

カチャカプロが「中小企業省力化投資補助金」の補助対象に登録されました！中小企業がカチャカプロを配膳用途で購入される場合、半額で購入することができるようになります！配膳ロボットとして、最初の登録だそうです！！ご興味ある方はぜひご連絡ください！

1 17 69 12K 3

Download Image

Masanori HIRANO @_mhirano

2 weeks ago

先日公開した金融特化のモデルをさらにinstructionに対応させたモデルを公開しました。今回は、なんと、Model Mergeを使って作っています。是非、お試しください！ huggingface.co/pfnet/nekomata…

1 14 113 53K 75

Sosuke Ito (伊藤創祐) @ito_sosuke

2 weeks ago

現在、新たな数理科学の記事を書くことになったので、ついでに昨年の記事「熱・統計力学と数学」について公開することにしました。非自明なことはあまり書いていませんが、物理の学部生あたりが情報幾何周りに興味をもって僕らの研究を知るにはいい記事かもしれません。 sosuke110.com/surikagaku2023…

0 42 154 19K 78

Thomas Wolf @Thom_Wolf

2 weeks ago

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…

Guilherme Penedo @gui_penedo

2 weeks ago

We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!

40 347 2K 589K 836

Download Image

24 303 2K 292K 967

PFN 3D/4D Scan @pfn_3d

4 weeks ago

Preferred Networks のオフィスの受付を3Dスキャンしました。会社設立10周年を記念していただいた胡蝶蘭も細部までバッチリ再現できています #GaussianSplatting

1 31 167 31K 27

Download Video

Aaron Defazio @aaron_defazio

a month ago

Schedule-Free Learning github.com/facebookresear… We have now open sourced the algorithm behind my series of mysterious plots. Each plot was either Schedule-free SGD or Adam, no other tricks!

37 215 1K 425K 917

Download Image

Teortaxes▶️ @teortaxesTex

a month ago

It seems that results of that Microsoft paper about ternary LLMs can be replicated after all – for 3B@100B at least. huggingface.co/1bitLLM/bitnet…

18 97 697 296K 310

Download Image

Nathan Lambert @natolambert

a month ago

DBRX system prompt: "You are DBRX, created by Databricks. The current date is March 27, 2024. Your knowledge base was last updated in December 2023. You answer questions about events prior to and after December 2023 the way a highly informed individual in December 2023 would if…

13 36 323 142K 258

Download Image

Takuya Akiba @iwiwi

a month ago

@hillbig ！！！ご紹介ありがとうございます！

0 0 4 2K 0

Taiji Suzuki @btreetaiji

2 months ago

弊研究室の大古一聡君が「総長大賞」を受賞いたしました．総長大賞は総長賞の中でも特に優れた業績へ贈られる賞です．受賞内容は「1 次勾配情報に基づく学習問題の統計的・計算量的解析」で，最近話題の拡散モデルの最適性に関する研究などが評価されました． u-tokyo.ac.jp/content/400163…

0 24 170 26K 25

Abhi Venigalla @abhi_venigalla

2 months ago

@francoisfleuret The 30x is real and comes from this technical brief, page 15: nvdam.widen.net/s/xqt56dflgh/n… How is 30x possible given GB200 has only ~2.3x increase in memBW and FLOP/s over H100? It involves comparing per-chip generation throughput = output_tokens/s/chip. The two systems compared are…

8 12 157 29K 79

Thomas Wolf @Thom_Wolf

2 months ago

Slides (new deck, painful but worth it!): docs.google.com/presentation/d… What a joy to teach at ELLIS winter school, amazing students and questions Enjoy

Yuki @y_m_asano

2 months ago

And following up on this, we have @Thom_Wolf from @huggingface teaching us how to train LLMs with all the nitty gritty details. And the strong message to focus and inspect THE DATA. Also, second speaker in a row referring to @karpathy's tokenizer lecture 🎏.

3 6 47 43K 25

Download Image

9 46 255 47K 251

Covariant @CovariantAI

2 months ago

Today, we are introducing RFM-1, our Robotics Foundation Model giving robots human-like reasoning capabilities.

10 75 485 115K 141

Download Video

Yann LeCun @ylecun

2 months ago

@MLStreetTalk There is no inconsistency between those two segments. In fact, I'm pretty much making the same point. There are 4 models of computation: 1. y=f(x) where f has a fixed number of sequential non-linear steps 2. z(k+1)=g(z(k),x); y=f(z(K)) where K is in principle unbounded 3. ž =…

23 67 469 73K 532

Daniel Han @danielhanchen

2 months ago

Found more bugs for #Gemma: 1. Must add <bos> 2. There’s a typo for <end_of_turn>model 3. sqrt(3072)=55.4256 but bfloat16 is 55.5 4. Layernorm (w+1) must be in float32 5. Keras mixed_bfloat16 RoPE is wrong 6. RoPE is sensitive to y*(1/x) vs y/x 7. (Fixed) RoPE should be float32…

35 176 1K 555K 724

Download Image

いもす @imos

2 months ago

LLM全般をしっかりまとめた話をオープンハウスでします。もとは昨年に社内向けに結構時間をかけて作ったスライドで、歴史から技術要素等までを広く紹介します。他の人の話も面白いと思うので、学生向けではありますが興味がありましたら是非。

Preferred Networks @PreferredNetJP

2 months ago

【学生向けイベント】大規模言語モデル（LLM）開発に興味のあるエンジニア・リサーチャー志望学生対象のオープンハウスを実施します。 📅2024年3月29日(金) 18:30 〜 🏢PFN大手町オフィス皆さまの応募をお待ちしています！イベント詳細： connpass.com/event/311791/

0 41 78 77K 37

0 20 89 21K 19

Yann LeCun @ylecun

3 months ago

The Diffusion Transformer paper, by my former-FAIR-and-current-NYU colleague @sainingxie and former-Berkeley-student-and-current-OpenAI engineer William Peebles, was rejected from CVR2023 for "lack of novelty", accepted at ICCV2023, and apparently forms the basis for Sora.…

Saining Xie @sainingxie

3 months ago

Here's my take on the Sora technical report, with a good dose of speculation that could be totally off. First of all, really appreciate the team for sharing helpful insights and design decisions – Sora is incredible and is set to transform the video generation community. What we…