abhimanyu singh @siinghsaa
#MachineLearning #Infosec #dreamer🙂 #DeepLearning India Joined June 2011-
Tweets144
-
Followers85
-
Following3K
-
Likes892
This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to right, but all at once. You start with noise and gradually denoise into a token stream. Most of the image / video generation AI tools actually work this way and use Diffusion, not Autoregression. It's only text (and sometimes audio!) that have resisted. So it's been a bit of a mystery to me and many others why, for some reason, text prefers Autoregression, but images/videos prefer Diffusion. This turns out to be a fairly deep rabbit hole that has to do with the distribution of information and noise and our own perception of them, in these domains. If you look close enough, a lot of interesting connections emerge between the two as well. All that to say that this model has the potential to be different, and possibly showcase new, unique psychology, or new strengths and weaknesses. I encourage people to try it out!
We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.
day 7 of #30DaysofFLCode started the @openminedorg courses finished L1/7 Introduction to Remote Data Science courses.openmined.org/courses/introd…
day 6 of #30DaysofFLCode yet another minicourse at DeepLearning.AI by Nicholas Lane and Daniel J. Beutel day 2/2 at Federated Fine-tuning of LLMs with Private Data finished
day 6 of #30DaysofFLCode yet another minicourse at DeepLearning.AI by Nicholas Lane and Daniel J. Beutel day 2/2 at Federated Fine-tuning of LLMs with Private Data finished
day 5 of hashtag#30DaysofFLCode yet another minicourse at DeepLearning.AI by Nicholas Lane and Daniel J. Beutel day 1/2 at Federated Fine-tuning of LLMs with Private Data lnkd.in/gwWiNGVQ the juiciest and longest chapter of the course yet to be finished
day 5 of hashtag#30DaysofFLCode yet another minicourse at DeepLearning.AI by Nicholas Lane and Daniel J. Beutel day 1/2 at Federated Fine-tuning of LLMs with Private Data lnkd.in/gwWiNGVQ the juiciest and longest chapter of the course yet to be finished
day 4 of #30DaysofFLCode day 2/2 finished the mini course just peeped into the SyftBox syftbox-documentation.openmined.org
day 4 of #30DaysofFLCode day 2/2 finished the mini course just peeped into the SyftBox syftbox-documentation.openmined.org
Day 3 of #30DaysOfFLCode day 1/2 mini-course by DeepLearning.AI on federated learning deeplearning.ai/short-courses/… authors have more emphasis on their framework libraries i.e. flower but still can get a rough idea. will try to replicate code on next day
Day 3 of #30DaysOfFLCode day 1/2 mini-course by DeepLearning.AI on federated learning deeplearning.ai/short-courses/… authors have more emphasis on their framework libraries i.e. flower but still can get a rough idea. will try to replicate code on next day
Day 2 #30DaysOfFLCode went through the video Intro to Privacy Preserving Artificial Intelligence - Andrew Trask youtube.com/watch?v=yUXwsN…
Day 2 #30DaysOfFLCode went through the video Intro to Privacy Preserving Artificial Intelligence - Andrew Trask youtube.com/watch?v=yUXwsN…
Day 1 of hashtag#30DaysOfFLCode went through Chapter 15, Intro to Federated Learning—Deep Learning on Unseen Data , of book Grokking Deep Learning by @iamtrask If you are a complete beginner in Deep Learning and federated learning please go through this chapter.
Day 1 of hashtag#30DaysOfFLCode went through Chapter 15, Intro to Federated Learning—Deep Learning on Unseen Data , of book Grokking Deep Learning by @iamtrask If you are a complete beginner in Deep Learning and federated learning please go through this chapter.
I'm publicly committing to the #30DaysOfFLCode Challenge! (federated learning) Join me in learning more about #FederatedLearning → 30DaysOfFLCode.com
I'm publicly committing to the #30DaysOfFLCode Challenge! (federated learning) Join me in learning more about #FederatedLearning → 30DaysOfFLCode.com
The upper bound for how long to pause AI is only a century, because “farming” (artificially selecting) higher-IQ humans could probably create competent IQ 200 safety researchers. It just takes C-sections to enable huge heads and medical science for other issues that come up.
📽️ New 4 hour (lol) video lecture on YouTube: "Let’s reproduce GPT-2 (124M)" youtu.be/l8pRSuU81PU The video ended up so long because it is... comprehensive: we start with empty file and end up with a GPT-2 (124M) model: - first we build the GPT-2 network - then we optimize it to train very fast - then we set up the training run optimization and hyperparameters by referencing GPT-2 and GPT-3 papers - then we bring up model evaluation, and - then cross our fingers and go to sleep. In the morning we look through the results and enjoy amusing model generations. Our "overnight" run even gets very close to the GPT-3 (124M) model. This video builds on the Zero To Hero series and at times references previous videos. You could also see this video as building my nanoGPT repo, which by the end is about 90% similar. Github. The associated GitHub repo contains the full commit history so you can step through all of the code changes in the video, step by step. github.com/karpathy/build… Chapters. On a high level Section 1 is building up the network, a lot of this might be review. Section 2 is making the training fast. Section 3 is setting up the run. Section 4 is the results. In more detail: 00:00:00 intro: Let’s reproduce GPT-2 (124M) 00:03:39 exploring the GPT-2 (124M) OpenAI checkpoint 00:13:47 SECTION 1: implementing the GPT-2 nn.Module 00:28:08 loading the huggingface/GPT-2 parameters 00:31:00 implementing the forward pass to get logits 00:33:31 sampling init, prefix tokens, tokenization 00:37:02 sampling loop 00:41:47 sample, auto-detect the device 00:45:50 let’s train: data batches (B,T) → logits (B,T,C) 00:52:53 cross entropy loss 00:56:42 optimization loop: overfit a single batch 01:02:00 data loader lite 01:06:14 parameter sharing wte and lm_head 01:13:47 model initialization: std 0.02, residual init 01:22:18 SECTION 2: Let’s make it fast. GPUs, mixed precision, 1000ms 01:28:14 Tensor Cores, timing the code, TF32 precision, 333ms 01:39:38 float16, gradient scalers, bfloat16, 300ms 01:48:15 torch.compile, Python overhead, kernel fusion, 130ms 02:00:18 flash attention, 96ms 02:06:54 nice/ugly numbers. vocab size 50257 → 50304, 93ms 02:14:55 SECTION 3: hyperpamaters, AdamW, gradient clipping 02:21:06 learning rate scheduler: warmup + cosine decay 02:26:21 batch size schedule, weight decay, FusedAdamW, 90ms 02:34:09 gradient accumulation 02:46:52 distributed data parallel (DDP) 03:10:21 datasets used in GPT-2, GPT-3, FineWeb (EDU) 03:23:10 validation data split, validation loss, sampling revive 03:28:23 evaluation: HellaSwag, starting the run 03:43:05 SECTION 4: results in the morning! GPT-2, GPT-3 repro 03:56:21 shoutout to llm.c, equivalent but faster code in raw C/CUDA 03:59:39 summary, phew, build-nanogpt github repo
“Are Transformers Effective for Time Series Forecasting?” represents a pivotal paper, decisively highlighting the shortcomings and deficiencies in research surrounding the use of transformers for #timeseries #forecasting. This paper effectively exposes the deceptive practices employed by various authors in their papers, such as inadequate benchmarking and other tactics, which have previously led to inflated claims regarding the performance of transformers in this domain.
#OWASP #Top10 for Large Language Model (#LLM) Applications v0.5.0 now out highlighting the top security & safety risks and issues that developers and security teams must consider when building applications leveraging #AI / LLMs: Download the PDF here👇 owasp.org/www-project-to…
How can you beat XGBoost, CatBoost, and TabNet on tabular data? Use a cocktail of 13 modern regularization techniques! (arxiv.org/abs/2106.11189) [1/9]
A Short Chronology Of Deep Learning For Tabular Data: sebastianraschka.com/blog/2022/deep… Deep tabular methods are an interesting research direction! So, this morning, I sat down and summarized my thoughts + the recent papers I read.
All right, here is one trick for using XGBoost for *data analysis*. 1/5
Below are 7 popular Machine Learning algorithms implemented from scratch in Python🧵
This series of #Jupyter #Notebooks is a VERY nice step-by-step intro to data science and machine learning. If you're just starting out - I recommend walking through these notebooks as a first primer Definitely a great #100DaysOfMLCode project github.com/rasbt/python-m…
@mervenoyann Your notes are very crisp and are easy to follow ,love them
Chris Glaze @chris_m_glaze
1K Followers 4K Following Principal Research Scientist at @SnorkelAI. PhD in computational neuroscience. Previously: @penn @UofMaryland
byte gamy @bytegamy
179 Followers 361 Following Full-stack engineer. AI, system design, and scalable ideas.
SmythOS @Smyth_OS
407 Followers 4K Following Open Source. SDK. CLI. Runtime. Visual. Vibe. Build, debug, deploy agents with ease. Unparalleled for production security. https://t.co/hZwygfkyxQ
Johnnie Hodges @HodgesJohn78218
1 Followers 168 Following Recruiting webshell engineers to penetrate websites, with a monthly salary of up to $100,000. If interested, please contact https://t.co/xbHVdNfnD9
Elizabeth Creek @Queenlizzy_s
133 Followers 1K Following 📈I help busy professionals who trade on the side execute their edge cleanly WITHOUT self-sabotage 1000+ coached since '14 DM "CONSISTENCY" for help
ReneeLynch @H4N9P7H9Rr1sQqz
172 Followers 4K Following
Virat_yogi @Yogi__Virat
7 Followers 44 Following Helping people turn learning goals into AI-curated roadmaps that stick. 🔗 https://t.co/s94VBSM9FU #Learning #SaaS #EdTech
Garvit Banga @garvit_banga
84 Followers 1K Following Federated Learning | Domain Adaptation | Master’s student @nyuniversity Prev: @IITBHU_Varanasi
Soytear @SoytearaPu
47 Followers 1K Following
Kay @KaseySulli43316
15 Followers 769 Following
Joe Blitzstein @stat110
17K Followers 5K Following Statistics professor at Harvard; statistician and data scientist; probability and paradoxes; Bayesian frequentist reconciliation; chess.
영글 @c22Y1Hle2JHt8
14 Followers 537 Following
zbhvn052kegmt @jeg72ers
26 Followers 828 Following Tiktokshop conducts recruitment for part-time partners! Salary $100-$300 per day, please contact us https://t.co/adcNoY1axd
Maria wagner @MohamedKon60117
23 Followers 177 Following Je n'ai pas le temps de détester les gens qui me détestent car je suis trop occupé à aimer ceux qui m'aiment.
The Innovation Studio @TheAIFactory
1K Followers 3K Following Creating bold new AI-fueled companies that solve real-world problems #AI #IndustrialAI #ML #Innovation #VentureStudio #StartupStudio
Abasianie Samuel Etuk @AbasianieE
102 Followers 609 Following Quantum Mechanics || Quantum Computing || Quantum Cryptography || Quantum Hamiltonian Complexity || Computational Complexity Theory.
Mr Bin @mr_bin99
121 Followers 291 Following
Athul Raj Pushparajan @athulraj__p
15 Followers 113 Following Conservation biologist | MSc Applied Wildlife Conservation @angliaruskin | Traveller | Story Teller | chef @love_prezzo
Shantanu Meshram @ShantanuMeshram
79 Followers 665 Following Product & Analytics | Walmart | @IITKgp | Personal Tweets - Views my own | “Here just to read endlessly”
Dmitry Vostokov 🇮�... @DumpAnalysis
8K Followers 6K Following Diagnostician. Author of Diagnomicon. Gang of One. Software Surgeon. Machine Learning and AI for Software Diagnostics and Observability. Generative Debugging.
Fun Machine Learning @FunMachineLearn
7K Followers 4K Following Friendly & fun AI highlights in the streets, freak in the tweets. Ever-posting AI/ML awesomeness for everyone. https://t.co/UauHx2cHrq https://t.co/TOvgA9klM1
K SANDEEP NARASIMHA @ksnr1947
27 Followers 287 Following
Erika1 @Erika130222325
773 Followers 4K Following
Deepak sharma @DeepSharma0209
11 Followers 18 Following
veer @mhvr_rampal
92 Followers 665 Following #हिन्दु #writer #jnv #navodayan #motivator #thoughtfull #student
helloworld/ @hellodebug8
251 Followers 6K Following
ODVIX - Sistema de Ge... @OdvixO
332 Followers 4K Following ODVIX é um software de Gestão Empresarial On-line. Emissor de nota fiscal rápida, dados armazenados nas nuvens, acessível para pequenas e médias empresas.
IMTheNachoMan @IMTheNachoMan
426 Followers 739 Following average guy, family man, lazy programmer, world traveler, computer geek, random, chocoholic, infosec architect, easy going, people pleaser, thoughts=own, he/him
Emmanuel Owusu Ahenka... @Emma_Ahenkan
418 Followers 525 Following I love Machine Intelligence. I am an Applied Mathematician | Data Scientist | Machine Learning Engineer.
Bojan Tunguz @tunguz
291K Followers 8K Following Founder and CEO @tabul_ai. Creator of @trainxgb. ML ex Nvidia. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. Memelord. e/xgb. AMDG.
ODSC (Open Data Scien... @_odsc
111K Followers 24K Following Bringing together the global data science community to help foster the exchange of innovative ideas and encourage the growth of open source software.
Saber Soleymani @saber_soley
288 Followers 457 Following
Mike Tiso (vi/vim) @MichaelHacks0n
2K Followers 1K Following Mike Tiso | AI Engineer | Business owner | Software Engineer | Maker of apps | Solver of Problems
Bounty Security @BountySecurity
19K Followers 10K Following Offensive Web Application Security Software
Michal Fabinger @fabinger
2K Followers 3K Following Creating a new accredited ML/AI & Data Science M.A. program at Vedian College. Ask me about AI/ML, economics, biology, physics, math, stats, health & nutrition.
Chris Hanlon @ChrisHanlonCA
17K Followers 18K Following Security Engineer Google Security Hall of Fame Presenter & Workshop host at #BSidesLV and #DEFCON
Mohamed Haron @m7mdharon
3K Followers 752 Following Cybersecurity Analyst | Ethical Hacker | Bug Bounty Hunter | Web App Security Founder https://t.co/xxFeZoQ0Ae
Corey Singleton @Corey_vc
882 Followers 4K Following Senior Associate at Apogee Accelerator Group. I write about Venture Capital and connect promising entrepreneurs with VC firms. https://t.co/trQHK9Amlw
Threat Protect @CybersecurityTP
877 Followers 2K Following Enabling organisations to work with confidence by providing tailored, cost-optimised IT and security solutions
MachineLearningGuru @Machine55296475
153 Followers 625 Following I am a student? interested on #neuralnetworks #ArtificialIntelligence #DeepLearning #MachineLearning #AI #ML #DL #Cloud
AI tutorails @AiTutorails
162 Followers 758 Following I am interested in #artificialintelligence, #neuralnetworks, #machinelearning
omet hasan @omethasan
606 Followers 2K Following
scubaartfoto @scubaartfoto
603 Followers 4K Following Patrik Jonson from sweden Underwater Photographer. Ambassador for Mares, Exposure Underwater, Team Dyk and Scuba Travelers
IIT Bombay @iitbombay
242K Followers 332 Following The premier technological institute in India and one of the leading universities of the world.
Pankaj @the2ndfloorguy
31K Followers 97 Following ai + hardware @projectmiragehq • ex - ai labs @inmobi • I build whatever my brain finds funny • also my cat’s name is docker 🐾
Himalayan Hindu @himalayanhindu
7K Followers 55 Following जय बद्री-केदार | ओ भूमि, तेरी जै-जै कारा, म्यार हिमाला। ⛰️
kaushal kashyap @col_k_kashyap
5K Followers 103 Following
Maj Digvijay Singh Ra... @Dig_raw21
36K Followers 588 Following | भारतीय | वीर भोग्या वसुंधरा | Ex Special Forces 🇮🇳 •
historiakayasthas @historiakayasth
8K Followers 573 Following Rudra Vikrama Srivastava Archaeologist | Epigraphist | Adventurer | Sir Jadnunath Sarkar Fellow Batch 2 (2025-26) @fihcr_info | UGC NET-JRF-Archaeology
Ravi Gupta @shudhdesicomic
983 Followers 82 Following
Ada Fang @AdaFang_
6K Followers 236 Following PhD Candidate @Harvard | AI for Scientific Discovery & Biology | ex @GoogleDeepMind SR
AISecHub @AISecHub
9K Followers 8K Following 🚀 AISecHub | AI & Cybersecurity | Securing AI systems, and sharing insights on emerging challenges | https://t.co/YeYtqq5tJC
Google Gemma @googlegemma
88K Followers 0 Following The official home of Google's Gemma. Lightweight, state-of-the-art open models by Google DeepMind, built on Gemini tech. What will you build? 🚀💻
Office Of Vijay Patel @VijayGajeraO
43K Followers 280 Following This account is managed by the office of Vijay Patel.
Raul Junco @RaulJuncoV
47K Followers 508 Following System Design made me a better engineer. Now I help others do the same. System Design • Backend • Databases • Scalability • AI
Vishal Misra @vishalmisra
13K Followers 624 Following Vice Dean Computing and AI @CUSEAS. Dean of Cricket Analytics @SFOUnicorns. Tweets on academia, cricket, dad/bad/wry jokes. Opinions of guy am pointing at.
Sakana AI @SakanaAILabs
131K Followers 0 Following Building Frontier AI in Japan Try Sakana Chat, Marlin, Fugu 🐡 → https://t.co/1m2lSgnfB2
Sandra Wachter -@swac... @SandraWachter5
17K Followers 328 Following Professor of Technology & Regulation, Oxford Internet Institute, University of Oxford Humboldt Professor of Technology & Regulation, Hasso Plattner Institute
Jakob Nikolas Kather @jnkath
5K Followers 3K Following Professor of Medicine and Computer Science | Clinical AI at @medizin_TUD @tudresden_de @katherlab | Medical Oncology at @NCT_UCC_DD & @NCT_HD 💻🧬🇪🇺🌍
Wulfie Bain @wulfie_bain_
5K Followers 148 Following @OpenAI Applied AI International Lead, Startups. Prev CTO/founder, @BCG, @UniofOxford. Small sparks ✨ & just working things out
Samuel Schmidgall @SRSchmidgall
6K Followers 611 Following Research Scientist @GoogleDeepmind // prev @JohnsHopkins PhD @NSF Fellow
Valentin Liévin @valentinlievin
1K Followers 749 Following Research Scientist at @GoogleDeepMind. Better LLMs for healthcare and science. PhD @DTU_compute
MATS Research @MATSprogram
4K Followers 136 Following MATS empowers researchers to advance AI alignment, transparency, and security
Mathematica @mathemetica
41K Followers 753 Following Math isn't escape. It's the map through the madness.
Thach Nguyen Hoang �... @hi_im_d4rkn3ss
4K Followers 350 Following Security Researcher @starlabs_sg. Pwn2Own Mobile 2020, 2021, 2022, 2023. Pwn2Own Vancouver 2022, 2023, 2024, 2025.
offensivecon @offensive_con
28K Followers 1 Following OffensiveCon is a technical international security conference focused on offensive security only. Organised by @Binary_Gecko. Stay tuned #Offensivecon #Tokyo.
Gems of Indian Academ... @GemsofAcademia
2K Followers 16 Following Featuring Gems of Indian Academia and Top Institutions.
Calif @calif_io
5K Followers 30 Following We're https://t.co/KTEDnC2VUV. Join us to make the Internet safer for your mum and everyone else: https://t.co/eUFMLkW9t2.
starlabs @starlabs_sg
10K Followers 18 Following A Singapore company that discovers vulnerabilities to help customers mitigate the risks of cyber attacks. Organisers of @offbyoneconf
Pham Khanh @rskvp93
2K Followers 373 Following Security Engineer at @calif_io. Winner of Pwn2own Vancouver 2021, Torento 2022, Vancouver 2023. MSRC top 100 2019, 2020, 2021.
Richard Johnson @richinseattle
19K Followers 3K Following Computer Security, Reverse Engineering, and Fuzzing; Training & Publications @ https://t.co/mloVP6rPB7; hacking the planet since 1995; Undercurrents BOFH
Nguyen The Duc @ducnt_
3K Followers 392 Following Just another web warrior ⚔️ Security Researcher ۞ Principal Security Engineer @Verichains ۞ Pwn2Own 2023 ۞@vnsec squad ۞ 💰https://t.co/wuyz6IfAbA ۞ nano 💻
Bien 🇻🇳 @bienpnn
5K Followers 621 Following A weeb that loves crashing software | @qriousec & @seasecresponse & @ProjectSEKAIctf | アイマス最高 | @rinka_linca 推し
mdowd @mdowd
33K Followers 755 Following Internet Hacker. Founder of @vigilant_labs. Previously, co-founder of Azimuth Security (now L3Harris Trenchant)
Toan Pham @__suto
3K Followers 852 Following Cybersec Enthusiast. IE/Chrome(v8(ctf+sbx)+gpu)/FF(ion+sbx) Qrious Secure (@qriousec) & VnSecurity (@vnsec). IT Defender by day/Bug finding by random.
Learn Prompting @learnprompting
17K Followers 946 Following Creators of the Internet's 1st Prompt Engineering Guide. Trusted by 3M Users. Compete for $100K in Largest AI Red Teaming Competition: https://t.co/AEiLMn2jzy
HackAPrompt @hackaprompt
950 Followers 181 Following Gaslight AIs & Win Prizes in the World's Largest AI Hacking Competition | Made w/ 💙 by the team @learnprompting
Philippe Aghion @Ph_Aghion
26K Followers 137 Following Professor at College de France, INSEAD and LSE. 2025 Nobel Prize in Economics. Account managed by my students at @cdf1530 to share news, not opinions.
BaseThesis Labs @Basethesislabs
528 Followers 4 Following frontier lab focused on democratising the way humans interact with technology to maximise their potential
Claude @claudeai
1.5M Followers 2 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8d1e5 or download the app.
Dan Goldin @dansgoldin
21K Followers 335 Following 🇺🇸 Board member. 🌌 9th NASA Chief. 🗽 Bronx native. ISS + Webb + 61 Astronaut Missions @peraspera_usa 🇺🇸💥
Laude Institute @LaudeInstitute
4K Followers 400 Following Laude Institute backs computer science researchers turning research into real-world impact. // @LaudeVentures

























