We love our community 🫶 thanks for an amazing guide for folks to use
dredyson.com/complete-begin…
"Atlas delivers stable performance without the latency spikes... can run sustained workloads for hours without degradation... matters more in practice than a high watermark number"🔥
Atlas Inference is in transformers 🔥github.com/huggingface/tr…
With kernel-builder, served with Huggingface Hub 🌎
On a DGX Spark/GB10 there was no compiled fast path, so Qwen3.6 fell back to slow torch GDN. It now auto-loads our fast kernel instead. First of many @huggingface!
Happy to share that @AtlasInference is helping shape the MLPerf Edge LLM benchmark with @MLCommons taskforce 📢
We'll be contributing cross-architecture validation on @NVIDIAAI DGX Spark and @AMD Strix Halo. More details coming after official submissions later this year 📊
Cross-architecture from a single codebase is exactly why we built SCALE. Thrilled to see @AtlasInference getting this running! More performance optimizations for both @AMD and @nvidia are on the way.
scale-lang.com
Atlas Inference is running Qwen3.6-27B on AMD Strix Halo 🥳
Using @SpectralCom's SCALE ROCm backend, our CUDA kernels compile and run on RDNA⚙️
Cross-architecture inference from ONE codebase 🗣️
Thank you @AIatAMD for the gift 🙏
POC ✅ excited to keep tuning performance⚡️
Atlas Inference is running Qwen3.6-27B on AMD Strix Halo 🥳
Using @SpectralCom's SCALE ROCm backend, our CUDA kernels compile and run on RDNA⚙️
Cross-architecture inference from ONE codebase 🗣️
Thank you @AIatAMD for the gift 🙏
POC ✅ excited to keep tuning performance⚡️
@RisingSayak@NVIDIAAI Makes sense. We technically support vision for the Qwen3.6-suite but maybe not exactly what you're looking for just yet. Happy to build for any fitting use cases though!
@seree Thanks for taking the time to run through these! I think the default mem allocation may be higher than needed for a smaller dense model like this. Plz dm or post the details in #bugs regarding any of these other pieces, should be customizable/avoidable :) appreciate the feedback
@Alibaba_Qwen Excited to try Qwen3.7-Max (plz OSS release soon🙏) Look at how deeply embedded we are optimizing @Alibaba_Qwen:
3.5/3.6-35B, 3.5/3.6-27B, 3.5-122B (EP=2), 3-Next-80B (GDN/Mamba-2), 3-VL, 3-Coder. Achieved 130 tok/s on 3.5-35B. The Qwen series is genuinely WHY we built Atlas!
It’s official: @AtlasInference is now a @Alibaba_Qwen ambassador! 🤝
Our mission started with Qwen. It remains our top priority and most optimized series. Qwen revolutionized open-source AI, and we’re excited to keep pushing its limits ⚡️
Thank you to our amazing community❤️🔥
@torfi_F_Olafss@huggingface Yes we optimize per {model}_{quant} pair! So to answer your question @torfi_F_Olafss this should definitely help the NVFP4 kernel landscape.
Also just as a random sidenote I have many more hours on Minecraft than Atlas inference so take that as you will lol
DGX Spark lovers 🚨
Thank you @huggingface for merging SM_121 support into kernel-builder, every dev can now pull optimized kernels via get_kernel() 🚀
@AtlasInference pushed to make sure the DGX Spark community had representation 💾
Let's keep squeezing these GB10 chips 📈
33 Followers 216 Followingretired geek happily tinkering. waaay out of my depth - and loving it😜
mainly here for AI / low-key lurking & learning 🔬📚
no followers ≠ fake
99 Followers 231 FollowingOver-engineering dinners & my homelab. 🍳 Dev exploring AI/LLMs, Go & Laravel. Running NixOS because I like my servers as reproducible as my recipes. 🧠
10 Followers 384 FollowingI build AI agents that write code and try to keep them honest. Co-founder, Agentics Foundation · SoFla chapter. AI / AI Security / GRC. Advise startups on AI.
224 Followers 262 FollowingA SaaS-preneur chasing dollars by day, sneaking out of my cave occasionally to teach, and secretly indulging in my gacha addiction (shhh, don't tell my wife!)
73 Followers 136 FollowingAI · DevOps · Fullstack 🇮🇹🇩🇪
Building local AI on DGX Spark — fine-tunes, agents, own your stack.
Automating the chaos ⚡ Daily drops ⬇️
3K Followers 1K FollowingBuilding with LLMs 🤖 heavy agentic coding, workflows & real projects.
Active Repo: https://t.co/vx97PvkV57
Dev Ambassador @Alibaba_Qwen
2K Followers 118 FollowingSenior Creative Director of Entertainment, Minecraft. Producer on “A Minecraft Movie”, its upcoming sequel and the upcoming animated Netflix Minecraft series .
9K Followers 810 FollowingFather of 5, interested in many things from music to AI, philosophy, history, programming, politics, beauty, religions and life. Views here are my own.
7K Followers 1K FollowingLocal AI Consulting @ https://t.co/wz6ZW2fPv4
AI is about the workflow, not the model.
AMD Local LLM Group:
https://t.co/vYMqfCsUKU
283 Followers 54 FollowingNerds taming the green dragon with SCALE, our framework for compiling CUDA codebases for AMD GPUs, with support for more accelerated platforms coming soon.
2K Followers 607 FollowingJapanese-American AI tinkerer 🌸❄️ Obsessed with LLMs, inference optimization & building smarter systems. Turning curiosity into compute, one token at a time.
57K Followers 11 FollowingBuild and share machine learning apps in 3 lines of Python. Part of the @Huggingface family 🤗.
DMs are open for sharing your gradio app with us for promotion!