culi53 minutes ago
It's nice to see more focus on efficiency. All the recent new model releases have come along with massive jumps in certain benchmarks but when you dig into it it's almost always paired with a massive increase in token usage to achieve those results (ahem Google Deep Think ahem). For AI to truly be transformational it needs to solve the electricity problem
tankenmate26 minutes ago
And not just token usage, expensive token usage; when it comes to tokens/joule not all tokens are equal. Efficient use of MoE architectures does have an impact on tokens/joule and tokens/sec.
kristianp6 hours ago
Recent model released a couple of weeks ago. "Mixture of Experts (MoE) architecture, it selectively activates only 11B of its 196B parameters per token". Beats Kimi K2.5 and GLM 4.7 on more benchmarks than it loses to them.

Edit: there are 4 bit quants that can be run on an 128GB machine like a GB10 [1], AI Max+ 395, or mac studio.

[1] https://forums.developer.nvidia.com/t/running-step-3-5-flash...

danieltanfh954 hours ago
Hallucinates like crazy. use with caution. Tested it with a simple "Find me championship decks for X pokemon", "How does Y deck work". Opus 4.6, Deepseek and Kimi all performed well as expected.
wmf5 hours ago
That reverse x axis sure is confusing.
esafak3 hours ago
I imagine they thought they'd look better this way. I don't think they do.
SilverElfin4 hours ago
So who exactly is StepFun? What is their business (how do they make money)? Each time I click “About Stepfun” somewhere on their website, it sends me to a generic landing page in a loop.
kristopolous15 minutes ago
They've been around a couple years. This is the first model that has really broken into the anglosphere.

Keep a tab on aihubmix, the Chinese openrouter, if you want to stay on top of the latest models. They keep track of things like the Baichuan, Doubao, baai (beijing academy), Meituan, 01.AI (yi), xiaomi, etc...

Much larger chinese coverage than openrouter

0x19974 hours ago
SilverElfin3 hours ago
Thanks. Do they sell any of these products today or is it more like research? I am not able to find anything relating to pricing on their website. Just a chatbot.
0x19972 hours ago
Princing can be found on their docs website https://platform.stepfun.ai/docs/en/pricing/details
deaux4 hours ago
Might want to give it a search.
agentifysh2 hours ago
what country is behind this one ?
personalcompute2 hours ago
Step 3.5 Flash was made by Chinese company StepFun - https://en.wikipedia.org/wiki/StepFun