Thanks to the incredible team at DeepReinforce AI. We have a 9b model that's being compared with qwen3.5 35b.
AND IT FITS IN AN 8GB GPU/LAPTOP.
State-of-the-Art Coding Agents: Available in 9B-Dense, 31B-Dense, 35B-MoE, and 397B-MoE (post-trained on top of Gemma 4 and Qwen 3.5), achieving state-of-the-art performance among open-source models of comparable size on coding benchmarks such as Terminal-Bench 2.1, SWE-Bench, NL2Repo and OpenClaw.
Self-Improving Training Framework: Ornith-1.0 employs RL to learn to generate not only solution rollouts, but also the scallfold that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model discovers better search trajectories and generates higher-quality solutions.
Licence: MIT licensed, globally accessible, and free from regional limitations.
I've used this model for 48 hours now and the results are very promising
(posting results in a few days!)
Check it out here: