Reasoning Models

Chain of thought, thinking models, math and logic reasoning

293 articles across 96 editions

Articles

  1. Consistency of Large Reasoning Models Under Multi-Turn Attacks -- 2026-02-16
  2. [Editorial] AI Testing and Quality Engineering -- 2026-02-16
  3. [Editorial] https://github.com/GMaN1911/claude-cognitive -- 2026-01-02
  4. SA-RAG: Using spreading activation to improve multi-hop retrieval in RAG systems -- 2026-01-02
  5. Is there a way to see what is trashing my context? -- 2026-01-02
  6. A zero-setup agent that benchmarks multiple open / closed source LLMs on your specific problem / data -- 2026-01-02
  7. What is a good model for assisting with patching source code? -- 2026-01-02
  8. Just got an RTX Pro 6000 - need recommendations for processing a massive dataset with instruction following -- 2026-01-02
  9. MiniMaxAI/MiniMax-M2.1 -- 2026-01-02
  10. [Editorial] https://www.linkedin.com/posts/stuart-winter-tear_realist-and-pluralist-conceptions-of-intelligence-activity-7397231918871703554-FmSP -- 2025-12-11
  11. VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection -- 2025-12-11
  12. Nanbeige4-3B: Lightweight with strong reasoning capabilities -- 2025-12-10
  13. mistralai/Devstral-2-123B-Instruct-2512 -- 2025-12-10
  14. MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark -- 2025-12-04
  15. PrimeIntellect/INTELLECT-3 -- 2025-12-04
  16. cerebras/MiniMax-M2-REAP-162B-A10B -- 2025-12-04
  17. 62-day fixed-prompt probe on Grok-4: strong semantic attractors, thematic inversion, and refusal onset (1,242 samples, fully public) -- 2025-12-03
  18. I built an open-source "Passport" for Claude Agents (MCP) so they can cryptographically sign their own actions -- 2025-12-01
  19. Implemented Anthropic's Programmatic Tool Calling with Langchain so you can use it with any models and tune it for your own use case -- 2025-12-01
  20. CodeModeToon -- 2025-12-01
  21. WeiboAI/VibeThinker-1.5B -- 2025-11-28
  22. [Editorial] https://ai.google.dev/gemini-api/docs/prompting-strategies#agentic-si-template -- 2025-11-28
  23. An explainer blog on attention, KV-caching, continuous batching -- 2025-11-28
  24. I built an open-source CLI that generates context.json bundles for React/TypeScript projects -- 2025-11-28
  25. GraphLite: An Embeddable Graph Database with ISO Graph Query Language Support -- 2025-11-26
  26. allenai/Olmo-3-32B-Think -- 2025-11-25
  27. tencent/HunyuanOCR -- 2025-11-25
  28. peteromallet/Qwen-Image-Edit-InScene -- 2025-11-25
  29. [Editorial] https://www.linkedin.com/posts/stuart-winter-tear_decision-making-amid-information-based-threats-activity-7396539314815533056-4pVx -- 2025-11-18
  30. [Editorial] https://gist.github.com/ruvnet/d6d2739400943037443b78c3ef86d8a5 -- 2025-11-18
  31. [Editorial] https://github.com/mrwadams/stride-gpt/blob/master/docs/operationalization-guide.md -- 2025-11-18
  32. janhq/Jan-v2-VL-high -- 2025-11-18
  33. Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers -- 2025-11-18
  34. [Editorial] https://arxiv.org/pdf/2506.21734 -- 2025-11-11
  35. OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval -- 2025-11-11
  36. [Editorial] https://www.linkedin.com/posts/andriyburkov_this-paper-shows-a-27-million-parameter-model-activity-7393432619365052416-SFLO -- 2025-11-10
  37. Trajectory Distillation for Foundation Models -- 2025-11-10
  38. sail-sg/Precision-RL -- 2025-11-10
  39. inclusionAI/LLaDA2.0-flash-preview -- 2025-11-10
  40. AI Agents Reasoning Collapse Imminent (CMU, Berkeley) -- 2025-11-02
  41. Natural Language Programming: Run Natural Language as Script -- 2025-11-02
  42. Claude Code is a Beast – Tips from 6 Months of Hardcore Use -- 2025-11-02
  43. Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs -- 2025-10-31
  44. [Editorial] https://www.linkedin.com/posts/anthony-alcaraz-b80763155_ai-agents-cant-reason-without-semantic-structure-activity-7389222435906244608-QMMc -- 2025-10-31
  45. Raezil/lattice-agent -- 2025-10-31
  46. driaforall/mem-agent -- 2025-10-31
  47. Memp: Exploring Agent Procedural Memory -- 2025-10-31
  48. I built the HuggingChat Omni Router 🥳 🎈 -- 2025-10-28
  49. Claude Code 2.0.27 -- 2025-10-28
  50. ltjed/freephdlabor -- 2025-10-28
  51. Show HN: Whatdidido – CLI to summarize your work from Jira/Linear -- 2025-10-28
  52. Reasoning should be thought of as a drawback, not a feature -- 2025-10-21
  53. inclusionAI/Ring-1T-preview -- 2025-10-21
  54. Learning Lifted Action Models From Traces of Incomplete Actions and States -- 2025-10-20
  55. I got tired of OpenAI dependency. Built a multi-LLM control center instead. -- 2025-10-19
  56. Turn ChatGPT into a real-time meeting assistant (via MCP + Apps SDK) -- 2025-10-19
  57. Claude Code taking a coffee break 🤔 -- 2025-10-19
  58. Show HN: Cmux – Coding Agent Multiplexer -- 2025-10-19
  59. [Editorial] Sqlite vector -- 2025-10-17
  60. Meta Superintelligence group publishes paper on new RAG technique -- 2025-10-17
  61. [Editorial] ReasoningBank is a self-learning, local-first memory system -- 2025-10-16
  62. [Editorial] ReasoningBank is a self-learning, local-first memory system -- 2025-10-16
  63. I tested if tiny LLMs can self-improve through memory: Qwen3-1.7B gained +8% accuracy on MATH problems -- 2025-10-16
  64. Tested 9 RAG query transformation techniques – HydE is absurdly underrated -- 2025-10-16
  65. GPT-OSS from Scratch on AMD GPUs -- 2025-10-11
  66. How do I compare cost per token for serverless vs provisioned hardware? -- 2025-10-11
  67. OpenAI is good at deals -- 2025-10-11
  68. meituan-longcat/LongCat-Flash-Chat -- 2025-10-11
  69. adb1274/batchi -- 2025-10-11
  70. What are the best models for legal work in Oct 2025? -- 2025-10-07
  71. [Update] FamilyBench: New models tested - Claude Sonnet 4.5 takes 2nd place, Qwen 3 Next breaks 70%, new Kimi weirdly below the old version, same for GLM 4.6 -- 2025-10-07
  72. princeton-pli/RLMT -- 2025-10-07
  73. TGPO: Tree-Guided Preference Optimization for Robust Web Agent Reinforcement Learning -- 2025-10-07
  74. DSpAST: Disentangled Representations for Spatial Audio Reasoning with Large Language Models -- 2025-10-05
  75. swiss-ai/Apertus-8B-2509 -- 2025-10-04
  76. Qwen3-Omni thinking model running on local H100 (major leap over 2.5) -- 2025-09-30
  77. Seeking Advice: Best Model + Framework for Max Tokens/sec on Dual L40S (Testing Rig) -- 2025-09-30
  78. For local models, has anyone benchmarked tool calling protocols performance? -- 2025-09-30
  79. A step by step guide on how to build a LLM from scratch -- 2025-09-28
  80. YannQi/R-4B -- 2025-09-28
  81. New Agent benchmark from Meta Super Intelligence Lab and Hugging Face -- 2025-09-27
  82. evalops/dspy-micro-agent -- 2025-09-27
  83. nvidia/NVIDIA-Nemotron-Nano-9B-v2 -- 2025-09-27
  84. inclusionAI/Ling-flash-2.0 -- 2025-09-27
  85. Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model -- 2025-09-26
  86. CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation -- 2025-09-26
  87. A1: Asynchronous Test-Time Scaling via Conformal Prediction -- 2025-09-25
  88. DeepLink-org/DeepTrace -- 2025-09-25
  89. GLM 4.5 Air Template Breaking llamacpp Prompt Caching -- 2025-09-25
  90. Tracking prompt evolution for RAG systems - anyone else doing this? -- 2025-09-25
  91. MAESTRO v0.1.6 Update: Better support for models that struggle with JSON mode (DeepSeek, Kimi K2, etc.) -- 2025-09-25
  92. Dead-simple example code for Ollama function calling. -- 2025-09-25
  93. nvidia/NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-23
  94. DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning -- 2025-09-22
  95. Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward -- 2025-09-22
  96. support for the upcoming Olmo3 model has been merged into llama.cpp -- 2025-09-21
  97. Running Nvidia CUDA Pytorch/vLLM projects and pipelines on AMD with no modifications -- 2025-09-21
  98. A Quick Look At The AMD Instinct MI355X With ROCm 7.0 -- 2025-09-21
  99. Uncensored AI model for from 4b Max 8b -- 2025-09-21
  100. GPT-OSS-120B Performance Benchmarks and Provider Trade-Offs -- 2025-09-20
  101. Why are there three different Codex variants? -- 2025-09-20
  102. zli12321/Vision-SR1 -- 2025-09-19
  103. lrzjason/Comfyui-QwenEditUtils -- 2025-09-19
  104. Mini-o3/Mini-o3 -- 2025-09-19
  105. vLLM is kinda awesome -- 2025-09-19
  106. Public AI on Hugging Face Inference Providers 🔥 -- 2025-09-19
  107. HyST: LLM-Powered Hybrid Retrieval over Semi-Structured Tabular Data -- 2025-09-19
  108. GPT-OSS:20b & Qwen 4b are a match made in heaven for 24GB VRAM builds -- 2025-09-18
  109. Was working in RAG recently got to know how well Gemma3 4B performs -- 2025-09-18
  110. [Editorial] which patterns truly survived compression -- 2025-09-16
  111. [Editorial] AI Kill Chain -- 2025-09-16
  112. TsinghuaC3I/Unify-Post-Training -- 2025-09-16
  113. [Editorial] Tricks from OpenAI gpt-oss YOU can use with transformers -- 2025-09-15
  114. openbmb/MiniCPM4.1-8B -- 2025-09-15
  115. nunchaku-tech/nunchaku-qwen-image -- 2025-09-15
  116. ggml-org/gpt-oss-20b-GGUF -- 2025-09-15
  117. MBZUAI releases K2 Think. 32B reasoning model based on Qwen 2.5 32B backbone, focusing on high performance in math, coding and science. -- 2025-09-14
  118. unsloth/Qwen3-Coder-30B-A3B-Instruct-1M-GGUF -- 2025-09-14
  119. [vllm] Hints to run Qwen3-235B MoE on 8x AMD mixed cards! -- 2025-09-12
  120. Inference for 24 people with a 5000€ budget -- 2025-09-12
  121. $142 upgrade kit and spare modules turn Nvidia RTX 4090 24GB to 48GB AI card -- 2025-09-12
  122. Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers -- 2025-09-12
  123. Small Open Models Achieve Near Parity with Large Models in Low Resource Literary Translation at a Fraction of the Cost -- 2025-09-10
  124. Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic -- 2025-09-10
  125. Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search -- 2025-09-10
  126. Introducing FineVision: a huge open-source dataset for training SOTA Vision Language Models -- 2025-09-10
  127. wildminder/ComfyUI-VibeVoice -- 2025-09-10
  128. bytedance/USO -- 2025-09-10
  129. Wan-AI/Wan2.2-I2V-A14B -- 2025-09-10
  130. [Editorial] Update from Anthropic regarding their poor perfomance of late -- 2025-09-09
  131. LiquidAI/LFM2-VL-450M -- 2025-09-09
  132. An LLM-powered Natural-to-Robotic Language Translation Framework with Correctness Guarantees -- 2025-09-09
  133. Qwen3 30B A3B 2507 Hybrid Deep Reasoning Showcase -- 2025-09-08
  134. Is the "cost of inference" going up or down? -- 2025-09-08
  135. Smartphone Sensors Unlocked: Turn Your Phone into a Physics Lab -- 2025-09-08
  136. UniSLU: Unified Spoken Language Understanding from Heterogeneous Cross-Task Datasets -- 2025-09-08
  137. Voice cloning -- 2025-09-08
  138. haasonsaas/dspy-0to1-guide -- 2025-09-06
  139. 16 reproducible failures → upgraded into a 300+ page Global Fix Map. one link inside, feedback wanted -- 2025-09-06
  140. Show HN: Entropy-Guided Loop – How to make small models reason -- 2025-09-06
  141. Kwaipilot/KAT-V1-40B -- 2025-09-06
  142. Beyond Scaling Law: A Data-Efficient Distillation Framework for Reasoning -- 2025-09-06
  143. nasa-ibm-ai4science/Surya-1.0 -- 2025-09-06
  144. Context Reasoning Benchmarks: GPT-5, Claude, Gemini, Grok on Real Tasks -- 2025-09-05
  145. The CLAUDE.md Framework: A Guide to Structured AI-Assisted Work (prompts included) -- 2025-09-05
  146. Team-intN18-SoybeanSeclab/Typhon -- 2025-09-05
  147. DatarusAI/Datarus-R1-14B-preview -- 2025-09-05
  148. Training & Querying 3 Ollama Models with Zer00logy: Symbolic Cognition Framework and Void-Math OS -- 2025-09-04
  149. I'm building local, open-source, fast, efficient, minimal, and extendible RAG library I always wanted to use -- 2025-09-03
  150. Creating the brain behind dumb models -- 2025-09-03
  151. 🌟Introducing Art-0-8B: Reasoning the way you want it to with Adaptive Thinking🌟 -- 2025-09-02
  152. Fine Tune Model for Home Assistant? -- 2025-09-02
  153. DeepSeek V3.1 improves on the multiplayer Step Game social reasoning benchmark -- 2025-08-31
  154. I built Husk, a native, private, and open-source iOS client for your local models -- 2025-08-31
  155. Would a “Knowledge Coverage Audit” tool be useful for RAG/chatbot builders? -- 2025-08-30
  156. baichuan-inc/Baichuan-M2-32B -- 2025-08-28
  157. Hierarchical Reasoning Model (HRM) implementation for text generation -- 2025-08-27
  158. Datarus-R1-14B-Preview, an adaptive multi-step reasoning LLM for automated data analysis -- 2025-08-24
  159. Fully Open source, serverless, community-driven MCP alternative built in Python, TS and Go -- 2025-08-24
  160. unsloth/Kimi-K2-Instruct-GGUF -- 2025-08-24
  161. DeepSeek V3.1 Reasoner improves over DeepSeek R1 on the Extended NYT Connections benchmark -- 2025-08-24
  162. DeepSeek-V3.1 (Thinking and Non Thinking) -- 2025-08-22
  163. Modify <think> to explore the impact on <answer> -- 2025-08-22
  164. Tiny finance “thinking” model (Gemma-3 270M) with verifiable rewards (SFT → GRPO) — structured outputs + auto-eval (with code) -- 2025-08-22
  165. Qwen/Qwen3-30B-A3B-Thinking-2507 -- 2025-08-22
  166. tencent/Hunyuan-7B-Instruct -- 2025-08-22
  167. 🐧 llama.cpp on Steam Deck (Ubuntu 25.04) with GPU (Vulkan) — step-by-step that actually works -- 2025-08-22
  168. Running Qwen3-Coder-30B-A3 Q4_LM in Cursor with Agent Mode unlocked -- 2025-08-22
  169. Docker now support AI Models, anyone using it? -- 2025-08-22
  170. Why does gpt-oss 120b run slower in ollama than in LM Studio in my setup? -- 2025-08-22
  171. Speculative decoding in archgw candidate release 0.4.0. Could use feedback, -- 2025-08-16
  172. Nvidia Tilus: A Tile-Level GPU Kernel Programming Language -- 2025-08-16
  173. SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model -- 2025-08-16
  174. HoML: vLLM's speed + Ollama like interface -- 2025-08-15
  175. HoML vs. Ollama: A Deep Dive into Performance -- 2025-08-15
  176. Sampler Settings for GLM 4.5-Air -- 2025-08-15
  177. Fully verbal LLM program for OSX using whisper, ollama & XTTS -- 2025-08-15
  178. Closing the Modality Gap for Mixed Modality Search -- 2025-08-07
  179. ByteDance drops Seed-Prover -- 2025-08-06
  180. naver-hyperclovax/HyperCLOVAX-SEED-Think-14B -- 2025-08-06
  181. Context Management by Trimming Conversation -- 2025-08-06
  182. Exploiting Primacy Effect To Improve Large Language Models -- 2025-08-06
  183. [Editorial] HRM -- 2025-08-03
  184. How are people running an MLX-compatible OpenAI API server locally? -- 2025-08-03
  185. I built the perfect MCP client for broke developers (Ollama powered) -- 2025-08-03
  186. character-ai/pipelining-sft -- 2025-08-03
  187. CoexistAI – LLM-Powered Research Assistant (Now with MCP, Vision, Local File Chat, and More) -- 2025-08-02
  188. Best <2B open-source LLMs for European languages? -- 2025-08-02
  189. Local TTS quality -- 2025-08-02
  190. [Editorial] The Anatomy of a Modern LLM -- 2025-07-31
  191. PowerInfer/SmallThinker-21BA3B-Instruct -- 2025-07-31
  192. Has vLLM made Ollama and llama.cpp redundant? -- 2025-07-30
  193. [Editorial] Alternative to vector db rag -- 2025-07-30
  194. How are people extracting system prompts? -- 2025-07-29
  195. cherrydra/mcpurl -- 2025-07-29
  196. [Editorial] neural networks don’t need to be giant to be powerful -- 2025-07-27
  197. Qwen/Qwen3-235B-A22B-Thinking-2507 -- 2025-07-27
  198. mistralai/Magistral-Small-2507 -- 2025-07-27
  199. Running Qwen3 235B-A22B 2507 on a Threadripper 3970X + 3x RTX 3090 Machine at 15 tok/s -- 2025-07-25
  200. The Latest GPT-5 Leaks and Teasers -- 2025-07-25
  201. Qwen3-235B-A22B-Thinking-2507 released! -- 2025-07-25
  202. From chaotic prompting to structured workflow: My Claude evolution -- 2025-07-24
  203. Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support -- 2025-07-24
  204. MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models -- 2025-07-24
  205. Building an MCP Server and Client with FastMCP 2.0 -- 2025-07-24
  206. Does LLM architecture allow for injecting some more input tokens in the middle of token generation? -- 2025-07-24
  207. Lucy: A Mobile-Capable 1.7B Reasoning Model That Rivals Jan-Nano -- 2025-07-23
  208. microsoft/Phi-4-mini-flash-reasoning -- 2025-07-22
  209. LGAI-EXAONE/EXAONE-4.0-32B -- 2025-07-22
  210. Replacing thinking with tool usage enables reasoning in small language models -- 2025-07-22
  211. A Request for Comments (RFC) for MCP-alternative Universal Tool Calling Protocol (UTCP) was created -- 2025-07-22
  212. How to use the same context across LLMs and Agents -- 2025-07-22
  213. new models from NVIDIA: OpenReasoning-Nemotron 32B/14B/7B/1.5B -- 2025-07-21
  214. OpenAI Places Second Behind Human Coder at AtCoder Progmming Event -- 2025-07-21
  215. HelpingAI/Dhanishtha-2.0-preview -- 2025-07-21
  216. Probing for Arithmetic Errors in Language Models -- 2025-07-21
  217. Struggling to Generate Polished UI with Claude Code -- 2025-07-20
  218. IMO 2025 LLM Mathematical Reasoning Evaluation -- 2025-07-20
  219. A comprehensive study of LLM-based argument classification: from LLAMA through GPT-4o to Deepseek-R1 -- 2025-07-19
  220. Madness, the ignorant's question. Would it be possible to lighten an LLM model? -- 2025-07-18
  221. Open source and free iOS app to chat with your LLMs when you are away from home. -- 2025-07-16
  222. Requirements and architecture for a good enough model with scientific papers RAG -- 2025-07-16
  223. Excited to share updates to Open WebUI Starter! New docs, Docker support, and templates for everyone -- 2025-07-16
  224. OpenAI's open source LLM is a reasoning model, coming Next Thursday! -- 2025-07-14
  225. The BastionRank Showdown: Crowning the Best On-Device AI Models of 2025 -- 2025-07-14
  226. Local Llama with Home Assistant Integration and Multilingual-Fuzzy naming -- 2025-07-14
  227. Podcast generation app -- works with Ollama -- 2025-07-14
  228. support for Jamba hybrid Transformer-Mamba models has been merged into llama.cpp -- 2025-07-13
  229. Asynchronous Robot Inference: Decoupling Action Prediction and Execution -- 2025-07-13
  230. Suggestion: Grayscale-First Hack to Optimize Image Recognition in Grok—Save Compute Without Losing Accuracy? -- 2025-07-13
  231. Upskill your LLMs with Gradio MCP Servers -- 2025-07-09
  232. AGI is not multimodal -- 2025-07-09
  233. How Do Vision-Language Models Process Conflicting Information Across Modalities? -- 2025-07-09
  234. High Precision -- 2025-07-09
  235. skt/A.X-4.0 -- 2025-07-09
  236. Medical language model - for STT and summarize things -- 2025-07-09
  237. Ollama alternatives -- 2025-07-09
  238. Dealing with tool_calls hallucinations -- 2025-07-09
  239. SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model -- 2025-07-07
  240. i made a commit message generator that can be used offline and for free -- 2025-07-05
  241. THUDM/GLM-4.1V-9B-Thinking -- 2025-07-05
  242. baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle -- 2025-07-05
  243. LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs -- 2025-07-05
  244. skt/A.X-4.0-Light -- 2025-07-04
  245. ChatDOC/OCRFlux-3B -- 2025-07-04
  246. Is there a local model that can solve this text decoding riddle? -- 2025-07-03
  247. Seven replies to the viral Apple reasoning paper and why they fall short -- 2025-07-03
  248. R1-0528 won't stop thinking -- 2025-07-03
  249. Running Deepseek R1 0528 q4_K_M and mlx 4-bit on a Mac Studio M3 -- 2025-07-02
  250. Hoshinonyaruko/Gensokyo-MCP -- 2025-07-01
  251. THU-KEG/AdaptThink -- 2025-06-28
  252. modelcontextprotocol/registry -- 2025-06-27
  253. Skywork/Skywork-SWE-32B -- 2025-06-25
  254. moonshotai/Kimi-VL-A3B-Thinking-2506 -- 2025-06-25
  255. POLARIS-Project/Polaris-4B-Preview -- 2025-06-25
  256. XiaomiMiMo/MiMo -- 2025-06-22
  257. nvidia/AceReason-Nemotron-1.1-7B -- 2025-06-22
  258. Menlo/Jan-nano -- 2025-06-22
  259. MiniMax-AI/SynLogic -- 2025-06-15
  260. The Fractured Entangled Representation Hypothesis -- 2025-06-15
  261. mistralai/Magistral-Small-2506_gguf -- 2025-06-14
  262. Ruminate: From All-or-Nothing to Just-Right Reasoning in LLMs -- 2025-06-14
  263. [update] Restructured repo under rvn-tools — modular CLI for LLM formats -- 2025-06-14
  264. Testing Quant Quality for Shisa V2 405B -- 2025-06-14
  265. Old model, new implementation -- 2025-06-14
  266. Ollama vs Llamacpp: Different output for same model -- 2025-06-14
  267. How to improve my ViT model -- 2025-06-14
  268. From RPC to transactions and durable executions -- 2025-06-14
  269. Flattening Rust’s learning curve -- 2025-06-14
  270. Async from scratch 3: Pinned against the wall -- 2025-06-14
  271. How to get the most out of my AMD 7900XT? -- 2025-06-14
  272. typelevel/cats -- 2025-06-11
  273. wesm/pydata-book -- 2025-06-11
  274. lerobot/smolvla_base -- 2025-06-10
  275. jedisct1/openapi-mcp -- 2025-06-09
  276. sarvamai/sarvam-m -- 2025-06-07
  277. Qwen/Qwen3-Reranker-0.6B -- 2025-06-07
  278. arcee-ai/Homunculus -- 2025-06-04
  279. PRIME-RL/Entropy-Mechanism-of-RL -- 2025-06-02
  280. Atlas: Learning to Optimally Memorize the Context at Test Time -- 2025-06-02
  281. Gen-Verse/MMaDA -- 2025-06-01
  282. osmosis-ai/Osmosis-Structure-0.6B -- 2025-06-01
  283. 0-1 phase transitions in sparse spiked matrix estimation -- 2025-06-01
  284. 0-Step Capturability, Motion Decomposition and Global Feedback Control of the 3D Variable Height-Inverted Pendulum -- 2025-06-01
  285. simplescaling/s1 -- 2025-05-31
  286. FractalAIResearch/Fathom-R1-14B -- 2025-05-31
  287. unsloth/DeepSeek-R1-0528-Qwen3-8B-GGUF -- 2025-05-31
  288. deepseek-ai/DeepSeek-R1-0528-Qwen3-8B -- 2025-05-31
  289. I made Model Version Control Protocol for AI agents -- 2025-05-31
  290. AI Baby Monitor – fully local Video-LLM nanny (beeps when safety rules are violated) -- 2025-05-31
  291. LMStudio - llama.cpp - vLLM -- 2025-05-31
  292. Built an ADK Agent that finds Jobs based on your Resume -- 2025-05-31
  293. Should I resize the image before sending it to Qwen VL 7B? Would it give better results? -- 2025-05-31
  294. How to start a LLM project? -- 2025-05-31
  295. Beware of Fast-Math -- 2025-05-31