AI Infrastructure

Deployment, Kubernetes, scaling, cloud vs local, MLOps

678 articles across 155 editions

Articles

  1. klawsh/klaw.sh -- 2026-02-24
  2. [Editorial] -- 2026-02-24
  3. [Editorial] -- 2026-02-24
  4. [Editorial] -- 2026-02-24
  5. [Editorial] -- 2026-02-24
  6. In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach -- 2026-02-24
  7. [Editorial] Taala's Etches AI Models onto Transistors to Rocket-Boost Inference -- 2026-02-23
  8. Repurposing 800 RX 580s into an AI Inference Cluster: Mass Document OCR at 24x Lower Cost -- 2026-02-23
  9. Jolt Atlas: Verifiable Inference via Lookup Arguments in Zero Knowledge -- 2026-02-21
  10. [Editorial] RTI Genesis — Real-Time Infrastructure -- 2026-02-21
  11. [Editorial] RuVector & RVF Vector Database -- 2026-02-21
  12. [Editorial] RVDNA — Does It Work? -- 2026-02-21
  13. [Editorial] Enterprise Open Source AI Coding Is Changing the ROI Calculation -- 2026-02-20
  14. [Editorial] Think Tax: The Real Cost of AI-Generated Code -- 2026-02-20
  15. [Editorial] RuVector DNA Sequence Analysis Example -- 2026-02-20
  16. GGML and llama.cpp join HF to ensure the long-term progress of Local AI -- 2026-02-20
  17. [Editorial] Agentic AI for Enterprise -- 2026-02-20
  18. Vellium: open-source desktop app for creative writing with visual controls -- 2026-02-20
  19. [Editorial] AI Security, Governance, and Cybersecurity -- 2026-02-19
  20. AI-generated password isn't random, it just looks that way -- 2026-02-19
  21. I plugged a $30 radio into my Mac mini and told my AI "connect to this" — now I control my smart home and send voice messages over radio with zero internet -- 2026-02-19
  22. FlashLM v4: 4.3M ternary model trained on CPU in 2 hours — coherent stories from adds and subtracts only -- 2026-02-19
  23. [Editorial] Antigravity Awesome Skills -- 2026-02-18
  24. I built an MCP that connects your agent to 8,000+ skills with zero setup -- 2026-02-18
  25. OpenAI buys OpenClaw, hires creator Peter Steinberger -- 2026-02-18
  26. Why top talent is walking away from OpenAI and xAI -- 2026-02-18
  27. [Editorial] Enterprise AI Summit — Gene Kim -- 2026-02-18
  28. XiaomiRobotics/Xiaomi-Robotics-0 -- 2026-02-18
  29. [Editorial] Product Security Is About to Hit a Wall -- 2026-02-17
  30. [Editorial] Product Security Wall (follow-up) -- 2026-02-17
  31. [Editorial] Oligo Accelerates Vulnerability Intelligence with NVIDIA -- 2026-02-17
  32. [Editorial] RVF — Most Consequential AI Infrastructure -- 2026-02-16
  33. [Editorial] Introducing RVF Cognitive Container -- 2026-02-16
  34. 375ms Voice-to-Voice Latency: Local Nemotron-4 + Kokoro-82M on Blackwell Bare Metal -- 2026-02-16
  35. Expensively Quadratic: The LLM Agent Cost Curve -- 2026-02-16
  36. [Editorial] https://www.authsignal.com/blog/articles/account-recovery-is-the-identity-industrys-most-overlooked-challenge -- 2026-02-13
  37. [Editorial] https://raffy.ch/blog/2026/02/03/the-gaps-that-created-the-new-wave-of-siem-and-ai-soc-vendors -- 2026-02-13
  38. [Editorial] https://m.youtube.com/watch?v=w8p-yFqF13o -- 2026-02-13
  39. [Editorial] https://mrinal.com/articles/agent-identities -- 2026-02-13
  40. [Editorial] https://labs.zenity.io/p/perplexity-comet-a-reversing-story -- 2026-02-13
  41. [Editorial] https://arxiv.org/abs/2602.10117 -- 2026-02-13
  42. [Editorial] https://arxiv.org/abs/2602.09433 -- 2026-02-13
  43. [Editorial] https://www.linkedin.com/posts/hermanerrico_i-put-out-a-site-and-paper-defining-a-new-activity-7427822997593387008-zzYm -- 2026-02-13
  44. [Editorial] https://www.linkedin.com/pulse/ive-spent-three-decades-cybersecurity-ai-biggest-trust-brett-kelsey-v7r3c -- 2026-02-13
  45. [Editorial] https://www.linkedin.com/pulse/ai-red-teamers-advice-orgs-deploying-brian-chamberlain-utkse -- 2026-02-13
  46. [Editorial] https://www.linkedin.com/posts/cole-medin-727752184_vibe-coding-has-a-30-50-security-vulnerability-activity-7420461997537959938-y5uG -- 2026-02-13
  47. [Editorial] https://zeltser.com/ai-malware-analysis-remnux -- 2026-02-13
  48. I built a personal AI assistant in 815 lines of TypeScript — every capability is just a Markdown file -- 2026-02-13
  49. whisper.cpp + llama.cpp in a desktop app — local voice-to-text with LLM text cleanup -- 2026-02-13
  50. I built a social network where 6 Ollama agents debate each other autonomously — Mistral vs Llama 3.1 vs CodeLlama -- 2026-02-13
  51. Lorph: A Local AI Chat App with Advanced Web Search via Ollama -- 2026-02-13
  52. [Editorial] https://www.linkedin.com/pulse/when-brain-os-meets-real-operating-systems-rafael-knuth-4hcsf -- 2026-02-11
  53. [Editorial] https://docs.entire.io/core-concepts -- 2026-02-11
  54. Why System Prompts are failing your local agent builds (and why you need a Logic Floor) -- 2026-02-11
  55. I built an MCP server that syncs Cursor, Claude Desktop, and Windsurf with one brain [Open Source] -- 2026-02-11
  56. built a self-hosted API proxy that strips PII before prompts reach any LLM - works with Ollama too -- 2026-02-11
  57. Bitnet.cpp - Inference framework for 1-bit (ternary) LLM's -- 2026-02-11
  58. Last Week in Multimodal AI - Local Edition -- 2026-02-11
  59. [Editorial] https://www.zdnet.com/article/claude-code-alternative-free-local-open-source-goose -- 2026-02-10
  60. Recommend model for openclaw clawdbot running locally on old laptop 4gb vram 16g ram asus -- 2026-02-10
  61. [Editorial] https://github.com/ikennaokpala/forge -- 2026-02-09
  62. Trainable System Router and Industry standard Dual Method Memory System Release -- 2026-02-09
  63. [Editorial] https://blogs.microsoft.com/blog/2026/01/26/maia-200-the-ai-accelerator-built-for-inference -- 2026-02-09
  64. [Editorial] https://www.linkedin.com/posts/ownyourai_i-just-open-sourced-my-security-auditor-for-activity-7426565421375541248-rqGu -- 2026-02-09
  65. [Editorial] https://www.linkedin.com/posts/activity-7426382890004971520-VBdy -- 2026-02-09
  66. [Editorial] https://www.linkedin.com/posts/samuele-giampieri-b1b67597_redamon-airedteam-penetrationtesting-activity-7426292400534437889--0Ny -- 2026-02-09
  67. [Editorial] https://hackernoon.com/everyone-says-ai-is-insecure-so-i-measured-it -- 2026-02-09
  68. [Editorial] https://x.com/fr0gger_/status/2020025525784514671?ct=rw-li -- 2026-02-09
  69. Agent deleted production data because no policy layer said 'no' - what's your governance strategy? -- 2026-02-09
  70. [Editorial] https://github.com/usestrix/strix -- 2026-02-06
  71. [Editorial] https://github.com/GH05TCREW/pentestagent -- 2026-02-06
  72. [Editorial] https://www.edloveless.com/the-call-is-coming-from-inside-the-house-and-its-watching-netflix -- 2026-02-06
  73. eScan Antivirus Delivers Malware in Supply Chain Attack -- 2026-02-06
  74. Run Ollama on your Android! -- 2026-02-04
  75. Is using the officially supported local LLM integration in Claude Code for business/corporate use a violation of ToS? -- 2026-02-04
  76. [Editorial] https://www.linkedin.com/posts/hermanerrico_aisecurity-agenticai-cybersecurity-activity-7424484799123247104-40_F -- 2026-02-04
  77. m4xxxxx/AIxVuln -- 2026-02-04
  78. OpenClaw on edge Linux (systemd + cron) — quick experiment + a few questions -- 2026-02-03
  79. [Ollama Cloud] 29.7% failure rate, 3,500+ errors in one session, support ignoring tickets for 2 weeks - Is this normal? -- 2026-02-03
  80. OpenClaw is everywhere all at once, and a disaster waiting to happen -- 2026-02-03
  81. [Editorial] https://www.linkedin.com/posts/robvanderveer_iso42001-pren18282-pren18282-share-7423993903118290945--EO7 -- 2026-02-03
  82. Large categorized list of AI / LLM benchmarks & leaderboards -- 2026-02-03
  83. The cost of massive context: Burned 45M Gemini tokens in hours using OpenCode. Is Context Caching still a myth for most agents? -- 2026-01-30
  84. PSA: CHECK YOUR OPENAI PAYMENT CARD -- 2026-01-30
  85. [Editorial] https://humanemulator.co/ -- 2026-01-30
  86. Generating skills for api+local CUAs via noVNC demonstration recording MCP -- 2026-01-30
  87. Our Agent Rebuilt Itself in 26 Hours. AMA👀 -- 2026-01-30
  88. I built a multi-agent orchestration layer for Claude Code - sharing in case it's useful to anyone -- 2026-01-30
  89. AlfonsSkills/SkillSync -- 2026-01-30
  90. 1rgs/nanocode -- 2026-01-30
  91. We Got Claude to Build CUDA Kernels and teach open models! -- 2026-01-30
  92. Show: Fully Local Voice Assistant (with optional Voice Cloning) -- 2026-01-30
  93. Thoughts on PowerInfer as a way to break the memory bottleneck? -- 2026-01-30
  94. Ollama Models Ranked by VRAM Requirements -- 2026-01-30
  95. local-vision-bridge: OpenWebUI Function to intercept images, send them to a vision capable model, and forward description of images to text only model -- 2026-01-30
  96. We added an on-device AI meeting note taker into AnythingLLM to replace SaaS solutions -- 2026-01-29
  97. Stop wasting 30%+ of your context window on JSON braces. Meet SONA -- 2026-01-29
  98. 1.8-3.3x faster Embedding finetuning now in Unsloth (~3GB VRAM) -- 2026-01-29
  99. I built a free open-source TDD canvas for VS Code. Claude Code writes tests first, captures runtime traces when they fail, fixes until green -- 2026-01-29
  100. Show HN: Sandbox Agent SDK – unified API for automating coding agents -- 2026-01-29
  101. OSS ChatGPT WebUI – 530 Models, MCP, Tools, Gemini RAG, Image/Audio Gen -- 2026-01-29
  102. Renting out the cheapest GPUs ! (CPU options available too) -- 2026-01-29
  103. umputun/ralphex -- 2026-01-29
  104. rezonia/invoice-processor -- 2026-01-29
  105. [Editorial] https://www.linkedin.com/pulse/person-rights-responsibility-why-ai-contributors-break-ralf-d-m%C3%BCller-m2k9f -- 2026-01-29
  106. OpenStreetMap overwhelmed by bots scraping data -- 2026-01-28
  107. LLM-Generated Newspaper Provides Ultimate in Niche Publications -- 2026-01-28
  108. Anyscale's new data: Most AI clusters run at <50% utilization. Is "Disaggregation" the fix, or just faster cold starts? -- 2026-01-28
  109. Deploying Open WebUI for 2,000 Users (Solo) – Sanity Check Needed -- 2026-01-28
  110. A Tool to Calculate If a LLM Will Fit Your GPU -- 2026-01-27
  111. We indexed the entire Ollama Library (10TB+ VRAM). Here is how we run them all on 1 Node. -- 2026-01-27
  112. Is the next leap in AI architectural? Comparing VRAM-hungry Transformers with Compute-intensive Energy-Based Models -- 2026-01-27
  113. Show HN: A Local OS for LLMs. MIT License. Zero Hallucinations. Infinite Memory -- 2026-01-27
  114. facebookresearch/actionmesh -- 2026-01-27
  115. deepseek-ai/Engram -- 2026-01-27
  116. Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge -- 2026-01-27
  117. [Editorial] https://github.com/ruvnet/ruvector/blob/claude/clawdbot-ruvector-setup-RHW3a/npm/packages/ruvbot/docs/FEATURE_COMPARISON.md -- 2026-01-27
  118. Can companies "hack" ChatGPT to promote them? -- 2026-01-27
  119. [Editorial] https://grahamhelton.com/blog/nodes-proxy-rce -- 2026-01-26
  120. Route leak incident on January 22, 2026 -- 2026-01-26
  121. [Editorial] https://www.linkedin.com/posts/jeffreyemanuel_agent-coding-life-hack-im-100-convinced-activity-7421442482082660352-l5AG -- 2026-01-26
  122. Model Persistence, Context Management, Multilayered Cognition, Data Export, Cross Provider Support --- Anybody interested? -- 2026-01-26
  123. Anyone else wish they could "branch" conversations like git branches? -- 2026-01-26
  124. xuzeyu91/WebCode -- 2026-01-26
  125. ast-grep: A CLI tool for code structural search, lint and rewriting -- 2026-01-26
  126. [Editorial] https://www.linkedin.com/pulse/ai-conversation-we-should-actually-having-renato-beninatto-vs55c -- 2026-01-26
  127. Designing AI-resistant technical evaluations -- 2026-01-26
  128. I put an RTX PRO 4000 Blackwell SFF in my MS-S1 Max (Strix Halo), some benchmarks -- 2026-01-26
  129. ClaraVerse | Local AI workspace (4 months ago) -> Your feedback -> Back with improvements. -- 2026-01-26
  130. Beyond Vendor Lock-In: A Framework for LLM Sovereignty -- 2026-01-26
  131. [Editorial] https://www.linkedin.com/posts/ivandj_early-claims-around-self-evolving-memory-activity-7421307316437676033-l0Jm -- 2026-01-26
  132. [Editorial] https://www.linkedin.com/posts/reuvencohen_introducing-ruvector-world-model-activity-7421556928910290944-cx4v -- 2026-01-26
  133. stepfun-ai/Step3-VL-10B -- 2026-01-26
  134. The value of $200 a month AI users -- 2026-01-23
  135. Claude Permanent Memory Leak - This could be the cause of issue 16157 - instally hitting usage limits -- 2026-01-23
  136. browser-use/agent-sdk -- 2026-01-23
  137. egebese/seo-research-mcp -- 2026-01-23
  138. Crates.io: Development Update -- 2026-01-23
  139. [Editorial] https://www.linkedin.com/posts/owais-drera-590750378_github-owaisdreraagent-slayer-activity-7419782518985486336-7WE3 -- 2026-01-23
  140. [Editorial] https://www.linkedin.com/posts/resilientcyber_prompt-injection-activity-7420165497230454784-NOHa -- 2026-01-23
  141. [Editorial] https://www.linkedin.com/posts/anshumanbhartiya_lets-talk-about-threat-modeling-and-skills-activity-7418130148312674305-arTh -- 2026-01-23
  142. [Editorial] https://www.linkedin.com/posts/reuvencohen_introducing-prime-radiant-a-real-time-activity-7420466084006223873-hOct -- 2026-01-23
  143. [Editorial] https://www.wiz.io/blog/wiz-research-codebreach-vulnerability-aws-codebuild -- 2026-01-23
  144. Aider's documentation for getting connected to local inference sucks. Hopefully this helps. -- 2026-01-22
  145. Polymcp Integrates Ollama – Local and Cloud Execution Made Simple -- 2026-01-22
  146. Show HN: LangGraph architecture that scales (hexagonal pattern, 110 tests) -- 2026-01-22
  147. [Editorial] https://www.linkedin.com/posts/reuvencohen_mcps-generally-kind-of-suck-and-the-community-activity-7420106621437095936-0qkE -- 2026-01-22
  148. [Editorial] docker ai sandbox -- 2026-01-22
  149. cvsouth/memories-mcp -- 2026-01-22
  150. TencentCloudADP/youtu-tip -- 2026-01-22
  151. What we learned processing 1M+ emails for context engineering -- 2026-01-22
  152. [Editorial] https://www.linkedin.com/posts/unsloth_you-can-now-run-glm-47-flash-locally-on-activity-7419220348719624192-CV65 -- 2026-01-22
  153. Here is how to get GLM 4.7 working on llama.cpp with flash attention and correct outputs -- 2026-01-22
  154. unsloth/GLM-4.7-Flash-GGUF -- 2026-01-22
  155. 768Gb Fully Enclosed 10x GPU Mobile AI Build -- 2026-01-22
  156. I used Ollama (Mistral Small 24B) + LightRAG to build a graph pipeline that catches hidden risks where standard Vector RAG fails. -- 2026-01-21
  157. Hey all- I built a self-hosted MCP server to run AI semantic search over your own databases, files, and codebases. Supports Ollama and cloud providers if you want. Thought you all might find a good use for it. -- 2026-01-21
  158. I built Semantiq - a universal MCP server that gives semantic code understanding to Claude Code, Cursor, and any AI coding tool (100% local, no API keys) -- 2026-01-21
  159. My company banned AI tools and I dont know what to do -- 2026-01-21
  160. AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality -- 2026-01-21
  161. MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching -- 2026-01-21
  162. [Editorial] https://github.com/7h30th3r0n3/Evil-M5Project -- 2026-01-20
  163. [Editorial] https://7h30th3r0n3.fr/the-vulnerability-that-killed-freewifi_secure -- 2026-01-20
  164. x86 prefixes and escape opcodes flowchart -- 2026-01-20
  165. I need a feedback about an open-source CLI that scan AI models (Pickle, PyTorch, GGUF) for malware, verify HF hashes, and check licenses -- 2026-01-20
  166. Running multiple models locally on a single GPU, with model switching in 2-5 seconds. -- 2026-01-20
  167. EXAONE MoE support has been merged into llama.cpp -- 2026-01-20
  168. naklecha/simple-llm -- 2026-01-20
  169. [Editorial] https://substack.com/inbox/post/184924197 -- 2026-01-20
  170. [Editorial] https://www.linkedin.com/posts/mondweepchakravorty_this-article-details-how-to-get-started-using-ugcPost-7418423980123987969-uVok -- 2026-01-20
  171. Automating illustration for the Conan story "Tower of the Elephant"--Llama and Mistral for prompt generation, Qwen3-VL for image scoring, and image models. -- 2026-01-20
  172. Demo: On-device browser agent (Qwen) running locally in Chrome -- 2026-01-20
  173. Agent observability is way different from regular app monitoring - maintainer's pov -- 2026-01-20
  174. charIesding/agent-dashboard -- 2026-01-20
  175. Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems -- 2026-01-20
  176. Binary Fuse Filters: Fast and Smaller Than XOR Filters -- 2026-01-19
  177. Read_once(), Write_once(), but Not for Rust -- 2026-01-19
  178. Show HN: HTTP:COLON – A quick HTTP header/directive inspector and reference -- 2026-01-19
  179. 7x Longer Context Reinforcement Learning in Unsloth -- 2026-01-19
  180. openbmb/AgentCPM-Explore -- 2026-01-19
  181. black-forest-labs/FLUX.2-klein-4B -- 2026-01-19
  182. [Editorial] https://sean.heelan.io/2026/01/18/on-the-coming-industrialisation-of-exploit-generation-with-llms -- 2026-01-19
  183. [Editorial] https://red.anthropic.com/2026/cyber-toolkits-update -- 2026-01-19
  184. [Editorial] https://github.com/trailofbits/skills -- 2026-01-19
  185. [Editorial] https://blog.cloudflare.com/fail-small-resilience-plan -- 2026-01-16
  186. [Editorial] https://www.linkedin.com/posts/akopytko_symbolicai-neurosymbolicai-deterministicai-activity-7417128350650912768-reUc -- 2026-01-16
  187. [Editorial] https://www.usenix.org/system/files/usenixsecurity25-zhang-xiang.pdf -- 2026-01-16
  188. [Editorial] https://state-of-iranblackout.whisper.security/ -- 2026-01-16
  189. [Editorial] https://equixly.com/blog/2026/01/14/can-ai-identify-0days -- 2026-01-16
  190. [Editorial] https://www.linkedin.com/posts/reuvencohen_announcing-claude-flow-v3-a-full-rebuild-activity-7417928335160262656-NYqJ -- 2026-01-16
  191. [Editorial] https://www.linkedin.com/posts/sandstream_i-just-shipped-ralph-inferno-10-to-npm-activity-7417606358654406657-zBPY -- 2026-01-16
  192. [Editorial] https://www.linkedin.com/posts/rasmuswiding_parallel-ai-agents-the-complete-infrastructure-activity-7417646422436777984-D1Zw -- 2026-01-16
  193. Ralph Loop inspired me to build this - AI decides what Claude Code does next orchestrating claude code until task is done -- 2026-01-16
  194. Local AI App With SD-1.5 Models -- 2026-01-15
  195. For RAG serving: how do you balance GPU-accelerated index builds with cheap, scalable retrieval at query time? -- 2026-01-15
  196. Home workstation vs NYC/NJ colo for LLM/VLM + Whisper video-processing pipeline (start 1 GPU, scale to 4–8) -- 2026-01-15
  197. Create specialized Ollama models in 30 seconds -- 2026-01-15
  198. [Editorial] https://www.linkedin.com/pulse/ai-race-moving-faster-than-our-security-standards-can-david-abutbul-zmvtf -- 2026-01-15
  199. [Editorial] https://www.linkedin.com/posts/josh-orenstein_iran-just-did-something-no-government-has-activity-7417294442811895811-oOTR -- 2026-01-15
  200. [Editorial] https://sanderschulhoff.substack.com/p/the-ai-security-industry-is-bullshit -- 2026-01-15
  201. [Editorial] https://hackthemodel.com/ai-security-isnt-bullshit-but-we-re-securing-the-wrong-thing-b925d04b517a -- 2026-01-15
  202. [Editorial] https://www.linkedin.com/posts/reuvencohen_qudag-bitchat-is-a-secure-peer-to-peer-messaging-activity-7417222548897329152-153E -- 2026-01-15
  203. Confer – End to end encrypted AI chat -- 2026-01-15
  204. Two ASRock Radeon AI Pro R9700's cooking in CachyOS. -- 2026-01-14
  205. which small model can i use to read this gauge? -- 2026-01-14
  206. Supertone/supertonic-2 -- 2026-01-14
  207. Qualcomm's RISC-Ventana Fusion -- 2026-01-14
  208. An Open Source Electromagnetic Resonance Tablet -- 2026-01-14
  209. [Editorial] https://github.com/VibiumDev/vibium -- 2026-01-13
  210. Battle of AI Gateways: Rust vs. Python for AI Infrastructure: Bridging a 3,400x Performance Gap -- 2026-01-13
  211. Built a local TTS app using Apple's MLX framework. No cloud, no API calls, runs entirely on device. -- 2026-01-13
  212. I built a tool to clean HTML pages for RAG (JSON / MD / low-noise HTML) -- 2026-01-13
  213. [Editorial] https://cloudsecurityalliance.org/blog/2026/01/09/the-first-question-security-should-ask-on-ai-projects -- 2026-01-12
  214. [Editorial] https://www.linkedin.com/posts/stephenbklein_the-age-of-pretend-the-ai-industry-just-spent-activity-7415779694509219842-8OkK -- 2026-01-12
  215. [Editorial] https://www.linkedin.com/posts/reuvencohen_most-people-talk-about-gpus-as-if-they-are-activity-7415778737486483456-7DQK -- 2026-01-12
  216. [Editorial] https://blog.openthreatresearch.com/evolving-the-threat-hunter-playbook-planning-hunts-with-agent-skills -- 2026-01-12
  217. [Editorial] https://maggiegray.us/p/the-age-of-ai-for-offensive-cyber -- 2026-01-12
  218. [Editorial] https://www.linkedin.com/posts/resilientcyber_llm-fingerprinting-activity-7415849264452739072-H9fw -- 2026-01-12
  219. [Editorial] https://www.linkedin.com/posts/johnbruggeman_kimwolf-tldr-whattodo-activity-7413983885392396289-xsd4 -- 2026-01-12
  220. [Editorial] https://www.linkedin.com/posts/clintgibler_cybersecurity-ai-activity-7407102282120462337-6URK -- 2026-01-12
  221. [Editorial] https://xoxruns.medium.com/feedback-driven-iteration-and-fully-local-webapp-pentesting-ai-agent-achieving-78-on-xbow-199ef719bf01 -- 2026-01-12
  222. [Editorial] https://www.linkedin.com/posts/yass-99637a105_i-spent-the-last-couple-of-months-building-activity-7415098924224499714-lCDV -- 2026-01-12
  223. A closer look at a BGP anomaly in Venezuela -- 2026-01-09
  224. Show HN: I visualized the entire history of Citi Bike in the browser -- 2026-01-09
  225. Modifying a QingPing Air Quality Monitor for Local MQTT Access -- 2026-01-09
  226. Arbitrage: Efficient Reasoning via Advantage-Aware Speculation -- 2026-01-09
  227. Connect any LLM to all your knowledge sources and chat with it -- 2026-01-08
  228. Have claude code interact with another claude code session interactively to test a plugin im building -- 2026-01-08
  229. [Editorial] https://www.linkedin.com/posts/robert-westin_vibecoding-google-chrome-ugcPost-7410672189860933633-rnuH -- 2026-01-08
  230. shootthesound/comfyUI-LongLook -- 2026-01-08
  231. tangxiaofeng7/cscan -- 2026-01-08
  232. Solar-Open-100B-GGUF is here! -- 2026-01-08
  233. [HW TUNING] Finding the best GPU power limit for inference -- 2026-01-08
  234. HomeGenie v2.0: 100% Local Agentic AI (Sub-5s response on CPU, No Cloud) -- 2026-01-08
  235. WebGPU llama.cpp running in browser with Unity to drive NPC interactions (demo) -- 2026-01-08
  236. Offline agent testing chat mode using Ollama as the judge (EvalView) -- 2026-01-08
  237. [Editorial] https://www.linkedin.com/posts/reuvencohen_we-are-hitting-the-ceiling-of-prompt-driven-activity-7415027558171488256-_Dvn -- 2026-01-08
  238. [Editorial] https://www.linkedin.com/posts/cole-medin-727752184_most-developers-using-ai-coding-assistants-activity-7414834730149376000-lecD -- 2026-01-08
  239. [Editorial] https://www.linkedin.com/posts/pratik-kadam-pk_i-wasted-3-weeks-building-ai-agents-the-wrong-activity-7414361937570078720-BNbD -- 2026-01-08
  240. [Editorial] https://www.linkedin.com/posts/ownyourai_you-know-claude-code-works-really-well-with-activity-7414678511967244288-VHxZ -- 2026-01-08
  241. [Editorial] https://backalleycoder.com/posts/passseeds-an-experiment-in-hijacking-passkeys-to-unlock-cryptographic-use-cases -- 2026-01-07
  242. [Editorial] https://hackbot.dad/writing/intro-to-gpus -- 2026-01-07
  243. [Editorial] https://www.linkedin.com/posts/vilhelm-von-ehrenheim_are-you-avoiding-the-dumb-zone-dex-dropped-activity-7414210431570993152-8AFP -- 2026-01-07
  244. [Editorial] https://www.linkedin.com/posts/cole-medin-727752184_2025-overpromised-on-ai-agents-2026-demands-activity-7414472389167841280-WDzs -- 2026-01-07
  245. [Editorial] https://www.linkedin.com/posts/ronitelman_the-missing-step-in-decision-intelligence-activity-7413638316899762177-Aifb -- 2026-01-07
  246. Achieving 30x Real-Time Transcription on CPU . Multilingual STT Openai api endpoint compatible. Plug and play in Open-webui - Parakeet -- 2026-01-07
  247. Local Image Edit API Server for Models like Qwen-Image-Edit or Flux2-dev -- 2026-01-07
  248. Using n8n to orchestrate DeepSeek/Llama3 Agents via SSH (True Memory Persistence) -- 2026-01-07
  249. [Editorial] https://arxiv.org/html/2512.24601v1 -- 2026-01-06
  250. [Editorial] https://www.linkedin.com/posts/javier-cullas-644179109_ruvllm-llm-onlinelearning-activity-7414118850759262208-1Epx -- 2026-01-06
  251. [Editorial] https://github.com/Cornjebus/rlm-replication-study -- 2026-01-06
  252. I built Ctrl: Execution control plane for high stakes agentic systems -- 2026-01-06
  253. I built a local GUI for vector DBs (pgvector, Qdrant, Chroma, more) -- 2026-01-06
  254. Has anyone tried routing Claude Code CLI to multiple model providers? -- 2026-01-06
  255. [Editorial] https://www.linkedin.com/posts/andriyburkov_one-of-the-fundamental-papers-that-advanced-activity-7412675071640485888-4gvc -- 2026-01-05
  256. [Editorial] https://anthropic.skilljar.com/claude-code-in-action -- 2026-01-05
  257. RTX 3090 vs RTX 4090 for local AI assistant - impact on Time To First Token (TTFT)? -- 2026-01-05
  258. Any Vision model on pair with GPT-OSS 120B? -- 2026-01-05
  259. tobilg/ai-observer -- 2026-01-05
  260. omniASR-server: OpenAI-compatible API for Meta's omniASR with streaming support -- 2026-01-05
  261. Tally – A tool to help agents classify your bank transactions -- 2026-01-05
  262. DiffSynth-Studio/Qwen-Image-i2L -- 2026-01-05
  263. orneryd/NornicDB -- 2026-01-02
  264. Build a Deep Learning Library -- 2026-01-02
  265. Liquid CO2 For Grid Scale Energy Storage Isn’t Just Hot Air -- 2026-01-02
  266. How llama.cpp implements 2.9x faster top-k sampling with bucket sort -- 2025-12-31
  267. Built an offline-first vector database (v0.2.0) looking for real-world feedback -- 2025-12-31
  268. Linux 7.0 Expected to Bring IO_uring Iopoll Polling Improvements -- 2025-12-31
  269. [Release] Dingo v2.0 – Open-source AI data quality tool now supports SQL databases, RAG evaluation, and Agent-as-a-Judge hallucination detection! -- 2025-12-31
  270. Securing MCP in production -- 2025-12-31
  271. Why I Ditched Serverless Neptune/OpenSearch for Dockerized Neo4j/pgvector on EC2 (60% Cost Cut) -- 2025-12-30
  272. [Editorial] https://ratatui.rs/ -- 2025-12-29
  273. [Editorial] https://github.com/ruvnet/ruvector/tree/main/crates/ruvector-mincut -- 2025-12-29
  274. [Editorial] https://loggingsucks.com/ -- 2025-12-29
  275. Karpathy on Programming -- 2025-12-29
  276. [Editorial] https://github.com/marcuspat/turbo-flow-claude -- 2025-12-29
  277. Claude Watch - Monitor Your Context Usage Across All Sessions -- 2025-12-29
  278. [Research] Jacobi Forcing: turning AR LLMs into diffusion-style parallel decoders, staying causal with 4x speedup -- 2025-12-23
  279. Variable Sized Experts in MoEs -- 2025-12-23
  280. [Editorial] https://www.linkedin.com/posts/harish-santhanalakshmi-ganesan-31ba96171_github-cisco-ai-defensemcp-scanner-scan-activity-7409036231025811456-y16c -- 2025-12-23
  281. [Editorial] PentestGPT -- 2025-12-23
  282. Untargeted Jailbreak Attack -- 2025-12-23
  283. AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems -- 2025-12-23
  284. Data in, Research Paper out. Fully autonomous. Open-sourced & Free Research Agent. -- 2025-12-23
  285. Why claude code compare to github copilot ? -- 2025-12-23
  286. black-forest-labs/flux2 -- 2025-12-23
  287. tulerfeng/OneThinker -- 2025-12-23
  288. My problem: my agent code got tied to one provider. I built a thin wrapper so I can swap OpenAI ↔ Ollama without rewrites. -- 2025-12-23
  289. Hey r/LocalLLaMA, I built a fully local AI agent that runs completely offline (no external APIs, no cloud) and it just did something pretty cool: It noticed that the "panic button" in its own GUI was completely invisible on dark theme (black text on black background), reasoned about the problem, a -- 2025-12-23
  290. Demo - RPI4 wakes up a server with dynamically scalable 7 gpus -- 2025-12-23
  291. Show HN: I Built an Image Captioning Tool Using Llama.cpp -- 2025-12-23
  292. Introducing Bilgecan: self-hosted, open-source local AI platform based on Ollama + Spring AI + PostgreSQL + pgvector -- 2025-12-22
  293. The Open WebUI Documentation just got a massive 2,600+ line overhaul (v0.6.42) -- 2025-12-22
  294. [Editorial] https://zymtrace.com/ -- 2025-12-22
  295. PLX/PEX PCIe 4.0 seems to help for LLMs and P2P! I.e. PEX88096 (1 PCIe 4.0 X16 to 5 PCIE 4.0 X16) and others, and comparison vs bifurcation. -- 2025-12-22
  296. Qubes OS 4.3.0 has been released -- 2025-12-22
  297. SeeSee21/Z-Image-Turbo-AIO -- 2025-12-22
  298. Designing a CPU for Native BASIC -- 2025-12-22
  299. Memory at the Speed of Light -- 2025-12-19
  300. MRI-style transformer scan, Llama 3.2 3B -- 2025-12-19
  301. TQTQliu/Light-X -- 2025-12-19
  302. [Editorial] https://www.linkedin.com/posts/rocklambros_aisecurity-devsecops-activity-7407423157445287937-Wc0Z -- 2025-12-19
  303. If Your AI App Only Works When You Sit Next To It -- 2025-12-19
  304. dsl-learn/cutile-learn -- 2025-12-18
  305. Errors in Rust: A Deep Dive -- 2025-12-18
  306. Plug Into USB, Read Hostname and IP Address -- 2025-12-18
  307. [Editorial] https://www.linkedin.com/posts/yotam-perkal_comparing-ai-agents-to-cybersecurity-professionals-activity-7407076565357887488-KI5M -- 2025-12-18
  308. Building an event-driven alternative to LangGraph because single-threaded loops are killing me. Roast my architecture. -- 2025-12-18
  309. Claude Code, GPT-5.2, DeepSeek v3.2, and Self-Hosted Devstral 2 on Fresh SWE-rebench (November 2025) -- 2025-12-18
  310. ai-sage/GigaChat3-702B-A36B-preview -- 2025-12-18
  311. Open Source Alternative to Perplexity -- 2025-12-16
  312. How I Self-Hosted a Local Reranker for Open WebUI with vLLM (No More Jina API) -- 2025-12-16
  313. [Editorial] https://openreview.net/pdf?id=nbMeRvNb7A -- 2025-12-16
  314. Feedback Wanted - Vector Compression Engine (benchmarked v FAISS) -- 2025-12-16
  315. Why it so hard to abliterated kimi k2 thinking model? -- 2025-12-16
  316. Price of a bot army revealed across online platforms -- 2025-12-15
  317. iOS 26.2 fixes 20 security vulnerabilities, 2 actively exploited -- 2025-12-15
  318. Litestream VFS -- 2025-12-15
  319. I got tired of my agents losing context on topic shifts, so I hacked together a branch router - thoughts? -- 2025-12-12
  320. Thoughts on decentralized training with Psyche? -- 2025-12-12
  321. DeepSeek V3.2 got gold at IMO and IOI - weights on HF, MIT license, but Speciale expires Dec 15 -- 2025-12-10
  322. Linux Foundation Announces the Formation of the Agentic AI Foundation (AAIF), Anchored by New Project Contributions Including Model Context Protocol (MCP), goose and AGENTS.md -- 2025-12-10
  323. Run Any Model Provider on OpenWebUI immediately by discovering AI services on your LAN -- 2025-12-08
  324. dynamic allocation of less used experts to slower memory -- 2025-12-08
  325. At What Point Does Owning GPUs Become Cheaper Than LLM APIs ? I -- 2025-12-05
  326. How to run phones while being struck by suicide drones -- 2025-12-04
  327. FreeBSD 15.0-Release Announcement -- 2025-12-04
  328. UEFI On ARM? More Likely Than You Think -- 2025-12-04
  329. This Week in Security: Cloudflare Wasn’t DNS, BADAUDIO, and Not a Vuln -- 2025-11-28
  330. [Editorial] https://www.linkedin.com/posts/reuvencohen_bigger-isnt-better-the-future-of-ai-isn-activity-7398720183797911554-wHsT/ -- 2025-11-28
  331. Building the largest known Kubernetes cluster, with 130k nodes -- 2025-11-26
  332. [Editorial] https://github.com/ChrisRoyse/UsefulPrompts/blob/main/pushrepo.md -- 2025-11-25
  333. Open-source package: let your coding agent generate interactive docs -- 2025-11-25
  334. Google's Antigravity - Another VS Code Fork! -- 2025-11-20
  335. Godbolt's Rule -- 2025-11-20
  336. Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms -- 2025-11-20
  337. [Editorial] https://www.linkedin.com/posts/avi-lumelsky-713111144_an-ai-powered-cyberattack-is-self-replicating-activity-7396569417549234177-n6ai -- 2025-11-19
  338. Native Sysmon functionality coming to Windows -- 2025-11-19
  339. A more surgical approach to abliteration -- 2025-11-19
  340. [30 Trillion token dataset] "HPLT 3.0: Very Large-Scale Multilingual Resources for LLM and MT. Mono- and Bi-lingual Data, Multilingual Evaluation, and Pre-Trained Models", Oepen et al. 2025 -- 2025-11-19
  341. Compress to Impress: Efficient LLM Adaptation Using a Single Gradient Step on 100 Samples -- 2025-11-19
  342. [Editorial] https://www.npmjs.com/package/neural-trader -- 2025-11-14
  343. Critical RCE patched in Imunify360 affects up to 50M+ websites -- 2025-11-14
  344. Kubernetes Ingress Nginx is retiring -- 2025-11-14
  345. About KeePassXC's Code Quality Control -- 2025-11-14
  346. I built my own self-hosted GPT with LM Studio, Caddy, and Cloudflare Tunnel -- 2025-11-14
  347. Can't find Model in Ollama -- 2025-11-14
  348. [PSA] Claude Code Web users: Want something useful to do with your $1k free credits? Help fix all the borked HuggingFace Spaces. -- 2025-11-14
  349. Hi reddit, I rebuilt Karpathy's Nanochat in pure Rust [nanochat-rs] -- 2025-11-14
  350. Building for an Open Future - our new partnership with Google Cloud -- 2025-11-14
  351. kubernetes-tenants/tenant-operator -- 2025-11-12
  352. Native LLM Router Integration with Cost Transparency for OpenWebUI -- 2025-11-12
  353. Working on a list of open source tools for a Kubernetes ML stack -- 2025-11-10
  354. I built a leaderboard for Rerankers -- 2025-11-10
  355. LiquidAI/LFM2-ColBERT-350M -- 2025-11-10
  356. Apache Iggy is a high-performance, persistent message streaming platform -- 2025-11-07
  357. [Editorial] https://www.linkedin.com/posts/gadievron_deep-dive-cursor-code-injection-runtime-activity-7391805842318077952-bRjD -- 2025-11-05
  358. Now you can deploy OpenStatus on Raspberry Pi -- 2025-11-03
  359. Qwen3-VL-32B Q8 speeds in llama.cpp vs vLLM FP8 on a RTX PRO 6000 -- 2025-11-03
  360. OCR models: HF demos vs local performance -- 2025-11-03
  361. Help me decide: EPYC 7532 128GB + 2 x 3080 20GB vs GMtec EVO-X2 -- 2025-11-03
  362. [Editorial] https://github.com/claraverse-space/ClaraVerse -- 2025-10-31
  363. You can now run Ollama models in Jan -- 2025-10-31
  364. Codex and Supabase -- 2025-10-31
  365. snowyfizz/Vision-Detection-API -- 2025-10-29
  366. Show HN: Apache Fory Rust – 10-20x faster serialization than JSON/Protobuf -- 2025-10-29
  367. huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning -- 2025-10-29
  368. Need help understanding OpenAIs API usage for text-embedding -- 2025-10-26
  369. [Editorial] Browsers you can socially engineer -- 2025-10-24
  370. [Editorial] share terminal sessions using Claude Code for web -- 2025-10-24
  371. [Editorial] New web -- 2025-10-23
  372. ContextGuard – Open-source security monitoring for MCP servers -- 2025-10-23
  373. Gemini AI owners, please, I beg you, let me disable canvas permanently -- 2025-10-23
  374. We rewrote OpenFGA in pure Postgres -- 2025-10-22
  375. Ntfsplus: NTFS Filesystem Remake -- 2025-10-22
  376. Reasoning should be thought of as a drawback, not a feature -- 2025-10-21
  377. inclusionAI/Ring-1T-preview -- 2025-10-21
  378. [Editorial] LinkedIn Alogrithm -- 2025-10-18
  379. State of AI Report 2025 -- 2025-10-17
  380. [Editorial] Asimov’s three laws — updated for the genAI age -- 2025-10-17
  381. Comparing Popular AI Evaluation Platforms for 2025 -- 2025-10-17
  382. I analyzed 200 e-commerce sites and found 73% of their traffic is fake -- 2025-10-17
  383. ZephrFish/OmniProx -- 2025-10-14
  384. [Editorial] Claude Flow updates -- 2025-10-11
  385. [Editorial] https://www.linkedin.com/pulse/from-chatbot-operating-system-what-openais-next-move-means-leimer-ju18c -- 2025-10-11
  386. 11 AI Agent Projects You Can Build Today (With Guides) -- 2025-10-11
  387. Anyone here building Agentic AI into their office workflow? How’s it going so far? -- 2025-10-11
  388. How would you address it (free alternatives) -- 2025-10-11
  389. meituan-longcat/LongCat-Flash-Chat -- 2025-10-11
  390. Ring Flash 2.0 104B A6B with Linear Attention released a few days ago -- 2025-10-07
  391. GDPVal: Measuring the performance of our models on real-world tasks -- 2025-09-26
  392. OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview -- 2025-09-25
  393. microsoft/VibeVoice-1.5B -- 2025-09-25
  394. jhu-clsp/mmBERT-base -- 2025-09-25
  395. Sophia NLU Engine Upgrade - New and Improved POS Tagger -- 2025-09-22
  396. llama.ui: new updates! -- 2025-09-22
  397. [Tool] Intuitive branching/forking/merging of chats via ThreadIt -- 2025-09-22
  398. Google Android RAG SDK – Quick Comparison Study -- 2025-09-22
  399. Advice on building an enterprise-scale, privacy-first conversational assistant (local LLMs with Ollama vs fine-tuning) -- 2025-09-22
  400. I built a tool to do deep research on my local file system -- 2025-09-21
  401. sbcinnovation/lmux -- 2025-09-21
  402. Hypervisor from Scratch -- 2025-09-21
  403. Show HN: Ghostpipe – Connect files in your codebase to user interfaces -- 2025-09-21
  404. Scaleway on Hugging Face Inference Providers 🔥 -- 2025-09-21
  405. New version of AlchemyLab (another Claude Code alternative) -- 2025-09-20
  406. v0.6.29 Released - Major new version, major redesigns and many new features and performance improvements -- 2025-09-20
  407. What Facebook's Memcache Taught Me About Systems Thinking -- 2025-09-20
  408. Linus Torvalds Guitar Pedal Project -- 2025-09-20
  409. Alex Karp Insists Palantir Doesn't Spy on Americans. Here's What He's Not Saying -- 2025-09-20
  410. ArchGW 0.3.11 – Cross-API streaming (Anthropic client ↔ OpenAI models) -- 2025-09-19
  411. Public AI on Hugging Face Inference Providers 🔥 -- 2025-09-19
  412. VS Code Chat: Introducing auto model selection (preview) -- 2025-09-18
  413. ircfspace/masque-plus -- 2025-09-18
  414. Google Agentic Payments Protocol and X402: Agents Can Now Pay Each Other -- 2025-09-18
  415. The madness of SaaS chargebacks -- 2025-09-18
  416. Fix AI pipeline bugs before they hit your local stack: a semantic firewall + grandma clinic (beginner friendly, MIT) -- 2025-09-17
  417. Any idea how to use ollama (debian) with 2x GPUs to load larger models? -- 2025-09-14
  418. Rails on SQLite: new ways to cause outages -- 2025-09-14
  419. [Editorial] Defeating Nondeterminism in LLM Inference -- 2025-09-14
  420. Nvidia Unveils Rubin CPX Amidst Chart-Topping Blackwell Ultra MLPerf Results -- 2025-09-14
  421. [Editorial] v3 of the Smart Turn semantic VAD model. -- 2025-09-12
  422. Best Tiny Model for programming? -- 2025-09-12
  423. Day 9 of Working with 8 Concurrent Claude Codes -- 2025-09-12
  424. Danau5tin/multi-agent-coding-system -- 2025-09-12
  425. ApeRAG: Production-ready GraphRAG with multi-modal indexing and K8s deployment -- 2025-09-12
  426. Deploying 1.4KW GPUs (B300) what's the biggest bottleneck you've seen power delivery or cooling? -- 2025-09-10
  427. New approach to block decoding from Meta, claims that around 4x inference speedup is possible, with 4x less compute passes at the same time. -- 2025-09-10
  428. Qwen3 30B A3B Q40 @ 13 tok/sec on Raspberry Pi cluster -- 2025-09-10
  429. SERVE 8B model directly from iPhone -- 2025-09-10
  430. How do you handle integration blindness of AI coding? -- 2025-09-10
  431. From 14-year corporate job to AI-powered solo founder - Day 3 insights -- 2025-09-10
  432. Claude Code often pretends to execute tasks but doesn’t actually do them -- 2025-09-10
  433. I built a Graph RAG pipeline (VeritasGraph) that runs entirely locally with Ollama (Llama 3.1) and has full source attribution. -- 2025-09-09
  434. Built an offline AI CLI that generates apps and runs code safely -- 2025-09-09
  435. Three different models reviewing three different implementations coded by three different models -- 2025-09-09
  436. karpathy/rendergit -- 2025-09-09
  437. Shipping textures as PNGs is suboptimal -- 2025-09-09
  438. jkroepke/access-log-exporter -- 2025-09-08
  439. Nvidia Dynamo vs vLLM production stack — how do they compare in real-world multi-node serving? -- 2025-09-08
  440. A multi-interface (REST and MCP) server for automatic license plate recognition 🚗 -- 2025-09-05
  441. Replay - like Git for App States and Agent Context -- 2025-09-05
  442. Bringing Computer Use to the Web -- 2025-09-05
  443. Missing Agents -- 2025-09-05
  444. Show HN: Woomarks, transfer your Pocket links to this app or self-host it -- 2025-09-05
  445. Capture and Plot Serial Data in the Browser -- 2025-09-05
  446. ECA: free vendor lock alternative -- 2025-09-05
  447. AMD Ryzen 7 8700G for Local AI: User Experience with Integrated Graphics? -- 2025-09-05
  448. The Sense and Nonsense of Virtual Power Plants -- 2025-09-04
  449. Authenticate Thyself -- 2025-09-04
  450. Taco Bell Says 'No Más' to AI Drive-Thru Experiment -- 2025-09-02
  451. Setting up MCP in Codex is easy, don’t let the TOML trip you up -- 2025-09-02
  452. Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation -- 2025-09-02
  453. Built a Confluence to OpenWebUI Knowledge Base Sync Tool -- 2025-09-02
  454. Trying to simplify RAG setups → built a free hybrid search sandbox (feedback welcome) -- 2025-09-01
  455. [Guide + Code] Fine-Tuning a Vision-Language Model on a Single GPU (Yes, With Code) -- 2025-09-01
  456. OpenWebUI-SDK Development -- 2025-09-01
  457. I built a local “second brain” AI that actually remembers everything (321 tests passed) -- 2025-08-30
  458. Gpt-oss Fine-tuning - now with 60K context length and fits on <13GB VRAM -- 2025-08-30
  459. facebookincubator/pces -- 2025-08-29
  460. Google Debuts Device-Bound Session Credentials Against Session Hijacking -- 2025-08-29
  461. Treasury Announces Federal Govt Will Phase Out Paper Checks on September 30th -- 2025-08-29
  462. Bearer token keeps getting forgotten - somehow -- 2025-08-29
  463. texttron/BrowseComp-Plus -- 2025-08-28
  464. Scaling RL to Long Videos -- 2025-08-28
  465. Meta's AI Companion Policy Is Outrageous -- 2025-08-27
  466. Developer sentenced to prison for activating “kill switch” to avenge his firing -- 2025-08-25
  467. How to Stop Zeus from Toasting Your Pi -- 2025-08-25
  468. superfashi/pwnbot-ng -- 2025-08-25
  469. Automated microgreens mini-farm ran by Claude Code -- 2025-08-25
  470. Faster prefill on CPU-MoE IK-llama? -- 2025-08-23
  471. Llamarunner, a llama.cpp manager and runner (with user presets!) -- 2025-08-23
  472. what's "load_in_4bit" in unsloth LORA training? -- 2025-08-23
  473. merve/smol-vision -- 2025-08-23
  474. Rubby2001/Rshell---A-Cross-Platform-C2 -- 2025-08-23
  475. Cloudflare incident on August 21, 2025 -- 2025-08-23
  476. DeepSeek-V3.1 (Thinking and Non Thinking) -- 2025-08-22
  477. Modify <think> to explore the impact on <answer> -- 2025-08-22
  478. Tiny finance “thinking” model (Gemma-3 270M) with verifiable rewards (SFT → GRPO) — structured outputs + auto-eval (with code) -- 2025-08-22
  479. Qwen/Qwen3-30B-A3B-Thinking-2507 -- 2025-08-22
  480. tencent/Hunyuan-7B-Instruct -- 2025-08-22
  481. Bringing Computer Use to the Web -- 2025-08-21
  482. vibheksoni/stealth-browser-mcp -- 2025-08-21
  483. RAG Web Search performs poorly -- 2025-08-21
  484. AGENTS.md – Open format for guiding coding agents -- 2025-08-21
  485. turtacn/kubestack-ai -- 2025-08-21
  486. Critical Cache Poisoning Vulnerability in Dnsmasq -- 2025-08-21
  487. gguf-eval: an evaluation framework for GGUF models using llama.cpp -- 2025-08-21
  488. My open-source agent Maestro is now faster and lets you configure context limits for better local model support -- 2025-08-21
  489. guide : running gpt-oss with llama.cpp -- 2025-08-21
  490. Docker container for running Claude Code in "dangerously skip permissions" mode -- 2025-08-21
  491. Llama Habitat Continues to Expand, Now Includes the PSP -- 2025-08-21
  492. OpenAI Cookbook - Verifying gpt-oss implementations -- 2025-08-21
  493. Docker Model Runner is really neat -- 2025-08-20
  494. Build a Powerful RAG Web Scraper with Ollama and LangChain -- 2025-08-20
  495. Ollama interface with memory -- 2025-08-20
  496. YuminosukeSato/pyproc -- 2025-08-20
  497. AGENTS.md – Open format for guiding coding agents -- 2025-08-20
  498. Need Help: So-Vits-SVC Vibrated/Glitchy Output + Source Vocal Has Residual Music (G=98k, Diff=57k) -- 2025-08-19
  499. GDPR meant nothing: chat control ends privacy for the EU [video] -- 2025-08-19
  500. From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels -- 2025-08-19
  501. [Editorial] XBOW vs HackerOne, Flawless victory! -- 2025-08-19
  502. GPT-5 doubles performance in offensive security benchmark -- 2025-08-19
  503. Drop-in Voice App Control for iOS with Local Models -- 2025-08-18
  504. GPT-OSS 20b runs on a RasPi 5, 16gb -- 2025-08-18
  505. PowerInfer/SmallThinker-21BA3B-Instruct -- 2025-08-18
  506. [Editorial] Beyond LLM and SLM.. -- 2025-08-18
  507. Why it’s a mistake to ask chatbots about their mistakes | Ars Technica -- 2025-08-18
  508. LocalAI Major Update: Modular Backends (update llama.cpp, stablediffusion.cpp, and others independently!), Qwen-VL, Qwen-Image Support, Image Editing & More -- 2025-08-18
  509. Could you use RAG and Wikidumps to keep AI in the loop? -- 2025-08-17
  510. Markdown-UI: an interactive UI inside Markdown for LLMs -- 2025-08-17
  511. How can I reduce financial model deployment time from 5–10 days to 2 using automation (Cline, SQL, Snowflake,Tableau/Sigma)? -- 2025-08-17
  512. Model intelligence is no longer the constraint for automation -- 2025-08-17
  513. Implementing a basic equivalent of OpenBSD's pflog in Linux nftables -- 2025-08-17
  514. Google Play Store bans wallets that don't have banking license -- 2025-08-17
  515. Physical Aimbot Shoots For Success In Valorant -- 2025-08-17
  516. [Reproducible] Constraint-guided knowledge file for Claude reduces long-chain drift (MIT PDF, 60-sec setup) -- 2025-08-17
  517. Looking for a way to mimic custom slash commands in Aider -- 2025-08-17
  518. NO WAY BACK -- 2025-08-14
  519. X-Omni-Team/X-Omni -- 2025-08-14
  520. SkyworkAI/Matrix-3D -- 2025-08-14
  521. Best (free) AI Model for learning/understanding large unfamiliar codebases? -- 2025-08-13
  522. MCPs that are part of my day-to-day Claude Code workflow -- 2025-08-13
  523. [Editorial] Rust, AI Agents -- 2025-08-13
  524. [Editorial] AI Winter? -- 2025-08-13
  525. sii-research/siiRL -- 2025-08-13
  526. Hand-picked selection of articles on AI fundamentals/concepts -- 2025-08-13
  527. Update for Maestro - A Self-Hosted Research Assistant. Now with Windows/macOS support, Word/MD files support, and a smarter writing agent -- 2025-08-13
  528. Llama.cpp Vulkan is awesome, It gave new life to my old RX580 -- 2025-08-13
  529. Pairs of GPUs for inference? -- 2025-08-13
  530. Ollama 2x mi50 32GB -- 2025-08-13
  531. Go 1.25 Is Released -- 2025-08-13
  532. [Editorial] New Red Team's Networking Techniques -- 2025-08-13
  533. [Editorial] GLM-4.5, enterprise use -- 2025-08-13
  534. GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface -- 2025-08-13
  535. Nonescape: SOTA AI-Image Detection Model (Open-Source) -- 2025-08-12
  536. Activation-Guided Local Editing for Jailbreaking Attacks -- 2025-08-12
  537. Run GPT-OSS with MLX or GGUF in your CLI using 1 line of code -- 2025-08-11
  538. Built a new VLM (MicroLlaVA) on a single NVIDIA 4090 -- 2025-08-11
  539. nvidia/audio-flamingo-3 -- 2025-08-11
  540. LGAI-EXAONE/EXAONE-4.0-32B -- 2025-08-11
  541. Does GPT-5 have JSON output mode? -- 2025-08-10
  542. How to prevent claude from running `git -A` using hooks? -- 2025-08-10
  543. TSMC to go 3D with wafer-sized processors -- 2025-08-10
  544. Proton's New Two-Factor Authenticator App -- 2025-08-10
  545. CodeFu-7B-v0.1 - a Reinforcement Learning (RL)-trained 7B model for Competitive Programming -- 2025-08-08
  546. lucidrains/h-net-dynamic-chunking -- 2025-08-08
  547. Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training -- 2025-08-08
  548. Saidia: Offline-First AI Assistant for Educators in low-connectivity regions -- 2025-08-06
  549. Finding a local model for text table QA -- 2025-08-06
  550. Kodezi/Chronos -- 2025-08-06
  551. [Editorial] Voice ai and voice agents, howto -- 2025-08-05
  552. [Editorial] You don’t need a WebRTC server for your voice agents -- 2025-08-05
  553. Waiting on direct MCP integration—dev team, got a roadmap update? -- 2025-08-05
  554. Character Bitmap Graphics on the Pet 2001 -- 2025-08-04
  555. Caches: LRU vs. Random -- 2025-08-04
  556. Names are not type safety (2020) -- 2025-08-04
  557. [Editorial] HRM -- 2025-08-03
  558. How are people running an MLX-compatible OpenAI API server locally? -- 2025-08-03
  559. I built the perfect MCP client for broke developers (Ollama powered) -- 2025-08-03
  560. character-ai/pipelining-sft -- 2025-08-03
  561. 🚀 Qwen3-30B-A3B Small Update -- 2025-08-01
  562. These new Qwen3 models are cooking! -- 2025-08-01
  563. unsloth/Qwen3-235B-A22B-Instruct-2507-GGUF -- 2025-08-01
  564. [Editorial] Taking AI back from the sparkle ponies. -- 2025-07-28
  565. Autonomous AI Surveillance: Multimodal Deep Learning for Cognitive and Behavioral Monitoring -- 2025-07-28
  566. InditexTech/k8s-overcommit-operator -- 2025-07-27
  567. Migadu Email -- 2025-07-27
  568. Parquet Content-Defined Chunking -- 2025-07-27
  569. How are people staging AI training datasets from NVMe → DDR5 → GPU VRAM for fine-tuning on RTX 5090s? -- 2025-07-25
  570. Running Qwen3 235B-A22B 2507 on a Threadripper 3970X + 3x RTX 3090 Machine at 15 tok/s -- 2025-07-25
  571. mistral-small3.2:latest 15B takes 28GB VRAM? -- 2025-07-25
  572. Looking for help with terrible vLLM performance -- 2025-07-25
  573. Warashi/cage -- 2025-07-22
  574. The Most Powerful Server Embiggens a Bit with Power11 -- 2025-07-22
  575. Vintage Hardware Find Includes Time Capsule of Data -- 2025-07-22
  576. rip-zoyo/orbit-tls -- 2025-07-22
  577. vidore/colqwen-omni-v0.1 -- 2025-07-21
  578. Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data -- 2025-07-21
  579. Super fast local CPU file processing with static embeddings! -- 2025-07-21
  580. Servo Web Engine Further Tuning Performance -- 2025-07-20
  581. Upcoming deprecation of GitHub Command Palette feature preview -- 2025-07-20
  582. I fed Gemini a lot of posts from this reddit and let it summarize the best practice -- 2025-07-19
  583. Consilium: When Multiple LLMs Collaborate -- 2025-07-19
  584. Defense Department to begin using Grok -- 2025-07-18
  585. LGAI-EXAONE/EXAONE-4.0-1.2B -- 2025-07-18
  586. This SSD Will Self Destruct in Ten Seconds… -- 2025-07-18
  587. Locally Running AI model with Intel GPU -- 2025-07-18
  588. Where local is lagging behind... Wish lists for the rest of 2025 -- 2025-07-18
  589. Devstral-Vision-Small-2507 -- 2025-07-18
  590. Xttsv2 model, Chatterbox on MacBook air 8 gb -- 2025-07-18
  591. Claude deleted my whole repository -- 2025-07-17
  592. Defeating Memory Leaks with Zig Allocators -- 2025-07-17
  593. OpenDPDv2: A Unified Learning and Optimization Framework for Neural Network Digital Predistortion -- 2025-07-17
  594. Stop monitoring systems; start monitoring outcomes -- 2025-07-16
  595. Migrating the Hub from Git LFS to Xet -- 2025-07-16
  596. RekaAI/reka-flash-3.1 -- 2025-07-15
  597. What kind of throughput can I expect with Llama 3.1 on a H200? -- 2025-07-15
  598. Local LLM to back Elastic AI -- 2025-07-13
  599. Blackwell FP8 W8A8 NVFP4 support discussion -- 2025-07-13
  600. Is there some localllm benchmarking tool to see how well your system will handle a model? -- 2025-07-13
  601. Unlocking AMD MI300X for High-Throughput, Low-Cost LLM Inference -- 2025-07-13
  602. AI4Research: A Survey of Artificial Intelligence for Scientific Research -- 2025-07-13
  603. Owen777/Kontext-Style-Loras -- 2025-07-13
  604. Best Practices for Integrating Onyx (Danswer) with Open WebUI Pipelines -- 2025-07-13
  605. Just shipped first uvx compatible public pypi release for my automated Open WebUI Postgres migration tool -- 2025-07-13
  606. [P-6] Decoding FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space -- 2025-07-13
  607. Local AI server with Ollama and Tailscale integration looking for feedback -- 2025-07-12
  608. Running OpenWebUI Without RAG: Faster Web Search & Document Upload -- 2025-07-12
  609. Show HN: Pangolin – Open source alternative to Cloudflare Tunnels -- 2025-07-11
  610. SUS Lang: The SUS Hardware Description Language -- 2025-07-11
  611. Embedded USB Debug for Snapdragon -- 2025-07-11
  612. introducing cocoindex - super simple etl to prepare data for ai, with dynamic index (ollama integrated) -- 2025-07-08
  613. welltodopoker/kubernetes-dynamic-reclaimable-pvc-controllers -- 2025-07-08
  614. Python Pandas Ditches NumPy for Speedier PyArrow -- 2025-07-08
  615. Claude Max Integration - Roo Code 3.21.4 & 3.21.5 Release Notes -- 2025-07-05
  616. OpenAI to buy AI startup from Jony Ive -- 2025-07-05
  617. The Hobby Computer Culture -- 2025-07-05
  618. OpenMIDIStomper Makes Sure Your Gear Does What Your Foot Says -- 2025-07-05
  619. How realistic is it to run a media site entirely on AI-generated code with no developers? -- 2025-07-03
  620. Thinking about switching from cloud based AI to sth more local -- 2025-07-03
  621. There are no new ideas in AI only new datasets -- 2025-07-02
  622. trycua/cua -- 2025-06-30
  623. ml0-1337/claude-gate -- 2025-06-30
  624. mstrYoda/go-arctest -- 2025-06-30
  625. Hack of SEC's Edgar System Exposed Flaws in US Financial Security -- 2025-06-29
  626. $^{100}$Mo-enriched Li$_2$MoO$_4$ scintillating bolometers for $0\nu 2\beta$ decay search: from LUMINEU to CUPID-0/Mo projects -- 2025-06-29
  627. yushangxiao/claude2api -- 2025-06-28
  628. nlohmann/json -- 2025-06-25
  629. marimo-team/marimo -- 2025-06-25
  630. Python ASGI Framework Benchmarks -- 2025-06-24
  631. AI in my plasma physics research didn’t go the way I expected -- 2025-06-23
  632. identicallead/mse6 -- 2025-06-22
  633. GCC 13.4 Released with 129 additional bug fixes -- 2025-06-22
  634. Databricks acquires Neon -- 2025-06-22
  635. crumbyte/noxdir -- 2025-06-21
  636. strapi/strapi -- 2025-06-21
  637. n8n-io/n8n -- 2025-06-20
  638. Learning (The Basics of) Nftables -- 2025-06-18
  639. Linux Cgroup from First Principles -- 2025-06-18
  640. Databricks Free Edition -- 2025-06-18
  641. kn0x0x/CVE-2025-32756-POC -- 2025-06-17
  642. Magic Leap One Bootloader Exploit -- 2025-06-17
  643. Take9 Won't Improve Cybersecurity -- 2025-06-17
  644. nanonets/Nanonets-OCR-s -- 2025-06-16
  645. Menlo/Jan-nano-gguf -- 2025-06-16
  646. Olow304/memvid -- 2025-06-14
  647. carbon-language/carbon-lang -- 2025-06-12
  648. 1001 Ways of Scenario Generation for Testing of Self-driving Cars: A Survey -- 2025-06-11
  649. 100G Data Center Interconnections with Silicon Dual-Drive Mach-Zehnder Modulator and Direct Detection -- 2025-06-11
  650. litert-community/Gemma3-1B-IT -- 2025-06-11
  651. Qwen/Qwen3-Reranker-8B -- 2025-06-11
  652. open-thoughts/OpenThinker3-7B -- 2025-06-10
  653. Cosmos-Reason1: Physical AI Common Sense and Embodied Reasoning Models -- 2025-06-10
  654. How does vector dimension reduction work in new Qwen3 embedding models? -- 2025-06-10
  655. New Upgraded Deepseek R1 is now almost on par with OpenAI's O3 High model on LiveCodeBench! Huge win for opensource! -- 2025-06-10
  656. Mundane Robustness Benchmarks -- 2025-06-10
  657. My Local LLM plan for academic editing help -- 2025-06-10
  658. What is the best and affordable uncensored model to fine tune with your own data? -- 2025-06-10
  659. Backpropagation with Automatic Differentiation from Scratch in Python -- 2025-06-10
  660. DeepSeek-R1-0528 Released on Official API! -- 2025-06-10
  661. Misconceptions about the Unix Philosophy -- 2025-06-10
  662. Where hyperscale hardware goes to retire: Ars visits a big ITAD site -- 2025-06-10
  663. How to Connect an External RAG Database (FAISS, ChromaDB, etc.) to Open WebUI? -- 2025-06-10
  664. meilisearch/meilisearch -- 2025-06-09
  665. Weaponizing Dependabot: Pwn Request at its finest -- 2025-06-08
  666. Experts -- 2025-06-08
  667. 007: Democratically Finding The Cause of Packet Drops -- 2025-06-08
  668. GoogleCloudPlatform/generative-ai -- 2025-06-07
  669. EvolutionAPI/evo-ai -- 2025-06-06
  670. Stopping AI scrapers from taking down my server -- 2025-06-05
  671. kubeflow/kubeflow -- 2025-06-03
  672. Enhancing MySQL: MySQL improvement project -- 2025-06-03
  673. JefferyHcool/BiliNote -- 2025-06-02
  674. IDE for PostgreSQL in VS Code from Microsoft -- 2025-06-02
  675. 10,000 km Straight-line Transmission using a Real-time Software-defined GPU-Based Receiver -- 2025-06-02
  676. astral-sh/uv -- 2025-06-01
  677. Realtek's $10 tiny 10GbE NIC will hit motherboards soon -- 2025-06-01
  678. RSyncUI – A SwiftUI based macOS GUI for rsync -- 2025-06-01