AI Hardware

AI news coverage

480 articles across 145 editions

Articles

  1. [Editorial] Taala's Etches AI Models onto Transistors to Rocket-Boost Inference -- 2026-02-23
  2. Repurposing 800 RX 580s into an AI Inference Cluster: Mass Document OCR at 24x Lower Cost -- 2026-02-23
  3. [Editorial] Enterprise Open Source AI Coding Is Changing the ROI Calculation -- 2026-02-20
  4. [Editorial] Think Tax: The Real Cost of AI-Generated Code -- 2026-02-20
  5. [Editorial] RuVector DNA Sequence Analysis Example -- 2026-02-20
  6. 15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern -- 2026-02-20
  7. Nvidia and OpenAI abandon unfinished $100B deal in favour of $30B investment -- 2026-02-20
  8. Google releases Gemini 3.1 Pro with Benchmarks -- 2026-02-20
  9. [Editorial] RVF — Most Consequential AI Infrastructure -- 2026-02-16
  10. [Editorial] Introducing RVF Cognitive Container -- 2026-02-16
  11. 375ms Voice-to-Voice Latency: Local Nemotron-4 + Kokoro-82M on Blackwell Bare Metal -- 2026-02-16
  12. Expensively Quadratic: The LLM Agent Cost Curve -- 2026-02-16
  13. Is the Nvidia T4 actually viable for 70B (EXL2) daily driving, or is it just pure cope compared to dual 3090s? -- 2026-02-13
  14. Open weight kimi k2.5 overtakes opus 4.5 non thinking on arena -- 2026-02-13
  15. When did we go from 400k to 256k? -- 2026-02-13
  16. [Editorial] https://blogs.microsoft.com/blog/2026/01/26/maia-200-the-ai-accelerator-built-for-inference -- 2026-02-09
  17. OpenClaw on edge Linux (systemd + cron) — quick experiment + a few questions -- 2026-02-03
  18. [Ollama Cloud] 29.7% failure rate, 3,500+ errors in one session, support ignoring tickets for 2 weeks - Is this normal? -- 2026-02-03
  19. OpenClaw is everywhere all at once, and a disaster waiting to happen -- 2026-02-03
  20. Companion MIDI Pedal Helps Roland Groovebox Along -- 2026-01-30
  21. Renting out the cheapest GPUs ! (CPU options available too) -- 2026-01-29
  22. What secondary GPU should I get, mainly for local prompting? -- 2026-01-28
  23. On-device tool calling with Llama 3.2 3B on iPhone - made it suggest sushi restaurants [Open Source, React Native] -- 2026-01-28
  24. I have written gemma3 inference in pure C -- 2026-01-28
  25. There's a hidden Android setting that spots fake cell towers -- 2026-01-23
  26. TerabyteDeals – Compare storage prices by $/TB -- 2026-01-23
  27. 768Gb Fully Enclosed 10x GPU Mobile AI Build -- 2026-01-22
  28. Drone Hacking Part 1: Dumping Firmware and Bruteforcing ECC -- 2026-01-21
  29. Looking at a Real Fake Raspberry Pi RP2040 Board -- 2026-01-21
  30. 3x3090 + 3060 in a mid tower case -- 2026-01-19
  31. Built an 8× RTX 3090 monster… considering nuking it for 2× Pro 6000 Max-Q -- 2026-01-19
  32. vLLM on 2x/4x Tesla v100 32GB -- 2026-01-16
  33. M.2 to 4x Pcie for extra GPU Power Question -- 2026-01-16
  34. New version of Raspberry Pie Generative AI card (HAT+ 2) -- 2026-01-16
  35. Qualcomm's RISC-Ventana Fusion -- 2026-01-14
  36. An Open Source Electromagnetic Resonance Tablet -- 2026-01-14
  37. sardanioss/httpcloak -- 2026-01-13
  38. Making a CRT Spin Right Round, Round, Round -- 2026-01-13
  39. [Editorial] https://www.linkedin.com/posts/stephenbklein_the-age-of-pretend-the-ai-industry-just-spent-activity-7415779694509219842-8OkK -- 2026-01-12
  40. [Editorial] https://www.linkedin.com/posts/reuvencohen_most-people-talk-about-gpus-as-if-they-are-activity-7415778737486483456-7DQK -- 2026-01-12
  41. Dual rx 9070 for LLMs? -- 2026-01-09
  42. Opus 4.5 head-to-head against Codex 5.2 xhigh on a real task. Neither won. -- 2026-01-09
  43. ARCANGEL0/EVA -- 2026-01-07
  44. k2-fsa/Flow2GAN -- 2026-01-07
  45. GNU Ddrescue 1.30 Released -- 2026-01-07
  46. Debunking the AI food delivery hoax that fooled Reddit -- 2026-01-07
  47. Who Cares about the Baltic Jammer? Terrestrial Navigation in Baltic Sea Region [video] -- 2025-12-30
  48. Streaming Music to Cassette -- 2025-12-30
  49. [Editorial] https://zymtrace.com/ -- 2025-12-22
  50. PLX/PEX PCIe 4.0 seems to help for LLMs and P2P! I.e. PEX88096 (1 PCIe 4.0 X16 to 5 PCIE 4.0 X16) and others, and comparison vs bifurcation. -- 2025-12-22
  51. Qubes OS 4.3.0 has been released -- 2025-12-22
  52. SeeSee21/Z-Image-Turbo-AIO -- 2025-12-22
  53. Designing a CPU for Native BASIC -- 2025-12-22
  54. Memory at the Speed of Light -- 2025-12-19
  55. Key Highlights of NVIDIA’s New Model: Nemotron 3 -- 2025-12-17
  56. The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator -- 2025-12-17
  57. [Editorial] https://docs.unsloth.ai/new/deploy-llms-phone -- 2025-12-17
  58. Full AI Voice Agent (Whisper + 700M LLM + NeuTTS) running entirely on an Nvidia Jetson Orin Nano ($250 hardware) with no internet access -- 2025-12-17
  59. 8x Radeon 7900 XTX Build for Longer Context Local Inference - Performance Results & Build Details -- 2025-12-17
  60. mistralai/Ministral-3-8B-Instruct-2512 -- 2025-12-12
  61. Benchmarked A100 vs H100 local storage for Multi-GPU loading. The Gen4 bottleneck is brutal for cold starts. -- 2025-12-11
  62. My first OSS project! Observability & Replay for AI agents -- 2025-12-11
  63. Ollama + OpenVINO -- 2025-12-11
  64. OpenTelemetry Distribution Builder -- 2025-12-11
  65. Miles + FSDP2 = Megatron-Level Performance with More Flexibility -- 2025-12-10
  66. [help] RTX pro 6000 - llama.cpp Qwen3-Next-80B maxes out at 70% gpu? -- 2025-12-09
  67. Rate/roast my setup -- 2025-12-09
  68. dynamic allocation of less used experts to slower memory -- 2025-12-08
  69. I built a personal assistant script, and the CPU inference speed beats my Llama setup. -- 2025-12-08
  70. At What Point Does Owning GPUs Become Cheaper Than LLM APIs ? I -- 2025-12-05
  71. A Deep Dive into Using PIO and DMA on the RP2350 -- 2025-12-05
  72. CUA Local Opensource -- 2025-12-05
  73. AMD PRO 395 Radeon 8060S Graphics - Any recent Benchmarks -- 2025-12-04
  74. LoRa Repeater Lasts 5 Years on PVC Pipe and D Cells -- 2025-12-03
  75. LM Studio beta supports Qwen3 80b Next. -- 2025-12-03
  76. 4xRTX 4000 Pro Blackwell vs 1x6000 RTX Pro -- 2025-12-02
  77. moonshotai/Kimi-Linear-48B-A3B-Instruct -- 2025-12-02
  78. orabazes/FLUX.2-dev-GGUF -- 2025-12-02
  79. Transformers v5: Simple model definitions powering the AI ecosystem -- 2025-12-02
  80. You can now do FP8 reinforcement learning locally! (<5GB VRAM) -- 2025-12-01
  81. stepfun-ai/Step-Audio-R1 -- 2025-11-28
  82. Strix Halo batching with tensor parallel and pipeline parallel using vllm benchmarked -- 2025-11-28
  83. RTX 3090 vs RX 7900 with ROCm, also Vulcan -- 2025-11-26
  84. moonshotai/Kimi-K2-Thinking -- 2025-11-26
  85. Can an expert chime in and explain what is holding Vulkan back from becoming the standard API for ML? -- 2025-11-25
  86. Ollama Not Using GPU on RTX 5070 Ti (Blackwell) -- 2025-11-25
  87. Microsoft makes Zork open-source -- 2025-11-25
  88. Possibly-Smallest ESP32 Board Uses Smallest-Footprint Parts -- 2025-11-25
  89. The Qtile Window Manager: A Python-Powered Tiling Experience -- 2025-11-25
  90. Most Stable Raspberry Pi? Better NTP with Thermal Management -- 2025-11-25
  91. zai-org/Glyph -- 2025-11-21
  92. tlennon-ie/qwen-edit-skin -- 2025-11-21
  93. Mating Cycles: Engineering Connectors to Last -- 2025-11-21
  94. Commmunication-Efficient and Accurate Approach for Aggregation in Federated Low-Rank Adaptation -- 2025-11-21
  95. Gemini 2.5 Flash Image / Nano Banana Tutorial -- 2025-11-21
  96. Built a tool to solve the "how much GPU do I actually need?" problem for LLM deployment -- 2025-11-20
  97. New Parameter Browser added to Llamacpp Model Launcher! experimental model parameter tuning(window/cuda only) -- 2025-11-20
  98. cuda device list mismatch - ggml_cuda_init / ubuntu - significance to using --main-gpu flag -- 2025-11-20
  99. What Size of LLM Can 4x RTX 5090 Handle? (96GB VRAM) -- 2025-11-20
  100. DOE gives Microsoft partner $1B loan to restart Three Mile Island reactor -- 2025-11-20
  101. PCIE Bifurcation - More than 4 GPUs on a consumer motherboard -- 2025-11-18
  102. Qual a melhor GPU para o llama 3(.1 ou .3) -- 2025-11-18
  103. PyTorch 2.10.0a0 w/ Blackwell (sm_120) Support — Patched & Packaged for One-Command Install -- 2025-11-17
  104. Half-trillion parameter model on a machine with 128 GB RAM + 24 GB VRAM -- 2025-11-17
  105. Real-Time BART in a Box Smaller Than Your Coffee Mug -- 2025-11-17
  106. Tiny386 on an Espressif ESP32-S3 -- 2025-11-14
  107. [Editorial] Balancing order, freedom, and technology -- 2025-11-12
  108. AMD warns the Intel and Nvidia partnership is a risk to its business -- 2025-11-12
  109. A Pentium In Your Hand -- 2025-11-12
  110. Hardware recommendations for Ollama for homelab -- 2025-11-10
  111. When Your Hash Becomes a String: Hunting Ruby's Million-to-One Memory Bug -- 2025-11-07
  112. Maude 3 Manual -- 2025-11-07
  113. [Editorial] https://www.suffsyed.com/futurememo/the-design-leaders-are-lying-to-you -- 2025-11-07
  114. Adding a RTX 5080 into a 2U server with OcuLink -- 2025-11-06
  115. Why does Image Recognition work in llama-server but not through Open WebUI? -- 2025-11-06
  116. 2025 Component Abuse Challenge: A Piezo Disk Powers A Transmitter -- 2025-11-06
  117. [D] It turns out WDDM driver mode is making our RAM - GPU transfer extremely slower compared to TCC or MCDM mode. Anyone has figured out the bypass NVIDIA software level restrictions? -- 2025-11-05
  118. [Editorial] https://blog.peerllm.com/2025/11/02/announcing-v0.7.6.html -- 2025-11-04
  119. Faster llama.cpp ROCm performance for AMD RDNA3 (tested on Strix Halo/Ryzen AI Max 395) -- 2025-11-04
  120. KTransformers Open Source New Era: Local Fine-tuning of Kimi K2 and DeepSeek V3 -- 2025-11-04
  121. ZOZO's Contact Solver for physics-based simulations -- 2025-11-01
  122. Need advice on building a GPU-based render/Al compute setup: Unsure about hardware direction -- 2025-11-01
  123. Flamingo 3 released in safetensors -- 2025-10-31
  124. My LLM-powered text adventure needed a dynamic soundtrack, so I'm training a MIDI generation model to compose it on the fly. Here's a video of its progress so far. -- 2025-10-31
  125. Optimizing gpt-oss-120B on AMD RX 6900 XT 16GB: Achieving 19 tokens/sec -- 2025-10-31
  126. [Editorial] https://tee.fail/ -- 2025-10-29
  127. Satellite Snooping Reveals Sensitive Unencrypted Data -- 2025-10-29
  128. GPT-OSS-20b TAKE THE HELM! Further experiments in autopilot. -- 2025-10-28
  129. 5060ti chads... ram overclocking, the phantom menace -- 2025-10-28
  130. Batch inference locally on 4080 -- 2025-10-28
  131. Qwen/Qwen3-VL-30B-A3B-Instruct-FP8 -- 2025-10-28
  132. Esonhugh/go-rex-java -- 2025-10-27
  133. SuperSonic – SuperCollider's audio engine in a Web AudioWorklet -- 2025-10-27
  134. 3-way FTP: Pushing files around with silly and unusual methods -- 2025-10-27
  135. HRV Gets Home Automation Upgrades -- 2025-10-27
  136. Looking for some advice/input for LLM and more -- 2025-10-26
  137. AlpinDale/ssh-dashboard -- 2025-10-26
  138. GPU 101 and Triton kernels -- 2025-10-26
  139. Llama.cpp is looking for M5 Neural Accelerator performance testers -- 2025-10-24
  140. NVIDIA sent me a 5090 so I can demo Qwen3-VL GGUF -- 2025-10-24
  141. AMD ROCm 7.9 and dwindling GPU support -- 2025-10-24
  142. I got Kokoro TTS running natively on iOS! 🎉 Natural-sounding speech synthesis entirely on-device -- 2025-10-22
  143. Mobile fully on device inference AI chat app with RAG support -- 2025-10-22
  144. I am generally impressed by iPhone 17 GPU -- 2025-10-22
  145. ⚡ Gemma 3 1B Smart Q4 — Bilingual (IT/EN) Offline AI for Raspberry Pi 4/5 -- 2025-10-22
  146. Valve Developer Contributes Major Improvement To RADV Vulkan For Llama.cpp AI -- 2025-10-22
  147. I want to build an AI inference server for 72B models...what should I do? -- 2025-10-22
  148. PlayDiffusion finetune for audio inpainting non-verbal tags -- 2025-10-21
  149. Nvidia has produced the first Blackwell wafer on US soil -- 2025-10-21
  150. Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy -- 2025-10-21
  151. C Project Turns Into Full-Fledged OS -- 2025-10-21
  152. Local VLLM Accelerated Evolution Framework -- 2025-10-19
  153. Nvidia DGX Spark, is it worth ? -- 2025-10-19
  154. Open source streaming STT (Parakeet + Silero + Pipecat Smart Turn) -- 2025-10-19
  155. Turn ChatGPT into a real-time meeting assistant (via MCP + Apps SDK) -- 2025-10-19
  156. [Editorial] Claude Skills are awesome, maybe a bigger deal than MCP -- 2025-10-18
  157. NVIDIA DGX Spark Benchmarks -- 2025-10-18
  158. Should I add another 5060 Ti 16GB or two? Already had 1 x 5070 Ti and 3 x 5060 Ti 16G -- 2025-10-18
  159. Last week in Multimodal AI - Local Edition -- 2025-10-16
  160. BosonAI's Higgs-Llama-3-70B AWQ Quantized (140GB → 37GB) -- 2025-10-16
  161. Worthwhile using Ollama without nVidia? -- 2025-10-16
  162. LiquidAI/LFM2-8B-A1B-GGUF -- 2025-10-16
  163. The Entire Process of Building an Open Source Analog ASIC -- 2025-10-15
  164. Smart Bulbs Are Turning Into Motion Sensors -- 2025-10-11
  165. Qwen3-VL-30B-A3B-Thinking GGUF with llama.cpp patch to run it -- 2025-10-10
  166. What and when 7900xtx is boosted? -- 2025-10-10
  167. Script to install a bunch of AI or Dev tools automatically.. what can I add to it or improve? -- 2025-10-10
  168. Qwen/Qwen3-VL-30B-A3B-Instruct -- 2025-10-10
  169. BenchVolt PD: USB PD Meets Benchtop Precision -- 2025-10-10
  170. Sneak Preview: Ollama Bench -- 2025-10-08
  171. When Curl Works but IntelliJ Doesn't: The Ollama Connection Mystery -- 2025-10-08
  172. XiangShan Vector Floating-Point Unit Design -- 2025-10-07
  173. google/timesfm-2.5-200m-pytorch -- 2025-10-07
  174. svg-project/flash-kmeans -- 2025-10-07
  175. [Editorial] Agentic Tribe -- 2025-10-06
  176. AlexanderYastrebov/onion-vanity-address -- 2025-10-06
  177. Open Printer is an open-source inkjet with DRM-free ink and no subscriptions -- 2025-10-06
  178. Yes, Gemini, A Wii Server Is Possible -- 2025-10-06
  179. Running Qwen3-VL-235B (Thinking & Instruct) AWQ on vLLM -- 2025-10-06
  180. Granite 4 H Tiny Q8 in RTX 3090, It's a context king. -- 2025-10-06
  181. Video2X 6.x — open-source upscaler + frame interpolation (Anime4K v4 / Real-ESRGAN / Real-CUGAN / RIFE) 🚀 -- 2025-10-06
  182. For llama.cpp/ggml AMD MI50s are now universally faster than NVIDIA P40s -- 2025-10-03
  183. MSI EdgeXpert Compact AI Supercomputer Based on NVIDIA DGX Spark -- 2025-10-03
  184. Kairos: Immutable Distro for K8s at the Edge -- 2025-10-03
  185. Nvidia Has Been Supplying NDA'ed Docs to Red Hat for Helping NVK Driver -- 2025-10-03
  186. Mini Laptop Needs Custom Kernel -- 2025-10-03
  187. K2-Think 32B - Reasoning model from UAE -- 2025-10-03
  188. MoonshotAI/checkpoint-engine -- 2025-10-03
  189. Whither the Chip Shortage? -- 2025-10-02
  190. [Editorial] https://github.com/emcie-co/parlant -- 2025-10-02
  191. Built a persistent memory system for LLMs - 3 months testing with Claude/Llama -- 2025-10-02
  192. Do I need to run /init on a repo if I already have AGENTS.md? -- 2025-10-02
  193. Upgrade to Kernel 6.16.9 solves 15.5GB Stix Halo memory limitation -- 2025-09-30
  194. Seeking Advice: Best Model + Framework for Max Tokens/sec on Dual L40S (Testing Rig) -- 2025-09-30
  195. OpenHelix-Team/VLA-Adapter -- 2025-09-29
  196. Reviving a Scrapped Sound Blaster 2.0 ISA Soundcard -- 2025-09-29
  197. kijai/ComfyUI-WanAnimatePreprocess -- 2025-09-29
  198. More money than brains... building a workstation for local LLM. -- 2025-09-28
  199. Fully-Local AI Agent Runs on Raspberry Pi, With a Little Patience -- 2025-09-28
  200. PC memory costs to climb as fabs chase filthy lucre in servers and HBM -- 2025-09-27
  201. Qwen3 235b Q2 with Celeron, 2x8gb of 2400 RAM, 96GB VRAM @ 18.71 t/s -- 2025-09-27
  202. This $5,999 RTX PRO 6000 Ebay listing is a scam, right? -- 2025-09-26
  203. Accelerating Local AI on Consumer GPUs: A Hardware-Aware Dynamic Strategy for YOLOv10s -- 2025-09-26
  204. How bad to have RTX Pro 6000 run at PCIE x8? -- 2025-09-24
  205. I Upgrade 4090's to have 48gb VRAM: Comparative LLM Performance -- 2025-09-23
  206. Some things I learned about installing flash-attn -- 2025-09-23
  207. Comparison H100 vs RTX 6000 PRO with VLLM and GPT-OSS-120B -- 2025-09-23
  208. My self-hosted app uses local Whisper for transcription and a local LLM for summaries & event extraction -- 2025-09-20
  209. I open-sourced a text2SQL RAG for all your databases and local models -- 2025-09-20
  210. firstbatchxyz/mem-agent-mcp -- 2025-09-20
  211. yangzhou24/OmniWorld -- 2025-09-18
  212. Analog Optical Computer for Inference and Combinatorial Optimization -- 2025-09-18
  213. Why is the name of a wireless mouse hard-coded into Windows Bluetooth drivers? -- 2025-09-17
  214. Chesars/whatsapp-mcp -- 2025-09-15
  215. Claude’s memory architecture is the opposite of ChatGPT’s -- 2025-09-15
  216. The Internet Will Be More Dead Than Alive Within 3 Years, Trend Shows | All signs point to a future internet where bot-driven interactions far outnumber human ones. -- 2025-09-15
  217. New "speech" mode in Imagine... -- 2025-09-15
  218. [vllm] Hints to run Qwen3-235B MoE on 8x AMD mixed cards! -- 2025-09-12
  219. Inference for 24 people with a 5000€ budget -- 2025-09-12
  220. $142 upgrade kit and spare modules turn Nvidia RTX 4090 24GB to 48GB AI card -- 2025-09-12
  221. Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers -- 2025-09-12
  222. Finally: 3090 Successor: 5070 Ti super 24Gb 800$ -- 2025-09-08
  223. Fantastic pretraining optimizers and where to find them -- 2025-09-08
  224. Qwen/Qwen3-4B-Instruct-2507 -- 2025-09-08
  225. moonshotai/Kimi-K2-Instruct-0905 -- 2025-09-08
  226. Tenstorrent p150a tested against RTX5090, RTX3090, A100, H100 by Russian blogger -- 2025-09-08
  227. unsloth/gemma-3-270m-it-GGUF -- 2025-09-08
  228. LiquidGEMM: Seems interesting -- 2025-09-07
  229. How to use a Hugging Face embedding model in Ollama -- 2025-09-07
  230. Wal3: A Write-Ahead Log for Chroma, Built on Object Storage -- 2025-09-07
  231. Running LLM Locally with Ollama + RAG -- 2025-09-07
  232. yaof20/Flash-RL -- 2025-09-07
  233. Vulkan back ends, what do you use? -- 2025-09-06
  234. Relaxed-System-Lab/Flash-Sparse-Attention -- 2025-09-06
  235. Intel Files Patent for "Software Defined Super Cores" -- 2025-09-04
  236. The Sense and Nonsense of Virtual Power Plants -- 2025-09-04
  237. Configurable Stereo Preamp from Matrix Switch -- 2025-09-03
  238. Raspberry Pi 5 support (OpenBSD) -- 2025-09-03
  239. Measuring Nanoparticles by Scattering a Laser -- 2025-09-03
  240. gpt-oss:120b running on an AMD 7800X3D CPU and a 7900XTX GPU -- 2025-09-03
  241. unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF -- 2025-09-03
  242. I tried almost every tts model on my ryzen 7 5000 series 16gb ram rtx 3060 laptop 6-8GB Vram -- 2025-09-02
  243. devnen/Kitten-TTS-Server -- 2025-09-02
  244. TheDrummer is on fire!!! -- 2025-09-01
  245. NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-01
  246. Sparrow: Custom language model architecture for microcontrollers like the ESP32 -- 2025-08-30
  247. Claude Memory Lazy Method: The Graduation Path (From 4 Prompts to 1) -- 2025-08-30
  248. Lynx-R1 Headset Makers Release 6DoF SLAM Solution As Open Source -- 2025-08-30
  249. RTX PRO 6000 MAX-Q Blackwell for LLM -- 2025-08-28
  250. A PLL For Perfect Pitch -- 2025-08-27
  251. Local Inference for Very Large Models - a Look at Current Options -- 2025-08-27
  252. Is there any way to run 100-120B MoE models at >32k context at 30 tokens/second without spending a lot? -- 2025-08-26
  253. Right GPU for AI research -- 2025-08-26
  254. Help me decide between these two pc builds -- 2025-08-25
  255. Faster prefill on CPU-MoE IK-llama? -- 2025-08-23
  256. Llamarunner, a llama.cpp manager and runner (with user presets!) -- 2025-08-23
  257. what's "load_in_4bit" in unsloth LORA training? -- 2025-08-23
  258. merve/smol-vision -- 2025-08-23
  259. city96/Qwen-Image-gguf -- 2025-08-23
  260. Menlo/Lucy-128k -- 2025-08-23
  261. NVIDIA just accelerated output of OpenAI’s gpt-oss-120B by nearly 2x -- 2025-08-23
  262. Is openrouters tokens per second reading super bugged? -- 2025-08-22
  263. It’s a Pi, But it’s not Quite a Raspberry Pi -- 2025-08-22
  264. Security Researchers Find XZ Utils Backdoored Debian Images on Docker Hub -- 2025-08-20
  265. Open Source Lithium-Titanate Battery Management System -- 2025-08-20
  266. Prospect Theory Fails for LLMs: Revealing Instability of Decision-Making under Epistemic Uncertainty -- 2025-08-20
  267. Need Help: So-Vits-SVC Vibrated/Glitchy Output + Source Vocal Has Residual Music (G=98k, Diff=57k) -- 2025-08-19
  268. GDPR meant nothing: chat control ends privacy for the EU [video] -- 2025-08-19
  269. From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels -- 2025-08-19
  270. PSA: Don't waste time trying Gemma 3 27B on V100s - it's architecturally impossible -- 2025-08-16
  271. People with MacBook Pro with 36gb of memory, which models you are running for coding? -- 2025-08-16
  272. SparcStation 1+ Finally Gets Attention -- 2025-08-15
  273. llamacpp+ROCm7 beta is now supported on Lemonade -- 2025-08-10
  274. gpt-oss 120B runs ~13tps on laptop with igpu -- 2025-08-10
  275. Throwing a MI50 32Gb in a gaming pc -- 2025-08-10
  276. Suggestion for upgrading hardware for MOE inference and fine-tuning. -- 2025-08-09
  277. Best models under 16GB?? -- 2025-08-09
  278. Explicit tail calls are now available on Rust Nightly (become keyword) -- 2025-08-09
  279. HuggingFaceTB/SmolLM3-3B -- 2025-08-09
  280. mistralai/Magistral-Small-2507 -- 2025-08-09
  281. What to do with a NVIDIA Tesla V100S 32GB GPU -- 2025-08-07
  282. dsekz/chrome-x-browser-validation-header -- 2025-08-07
  283. MorDavid/BruteForceAI -- 2025-08-07
  284. Show HN: Aura – Like robots.txt, but for AI actions -- 2025-08-07
  285. I built a GitHub scanner that automatically discovers AI tools using a new .awesome-ai.md standard I created -- 2025-08-07
  286. Show HN: Tambo – build generative UX web apps -- 2025-08-06
  287. Brilliant Labs Has New Smart Glasses, With a New Display -- 2025-08-06
  288. Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ -- 2025-08-06
  289. [Editorial] Voice ai and voice agents, howto -- 2025-08-05
  290. [Editorial] You don’t need a WebRTC server for your voice agents -- 2025-08-05
  291. Waiting on direct MCP integration—dev team, got a roadmap update? -- 2025-08-05
  292. GLM-4.5 llama.cpp PR is nearing completion -- 2025-08-05
  293. glm-4.5-Air appreciation poist - if you have not done so already, give this model a try -- 2025-08-05
  294. peteromallet/Flux-Kontext-InScene -- 2025-08-02
  295. A Dual-Screen Cyberdeck To Rule Them All -- 2025-08-02
  296. NVIDIA RTX PRO 4000 Blackwell - 24GB GDDR7 -- 2025-08-02
  297. Help for new LLM Rig -- 2025-08-02
  298. bytillo/spyder-osint -- 2025-08-01
  299. Secure boot certificate rollover is real but probably won't hurt you -- 2025-08-01
  300. 2025 One Hertz Challenge: RPI TinynumberHat9 -- 2025-08-01
  301. Need help deciding on GPU options for inference -- 2025-07-31
  302. Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification -- 2025-07-30
  303. Building a quiet LLM machine for 24/7 use, is this setup overkill or smart? -- 2025-07-30
  304. Best Local LLM + Hardware Build for Coding With a $15k Budget (2025) -- 2025-07-29
  305. Optimizing inference on GPU + CPU -- 2025-07-29
  306. I got Ollama models running locally and exposed them via a public API with one command -- 2025-07-29
  307. I want to use llama 7b to check if a 5-7 sentence paragraph contains a given subject, what's the minimum GPU I need? -- 2025-07-29
  308. Teufel Introduces an Open Source Bluetooth Speaker -- 2025-07-29
  309. Build advice: Consumer AI workstation with RTX 3090 + dual MI50s for LLM inference and Stable Diffusion (~$5k budget) -- 2025-07-26
  310. RTX 5090 (32GB VRAM) - Full Fine-Tuning: What Can I Expect? -- 2025-07-26
  311. Is there a reason to prefer Nvidia over AMD for programming use cases? -- 2025-07-26
  312. Entry GPU options - 5060 8GB enough to play with? -- 2025-07-26
  313. How open-source models like Mistral, Devstral, and DeepSeek R1 compare for coding -- 2025-07-26
  314. mistralai/Voxtral-Mini-3B-2507 -- 2025-07-26
  315. Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ -- 2025-07-26
  316. Playtron's Linux-Based GameOS Hits the Road with 1.0 -- 2025-07-26
  317. Remembering Chiptunes, the Demoscene and the Illegal Music of Keygens -- 2025-07-26
  318. How are people staging AI training datasets from NVMe → DDR5 → GPU VRAM for fine-tuning on RTX 5090s? -- 2025-07-25
  319. Running Qwen3 235B-A22B 2507 on a Threadripper 3970X + 3x RTX 3090 Machine at 15 tok/s -- 2025-07-25
  320. mistral-small3.2:latest 15B takes 28GB VRAM? -- 2025-07-25
  321. Looking for help with terrible vLLM performance -- 2025-07-25
  322. What upgrade option is better with $2000 available for my configuration? -- 2025-07-24
  323. Foundry competition heats up as Japan's Rapidus says 2nm tech on track for 2027 -- 2025-07-23
  324. AI Model Juggler automatically and transparently switches between LLM and image generation backends and models -- 2025-07-22
  325. Looking to possibly replace my ChatGPT subscription with running a local LLM. What local models match/rival 4o? -- 2025-07-22
  326. Nvidia GTX-1080Ti 11GB Vram -- 2025-07-22
  327. I messed up my brother's Llama AI workstation.. looking for advice -- 2025-07-22
  328. How do Claude Code token counts translate to “prompts” for usage limits? -- 2025-07-22
  329. Claude is IN the files. -- 2025-07-21
  330. Bitcoin Devs Float Proposal to Freeze Quantum-Vulnerable Addresses -- 2025-07-21
  331. OpenSCAD: The Programmers Solid 3D CAD Modeller -- 2025-07-21
  332. Software Defined Retro ROMs -- 2025-07-21
  333. Arc Virtual Cell Challenge: A Primer -- 2025-07-21
  334. Recommend hardware for my use case? -- 2025-07-20
  335. Best Hardware Setup to Run DeepSeek-V3 670B Locally on $40K–$80K? -- 2025-07-20
  336. e6a5/flow -- 2025-07-20
  337. Improve Your KiCad Productivity With These Considered Shortcut Keys -- 2025-07-20
  338. LGAI-EXAONE/EXAONE-4.0-1.2B -- 2025-07-18
  339. This SSD Will Self Destruct in Ten Seconds… -- 2025-07-18
  340. Locally Running AI model with Intel GPU -- 2025-07-18
  341. Defeating Memory Leaks with Zig Allocators -- 2025-07-17
  342. OpenDPDv2: A Unified Learning and Optimization Framework for Neural Network Digital Predistortion -- 2025-07-17
  343. An Open-Concept 3D Printer Using Cantilever Arms -- 2025-07-17
  344. What kind of rig would you build with a 5k budget for local LLM? -- 2025-07-16
  345. What is your "perfect" £10,000 for Local LLM, Gaming, plex with the following conditional and context. -- 2025-07-16
  346. How to use Claude code -- 2025-07-16
  347. unsloth/Kimi-K2-Instruct-GGUF -- 2025-07-16
  348. moonshotai/Kimi-K2-Base -- 2025-07-16
  349. It's been a while, I'm out of date, suggest me a model -- 2025-07-16
  350. i need the best local llm i can run on my gaming pc -- 2025-07-16
  351. Arduino Saves Heat Pump -- 2025-07-16
  352. Japan Achieves World Record 1.02 Petabits per Second Internet Speed -- 2025-07-15
  353. Jcorp Nomad: ESP32-S3 Offline Media Server in a Thumbdrive -- 2025-07-15
  354. Qwen3-235B-A22B @ 0.7t/s. Hardware or configuration bottleneck? -- 2025-07-15
  355. Building a silent, budget 4-GPU LLM workstation—1×3090 + 3×P40, need advice -- 2025-07-15
  356. Enough resources for light AI workloads? -- 2025-07-15
  357. What can I expect from current amd igpu performance? -- 2025-07-15
  358. Nvidia RTX Pro 6000 (96 Gb) vs Apple M3 Ultra (512 Gb) -- 2025-07-14
  359. How fast is inference when utilizing DDR5 and PCIe 5.0x16? -- 2025-07-14
  360. DIY Navigation System Floats this Boat -- 2025-07-12
  361. Show HN: Pangolin – Open source alternative to Cloudflare Tunnels -- 2025-07-11
  362. SUS Lang: The SUS Hardware Description Language -- 2025-07-11
  363. Embedded USB Debug for Snapdragon -- 2025-07-11
  364. Creating custom kernels for the AMD MI300 -- 2025-07-10
  365. How are you selecting LLMs? -- 2025-07-10
  366. Getting started with local AI -- 2025-07-10
  367. How are commercial dense models so much faster? -- 2025-07-09
  368. Best model for a RX 6950xt? -- 2025-07-07
  369. SSD Upgrade for Mac Mini M4 -- 2025-07-06
  370. Nvidia DGX Spark - what's the catch? -- 2025-07-06
  371. Kyutai TTS is here: Real-time, voice-cloning, ultra-low-latency TTS, Robust Longform generation -- 2025-07-04
  372. Privacy preserving ChatGPT/Claude voice mode alternative -- 2025-07-04
  373. Subpixel Rendering For Impossibly Small Terminal Text -- 2025-07-04
  374. Apple Intelligence on device model available to developers -- 2025-07-03
  375. Is AMD Ryzen AI Max+ 395 really the only consumer option for running Llama 70B locally? -- 2025-07-02
  376. Ollama - Windows 11 > LXC Docker - Openwebui = constant BSOD with RTX 5090 Ventus on driver 576.80 -- 2025-07-02
  377. Running Open WebUI with NVIDIA GPU Support? -- 2025-07-02
  378. martinbowling/thinkchain -- 2025-06-25
  379. WireGuard vanity keygen -- 2025-06-25
  380. zeptoforth: A not-so-small Forth for ARM Cortex-M -- 2025-06-25
  381. Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone -- 2025-06-21
  382. Optimized Chatterbox TTS (Up to 2-4x non-batched speedup) -- 2025-06-20
  383. [Discussion] Thinking Without Words: Continuous latent reasoning for local LLaMA inference – feedback? -- 2025-06-20
  384. Created a more accurate local speech-to-text tool for your Mac -- 2025-06-20
  385. Attention by Hand - Practice attention mechanism on an interactive webpage -- 2025-06-20
  386. Major update to my voice extractor (speech dataset creation program) -- 2025-06-20
  387. Building a Text Adventure Game with Persistent AI Agents Using Ollama -- 2025-06-20
  388. Azure OpenAI with latest version of NVIDIA'S Nemo Guardrails throwing error -- 2025-06-20
  389. GitHub RAG MCP Server - A GitIngest alternative for any IDE -- 2025-06-20
  390. Ruby on Rails Audit Complete -- 2025-06-20
  391. largest context window model for 24GB VRAM? -- 2025-06-20
  392. 100ps time resolution with thin silicon pixel detectors and a SiGe HBT amplifier -- 2025-06-18
  393. 0-$\pi$ quantum transition in a carbon nanotube Josephson junction: universal phase dependence and orbital degeneracy -- 2025-06-18
  394. Learning (The Basics of) Nftables -- 2025-06-18
  395. Linux Cgroup from First Principles -- 2025-06-18
  396. 0.82 um 105 W diode-pumped thulium-doped all silica fiber laser -- 2025-06-17
  397. Chinese AI firms smuggling suitcases full of hard drives to dodge US chip curbs -- 2025-06-17
  398. UPDATE: Inference needs nontrivial amount of PCIe bandwidth (8x RTX 3090 rig, tensor parallelism) -- 2025-06-16
  399. IQ1_Smol_Boi -- 2025-06-16
  400. Qwen releases official MLX quants for Qwen3 models in 4 quantization levels: 4bit, 6bit, 8bit, and BF16 -- 2025-06-16
  401. Seeking Help Setting Up a Local LLM Assistant for TTRPG Worldbuilding + RAG on Windows 11 -- 2025-06-16
  402. New VS Code Pair Programming Extension, Need Help Testing -- 2025-06-16
  403. Claude-Trace -- 2025-06-16
  404. Common Corpus: The Largest Collection of Ethical Data for LLM Pre-Training -- 2025-06-16
  405. best fine tuned local LLM for Github Copilot Agent specificaly -- 2025-06-16
  406. A Simulation in C++ of Joseph Weizenbaum's 1966 Eliza -- 2025-06-16
  407. 0D-2D Heterostructure for making very Large Quantum Registers using itinerant Bose-Einstein Condensate of Excitons -- 2025-06-16
  408. 100-mJ class, sub-two-cycle, carrier-envelope phase-stable dual-chirped optical parametric amplification -- 2025-06-16
  409. Olow304/memvid -- 2025-06-14
  410. mistralai/Magistral-Small-2506_gguf -- 2025-06-14
  411. Sublime Text Build 4200 and Future Plugin Changes -- 2025-06-14
  412. Carimbo: Minimal 2D game engine in modern C++20 with SDL, scriptable in Lua -- 2025-06-14
  413. 1000 Days to First Light: Construction of the Perth-Lowell Telescope Facility 1968-71 -- 2025-06-14
  414. Semantic Search Demo Using Qwen3 0.6B Embedding (w/o reranker) in-browser Using transformers.js -- 2025-06-13
  415. 2x Instinct MI50 32G running vLLM results -- 2025-06-13
  416. Self-hosted GitHub Copilot via Ollama – Dual RTX 4090 vs. Chained M4 Mac Minis -- 2025-06-13
  417. Testing Claude, OpenAI and AI21 Studio for long context RAG assistant in enterprise -- 2025-06-13
  418. Deepseek-R1-0528 MLX 4 bit quant up -- 2025-06-13
  419. Open Source iOS OLLAMA Client -- 2025-06-13
  420. OpenPOWER Foundation – Open-Source / Open Hardware PowerPC CPU ISA -- 2025-06-13
  421. TIL: timeout in Bash scripts -- 2025-06-13
  422. 3x Modded 4090 48GB or RTX Pro 6000? -- 2025-06-13
  423. KwaiCoder-AutoThink-preview is a Good Model for Creative Writing! Any Idea about Coding and Math? Your Thoughts? -- 2025-06-13
  424. Faulty 120W charger analysis (Anker GAN Prime) [video] -- 2025-06-13
  425. maomaocun/dLLM-cache -- 2025-06-13
  426. AIR-THU/Asyncdriver-Tensorrt -- 2025-06-13
  427. System Prompt Learning: Teaching your local LLMs to learn problem-solving strategies from experience (optillm plugin) -- 2025-06-11
  428. GitHub - som1tokmynam/FusionQuant: FusionQuant Model Merge & GGUF Conversion Pipeline - Your Free Toolkit for Custom LLMs! -- 2025-06-11
  429. Semantic Search PoC for Hugging Face – Now with Parameter Size Filters (0-1B to 70B+) -- 2025-06-11
  430. Has anyone had success implementing a local FIM model? -- 2025-06-11
  431. Which agent-like terminal do you guys use? Something like Warp but free. -- 2025-06-11
  432. Rocm or vulkan support for AMD Radeon 780M? -- 2025-06-11
  433. What are the most important stages to learn ML properly, step by step? -- 2025-06-11
  434. What's the best open source coding agent as of now that can be run locally and can even test the created APIs by running the application and calling the endpoinst with various payloads? -- 2025-06-11
  435. Gemini 2.5: Our most intelligent models are getting even better -- 2025-06-11
  436. Hugging Face unveils two new humanoid robots -- 2025-06-11
  437. Quick reference: Configure Ollama, Open WebUI installation paths in Windows 11 -- 2025-06-11
  438. Is there an alternative to LM Studio with first class support for MLX models? -- 2025-06-11
  439. 0.52 V-mm ITO-based Mach-Zehnder Modulator in Silicon Photonics -- 2025-06-10
  440. 100 GHz Micrometer compact broadband Monolithic ITO Mach Zehnder Interferometer Modulator enabling 3500 times higher Packing Density -- 2025-06-06
  441. 0-$\pi$ phase-controllable $thermal$ Josephson junction -- 2025-06-06
  442. ByteDance Bagel 14B MOE (7B active) Multimodal with image generation (open source, apache license) -- 2025-06-05
  443. VLLM with 4x7900xtx with Qwen3-235B-A22B-UD-Q2_K_XL -- 2025-06-05
  444. I would really like to start digging deeper into LLMs. If I have $1500-$2000 to spend, what hardware setup would you recommend assuming I have nothing currently. -- 2025-06-05
  445. Having trouble getting to 1-2req/s with vllm and Qwen3 30B-A3B -- 2025-06-05
  446. Trying to get to 24gb of vram - what are some sane options? -- 2025-06-05
  447. Locally downloading Qwen pretrained weights for finetuning -- 2025-06-05
  448. Web Application Frameworks Best Suited for AI Coding Assistants - putting the chicken before the egg. -- 2025-06-05
  449. Debian AI General Resolution Withdrawn -- 2025-06-05
  450. Authors Are Accidentally Leaving AI Prompts in Their Novels -- 2025-06-05
  451. Cancelling internet & switching to a LLM: what is the optimal model? -- 2025-06-05
  452. Yappus. Your Terminal Just Started Talking Back (The Fuck, but Better) -- 2025-06-03
  453. GPU consideration: AMD Pro W7800 -- 2025-06-03
  454. Thoughts on which open source is best for what use-cases -- 2025-06-03
  455. I accidentally too many P100 -- 2025-06-03
  456. Has anyone come across a good (open source) -- 2025-06-03
  457. Need Suggestions regarding ML Laptop Configuration -- 2025-06-03
  458. Web search tool - bing decommissioning -- 2025-06-03
  459. LLM function calls don't scale; code orchestration is simpler, more effective -- 2025-06-03
  460. GitHub issues is almost the best notebook in the world -- 2025-06-03
  461. Strengths and limitations of diffusion language models -- 2025-06-03
  462. How LLM uses MCP tools setup in OpenWebUI ? -- 2025-06-03
  463. Database_url string for mysql -- 2025-06-03
  464. 10,000 km Straight-line Transmission using a Real-time Software-defined GPU-Based Receiver -- 2025-06-02
  465. An Almost Pointless Exercise in GPU Optimization -- 2025-05-31
  466. The Windows Registry Adventure #7: Attack surface analysis -- 2025-05-31
  467. DuckLake: SQL as a Lakehouse Format -- 2025-05-31
  468. 1000x Faster Camera and Machine Vision with Ordinary Devices -- 2025-05-31
  469. 0.75 Gbit/s high-speed classical key distribution with mode-shift keying chaos synchronization of Fabry-Perot lasers -- 2025-05-31
  470. Building a plug-and-play vector store for any data stream (text, audio, video, etc.)—searchable by your LLM via MCP -- 2025-05-29
  471. Building a real-world LLM agent with open-source models—structure > prompt engineering -- 2025-05-29
  472. New LocalLLM Hardware complete -- 2025-05-29
  473. Parameter-Efficient Fine-Tuning (PEFT) Explained -- 2025-05-29
  474. LLM help for recovering deleted data? -- 2025-05-29
  475. AI Runner v4.10.0 Release Notes -- 2025-05-29
  476. Unpopular opinion: RAG is actively hurting your coding agents -- 2025-05-29
  477. Teal – A statically-typed dialect of Lua -- 2025-05-29
  478. I think it's time to give Nix a chance -- 2025-05-29
  479. deepseek-ai/DeepSeek-R1-0528 -- 2025-05-29
  480. AM5 or TRX4 for local LLMs? -- 2025-05-29
  481. Catalog of Novel Operating Systems -- 2025-05-28