Fine-Tuning

LoRA, RLHF, GRPO, model adaptation, training techniques

188 articles across 72 editions

Articles

  1. [Editorial] -- 2026-02-24
  2. Anthropic Accuses DeepSeek, Moonshot AI, and MiniMax of Creating 24,000 Fake Claude Accounts -- 2026-02-24
  3. Gemini 3.1 Pro -- 2026-02-23
  4. 15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern -- 2026-02-20
  5. Nvidia and OpenAI abandon unfinished $100B deal in favour of $30B investment -- 2026-02-20
  6. Google releases Gemini 3.1 Pro with Benchmarks -- 2026-02-20
  7. praetorian-inc/brutus -- 2026-02-19
  8. [Editorial] Unicornscan Getting Started -- 2026-02-19
  9. [Editorial] Unicornscan Alicorn -- 2026-02-19
  10. What Your Bluetooth Devices Reveal About You -- 2026-02-19
  11. [Editorial] When everyone can build software, who learns well? -- 2026-02-19
  12. Sonnet 4.6 feels like Opus 4.5 at Sonnet pricing -- 2026-02-19
  13. Anthropic Raises $30,000,000,000 As Run-Rate Revenue Grew 10x Annually Over Three Years -- 2026-02-19
  14. REASONING AUGMENTED RETRIEVAL (RAR) is the production-grade successor to single-pass RAG -- 2026-02-19
  15. Qwen Released Qwen 3.5 397B and Qwen 3.5 Plus! -- 2026-02-17
  16. Qwen3.5 NVFP4 (Blackwell) is up! -- 2026-02-17
  17. Running Gemma 3n E2B natively on Android via LiteRT -- 2026-02-17
  18. Deploying Open WebUI + vLLM on Amazon EKS -- 2026-02-17
  19. [Editorial] https://forge-quality.dev/articles/case-of-passing-tests-investigation -- 2026-02-02
  20. [Editorial] https://www.linkedin.com/posts/ownyourai_deepseek-just-released-the-first-vision-ai-activity-7421818927657385987-V1yo -- 2026-01-27
  21. Unsloth announces support for finetuning embedding models -- 2026-01-27
  22. matrixhub-ai/matrixhub -- 2026-01-14
  23. HM-RunningHub/ComfyUI_RH_DreamID-V -- 2026-01-14
  24. YouTube has removed the ability to search by upload date -- 2026-01-14
  25. Tried this open-source framework for LLM fine-tuning over UI -- 2025-12-12
  26. Golang optimizations for high‑volume services -- 2025-12-12
  27. [Editorial] https://www.linkedin.com/posts/stuart-winter-tear_assessing-llms-for-serendipity-discovery-activity-7396596796938153984-JY9u -- 2025-12-12
  28. Deprecations via warnings don't work for Python libraries -- 2025-12-11
  29. The "Confident Idiot" Problem: Why LLM-as-a-Judge fails in production. -- 2025-12-10
  30. Toyota unintended acceleration and the big bowl of "spaghetti" code (2013) -- 2025-12-09
  31. Free yourself from the Spotify desktop client with spotifyd -- 2025-12-04
  32. Llamacpp Parameters Tuning -- 2025-12-02
  33. [Editorial] https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration -- 2025-12-02
  34. Pong Gets the Boot -- 2025-11-26
  35. Building the largest known Kubernetes cluster, with 130k nodes -- 2025-11-26
  36. The Qtile Window Manager: A Python-Powered Tiling Experience -- 2025-11-25
  37. Most Stable Raspberry Pi? Better NTP with Thermal Management -- 2025-11-25
  38. Quantum physicists have shrunk and "de-censored" DeepSeek R1 -- 2025-11-20
  39. Gain 60% performance on RDNA 4 using this fix -- 2025-11-19
  40. Scale-out is the silent killer of LLM applications. Are we solving the wrong problem? -- 2025-11-19
  41. [Editorial] https://brianhorakh.medium.com/just-mcp-to-reduce-context-waste-in-spec-driven-development-3935922da5cf -- 2025-11-18
  42. [AutoBE] Qwen3-80B suddenly wrote doomsday AI mythology while generating a TODO app -- 2025-11-18
  43. My trick for better Claude Code collaboration: CLAUDE.md with conditional loading -- 2025-11-18
  44. A proper way to connect a local LLM to iMessage? -- 2025-11-13
  45. How do I level up from normie to normie pro with Claude -- 2025-11-13
  46. POC: Model Context Protocol integration for native Ollama app -- 2025-11-12
  47. Skills are in a weird middle ground between RAG and Custom GPTs, and I think that's why they feel so awkward -- 2025-11-12
  48. Native LLM Router Integration with Cost Transparency for OpenWebUI -- 2025-11-12
  49. Last week in Multimodal AI - Local Edition -- 2025-11-12
  50. DeepSeek-OCR GGUF model runs great locally - simple and fast -- 2025-11-12
  51. Qwen3-VL works really good with Zoom-in Tool -- 2025-11-12
  52. lightonai/LightOnOCR-1B-1025 -- 2025-11-12
  53. Qwen/Qwen3-VL-2B-Thinking -- 2025-11-12
  54. [Editorial] https://www.linkedin.com/posts/daniel-cuthbert0x_a-month-ago-gadi-evron-and-i-set-about-building-ugcPost-7393643597729845248-TSTD -- 2025-11-11
  55. Breakdown of New RunC Vulnerabilities -- 2025-11-11
  56. [Editorial] https://www.linkedin.com/posts/andriyburkov_this-paper-shows-a-27-million-parameter-model-activity-7393432619365052416-SFLO -- 2025-11-10
  57. Trajectory Distillation for Foundation Models -- 2025-11-10
  58. sail-sg/Precision-RL -- 2025-11-10
  59. inclusionAI/LLaDA2.0-flash-preview -- 2025-11-10
  60. Finetuning DeepSeek 671B locally with only 80GB VRAM and Server CPU -- 2025-11-07
  61. IPEX-LLM llama.cpp portable GPU and NPU working really well on laptop -- 2025-11-07
  62. Building a PV Solar-Powered Quadcopter -- 2025-11-07
  63. Adding a RTX 5080 into a 2U server with OcuLink -- 2025-11-06
  64. Why does Image Recognition work in llama-server but not through Open WebUI? -- 2025-11-06
  65. 2025 Component Abuse Challenge: A Piezo Disk Powers A Transmitter -- 2025-11-06
  66. [Editorial] Does the EU know that there are many countries outside of the EU that do not care at all about their -- 2025-11-03
  67. Ilya Sustkever's deposition reveals previously unknown details [pdf] -- 2025-11-03
  68. CISA and NSA share tips on securing Microsoft Exchange servers -- 2025-11-02
  69. The Smol Training Playbook: The Secrets to Building World-Class LLMs -- 2025-11-02
  70. Latest Update from Anthropic's new model - Neptune V6 -- 2025-11-02
  71. AI "Phone Farm" Startup Gets Funding from Marc Andreessen to Flood Social Media With Spam -- 2025-11-02
  72. Minimax-M2 cracks top 10 overall LLMs (production LLM performance gap shrinking: 7 points from GPT-5 in Artificial Analysis benchmark) -- 2025-11-01
  73. 🚨 OpenAI Gives Microsoft 27% Stake, Completes For-Profit Shift -- 2025-11-01
  74. FlashPack: High-throughput tensor loading for PyTorch -- 2025-11-01
  75. Kafka is Fast – I'll use Postgres -- 2025-11-01
  76. Analog Surround Sound Was Everywhere, But You Probably Didn’t Notice -- 2025-11-01
  77. Optimizing gpt-oss-120B on AMD RX 6900 XT 16GB: Achieving 19 tokens/sec -- 2025-10-31
  78. Flamingo 3 released in safetensors -- 2025-10-31
  79. Jeep Issues Emergency Recall for OTA-Bricked Wrangler 4xes -- 2025-10-29
  80. queenkiley/AI-Art-Generator -- 2025-10-26
  81. Unlock the power of images with AI Sheets -- 2025-10-26
  82. Open WebUI Context Menu -- 2025-10-26
  83. [Editorial] Browsers you can socially engineer -- 2025-10-24
  84. Update on Plans for Privacy Sandbox Technologies -- 2025-10-24
  85. PlayDiffusion finetune for audio inpainting non-verbal tags -- 2025-10-21
  86. Nvidia has produced the first Blackwell wafer on US soil -- 2025-10-21
  87. Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy -- 2025-10-21
  88. C Project Turns Into Full-Fledged OS -- 2025-10-21
  89. [Editorial] Agentic Orchestration -- 2025-10-19
  90. Chicken Squisher 3000: Squish-Proof Security -- 2025-10-19
  91. [Editorial] Claude Skills are awesome, maybe a bigger deal than MCP -- 2025-10-18
  92. [Editorial] Claude Skills -- 2025-10-18
  93. Copy-and-Patch: A Copy-and-Patch Tutorial -- 2025-10-17
  94. We built 3B and 8B models that rival GPT-5 at HTML extraction while costing 40-80x less - fully open source -- 2025-10-17
  95. Comparing Popular AI Evaluation Platforms for 2025 -- 2025-10-17
  96. State of AI Report 2025 -- 2025-10-17
  97. Nvidia breakthrough gives 4-bit pretraining technique the accuracy of FP8 -- 2025-10-15
  98. AI assisted suite - Doubt about n_gpu layer test -- 2025-10-15
  99. ibm-granite/granite-4.0-h-micro -- 2025-10-15
  100. Qwen/Qwen3-VL-235B-A22B-Instruct -- 2025-10-15
  101. Get your VLM running in 3 simple steps on Intel CPUs -- 2025-10-15
  102. A 5-minute, no-BS way to pick a local model for your real task -- 2025-10-14
  103. [Update] CodeLens.AI - Crowdsourced AI Leaderboard 3 Days Later: Blind Voting and What We Learned -- 2025-10-14
  104. ZephrFish/OmniProx -- 2025-10-14
  105. Preference optimization with ORPO and LoRA -- 2025-10-12
  106. [Show] SpiralTorch: A Rust-based PyTorch-style autograd engine (Python 3.14-ready) -- 2025-10-12
  107. 2G Gone? Bring It Back Yourself! -- 2025-10-12
  108. [Editorial] https://www.anthropic.com/research/small-samples-poison -- 2025-10-11
  109. [Editorial] https://www.linkedin.com/pulse/from-chatbot-operating-system-what-openais-next-move-means-leimer-ju18c -- 2025-10-11
  110. Rubygems.org AWS Root Access Event – September 2025 -- 2025-10-11
  111. Stop flexing Pass@N — show Pass-all-N -- 2025-10-11
  112. Architecting a project for optimal AI coding, any tips? -- 2025-10-11
  113. Basekick-Labs/arc -- 2025-10-11
  114. ServiceNow-AI/Apriel-1.5-15b-Thinker -- 2025-10-11
  115. meituan-longcat/LongCat-Flash-Chat -- 2025-10-11
  116. Did anyone try out GLM-4.5-Air-GLM-4.6-Distill ? -- 2025-10-10
  117. Thank you Anthropic & this community! Our little side project just hit 1M visits and even made it on National TV! -- 2025-10-10
  118. Sharing my free tool for easy handwritten fine-tuning datasets! -- 2025-10-09
  119. vllm setup for nvidia (can use llama) -- 2025-10-05
  120. Full-fine tuning doesn't require much vRAM with gradient checkpointing... -- 2025-10-05
  121. Qwen/Qwen3-Omni-30B-A3B-Thinking -- 2025-10-05
  122. inclusionAI/Ring-mini-linear-2.0 -- 2025-10-05
  123. llama.cpp: Quantizing from bf16 vs f16 -- 2025-10-05
  124. GLM 4.6 is nice -- 2025-10-04
  125. NVFP4 or MXFP4 MOE on sm120 (RTX 5900 RTX 6000 PRO) -- 2025-10-04
  126. Ask Hackaday: How Do You Distro Hop? -- 2025-09-30
  127. Uncensor Qwen3 models without retraining -- 2025-09-20
  128. Depth upscaling? -- 2025-09-20
  129. Definitive proof openai/gpt-oss-20b is dumb as hell -- 2025-09-19
  130. Free 10%+ Speedup for CPU/Hybrid Inference on Intel CPUs with Efficiency Cores -- 2025-09-17
  131. Claude Performance Report with Workarounds - September 7 to September 14 -- 2025-09-16
  132. PSA/RFC: KV Cache quantization forces excess processing onto CPU in llama.cpp -- 2025-09-15
  133. native tool calling support for DeepSeek V3.1 just merged in llama.cpp -- 2025-09-15
  134. model : add grok-2 support by CISC · Pull Request #15539 · ggml-org/llama.cpp -- 2025-09-15
  135. An Afternoon at the Recursive Café: Two Threads Interleaving -- 2025-09-14
  136. blacktop/go-hypervisor -- 2025-09-14
  137. TSYJ-He/AutoEnvForge -- 2025-09-14
  138. Hackaday Links: September 7, 2025 -- 2025-09-14
  139. Any idea how to use ollama (debian) with 2x GPUs to load larger models? -- 2025-09-14
  140. Rails on SQLite: new ways to cause outages -- 2025-09-14
  141. Qwen3-Coder-480B Q2_K_XL same speed as Qwen3-235b-instruct Q3_K_XL WHY? -- 2025-09-09
  142. Renting GPUs is hilariously cheap -- 2025-09-09
  143. Ex-Miner Turned Local LLM Enthusiast, now I have a Dilemma -- 2025-09-09
  144. Tencent-Hunyuan/HunyuanWorld-Voyager -- 2025-09-09
  145. How the “Kim” dump exposed North Korea's credential theft playbook -- 2025-09-09
  146. Further Adventures in Colorimeter Hacking -- 2025-09-09
  147. 🌟Introducing Art-0-8B: Reasoning the way you want it to with Adaptive Thinking🌟 -- 2025-09-02
  148. Fine Tune Model for Home Assistant? -- 2025-09-02
  149. [Editorial] Claude Code - massive issues -- 2025-09-01
  150. TheDrummer is on fire!!! -- 2025-09-01
  151. NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-01
  152. Gpt-oss Fine-tuning - now with 60K context length and fits on <13GB VRAM -- 2025-08-30
  153. A PLL For Perfect Pitch -- 2025-08-27
  154. Why are users still able to edit system prompts or memories even after disabling it? -- 2025-08-20
  155. Making your prompts better with GEPA-Lite using Ollama! -- 2025-08-15
  156. Optimizing OpenWebUI's speed through indexing (using PostgreSQL as a back-end) -- 2025-08-11
  157. PSA: DuckDuckGo search in OWUI routes to non-privacy friendly providers like Bing, Google, and Yahoo. -- 2025-08-11
  158. Open-webui Tools for Firewalla -- 2025-08-11
  159. Local RAG with 97% smaller index and Claude Code–compatible semantic search -- 2025-08-10
  160. Trump Announces 100% Tariff on Semiconductors, unless made in US -- 2025-08-08
  161. The Tape Speed Keyboard -- 2025-08-08
  162. My first finetune: Gemma 3 4B unslop via GRPO -- 2025-08-02
  163. Supervised Fine Tuning on Curated Data is Reinforcement Learning -- 2025-08-02
  164. Debugging the Pixel 8 kernel via KGDB -- 2025-07-31
  165. Ollama + Open WebUI -- is there a way for the same query to run through the same model multiple times (could be 3 times, could be 100 times), then gather all the answers together to summarise/count? -- 2025-07-25
  166. Localllama’s (first?) IFTA - I’ll Fine-Tune Anything -- 2025-07-20
  167. fsndzomga/metadspy -- 2025-07-10
  168. osmosis-ai/Osmosis-Apply-1.7B -- 2025-07-10
  169. IntervitensInc/pangu-pro-moe-model -- 2025-07-10
  170. Continual Gradient Low-Rank Projection Fine-Tuning for LLMs -- 2025-07-10
  171. Creating custom kernels for the AMD MI300 -- 2025-07-10
  172. i made a commit message generator that can be used offline and for free -- 2025-07-05
  173. THUDM/GLM-4.1V-9B-Thinking -- 2025-07-05
  174. baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle -- 2025-07-05
  175. LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs -- 2025-07-05
  176. trycua/cua -- 2025-06-30
  177. ml0-1337/claude-gate -- 2025-06-30
  178. mstrYoda/go-arctest -- 2025-06-30
  179. Accelerating Docker Builds by Halving EC2 Boot Time -- 2025-06-30
  180. Advanced Time Manipulation with GDB -- 2025-06-21
  181. Practical SDR: Getting started with software-defined radio -- 2025-06-21
  182. liaotxcn/Probabilistic-Filters -- 2025-06-19
  183. flohoss/gocron -- 2025-06-19
  184. Lessons from Mixing Rust and Java: Fast, Safe, and Practical -- 2025-06-19
  185. 100 prisoners and a lightbulb -- looking back -- 2025-06-19
  186. Paper2Poster/Paper2Poster -- 2025-06-10
  187. Octoberfest7/zip_smuggling -- 2025-05-30
  188. Silencing Firefox's Chattiness for Web App Testing -- 2025-05-30