Fine-Tuning

LoRA, RLHF, GRPO, model adaptation, training techniques

219 articles across 82 editions

Articles

  1. LLM Novice Uplift on Dual-Use Biology Tasks — 4x Accuracy Boost Bypasses Safeguards -- 2026-04-10
  2. [Editorial] Your AI Is Developing Capabilities Nobody Tested -- 2026-04-10
  3. The current state of the Chinese LLMs scene -- 2026-03-26
  4. Alibaba confirms they are committed to continuously open-sourcing new Qwen and Wan models -- 2026-03-26
  5. Cursor's Composer 2 apparently built on Kimi K2.5 without attribution -- 2026-03-26
  6. Nemotron Cascade 2 30B A3B -- 2026-03-26
  7. NVIDIA 2026 Conference LIVE. New Base model coming! -- 2026-03-20
  8. Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI -- 2026-03-20
  9. Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge -- 2026-03-20
  10. [New Model & Agent] LocoTrainer-4B: A Claude Code-style local agent designed specifically to master the MS-SWIFT framework (4B, 32K, GGUF) -- 2026-03-20
  11. shallowdream204/BitDance-14B-16x -- 2026-03-20
  12. [Editorial] Karpathy autoresearch -- 2026-03-10
  13. [Editorial] Doc-to-LoRA -- 2026-03-10
  14. GPT-5.4 -- 2026-03-09
  15. [Editorial] -- 2026-03-09
  16. YuanLabAI/Yuan3.0-Ultra: 1010B MoE, fully open weights -- 2026-03-05
  17. We could be hours (or less than a week) away from true NVFP4 support in Llama.cpp GGUF format -- 2026-03-05
  18. Step-3.5-Flash-Base & Midtrain (in case you missed them) -- 2026-03-05
  19. Qwen3.5-9B Uncensored Aggressive Release (GGUF) -- 2026-03-05
  20. unknown -- 2026-03-05
  21. [Editorial] David Maynor Security Gist -- 2026-03-04
  22. [Editorial] arXiv:2602.23093 -- 2026-03-04
  23. unpromptedcon.org -- 2026-03-04
  24. Inside the M4 Apple Neural Engine, Part 1: Reverse Engineering -- 2026-03-03
  25. Hydroph0bia – fixed SecureBoot bypass for UEFI firmware from Insyde H2O (2025) -- 2026-03-03
  26. [Editorial] -- 2026-02-26
  27. [Editorial] -- 2026-02-26
  28. [Editorial] -- 2026-02-26
  29. Free ASIC Llama 3.1 8B inference at 16,000 tok/s - no, not a joke -- 2026-02-25
  30. [Editorial] Cognitum -- 2026-02-25
  31. Hetzner Prices increase 30-40% -- 2026-02-25
  32. [Editorial] -- 2026-02-24
  33. Anthropic Accuses DeepSeek, Moonshot AI, and MiniMax of Creating 24,000 Fake Claude Accounts -- 2026-02-24
  34. Gemini 3.1 Pro -- 2026-02-23
  35. 15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern -- 2026-02-20
  36. Nvidia and OpenAI abandon unfinished $100B deal in favour of $30B investment -- 2026-02-20
  37. Google releases Gemini 3.1 Pro with Benchmarks -- 2026-02-20
  38. praetorian-inc/brutus -- 2026-02-19
  39. [Editorial] Unicornscan Getting Started -- 2026-02-19
  40. [Editorial] Unicornscan Alicorn -- 2026-02-19
  41. What Your Bluetooth Devices Reveal About You -- 2026-02-19
  42. [Editorial] When everyone can build software, who learns well? -- 2026-02-19
  43. Sonnet 4.6 feels like Opus 4.5 at Sonnet pricing -- 2026-02-19
  44. Anthropic Raises $30,000,000,000 As Run-Rate Revenue Grew 10x Annually Over Three Years -- 2026-02-19
  45. REASONING AUGMENTED RETRIEVAL (RAR) is the production-grade successor to single-pass RAG -- 2026-02-19
  46. Qwen Released Qwen 3.5 397B and Qwen 3.5 Plus! -- 2026-02-17
  47. Qwen3.5 NVFP4 (Blackwell) is up! -- 2026-02-17
  48. Running Gemma 3n E2B natively on Android via LiteRT -- 2026-02-17
  49. Deploying Open WebUI + vLLM on Amazon EKS -- 2026-02-17
  50. [Editorial] https://forge-quality.dev/articles/case-of-passing-tests-investigation -- 2026-02-02
  51. [Editorial] https://www.linkedin.com/posts/ownyourai_deepseek-just-released-the-first-vision-ai-activity-7421818927657385987-V1yo -- 2026-01-27
  52. Unsloth announces support for finetuning embedding models -- 2026-01-27
  53. matrixhub-ai/matrixhub -- 2026-01-14
  54. HM-RunningHub/ComfyUI_RH_DreamID-V -- 2026-01-14
  55. YouTube has removed the ability to search by upload date -- 2026-01-14
  56. Tried this open-source framework for LLM fine-tuning over UI -- 2025-12-12
  57. Golang optimizations for high‑volume services -- 2025-12-12
  58. [Editorial] https://www.linkedin.com/posts/stuart-winter-tear_assessing-llms-for-serendipity-discovery-activity-7396596796938153984-JY9u -- 2025-12-12
  59. Deprecations via warnings don't work for Python libraries -- 2025-12-11
  60. The "Confident Idiot" Problem: Why LLM-as-a-Judge fails in production. -- 2025-12-10
  61. Toyota unintended acceleration and the big bowl of "spaghetti" code (2013) -- 2025-12-09
  62. Free yourself from the Spotify desktop client with spotifyd -- 2025-12-04
  63. Llamacpp Parameters Tuning -- 2025-12-02
  64. [Editorial] https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration -- 2025-12-02
  65. Pong Gets the Boot -- 2025-11-26
  66. Building the largest known Kubernetes cluster, with 130k nodes -- 2025-11-26
  67. The Qtile Window Manager: A Python-Powered Tiling Experience -- 2025-11-25
  68. Most Stable Raspberry Pi? Better NTP with Thermal Management -- 2025-11-25
  69. Quantum physicists have shrunk and "de-censored" DeepSeek R1 -- 2025-11-20
  70. Gain 60% performance on RDNA 4 using this fix -- 2025-11-19
  71. Scale-out is the silent killer of LLM applications. Are we solving the wrong problem? -- 2025-11-19
  72. [Editorial] https://brianhorakh.medium.com/just-mcp-to-reduce-context-waste-in-spec-driven-development-3935922da5cf -- 2025-11-18
  73. [AutoBE] Qwen3-80B suddenly wrote doomsday AI mythology while generating a TODO app -- 2025-11-18
  74. My trick for better Claude Code collaboration: CLAUDE.md with conditional loading -- 2025-11-18
  75. A proper way to connect a local LLM to iMessage? -- 2025-11-13
  76. How do I level up from normie to normie pro with Claude -- 2025-11-13
  77. POC: Model Context Protocol integration for native Ollama app -- 2025-11-12
  78. Skills are in a weird middle ground between RAG and Custom GPTs, and I think that's why they feel so awkward -- 2025-11-12
  79. Native LLM Router Integration with Cost Transparency for OpenWebUI -- 2025-11-12
  80. Last week in Multimodal AI - Local Edition -- 2025-11-12
  81. DeepSeek-OCR GGUF model runs great locally - simple and fast -- 2025-11-12
  82. Qwen3-VL works really good with Zoom-in Tool -- 2025-11-12
  83. lightonai/LightOnOCR-1B-1025 -- 2025-11-12
  84. Qwen/Qwen3-VL-2B-Thinking -- 2025-11-12
  85. [Editorial] https://www.linkedin.com/posts/daniel-cuthbert0x_a-month-ago-gadi-evron-and-i-set-about-building-ugcPost-7393643597729845248-TSTD -- 2025-11-11
  86. Breakdown of New RunC Vulnerabilities -- 2025-11-11
  87. [Editorial] https://www.linkedin.com/posts/andriyburkov_this-paper-shows-a-27-million-parameter-model-activity-7393432619365052416-SFLO -- 2025-11-10
  88. Trajectory Distillation for Foundation Models -- 2025-11-10
  89. sail-sg/Precision-RL -- 2025-11-10
  90. inclusionAI/LLaDA2.0-flash-preview -- 2025-11-10
  91. Finetuning DeepSeek 671B locally with only 80GB VRAM and Server CPU -- 2025-11-07
  92. IPEX-LLM llama.cpp portable GPU and NPU working really well on laptop -- 2025-11-07
  93. Building a PV Solar-Powered Quadcopter -- 2025-11-07
  94. Adding a RTX 5080 into a 2U server with OcuLink -- 2025-11-06
  95. Why does Image Recognition work in llama-server but not through Open WebUI? -- 2025-11-06
  96. 2025 Component Abuse Challenge: A Piezo Disk Powers A Transmitter -- 2025-11-06
  97. [Editorial] Does the EU know that there are many countries outside of the EU that do not care at all about their -- 2025-11-03
  98. Ilya Sustkever's deposition reveals previously unknown details [pdf] -- 2025-11-03
  99. CISA and NSA share tips on securing Microsoft Exchange servers -- 2025-11-02
  100. The Smol Training Playbook: The Secrets to Building World-Class LLMs -- 2025-11-02
  101. Latest Update from Anthropic's new model - Neptune V6 -- 2025-11-02
  102. AI "Phone Farm" Startup Gets Funding from Marc Andreessen to Flood Social Media With Spam -- 2025-11-02
  103. Minimax-M2 cracks top 10 overall LLMs (production LLM performance gap shrinking: 7 points from GPT-5 in Artificial Analysis benchmark) -- 2025-11-01
  104. 🚨 OpenAI Gives Microsoft 27% Stake, Completes For-Profit Shift -- 2025-11-01
  105. FlashPack: High-throughput tensor loading for PyTorch -- 2025-11-01
  106. Kafka is Fast – I'll use Postgres -- 2025-11-01
  107. Analog Surround Sound Was Everywhere, But You Probably Didn’t Notice -- 2025-11-01
  108. Optimizing gpt-oss-120B on AMD RX 6900 XT 16GB: Achieving 19 tokens/sec -- 2025-10-31
  109. Flamingo 3 released in safetensors -- 2025-10-31
  110. Jeep Issues Emergency Recall for OTA-Bricked Wrangler 4xes -- 2025-10-29
  111. queenkiley/AI-Art-Generator -- 2025-10-26
  112. Unlock the power of images with AI Sheets -- 2025-10-26
  113. Open WebUI Context Menu -- 2025-10-26
  114. [Editorial] Browsers you can socially engineer -- 2025-10-24
  115. Update on Plans for Privacy Sandbox Technologies -- 2025-10-24
  116. PlayDiffusion finetune for audio inpainting non-verbal tags -- 2025-10-21
  117. Nvidia has produced the first Blackwell wafer on US soil -- 2025-10-21
  118. Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy -- 2025-10-21
  119. C Project Turns Into Full-Fledged OS -- 2025-10-21
  120. [Editorial] Agentic Orchestration -- 2025-10-19
  121. Chicken Squisher 3000: Squish-Proof Security -- 2025-10-19
  122. [Editorial] Claude Skills are awesome, maybe a bigger deal than MCP -- 2025-10-18
  123. [Editorial] Claude Skills -- 2025-10-18
  124. Copy-and-Patch: A Copy-and-Patch Tutorial -- 2025-10-17
  125. We built 3B and 8B models that rival GPT-5 at HTML extraction while costing 40-80x less - fully open source -- 2025-10-17
  126. Comparing Popular AI Evaluation Platforms for 2025 -- 2025-10-17
  127. State of AI Report 2025 -- 2025-10-17
  128. Nvidia breakthrough gives 4-bit pretraining technique the accuracy of FP8 -- 2025-10-15
  129. AI assisted suite - Doubt about n_gpu layer test -- 2025-10-15
  130. ibm-granite/granite-4.0-h-micro -- 2025-10-15
  131. Qwen/Qwen3-VL-235B-A22B-Instruct -- 2025-10-15
  132. Get your VLM running in 3 simple steps on Intel CPUs -- 2025-10-15
  133. A 5-minute, no-BS way to pick a local model for your real task -- 2025-10-14
  134. [Update] CodeLens.AI - Crowdsourced AI Leaderboard 3 Days Later: Blind Voting and What We Learned -- 2025-10-14
  135. ZephrFish/OmniProx -- 2025-10-14
  136. Preference optimization with ORPO and LoRA -- 2025-10-12
  137. [Show] SpiralTorch: A Rust-based PyTorch-style autograd engine (Python 3.14-ready) -- 2025-10-12
  138. 2G Gone? Bring It Back Yourself! -- 2025-10-12
  139. [Editorial] https://www.anthropic.com/research/small-samples-poison -- 2025-10-11
  140. [Editorial] https://www.linkedin.com/pulse/from-chatbot-operating-system-what-openais-next-move-means-leimer-ju18c -- 2025-10-11
  141. Rubygems.org AWS Root Access Event – September 2025 -- 2025-10-11
  142. Stop flexing Pass@N — show Pass-all-N -- 2025-10-11
  143. Architecting a project for optimal AI coding, any tips? -- 2025-10-11
  144. Basekick-Labs/arc -- 2025-10-11
  145. ServiceNow-AI/Apriel-1.5-15b-Thinker -- 2025-10-11
  146. meituan-longcat/LongCat-Flash-Chat -- 2025-10-11
  147. Did anyone try out GLM-4.5-Air-GLM-4.6-Distill ? -- 2025-10-10
  148. Thank you Anthropic & this community! Our little side project just hit 1M visits and even made it on National TV! -- 2025-10-10
  149. Sharing my free tool for easy handwritten fine-tuning datasets! -- 2025-10-09
  150. vllm setup for nvidia (can use llama) -- 2025-10-05
  151. Full-fine tuning doesn't require much vRAM with gradient checkpointing... -- 2025-10-05
  152. Qwen/Qwen3-Omni-30B-A3B-Thinking -- 2025-10-05
  153. inclusionAI/Ring-mini-linear-2.0 -- 2025-10-05
  154. llama.cpp: Quantizing from bf16 vs f16 -- 2025-10-05
  155. GLM 4.6 is nice -- 2025-10-04
  156. NVFP4 or MXFP4 MOE on sm120 (RTX 5900 RTX 6000 PRO) -- 2025-10-04
  157. Ask Hackaday: How Do You Distro Hop? -- 2025-09-30
  158. Uncensor Qwen3 models without retraining -- 2025-09-20
  159. Depth upscaling? -- 2025-09-20
  160. Definitive proof openai/gpt-oss-20b is dumb as hell -- 2025-09-19
  161. Free 10%+ Speedup for CPU/Hybrid Inference on Intel CPUs with Efficiency Cores -- 2025-09-17
  162. Claude Performance Report with Workarounds - September 7 to September 14 -- 2025-09-16
  163. PSA/RFC: KV Cache quantization forces excess processing onto CPU in llama.cpp -- 2025-09-15
  164. native tool calling support for DeepSeek V3.1 just merged in llama.cpp -- 2025-09-15
  165. model : add grok-2 support by CISC · Pull Request #15539 · ggml-org/llama.cpp -- 2025-09-15
  166. An Afternoon at the Recursive Café: Two Threads Interleaving -- 2025-09-14
  167. blacktop/go-hypervisor -- 2025-09-14
  168. TSYJ-He/AutoEnvForge -- 2025-09-14
  169. Hackaday Links: September 7, 2025 -- 2025-09-14
  170. Any idea how to use ollama (debian) with 2x GPUs to load larger models? -- 2025-09-14
  171. Rails on SQLite: new ways to cause outages -- 2025-09-14
  172. Qwen3-Coder-480B Q2_K_XL same speed as Qwen3-235b-instruct Q3_K_XL WHY? -- 2025-09-09
  173. Renting GPUs is hilariously cheap -- 2025-09-09
  174. Ex-Miner Turned Local LLM Enthusiast, now I have a Dilemma -- 2025-09-09
  175. Tencent-Hunyuan/HunyuanWorld-Voyager -- 2025-09-09
  176. How the “Kim” dump exposed North Korea's credential theft playbook -- 2025-09-09
  177. Further Adventures in Colorimeter Hacking -- 2025-09-09
  178. 🌟Introducing Art-0-8B: Reasoning the way you want it to with Adaptive Thinking🌟 -- 2025-09-02
  179. Fine Tune Model for Home Assistant? -- 2025-09-02
  180. [Editorial] Claude Code - massive issues -- 2025-09-01
  181. TheDrummer is on fire!!! -- 2025-09-01
  182. NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-01
  183. Gpt-oss Fine-tuning - now with 60K context length and fits on <13GB VRAM -- 2025-08-30
  184. A PLL For Perfect Pitch -- 2025-08-27
  185. Why are users still able to edit system prompts or memories even after disabling it? -- 2025-08-20
  186. Making your prompts better with GEPA-Lite using Ollama! -- 2025-08-15
  187. Optimizing OpenWebUI's speed through indexing (using PostgreSQL as a back-end) -- 2025-08-11
  188. PSA: DuckDuckGo search in OWUI routes to non-privacy friendly providers like Bing, Google, and Yahoo. -- 2025-08-11
  189. Open-webui Tools for Firewalla -- 2025-08-11
  190. Local RAG with 97% smaller index and Claude Code–compatible semantic search -- 2025-08-10
  191. Trump Announces 100% Tariff on Semiconductors, unless made in US -- 2025-08-08
  192. The Tape Speed Keyboard -- 2025-08-08
  193. My first finetune: Gemma 3 4B unslop via GRPO -- 2025-08-02
  194. Supervised Fine Tuning on Curated Data is Reinforcement Learning -- 2025-08-02
  195. Debugging the Pixel 8 kernel via KGDB -- 2025-07-31
  196. Ollama + Open WebUI -- is there a way for the same query to run through the same model multiple times (could be 3 times, could be 100 times), then gather all the answers together to summarise/count? -- 2025-07-25
  197. Localllama’s (first?) IFTA - I’ll Fine-Tune Anything -- 2025-07-20
  198. fsndzomga/metadspy -- 2025-07-10
  199. osmosis-ai/Osmosis-Apply-1.7B -- 2025-07-10
  200. IntervitensInc/pangu-pro-moe-model -- 2025-07-10
  201. Continual Gradient Low-Rank Projection Fine-Tuning for LLMs -- 2025-07-10
  202. Creating custom kernels for the AMD MI300 -- 2025-07-10
  203. i made a commit message generator that can be used offline and for free -- 2025-07-05
  204. THUDM/GLM-4.1V-9B-Thinking -- 2025-07-05
  205. baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle -- 2025-07-05
  206. LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs -- 2025-07-05
  207. trycua/cua -- 2025-06-30
  208. ml0-1337/claude-gate -- 2025-06-30
  209. mstrYoda/go-arctest -- 2025-06-30
  210. Accelerating Docker Builds by Halving EC2 Boot Time -- 2025-06-30
  211. Advanced Time Manipulation with GDB -- 2025-06-21
  212. Practical SDR: Getting started with software-defined radio -- 2025-06-21
  213. liaotxcn/Probabilistic-Filters -- 2025-06-19
  214. flohoss/gocron -- 2025-06-19
  215. Lessons from Mixing Rust and Java: Fast, Safe, and Practical -- 2025-06-19
  216. 100 prisoners and a lightbulb -- looking back -- 2025-06-19
  217. Paper2Poster/Paper2Poster -- 2025-06-10
  218. Octoberfest7/zip_smuggling -- 2025-05-30
  219. Silencing Firefox's Chattiness for Web App Testing -- 2025-05-30