Fine-Tuning
LoRA, RLHF, GRPO, model adaptation, training techniques
188 articles across 72 editions
Articles
- [Editorial] -- 2026-02-24
- Anthropic Accuses DeepSeek, Moonshot AI, and MiniMax of Creating 24,000 Fake Claude Accounts -- 2026-02-24
- Gemini 3.1 Pro -- 2026-02-23
- 15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern -- 2026-02-20
- Nvidia and OpenAI abandon unfinished $100B deal in favour of $30B investment -- 2026-02-20
- Google releases Gemini 3.1 Pro with Benchmarks -- 2026-02-20
- praetorian-inc/brutus -- 2026-02-19
- [Editorial] Unicornscan Getting Started -- 2026-02-19
- [Editorial] Unicornscan Alicorn -- 2026-02-19
- What Your Bluetooth Devices Reveal About You -- 2026-02-19
- [Editorial] When everyone can build software, who learns well? -- 2026-02-19
- Sonnet 4.6 feels like Opus 4.5 at Sonnet pricing -- 2026-02-19
- Anthropic Raises $30,000,000,000 As Run-Rate Revenue Grew 10x Annually Over Three Years -- 2026-02-19
- REASONING AUGMENTED RETRIEVAL (RAR) is the production-grade successor to single-pass RAG -- 2026-02-19
- Qwen Released Qwen 3.5 397B and Qwen 3.5 Plus! -- 2026-02-17
- Qwen3.5 NVFP4 (Blackwell) is up! -- 2026-02-17
- Running Gemma 3n E2B natively on Android via LiteRT -- 2026-02-17
- Deploying Open WebUI + vLLM on Amazon EKS -- 2026-02-17
- [Editorial] https://forge-quality.dev/articles/case-of-passing-tests-investigation -- 2026-02-02
- [Editorial] https://www.linkedin.com/posts/ownyourai_deepseek-just-released-the-first-vision-ai-activity-7421818927657385987-V1yo -- 2026-01-27
- Unsloth announces support for finetuning embedding models -- 2026-01-27
- matrixhub-ai/matrixhub -- 2026-01-14
- HM-RunningHub/ComfyUI_RH_DreamID-V -- 2026-01-14
- YouTube has removed the ability to search by upload date -- 2026-01-14
- Tried this open-source framework for LLM fine-tuning over UI -- 2025-12-12
- Golang optimizations for high‑volume services -- 2025-12-12
- [Editorial] https://www.linkedin.com/posts/stuart-winter-tear_assessing-llms-for-serendipity-discovery-activity-7396596796938153984-JY9u -- 2025-12-12
- Deprecations via warnings don't work for Python libraries -- 2025-12-11
- The "Confident Idiot" Problem: Why LLM-as-a-Judge fails in production. -- 2025-12-10
- Toyota unintended acceleration and the big bowl of "spaghetti" code (2013) -- 2025-12-09
- Free yourself from the Spotify desktop client with spotifyd -- 2025-12-04
- Llamacpp Parameters Tuning -- 2025-12-02
- [Editorial] https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration -- 2025-12-02
- Pong Gets the Boot -- 2025-11-26
- Building the largest known Kubernetes cluster, with 130k nodes -- 2025-11-26
- The Qtile Window Manager: A Python-Powered Tiling Experience -- 2025-11-25
- Most Stable Raspberry Pi? Better NTP with Thermal Management -- 2025-11-25
- Quantum physicists have shrunk and "de-censored" DeepSeek R1 -- 2025-11-20
- Gain 60% performance on RDNA 4 using this fix -- 2025-11-19
- Scale-out is the silent killer of LLM applications. Are we solving the wrong problem? -- 2025-11-19
- [Editorial] https://brianhorakh.medium.com/just-mcp-to-reduce-context-waste-in-spec-driven-development-3935922da5cf -- 2025-11-18
- [AutoBE] Qwen3-80B suddenly wrote doomsday AI mythology while generating a TODO app -- 2025-11-18
- My trick for better Claude Code collaboration: CLAUDE.md with conditional loading -- 2025-11-18
- A proper way to connect a local LLM to iMessage? -- 2025-11-13
- How do I level up from normie to normie pro with Claude -- 2025-11-13
- POC: Model Context Protocol integration for native Ollama app -- 2025-11-12
- Skills are in a weird middle ground between RAG and Custom GPTs, and I think that's why they feel so awkward -- 2025-11-12
- Native LLM Router Integration with Cost Transparency for OpenWebUI -- 2025-11-12
- Last week in Multimodal AI - Local Edition -- 2025-11-12
- DeepSeek-OCR GGUF model runs great locally - simple and fast -- 2025-11-12
- Qwen3-VL works really good with Zoom-in Tool -- 2025-11-12
- lightonai/LightOnOCR-1B-1025 -- 2025-11-12
- Qwen/Qwen3-VL-2B-Thinking -- 2025-11-12
- [Editorial] https://www.linkedin.com/posts/daniel-cuthbert0x_a-month-ago-gadi-evron-and-i-set-about-building-ugcPost-7393643597729845248-TSTD -- 2025-11-11
- Breakdown of New RunC Vulnerabilities -- 2025-11-11
- [Editorial] https://www.linkedin.com/posts/andriyburkov_this-paper-shows-a-27-million-parameter-model-activity-7393432619365052416-SFLO -- 2025-11-10
- Trajectory Distillation for Foundation Models -- 2025-11-10
- sail-sg/Precision-RL -- 2025-11-10
- inclusionAI/LLaDA2.0-flash-preview -- 2025-11-10
- Finetuning DeepSeek 671B locally with only 80GB VRAM and Server CPU -- 2025-11-07
- IPEX-LLM llama.cpp portable GPU and NPU working really well on laptop -- 2025-11-07
- Building a PV Solar-Powered Quadcopter -- 2025-11-07
- Adding a RTX 5080 into a 2U server with OcuLink -- 2025-11-06
- Why does Image Recognition work in llama-server but not through Open WebUI? -- 2025-11-06
- 2025 Component Abuse Challenge: A Piezo Disk Powers A Transmitter -- 2025-11-06
- [Editorial] Does the EU know that there are many countries outside of the EU that do not care at all about their -- 2025-11-03
- Ilya Sustkever's deposition reveals previously unknown details [pdf] -- 2025-11-03
- CISA and NSA share tips on securing Microsoft Exchange servers -- 2025-11-02
- The Smol Training Playbook: The Secrets to Building World-Class LLMs -- 2025-11-02
- Latest Update from Anthropic's new model - Neptune V6 -- 2025-11-02
- AI "Phone Farm" Startup Gets Funding from Marc Andreessen to Flood Social Media With Spam -- 2025-11-02
- Minimax-M2 cracks top 10 overall LLMs (production LLM performance gap shrinking: 7 points from GPT-5 in Artificial Analysis benchmark) -- 2025-11-01
- 🚨 OpenAI Gives Microsoft 27% Stake, Completes For-Profit Shift -- 2025-11-01
- FlashPack: High-throughput tensor loading for PyTorch -- 2025-11-01
- Kafka is Fast – I'll use Postgres -- 2025-11-01
- Analog Surround Sound Was Everywhere, But You Probably Didn’t Notice -- 2025-11-01
- Optimizing gpt-oss-120B on AMD RX 6900 XT 16GB: Achieving 19 tokens/sec -- 2025-10-31
- Flamingo 3 released in safetensors -- 2025-10-31
- Jeep Issues Emergency Recall for OTA-Bricked Wrangler 4xes -- 2025-10-29
- queenkiley/AI-Art-Generator -- 2025-10-26
- Unlock the power of images with AI Sheets -- 2025-10-26
- Open WebUI Context Menu -- 2025-10-26
- [Editorial] Browsers you can socially engineer -- 2025-10-24
- Update on Plans for Privacy Sandbox Technologies -- 2025-10-24
- PlayDiffusion finetune for audio inpainting non-verbal tags -- 2025-10-21
- Nvidia has produced the first Blackwell wafer on US soil -- 2025-10-21
- Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy -- 2025-10-21
- C Project Turns Into Full-Fledged OS -- 2025-10-21
- [Editorial] Agentic Orchestration -- 2025-10-19
- Chicken Squisher 3000: Squish-Proof Security -- 2025-10-19
- [Editorial] Claude Skills are awesome, maybe a bigger deal than MCP -- 2025-10-18
- [Editorial] Claude Skills -- 2025-10-18
- Copy-and-Patch: A Copy-and-Patch Tutorial -- 2025-10-17
- We built 3B and 8B models that rival GPT-5 at HTML extraction while costing 40-80x less - fully open source -- 2025-10-17
- Comparing Popular AI Evaluation Platforms for 2025 -- 2025-10-17
- State of AI Report 2025 -- 2025-10-17
- Nvidia breakthrough gives 4-bit pretraining technique the accuracy of FP8 -- 2025-10-15
- AI assisted suite - Doubt about n_gpu layer test -- 2025-10-15
- ibm-granite/granite-4.0-h-micro -- 2025-10-15
- Qwen/Qwen3-VL-235B-A22B-Instruct -- 2025-10-15
- Get your VLM running in 3 simple steps on Intel CPUs -- 2025-10-15
- A 5-minute, no-BS way to pick a local model for your real task -- 2025-10-14
- [Update] CodeLens.AI - Crowdsourced AI Leaderboard 3 Days Later: Blind Voting and What We Learned -- 2025-10-14
- ZephrFish/OmniProx -- 2025-10-14
- Preference optimization with ORPO and LoRA -- 2025-10-12
- [Show] SpiralTorch: A Rust-based PyTorch-style autograd engine (Python 3.14-ready) -- 2025-10-12
- 2G Gone? Bring It Back Yourself! -- 2025-10-12
- [Editorial] https://www.anthropic.com/research/small-samples-poison -- 2025-10-11
- [Editorial] https://www.linkedin.com/pulse/from-chatbot-operating-system-what-openais-next-move-means-leimer-ju18c -- 2025-10-11
- Rubygems.org AWS Root Access Event – September 2025 -- 2025-10-11
- Stop flexing Pass@N — show Pass-all-N -- 2025-10-11
- Architecting a project for optimal AI coding, any tips? -- 2025-10-11
- Basekick-Labs/arc -- 2025-10-11
- ServiceNow-AI/Apriel-1.5-15b-Thinker -- 2025-10-11
- meituan-longcat/LongCat-Flash-Chat -- 2025-10-11
- Did anyone try out GLM-4.5-Air-GLM-4.6-Distill ? -- 2025-10-10
- Thank you Anthropic & this community! Our little side project just hit 1M visits and even made it on National TV! -- 2025-10-10
- Sharing my free tool for easy handwritten fine-tuning datasets! -- 2025-10-09
- vllm setup for nvidia (can use llama) -- 2025-10-05
- Full-fine tuning doesn't require much vRAM with gradient checkpointing... -- 2025-10-05
- Qwen/Qwen3-Omni-30B-A3B-Thinking -- 2025-10-05
- inclusionAI/Ring-mini-linear-2.0 -- 2025-10-05
- llama.cpp: Quantizing from bf16 vs f16 -- 2025-10-05
- GLM 4.6 is nice -- 2025-10-04
- NVFP4 or MXFP4 MOE on sm120 (RTX 5900 RTX 6000 PRO) -- 2025-10-04
- Ask Hackaday: How Do You Distro Hop? -- 2025-09-30
- Uncensor Qwen3 models without retraining -- 2025-09-20
- Depth upscaling? -- 2025-09-20
- Definitive proof openai/gpt-oss-20b is dumb as hell -- 2025-09-19
- Free 10%+ Speedup for CPU/Hybrid Inference on Intel CPUs with Efficiency Cores -- 2025-09-17
- Claude Performance Report with Workarounds - September 7 to September 14 -- 2025-09-16
- PSA/RFC: KV Cache quantization forces excess processing onto CPU in llama.cpp -- 2025-09-15
- native tool calling support for DeepSeek V3.1 just merged in llama.cpp -- 2025-09-15
- model : add grok-2 support by CISC · Pull Request #15539 · ggml-org/llama.cpp -- 2025-09-15
- An Afternoon at the Recursive Café: Two Threads Interleaving -- 2025-09-14
- blacktop/go-hypervisor -- 2025-09-14
- TSYJ-He/AutoEnvForge -- 2025-09-14
- Hackaday Links: September 7, 2025 -- 2025-09-14
- Any idea how to use ollama (debian) with 2x GPUs to load larger models? -- 2025-09-14
- Rails on SQLite: new ways to cause outages -- 2025-09-14
- Qwen3-Coder-480B Q2_K_XL same speed as Qwen3-235b-instruct Q3_K_XL WHY? -- 2025-09-09
- Renting GPUs is hilariously cheap -- 2025-09-09
- Ex-Miner Turned Local LLM Enthusiast, now I have a Dilemma -- 2025-09-09
- Tencent-Hunyuan/HunyuanWorld-Voyager -- 2025-09-09
- How the “Kim” dump exposed North Korea's credential theft playbook -- 2025-09-09
- Further Adventures in Colorimeter Hacking -- 2025-09-09
- 🌟Introducing Art-0-8B: Reasoning the way you want it to with Adaptive Thinking🌟 -- 2025-09-02
- Fine Tune Model for Home Assistant? -- 2025-09-02
- [Editorial] Claude Code - massive issues -- 2025-09-01
- TheDrummer is on fire!!! -- 2025-09-01
- NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-01
- Gpt-oss Fine-tuning - now with 60K context length and fits on <13GB VRAM -- 2025-08-30
- A PLL For Perfect Pitch -- 2025-08-27
- Why are users still able to edit system prompts or memories even after disabling it? -- 2025-08-20
- Making your prompts better with GEPA-Lite using Ollama! -- 2025-08-15
- Optimizing OpenWebUI's speed through indexing (using PostgreSQL as a back-end) -- 2025-08-11
- PSA: DuckDuckGo search in OWUI routes to non-privacy friendly providers like Bing, Google, and Yahoo. -- 2025-08-11
- Open-webui Tools for Firewalla -- 2025-08-11
- Local RAG with 97% smaller index and Claude Code–compatible semantic search -- 2025-08-10
- Trump Announces 100% Tariff on Semiconductors, unless made in US -- 2025-08-08
- The Tape Speed Keyboard -- 2025-08-08
- My first finetune: Gemma 3 4B unslop via GRPO -- 2025-08-02
- Supervised Fine Tuning on Curated Data is Reinforcement Learning -- 2025-08-02
- Debugging the Pixel 8 kernel via KGDB -- 2025-07-31
- Ollama + Open WebUI -- is there a way for the same query to run through the same model multiple times (could be 3 times, could be 100 times), then gather all the answers together to summarise/count? -- 2025-07-25
- Localllama’s (first?) IFTA - I’ll Fine-Tune Anything -- 2025-07-20
- fsndzomga/metadspy -- 2025-07-10
- osmosis-ai/Osmosis-Apply-1.7B -- 2025-07-10
- IntervitensInc/pangu-pro-moe-model -- 2025-07-10
- Continual Gradient Low-Rank Projection Fine-Tuning for LLMs -- 2025-07-10
- Creating custom kernels for the AMD MI300 -- 2025-07-10
- i made a commit message generator that can be used offline and for free -- 2025-07-05
- THUDM/GLM-4.1V-9B-Thinking -- 2025-07-05
- baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle -- 2025-07-05
- LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs -- 2025-07-05
- trycua/cua -- 2025-06-30
- ml0-1337/claude-gate -- 2025-06-30
- mstrYoda/go-arctest -- 2025-06-30
- Accelerating Docker Builds by Halving EC2 Boot Time -- 2025-06-30
- Advanced Time Manipulation with GDB -- 2025-06-21
- Practical SDR: Getting started with software-defined radio -- 2025-06-21
- liaotxcn/Probabilistic-Filters -- 2025-06-19
- flohoss/gocron -- 2025-06-19
- Lessons from Mixing Rust and Java: Fast, Safe, and Practical -- 2025-06-19
- 100 prisoners and a lightbulb -- looking back -- 2025-06-19
- Paper2Poster/Paper2Poster -- 2025-06-10
- Octoberfest7/zip_smuggling -- 2025-05-30
- Silencing Firefox's Chattiness for Web App Testing -- 2025-05-30