Fine-Tuning

LoRA, RLHF, GRPO, model adaptation, training techniques

323 articles across 108 editions

Articles

The Underhanded C Contest -- 2026-07-03
Senior SWE-Bench: open-source benchmark that assesses agents as senior engineers -- 2026-07-02
New bench designed for smaller models: ObviousBench.com -- 2026-07-02
I built an autonomous dev pipeline and ran the same project head to head: a 27B local on a modded 4090, then again on cheap cloud LLMs -- 2026-07-02
Stdlib or Third-Party? Empirical Performance and Correctness of LLM-Assisted Zero-Dependency Python Libraries -- 2026-07-02
DeepSeek V4 official version launching mid-July -- 2026-06-30
OpenPangu-2.0-Flash: 92B MoE (6B active) on Ascend with 512K context -- 2026-06-30
Anthropic's Amodei: "Open Source models [could take us to] a very dangerous place." -- 2026-06-30
Even Google still believes in small models for coding — Gemma 4 31B hackathon at 1500 tok/s -- 2026-06-30
Previewing GPT‑5.6 Sol: a next-generation model -- 2026-06-29
[Editorial] -- 2026-06-29
[Editorial] -- 2026-06-29
We built a calibration-aware Q4_K_M quant of Qwen3.5 0.8B that recovers 96.5% of the BF16 gap vs pure llama.cpp Q4_K_M (SpectralQuant) -- 2026-06-29
Update: First Manual Results from Testing Procedural Skill Transfer in Small Models -- 2026-06-29
Multi Tier MoE Caching -- 2026-06-29
GLM-5.2 is a step change for open agents -- 2026-06-25
poolside/Laguna-M.1 · Hugging Face - 225B-A23B -- 2026-06-25
Mimo 2.5 is _fast_ at large context (dual RTX Pro 6000) -- 2026-06-25
The Eagle(3) has landed (for Qwen) -- 2026-06-25
CPU-only TTS benchmark: Kokoro 82M vs Supertonic 3 vs Inflect-Nano-v1 (4.6M params), with UTMOS scoring on every sample -- 2026-06-25
[Editorial] Video Content -- 2026-06-16
[Editorial] Video Content -- 2026-06-16
[Editorial] Video Content -- 2026-06-16
[Editorial] StandardAgents Arrow-JS — JavaScript Agent Framework -- 2026-06-16
archex: Local-First Deterministic Code-Context for AI Agents — No API Key, No Telemetry (Apache 2.0) -- 2026-06-16
Ironsmith: Open Source macOS App That Creates macOS Apps From Prompts — Works With Local Models -- 2026-06-16
zengxiao-he/tessera -- 2026-06-11
Tencent-Hunyuan/UniRL -- 2026-06-11
SafeAdapt: Provably Safe Policy Updates in Deep Reinforcement Learning -- 2026-06-11
Physics-Grounded Multi-Agent Architecture for Traceable, Risk-Aware Human-AI Decision Support in Manufacturing -- 2026-06-11
Config Files That Run Code: Supply Chain Security Blindspot -- 2026-06-10
Surveillance is not safety: A statement on the UK's latest threat to privacy -- 2026-06-10
FrontierCode -- 2026-06-10
Introducing North Mini Code: Cohere's First Model For Developers -- 2026-06-10
[Editorial] jedArden/ARMOR -- 2026-06-05
m-sec-org/wafkiller -- 2026-06-05
zmn-hamid/sni-spoofing-scanner -- 2026-06-05
Everything at Every Scale: Scale-Invariant Diffusion with Continuous Super-Resolution -- 2026-06-04
Gaussian Point Splatting -- 2026-06-04
DaVinci Resolve 21 -- 2026-06-04
Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler -- 2026-06-04
microsoft/harrier-oss-v1-270m -- 2026-06-04
CohereLabs/tiny-aya-global -- 2026-06-04
Use your Nvidia GPU's VRAM as swap space on Linux -- 2026-06-03
[Editorial] chipotlai-max -- 2026-06-03
[Editorial] Video Submission -- 2026-06-03
It is an amazing time for programmers -- 2026-06-03
A 10 year old Xeon is all you need -- 2026-06-02
Inferencing at 10.33 t/s on Qwen 3.5 35B on a $300 laptop -- 2026-06-02
OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension -- 2026-06-02
Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation -- 2026-06-02
Qwen3.6 huge quality gain from Q4 to Q6 for coding agent -- 2026-06-02
1-Bit Bonsai Image 4B Image Generation for Local Devices -- 2026-06-01
Old Mac Pro still proving its worth -- 2026-06-01
Heterogeneous GPU Weighting & Layer Splitting -- 2026-06-01
I finally put my NPU (Intel Arrow Lake) to use doing ASR for my smart home -- 2026-06-01
OpenMOSS-Team/MOSS-TTS-v1.5 · Hugging Face -- 2026-06-01
[Editorial] -- 2026-06-01
[Editorial] -- 2026-06-01
[Editorial] -- 2026-06-01
[Editorial] -- 2026-06-01
[Editorial] Barracuda Nightmare Eclipse Zero-Days -- 2026-05-29
[Editorial] Video -- 2026-05-29
Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL -- 2026-05-28
PACZero: PAC-Private Fine-Tuning of Language Models via Sign Quantization -- 2026-05-28
If AI writes your code, why use Python? -- 2026-05-12
[Editorial] Lex Fridman — DeepSeek Deep Dive with Dylan Patel & Nathan Lambert -- 2026-05-12
[Editorial] Reuven Cohen on AI Industry Developments -- 2026-05-12
[Editorial] When Helpfulness Becomes Sycophancy -- 2026-05-08
[Editorial] Vibe-Cast JEPA — Joint Embedding Predictive Architecture Exploration -- 2026-05-08
[Editorial] -- 2026-05-07
ProgramBench: Can we really rebuild huge binaries from scratch? (doesn't look like it) -- 2026-05-07
Adding Benchmaxxer Repellant to the Open ASR Leaderboard -- 2026-05-07
Does the "6 months gap" still hold? -- 2026-05-07
Claude Code @ Opus 4.7 vs OpenCode @ qwen3.6:27b. Both shipped a playable cozy roguelite. -- 2026-05-07
Fine-tuned Qwen3.6-35B-A3B DeltaNet experiment -- 2026-05-07
Zyphra/ZAYA1-8B -- 2026-05-07
Anthropic ships Claude for Creative Work with nine MCP-native connectors -- 2026-05-05
n8n Just Got a New Tool (and it can SUPERCHARGE Claude Automations) -- 2026-05-05
[Editorial] How to Use ADRs in Ruflo -- 2026-05-05
Microsoft and OpenAI end their exclusive and revenue-sharing deal -- 2026-05-01
[Editorial] Video: AI Development Insights -- 2026-05-01
Talkie: a 13B vintage language model from 1930 -- 2026-05-01
Your phone is about to stop being yours -- 2026-04-29
OpenAI almost banned me because I tried to automate YouTube download -- 2026-04-29
[Editorial] Video editorial submission -- 2026-04-29
GPT-5.5 is out -- 2026-04-28
OpenAI CEO's Identity Verification Company Announced Fake Bruno Mars Partnership -- 2026-04-28
Trump picked a fight with Anthropic. Now the administration is backing off. -- 2026-04-28
[Editorial] -- 2026-04-28
Generalization at the Edge of Stability -- 2026-04-28
[Editorial] -- 2026-04-28
[Editorial] NIST Cybersecurity MLX Pipeline -- 2026-04-27
[Editorial] LLM Fine-Tuning Guide -- 2026-04-27
[Editorial] Open Generative AI — Curated Resource List -- 2026-04-27
[Editorial] SecWest: AMD VVI — Hardware-Level Vulnerability Research -- 2026-04-20
[Editorial] Phenoelit/Halvar — Legendary Security Research -- 2026-04-20
Lean proved this program correct; then I found a bug -- 2026-04-14
Multi-Agentic Software Development Is a Distributed Systems Problem -- 2026-04-14
[Editorial] The Room That Quoted Back -- 2026-04-14
[Editorial] -- 2026-04-13
Qwen3.5-397B is shockingly useful at Q2 -- 2026-04-13
[Editorial] -- 2026-04-13
Liquid AI releases LFM2.5-VL-450M - structured visual understanding at 240ms -- 2026-04-13
LLM Novice Uplift on Dual-Use Biology Tasks — 4x Accuracy Boost Bypasses Safeguards -- 2026-04-10
[Editorial] Your AI Is Developing Capabilities Nobody Tested -- 2026-04-10
The current state of the Chinese LLMs scene -- 2026-03-26
Alibaba confirms they are committed to continuously open-sourcing new Qwen and Wan models -- 2026-03-26
Cursor's Composer 2 apparently built on Kimi K2.5 without attribution -- 2026-03-26
Nemotron Cascade 2 30B A3B -- 2026-03-26
NVIDIA 2026 Conference LIVE. New Base model coming! -- 2026-03-20
Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI -- 2026-03-20
Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge -- 2026-03-20
[New Model & Agent] LocoTrainer-4B: A Claude Code-style local agent designed specifically to master the MS-SWIFT framework (4B, 32K, GGUF) -- 2026-03-20
shallowdream204/BitDance-14B-16x -- 2026-03-20
[Editorial] Karpathy autoresearch -- 2026-03-10
[Editorial] Doc-to-LoRA -- 2026-03-10
GPT-5.4 -- 2026-03-09
[Editorial] -- 2026-03-09
YuanLabAI/Yuan3.0-Ultra: 1010B MoE, fully open weights -- 2026-03-05
We could be hours (or less than a week) away from true NVFP4 support in Llama.cpp GGUF format -- 2026-03-05
Step-3.5-Flash-Base & Midtrain (in case you missed them) -- 2026-03-05
Qwen3.5-9B Uncensored Aggressive Release (GGUF) -- 2026-03-05
unknown -- 2026-03-05
[Editorial] David Maynor Security Gist -- 2026-03-04
[Editorial] arXiv:2602.23093 -- 2026-03-04
unpromptedcon.org -- 2026-03-04
Inside the M4 Apple Neural Engine, Part 1: Reverse Engineering -- 2026-03-03
Hydroph0bia – fixed SecureBoot bypass for UEFI firmware from Insyde H2O (2025) -- 2026-03-03
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
Free ASIC Llama 3.1 8B inference at 16,000 tok/s - no, not a joke -- 2026-02-25
[Editorial] Cognitum -- 2026-02-25
Hetzner Prices increase 30-40% -- 2026-02-25
[Editorial] -- 2026-02-24
Anthropic Accuses DeepSeek, Moonshot AI, and MiniMax of Creating 24,000 Fake Claude Accounts -- 2026-02-24
Gemini 3.1 Pro -- 2026-02-23
15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern -- 2026-02-20
Nvidia and OpenAI abandon unfinished $100B deal in favour of $30B investment -- 2026-02-20
Google releases Gemini 3.1 Pro with Benchmarks -- 2026-02-20
praetorian-inc/brutus -- 2026-02-19
[Editorial] Unicornscan Getting Started -- 2026-02-19
[Editorial] Unicornscan Alicorn -- 2026-02-19
What Your Bluetooth Devices Reveal About You -- 2026-02-19
[Editorial] When everyone can build software, who learns well? -- 2026-02-19
Sonnet 4.6 feels like Opus 4.5 at Sonnet pricing -- 2026-02-19
Anthropic Raises $30,000,000,000 As Run-Rate Revenue Grew 10x Annually Over Three Years -- 2026-02-19
REASONING AUGMENTED RETRIEVAL (RAR) is the production-grade successor to single-pass RAG -- 2026-02-19
Qwen Released Qwen 3.5 397B and Qwen 3.5 Plus! -- 2026-02-17
Qwen3.5 NVFP4 (Blackwell) is up! -- 2026-02-17
Running Gemma 3n E2B natively on Android via LiteRT -- 2026-02-17
Deploying Open WebUI + vLLM on Amazon EKS -- 2026-02-17
[Editorial] https://forge-quality.dev/articles/case-of-passing-tests-investigation -- 2026-02-02
[Editorial] https://www.linkedin.com/posts/ownyourai_deepseek-just-released-the-first-vision-ai-activity-7421818927657385987-V1yo -- 2026-01-27
Unsloth announces support for finetuning embedding models -- 2026-01-27
matrixhub-ai/matrixhub -- 2026-01-14
HM-RunningHub/ComfyUI_RH_DreamID-V -- 2026-01-14
YouTube has removed the ability to search by upload date -- 2026-01-14
Tried this open-source framework for LLM fine-tuning over UI -- 2025-12-12
Golang optimizations for high‑volume services -- 2025-12-12
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_assessing-llms-for-serendipity-discovery-activity-7396596796938153984-JY9u -- 2025-12-12
Deprecations via warnings don't work for Python libraries -- 2025-12-11
The "Confident Idiot" Problem: Why LLM-as-a-Judge fails in production. -- 2025-12-10
Toyota unintended acceleration and the big bowl of "spaghetti" code (2013) -- 2025-12-09
Free yourself from the Spotify desktop client with spotifyd -- 2025-12-04
Llamacpp Parameters Tuning -- 2025-12-02
[Editorial] https://huggingface.co/blog/grimjim/norm-preserving-biprojected-abliteration -- 2025-12-02
Pong Gets the Boot -- 2025-11-26
Building the largest known Kubernetes cluster, with 130k nodes -- 2025-11-26
The Qtile Window Manager: A Python-Powered Tiling Experience -- 2025-11-25
Most Stable Raspberry Pi? Better NTP with Thermal Management -- 2025-11-25
Quantum physicists have shrunk and "de-censored" DeepSeek R1 -- 2025-11-20
Gain 60% performance on RDNA 4 using this fix -- 2025-11-19
Scale-out is the silent killer of LLM applications. Are we solving the wrong problem? -- 2025-11-19
[Editorial] https://brianhorakh.medium.com/just-mcp-to-reduce-context-waste-in-spec-driven-development-3935922da5cf -- 2025-11-18
[AutoBE] Qwen3-80B suddenly wrote doomsday AI mythology while generating a TODO app -- 2025-11-18
My trick for better Claude Code collaboration: CLAUDE.md with conditional loading -- 2025-11-18
A proper way to connect a local LLM to iMessage? -- 2025-11-13
How do I level up from normie to normie pro with Claude -- 2025-11-13
POC: Model Context Protocol integration for native Ollama app -- 2025-11-12
Skills are in a weird middle ground between RAG and Custom GPTs, and I think that's why they feel so awkward -- 2025-11-12
Native LLM Router Integration with Cost Transparency for OpenWebUI -- 2025-11-12
Last week in Multimodal AI - Local Edition -- 2025-11-12
DeepSeek-OCR GGUF model runs great locally - simple and fast -- 2025-11-12
Qwen3-VL works really good with Zoom-in Tool -- 2025-11-12
lightonai/LightOnOCR-1B-1025 -- 2025-11-12
Qwen/Qwen3-VL-2B-Thinking -- 2025-11-12
[Editorial] https://www.linkedin.com/posts/daniel-cuthbert0x_a-month-ago-gadi-evron-and-i-set-about-building-ugcPost-7393643597729845248-TSTD -- 2025-11-11
Breakdown of New RunC Vulnerabilities -- 2025-11-11
[Editorial] https://www.linkedin.com/posts/andriyburkov_this-paper-shows-a-27-million-parameter-model-activity-7393432619365052416-SFLO -- 2025-11-10
Trajectory Distillation for Foundation Models -- 2025-11-10
sail-sg/Precision-RL -- 2025-11-10
inclusionAI/LLaDA2.0-flash-preview -- 2025-11-10
Finetuning DeepSeek 671B locally with only 80GB VRAM and Server CPU -- 2025-11-07
IPEX-LLM llama.cpp portable GPU and NPU working really well on laptop -- 2025-11-07
Building a PV Solar-Powered Quadcopter -- 2025-11-07
Adding a RTX 5080 into a 2U server with OcuLink -- 2025-11-06
Why does Image Recognition work in llama-server but not through Open WebUI? -- 2025-11-06
2025 Component Abuse Challenge: A Piezo Disk Powers A Transmitter -- 2025-11-06
[Editorial] Does the EU know that there are many countries outside of the EU that do not care at all about their -- 2025-11-03
Ilya Sustkever's deposition reveals previously unknown details [pdf] -- 2025-11-03
CISA and NSA share tips on securing Microsoft Exchange servers -- 2025-11-02
The Smol Training Playbook: The Secrets to Building World-Class LLMs -- 2025-11-02
Latest Update from Anthropic's new model - Neptune V6 -- 2025-11-02
AI "Phone Farm" Startup Gets Funding from Marc Andreessen to Flood Social Media With Spam -- 2025-11-02
Minimax-M2 cracks top 10 overall LLMs (production LLM performance gap shrinking: 7 points from GPT-5 in Artificial Analysis benchmark) -- 2025-11-01
🚨 OpenAI Gives Microsoft 27% Stake, Completes For-Profit Shift -- 2025-11-01
FlashPack: High-throughput tensor loading for PyTorch -- 2025-11-01
Kafka is Fast – I'll use Postgres -- 2025-11-01
Analog Surround Sound Was Everywhere, But You Probably Didn’t Notice -- 2025-11-01
Optimizing gpt-oss-120B on AMD RX 6900 XT 16GB: Achieving 19 tokens/sec -- 2025-10-31
Flamingo 3 released in safetensors -- 2025-10-31
Jeep Issues Emergency Recall for OTA-Bricked Wrangler 4xes -- 2025-10-29
queenkiley/AI-Art-Generator -- 2025-10-26
Unlock the power of images with AI Sheets -- 2025-10-26
Open WebUI Context Menu -- 2025-10-26
[Editorial] Browsers you can socially engineer -- 2025-10-24
Update on Plans for Privacy Sandbox Technologies -- 2025-10-24
PlayDiffusion finetune for audio inpainting non-verbal tags -- 2025-10-21
Nvidia has produced the first Blackwell wafer on US soil -- 2025-10-21
Show HN: Syna – Minimal ML and RL Framework Built from Scratch with NumPy -- 2025-10-21
C Project Turns Into Full-Fledged OS -- 2025-10-21
[Editorial] Agentic Orchestration -- 2025-10-19
Chicken Squisher 3000: Squish-Proof Security -- 2025-10-19
[Editorial] Claude Skills are awesome, maybe a bigger deal than MCP -- 2025-10-18
[Editorial] Claude Skills -- 2025-10-18
Copy-and-Patch: A Copy-and-Patch Tutorial -- 2025-10-17
We built 3B and 8B models that rival GPT-5 at HTML extraction while costing 40-80x less - fully open source -- 2025-10-17
Comparing Popular AI Evaluation Platforms for 2025 -- 2025-10-17
State of AI Report 2025 -- 2025-10-17
Nvidia breakthrough gives 4-bit pretraining technique the accuracy of FP8 -- 2025-10-15
AI assisted suite - Doubt about n_gpu layer test -- 2025-10-15
ibm-granite/granite-4.0-h-micro -- 2025-10-15
Qwen/Qwen3-VL-235B-A22B-Instruct -- 2025-10-15
Get your VLM running in 3 simple steps on Intel CPUs -- 2025-10-15
A 5-minute, no-BS way to pick a local model for your real task -- 2025-10-14
[Update] CodeLens.AI - Crowdsourced AI Leaderboard 3 Days Later: Blind Voting and What We Learned -- 2025-10-14
ZephrFish/OmniProx -- 2025-10-14
Preference optimization with ORPO and LoRA -- 2025-10-12
[Show] SpiralTorch: A Rust-based PyTorch-style autograd engine (Python 3.14-ready) -- 2025-10-12
2G Gone? Bring It Back Yourself! -- 2025-10-12
[Editorial] https://www.anthropic.com/research/small-samples-poison -- 2025-10-11
[Editorial] https://www.linkedin.com/pulse/from-chatbot-operating-system-what-openais-next-move-means-leimer-ju18c -- 2025-10-11
Rubygems.org AWS Root Access Event – September 2025 -- 2025-10-11
Stop flexing Pass@N — show Pass-all-N -- 2025-10-11
Architecting a project for optimal AI coding, any tips? -- 2025-10-11
Basekick-Labs/arc -- 2025-10-11
ServiceNow-AI/Apriel-1.5-15b-Thinker -- 2025-10-11
meituan-longcat/LongCat-Flash-Chat -- 2025-10-11
Did anyone try out GLM-4.5-Air-GLM-4.6-Distill ? -- 2025-10-10
Thank you Anthropic & this community! Our little side project just hit 1M visits and even made it on National TV! -- 2025-10-10
Sharing my free tool for easy handwritten fine-tuning datasets! -- 2025-10-09
vllm setup for nvidia (can use llama) -- 2025-10-05
Full-fine tuning doesn't require much vRAM with gradient checkpointing... -- 2025-10-05
Qwen/Qwen3-Omni-30B-A3B-Thinking -- 2025-10-05
inclusionAI/Ring-mini-linear-2.0 -- 2025-10-05
llama.cpp: Quantizing from bf16 vs f16 -- 2025-10-05
GLM 4.6 is nice -- 2025-10-04
NVFP4 or MXFP4 MOE on sm120 (RTX 5900 RTX 6000 PRO) -- 2025-10-04
Ask Hackaday: How Do You Distro Hop? -- 2025-09-30
Uncensor Qwen3 models without retraining -- 2025-09-20
Depth upscaling? -- 2025-09-20
Definitive proof openai/gpt-oss-20b is dumb as hell -- 2025-09-19
Free 10%+ Speedup for CPU/Hybrid Inference on Intel CPUs with Efficiency Cores -- 2025-09-17
Claude Performance Report with Workarounds - September 7 to September 14 -- 2025-09-16
PSA/RFC: KV Cache quantization forces excess processing onto CPU in llama.cpp -- 2025-09-15
native tool calling support for DeepSeek V3.1 just merged in llama.cpp -- 2025-09-15
model : add grok-2 support by CISC · Pull Request #15539 · ggml-org/llama.cpp -- 2025-09-15
An Afternoon at the Recursive Café: Two Threads Interleaving -- 2025-09-14
blacktop/go-hypervisor -- 2025-09-14
TSYJ-He/AutoEnvForge -- 2025-09-14
Hackaday Links: September 7, 2025 -- 2025-09-14
Any idea how to use ollama (debian) with 2x GPUs to load larger models? -- 2025-09-14
Rails on SQLite: new ways to cause outages -- 2025-09-14
Qwen3-Coder-480B Q2_K_XL same speed as Qwen3-235b-instruct Q3_K_XL WHY? -- 2025-09-09
Renting GPUs is hilariously cheap -- 2025-09-09
Ex-Miner Turned Local LLM Enthusiast, now I have a Dilemma -- 2025-09-09
Tencent-Hunyuan/HunyuanWorld-Voyager -- 2025-09-09
How the “Kim” dump exposed North Korea's credential theft playbook -- 2025-09-09
Further Adventures in Colorimeter Hacking -- 2025-09-09
🌟Introducing Art-0-8B: Reasoning the way you want it to with Adaptive Thinking🌟 -- 2025-09-02
Fine Tune Model for Home Assistant? -- 2025-09-02
[Editorial] Claude Code - massive issues -- 2025-09-01
TheDrummer is on fire!!! -- 2025-09-01
NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-01
Gpt-oss Fine-tuning - now with 60K context length and fits on <13GB VRAM -- 2025-08-30
A PLL For Perfect Pitch -- 2025-08-27
Why are users still able to edit system prompts or memories even after disabling it? -- 2025-08-20
Making your prompts better with GEPA-Lite using Ollama! -- 2025-08-15
Optimizing OpenWebUI's speed through indexing (using PostgreSQL as a back-end) -- 2025-08-11
PSA: DuckDuckGo search in OWUI routes to non-privacy friendly providers like Bing, Google, and Yahoo. -- 2025-08-11
Open-webui Tools for Firewalla -- 2025-08-11
Local RAG with 97% smaller index and Claude Code–compatible semantic search -- 2025-08-10
Trump Announces 100% Tariff on Semiconductors, unless made in US -- 2025-08-08
The Tape Speed Keyboard -- 2025-08-08
My first finetune: Gemma 3 4B unslop via GRPO -- 2025-08-02
Supervised Fine Tuning on Curated Data is Reinforcement Learning -- 2025-08-02
Debugging the Pixel 8 kernel via KGDB -- 2025-07-31
Ollama + Open WebUI -- is there a way for the same query to run through the same model multiple times (could be 3 times, could be 100 times), then gather all the answers together to summarise/count? -- 2025-07-25
Localllama’s (first?) IFTA - I’ll Fine-Tune Anything -- 2025-07-20
fsndzomga/metadspy -- 2025-07-10
osmosis-ai/Osmosis-Apply-1.7B -- 2025-07-10
IntervitensInc/pangu-pro-moe-model -- 2025-07-10
Continual Gradient Low-Rank Projection Fine-Tuning for LLMs -- 2025-07-10
Creating custom kernels for the AMD MI300 -- 2025-07-10
i made a commit message generator that can be used offline and for free -- 2025-07-05
THUDM/GLM-4.1V-9B-Thinking -- 2025-07-05
baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle -- 2025-07-05
LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs -- 2025-07-05
trycua/cua -- 2025-06-30
ml0-1337/claude-gate -- 2025-06-30
mstrYoda/go-arctest -- 2025-06-30
Accelerating Docker Builds by Halving EC2 Boot Time -- 2025-06-30
Advanced Time Manipulation with GDB -- 2025-06-21
Practical SDR: Getting started with software-defined radio -- 2025-06-21
liaotxcn/Probabilistic-Filters -- 2025-06-19
flohoss/gocron -- 2025-06-19
Lessons from Mixing Rust and Java: Fast, Safe, and Practical -- 2025-06-19
100 prisoners and a lightbulb -- looking back -- 2025-06-19
Paper2Poster/Paper2Poster -- 2025-06-10
Octoberfest7/zip_smuggling -- 2025-05-30
Silencing Firefox's Chattiness for Web App Testing -- 2025-05-30