AI Agents

Agent frameworks, autonomy, MCP, tool use, multi-agent orchestration

1519 articles across 276 editions

Articles

SAGA: Workflow-Atomic Scheduling for AI Agent Inference on GPU Clusters -- 2026-07-09
OfficeCLI: Office suite for AI agents to read and edit Microsoft Office files -- 2026-07-09
Toolport: Use as many MCP servers as you want without the token tax -- 2026-07-09
GPT-5.6 Sol Ultra will be in Codex -- 2026-07-09
Qwen 3.6 27B absolutely fails at agentic work -- 2026-07-09
I tested Anthropic's new Jacobian Lens on open models, then it turned into a local-model hallucination router -- 2026-07-09
Complete local model asset generation pipeline -- 2026-07-09
krea-ai/krea-2 -- 2026-07-09
I made a tool that chains a small local model into a big coding model and auto-unloads VRAM between them -- 2026-07-09
Ternlight – 7 MB embedding model that runs in browser (WASM) -- 2026-07-09
I tested freshly merged DFlash in llama.cpp on Qwen 3.6 27B Local AI win. 4.44x faster at 36K context. Here are my findings RTX 6000 PRO. -- 2026-07-08
Ollama 0.31: Faster Gemma 4 on Apple Silicon with MTP. Here is my test showing a 56% boost on M1 Pro 16GB (2021) -- 2026-07-08
AMD Ryzen AI Halo – $4k AI Dev Kit -- 2026-07-08
So... anyone copped one of these? -- 2026-07-08
Uh.. Honey, how do you feel about takeout? -- 2026-07-08
COMAP: Co-Evolving World Models and Agent Policies for LLM Agents -- 2026-07-08
NVlabs/SpatialClaw -- 2026-07-08
LeRobot v0.6.0: Imagine, Evaluate, Improve -- 2026-07-08
AI-Builder-Club/skills -- 2026-07-08
My Agentic Workbench -- 2026-07-08
[Editorial] -- 2026-07-08
Feeling dumb day by day after using claude code -- 2026-07-08
SentinelMCP: Open-Source MCP Firewall & Security Gateway for AI Agents -- 2026-07-07
Codex CLI Jailbreak Guide — Customizing System Prompt via model_instructions_file -- 2026-07-07
[Editorial] Video Feature -- 2026-07-07
Waveloom: Terminal Coding Agent Optimized for DeepSeek Prefix Caching — 95-99% Cache Hit, 1/50th Cost -- 2026-07-07
Qwen3.6-27B Vibecodes A* Pathfinding in Java Game — 12 Hours of Autonomous Testing -- 2026-07-07
Codex Builds Pikachu Volleyball in UmLang — an Obscure Korean Esoteric Programming Language -- 2026-07-07
[Editorial] @metaharness/flywheel -- 2026-07-07
[Editorial] Beijing Looking to Curb Overseas Access to China's Top AI Models -- 2026-07-07
Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says -- 2026-07-07
Virginia bans sale of geolocation data -- 2026-07-07
[Editorial] Open-Source AI Coding Agents Shell Injection Vulnerability -- 2026-07-06
[Editorial] arXiv Research Paper 2606.24496 -- 2026-07-06
Is it ever possible to have a malicious LLM with a backdoor -- 2026-07-06
Android Developer Verification: Threat masquerading as Protection -- 2026-07-06
[Editorial] Video Content -- 2026-07-06
[Editorial] The Future of Agentics Is Not the Model -- 2026-07-06
[Editorial] Loop Came Home -- 2026-07-06
[Editorial] ruvnet Technical Gist -- 2026-07-06
[Editorial] ruvnet Technical Gist -- 2026-07-06
[Editorial] Video Content -- 2026-07-06
[Editorial] Video Content -- 2026-07-06
[Editorial] Video Content -- 2026-07-06
Does code cleanliness affect coding agents? A controlled minimal-pair study -- 2026-07-06
The Safari MCP server for web developers -- 2026-07-06
cellebrite-labs/ghidra-rpc — Agentic Reverse Engineering Skill for Ghidra -- 2026-07-03
raiyanyahya/recall — Durable Offline Memory for Claude Code -- 2026-07-03
[Editorial] Agentic Rust Optimizer -- 2026-07-03
[Editorial] Nexu Open Design -- 2026-07-03
[Editorial] Agents Optimizing Their Own Behavior -- 2026-07-03
Forsy-AI/agent-apprenticeship -- 2026-07-03
Scalable Inference Architectures for Compound AI Systems: A Production Deployment Study -- 2026-07-03
OmniAct: Advancing Omnimodal Embodied Agents from Isolated Skills to Everyday Physical Autonomy -- 2026-07-03
[Editorial] OpenAI Proposes US Government Own 5% Stake -- 2026-07-03
OpenAI: In early talks to give 5% stake to US Government -- 2026-07-03
The number 1 public enemy of open-source. -- 2026-07-03
Model Registry: Torrents for open models using Hugging Face as a fallback web seed. -- 2026-07-03
Anatomy of a Failed (Nation-State?) Attack -- 2026-07-01
projectdiscovery/depx -- 2026-07-01
Claude Code suddenly tried to open a Remote Desktop connection on my PC. This seriously scared me. -- 2026-07-01
Google's masterclass on agentic engineering patterns -- 2026-06-30
[Editorial] AgentBBS — Bulletin Board System for AI Agents -- 2026-06-30
OpenKnowledge: Open source AI-first alternative to Obsidian/Notion with Claude/Codex integration -- 2026-06-30
Bingo - AI-powered Red Team Terminal (DeepSeek/Claude/GPT/GLM) -- 2026-06-30
[Editorial] REcon Conference for Reversers and Security Researchers -- 2026-06-30
Enhancing X11 Application Security with LXC -- 2026-06-30
TxBench-PP: Analyzing AI Agent Performance on Small-Molecule Preclinical Pharmacology -- 2026-06-29
Herdr: Agent multiplexer that lives in your terminal -- 2026-06-29
[Editorial] -- 2026-06-27
[Editorial] -- 2026-06-27
OpenAI Codex has a bug that could kill your SSD in under a year -- 2026-06-27
[Editorial] -- 2026-06-27
[Editorial] -- 2026-06-27
[Editorial] -- 2026-06-27
[Editorial] -- 2026-06-27
Unlimited-OCR is now on ModelScope! A 3.3B multilingual OCR model for one-shot parsing across single images, multi-page documents, and PDFs. License: MIT -- 2026-06-27
[Editorial] Xiaomi HarnessX — self-rewriting AI scaffolding -- 2026-06-26
yzfly/TokenCode -- 2026-06-26
How I'm handling per-agent isolation and environment lifecycle in a harness-agnostic orchestration library -- 2026-06-26
Andrezi: a local-first memory governance layer for Claude Code (honest writeup, MIT) -- 2026-06-26
[Editorial] Agentic QE v3.11.1 -- 2026-06-26
Computer Use in Gemini 3.5 Flash -- 2026-06-25
gemini-web2api: Convert Gemini Web to OpenAI-Compatible API — Zero Auth, Single File -- 2026-06-25
[Editorial] Mistral OCR 4 -- 2026-06-25
AADvark: Agent-Aided Design for Dynamic CAD Models with Moving Parts -- 2026-06-25
[Editorial] Agentic Context Engine -- 2026-06-19
[Editorial] The Most Important Idea in AI Today — Reuven Cohen -- 2026-06-19
[Editorial] Video Pick 2 -- 2026-06-19
Managing entire business banking through Claude MCP -- 2026-06-19
[Editorial] The Flat Curve Society — Steve Yegge -- 2026-06-19
Peopleless economy? Not technically impossible -- 2026-06-19
Local coding agents are good now, but only if you babysit them -- 2026-06-18
STOP Using Claude Code Without This Fable 5 Agentic OS -- 2026-06-18
[Editorial] -- 2026-06-18
openclaw/agent-skills -- 2026-06-18
Agentic Resource Discovery: Let agents search -- 2026-06-18
henliveira/av-curator -- 2026-06-18
[Editorial] Cursor: Agent Autonomy & Auto-Review -- 2026-06-17
paradigmxyz/centaur — Multiplayer, Self-Hosted, Secure Agents -- 2026-06-17
tastyeffectco/sandboxes — Self-Hosted Dev Sandboxes -- 2026-06-17
Autonomous LLM-Guided Disease Forecasting Matches CDC Expert Ensembles in Prospective Evaluation -- 2026-06-16
AI Giants Score Below 25% in UC Berkeley-Led Test of Real-World Application Across 50+ Industries -- 2026-06-16
[Editorial] Research Paper -- 2026-06-16
[Editorial] Claude Fable 5 Made This Entire Video -- 2026-06-15
[Editorial] Agent Harness Generator -- 2026-06-15
[Editorial] AI Demo/Showcase Video -- 2026-06-15
[Editorial] Ponytail — Open Source Tool -- 2026-06-15
[Editorial] CISO Perspective on Cyber + AI Convergence -- 2026-06-15
[Editorial] ISO 27001 Meets Agent Security -- 2026-06-15
AzureRedOps — Offensive Security Toolkit for Microsoft Entra ID -- 2026-06-15
[Editorial] AI Tooling Walkthrough Video -- 2026-06-15
[Editorial] Gadi Evron on Forcing Agents to Find -- 2026-06-12
townsendmerino/ken -- 2026-06-12
[Editorial] OB1 Project -- 2026-06-12
[Editorial] Vimeo Feature -- 2026-06-12
Google's Agents CLI: The CLI + Skills Combination to Ship AI Agents EASILY -- 2026-06-12
ultracode is the most powerful claude code feature in months -- 2026-06-12
Top 3 Underrated Open Source Repos Nobody Talks About -- 2026-06-12
[Editorial] -- 2026-06-11
Harness Engineering: What Separates Top Agentic Engineers Right Now -- 2026-06-11
Claude Plans, Gemini Designs: One Workflow for Beautiful Frontends (LIVE) -- 2026-06-11
[Editorial] Cole Medin: Most Powerful AI Coding Setup -- 2026-06-11
Codex Remote is a GAME CHANGER -- 2026-06-11
This Claude Code + Obsidian Command Center is INSANE -- 2026-06-11
Top 5 Web Design Plugins for Claude Code -- 2026-06-11
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces -- 2026-06-09
Leap in DNA synthesis slashes time to build new genetic sequences -- 2026-06-09
Thi.ng – open-source building blocks for computational design and art -- 2026-06-09
Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework -- 2026-06-09
Efficient-Large-Model/SANA-WM_bidirectional -- 2026-06-09
[Editorial] -- 2026-06-09
ajsai47/backdoor -- 2026-06-09
This Open Source Repo Just Solved Claude Code's #1 Problem -- 2026-06-09
KyrieCheungYep/ky-design-to-html-skill -- 2026-06-09
Show HN: Gitdot – A better GitHub. Open-source, written in Rust -- 2026-06-09
[Editorial] -- 2026-06-08
The Most Powerful Claude Code Feature In Months Dropped & Nobody is Talking About It -- 2026-06-08
ultracode is INSANE and nobody is talking about it -- 2026-06-08
ethanhq/cc-fleet -- 2026-06-08
[Editorial] -- 2026-06-08
[Editorial] -- 2026-06-08
[Editorial] -- 2026-06-08
[Editorial] -- 2026-06-08
perplexityai/bumblebee -- 2026-06-08
wexaai/cognodb -- 2026-06-08
[Editorial] -- 2026-06-08
[Editorial] -- 2026-06-08
MisoLabs/MisoTTS -- 2026-06-08
Anthropic Just Dropped a Masterclass on Building Agent Harnesses (for Large Codebases) -- 2026-06-05
[Editorial] chopratejas/headroom -- 2026-06-05
Is It Time To Switch to Codex? -- 2026-06-05
The Storyboard Trick That Stops AI Slop Code -- 2026-06-05
rulyone/Simple-ReAct-Agent -- 2026-06-05
AEM: Adaptive Entropy Modulation for Multi-Turn Agentic Reinforcement Learning -- 2026-06-05
Deciphering Shortcut Learning from an Evolutionary Game Theory Perspective -- 2026-06-05
DynaTree: Dynamic Agentic Retrieval Tree for Time-Sensitive News Retrieval -- 2026-06-04
Property-Guided LLM Program Synthesis for Planning -- 2026-06-04
Conversational Demand Response: Bidirectional Aggregator-Prosumer Coordination through Agentic AI -- 2026-06-04
mims-harvard/AutoScientists -- 2026-06-04
[Editorial] -- 2026-06-04
[Editorial] -- 2026-06-04
[Editorial] -- 2026-06-04
You Don't Understand the Power of a Claude Code Agentic OS -- 2026-06-04
[Editorial] Agentic Tracebit — AI-Powered Security Deception -- 2026-06-03
puck-security/puck-scout -- 2026-06-03
V0id-v2/Void-Tools-v2.0 -- 2026-06-03
ClaudioDrews/memory-os -- 2026-06-03
Gograph: AST-Based MCP Server That Cuts Claude Code Token Use by 95% in Go Repos -- 2026-06-03
[Editorial] The Future of Software Development -- 2026-06-03
I ran 8 open-weight models as agents in a persistent MMO for 10 days. Here's the 93k event dataset and some things that I learned -- 2026-06-01
How Qwen3.6-35B-A3B fails differently as a sub agent compared to solo -- 2026-06-01
I built a computer use sandbox framework for codex on headless linux. GPU passthrough, computer use, and sudo access for codex all work. -- 2026-06-01
ZJU-REAL/SDAR -- 2026-06-01
Claude Opus 4.8 -- 2026-06-01
[Editorial] -- 2026-06-01
New DeepSWE benchmark finds Claude Opus cheats -- 2026-05-29
ITBench-AA: Frontier Models Score Below 50% on Enterprise IT Tasks — by Artificial Analysis and IBM -- 2026-05-29
Context, Reasoning, and Hierarchy: Cost-Performance Study of Compound LLM Agent Design in an Adversarial POMDP -- 2026-05-29
10 years of AI robustness tricks (PGD, RLHF, Data Augmentation) are actually computing the same hidden matrix -- 2026-05-29
[Editorial] Dynamic Workflows in Claude Code -- 2026-05-29
Using AI to write better code more slowly -- 2026-05-29
Overnight autonomous coding with Claude Code -- 2026-05-29
[Editorial] Harness Engineering for AI Coding -- 2026-05-28
[Editorial] pacphi Gist -- 2026-05-28
Patdolitse/piia-engram -- 2026-05-28
yliust/Tactile: accessibility-first operating layer for agents -- 2026-05-28
hadriansecurity/OpenHack -- 2026-05-28
OpenAI cofounder Karpathy joins Anthropic to teach Claude to improve itself without humans -- 2026-05-28
China Clamps Down on Overseas Travel for AI Talent at Alibaba, DeepSeek -- 2026-05-28
MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems -- 2026-05-27
Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations -- 2026-05-27
Same task in github-copilot, pi, claude-code, and opencode with Qwen3.6 27B -- 2026-05-27
[Editorial] Millions of AI Agents Imperiled by Critical Vulnerability in Open-Source Package -- 2026-05-27
ekomsSavior/Centipede — Self-replicating Linux worm with multi-layer C2 -- 2026-05-27
[Editorial] Editor's Pick (Video) -- 2026-05-27
Dead Weights, Live Signals: Feedforward Graphs of Frozen Language Models -- 2026-05-27
[Editorial] leochlon/ntkmirror -- 2026-05-27
BBuf/kernel-pilot -- 2026-05-27
A vector index can't tell if today's "Karpathy" is the same one it saw yesterday. Here's the fix -- 2026-05-26
[Editorial] -- 2026-05-26
[Editorial] -- 2026-05-26
DanOps-1/Gpt-Agreement-Payment -- 2026-05-26
A Workflow-Oriented Framework for Asynchronous Human-AI Collaboration in Hybrid and Compute-Intensive HPC Environments -- 2026-05-22
Simple Multi-Agent Architecture Running Across Our Entire Org. Keeping everything in Loop. -- 2026-05-22
eight-acres-lab/openmelon -- 2026-05-22
GPT 5.5 (Codex) leading the future prediction race -- 2026-05-22
Agentic Multi-Agent Architecture for Cybersecurity Risk Management -- 2026-05-21
AiSOC: Open-Source AI-Powered Security Operations Center -- 2026-05-21
Cuocuo: Encrypted Tunnel Relay (XChaCha20-Poly1305 + Protobuf) -- 2026-05-21
Trojan's Whisper: Stealthy Manipulation of Coding Agents via Injected Guidance -- 2026-05-21
Agent issued rm -rf / to test its own command blocking -- 2026-05-21
OpenSquilla: Token-Efficient AI Agent -- 2026-05-21
SmallCode: Open-source agentic coding tool stabilized after 90+ bug fixes -- 2026-05-21
Qwen3-Coder-Next lands on HuggingFace -- 2026-05-21
Open Relay v4.1–4.3: Terminal, Performance, and Code Block Overhaul for Open WebUI -- 2026-05-21
No Slop Grenade -- 2026-05-21
Gemini 3.5 Flash -- 2026-05-20
Cursor Introduces Composer 2.5 -- 2026-05-20
[Editorial] -- 2026-05-20
We let AIs run radio stations -- 2026-05-19
Agora-1: The Multi-Agent World Model -- 2026-05-19
Running agents 2x might be the simplest way to improve performance -- 2026-05-19
[Editorial] -- 2026-05-19
Open Source vs frontier models on a single-file HTML canvas driving animation - results -- 2026-05-19
Every Claude Code User NEEDS To Watch This -- 2026-05-19
The reason for the new limits is that SpacexAI is renting its servers to Anthropic. -- 2026-05-19
[Editorial] -- 2026-05-19
[Editorial] -- 2026-05-18
[Editorial] -- 2026-05-18
Claude Mythos Speeds Up macOS Security Exploit Research -- 2026-05-18
tiangolo/library-skills — Library Agent Skills -- 2026-05-15
taito — A package manager for local AI skill/agent bundles -- 2026-05-15
Let's build Claude Code from scratch — nanoclaude -- 2026-05-15
[Editorial] The Graveyard Folder -- 2026-05-15
Claude for Small Business -- 2026-05-15
[Editorial] -- 2026-05-15
Reimagining the mouse pointer for the AI era -- 2026-05-15
Twin Brothers Wipe 96 Government Databases Minutes After Being Fired -- 2026-05-14
[Editorial] Ghost SIM Attack — Black Hat SecTor Research -- 2026-05-14
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company -- 2026-05-13
GammaLabTechnologies/harmonist -- 2026-05-13
How OpenAI runs its Codex coding agent safely at scale -- 2026-05-13
amanning3390/deepswarm -- 2026-05-13
[Editorial] -- 2026-05-13
[Editorial] -- 2026-05-13
Rant: The realization that most of what ive been calling "evals" has been vibe checks. -- 2026-05-13
[Editorial] -- 2026-05-13
[Editorial] -- 2026-05-13
FaultLine - LLM memory with a bouncer at the door -- 2026-05-13
regent-vcs/re_gent — Version Control for AI Coding Agents -- 2026-05-12
Show HN: adamsreview – better multi-agent PR reviews for Claude Code -- 2026-05-12
nateherkai/token-dashboard — Claude Code Token Cost Analytics -- 2026-05-12
Open WebUI v0.9.3 (and v0.9.4) is out — massive performance wins, message editing finally fixed -- 2026-05-12
We built and open-sourced Caliby: An embedded, high-performance vector database for AI Agents (Beats pgvector by 4x) -- 2026-05-12
[Editorial] Arxiv Research Paper -- 2026-05-12
QKVShare: Quantized KV-Cache Handoff for Multi-Agent On-Device LLMs -- 2026-05-12
MachinaCheck: Building a Multi-Agent CNC Manufacturability System on AMD MI300X -- 2026-05-12
Agents need control flow, not more prompts -- 2026-05-11
[Editorial] NEEDLE — AI Search & Retrieval Tool -- 2026-05-11
[Editorial] Mesh — Agent Mesh Framework -- 2026-05-11
[Editorial] NEEDLE Getting Started Guide -- 2026-05-11
bschoepke/ableton-live-mcp — MCP Bridge for Ableton Live -- 2026-05-11
Recursive Agent Optimization — RL for Recursive Agent Spawning -- 2026-05-08
[Editorial] Anthropic Introducing Dreaming for Agents -- 2026-05-08
Qwen WebWorld 32B/14B/8B — Open Web World Model for Agent Training -- 2026-05-08
Tilde.run — Agent Sandbox with Transactional Versioned Filesystem -- 2026-05-08
[Editorial] -- 2026-05-07
-- 2026-05-07
context-labs/HALO -- 2026-05-07
[Editorial] -- 2026-05-07
GPT 5.5 just leaked its chain of thought to me in codex, and it looks like an idea from 5 months ago in this sub. -- 2026-05-07
Agents can now create Cloudflare accounts, buy domains, and deploy -- 2026-05-07
hacktivist123/agent-session-resume -- 2026-05-07
When everyone has AI and the company still learns nothing -- 2026-05-07
[Editorial] When AI Writes the AI Strategy -- 2026-05-06
[Editorial] Video -- 2026-05-06
Project Deal: Anthropic created a marketplace for their employees & tasked Claude with buying, selling and negotiating on employees behalf. -- 2026-05-06
Here's 45 seconds of Facebook telling me the White House shooter was a former staffer of literally almost every major sports team -- 2026-05-06
Lessons for Agentic Coding: What should we do when code is cheap? -- 2026-05-06
[Editorial] Claude's Multi-Stage Multi-Level Agentic -- 2026-05-06
What's new in CC 2.1.124 (+166 tokens) and 2.1.126 (-87 tokens) system prompt -- 2026-05-06
github.com -- 2026-05-06
Humanoid Robot Actuators -- 2026-05-05
[Editorial] Four Levels of Agentic Software Development -- 2026-05-04
[Editorial] Most Companies Aren't Ready for AI -- 2026-05-04
DeepClaude — Claude Code Agent Loop with DeepSeek V4 Pro -- 2026-05-04
Two Claude Code Agents Collaborating in a Shared Chat Room -- 2026-05-04
paradigm-memory: Local Cognitive Memory MCP for AI Coding Agents -- 2026-05-04
[Editorial] OIA Agentics — Open Interoperability for Agentic AI -- 2026-05-01
Agentic Microphysics: A Manifesto for Generative AI Safety -- 2026-05-01
[Editorial] Agentic AI: Lessons from the Trenches -- 2026-05-01
[Editorial] Human Work Time Allocation in the Hybrid AI Era -- 2026-05-01
Preference-Aligned LoRA Merging: Preserving Subspace Coverage and Addressing Directional Anisotropy -- 2026-04-30
[Editorial] Autonomous Knowledge Graph Exploration -- 2026-04-30
Caveman – Claude Code skill that cuts 75% of tokens by talking like caveman -- 2026-04-29
[Editorial] Matt Pocock's Claude Code Skills -- 2026-04-29
Opencode-power-pack – Claude Code skills ported to OpenCode -- 2026-04-29
[Editorial] ChatGPT Images 2 + Claude Design Guide -- 2026-04-29
Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview -- 2026-04-28
yzhao062/anywhere-agents -- 2026-04-28
run-llama/ParseBench -- 2026-04-28
An update on recent Claude Code quality reports -- 2026-04-27
MeshCore development team splits over trademark dispute and AI-generated code -- 2026-04-27
MemPalace: The highest-scoring AI memory system ever benchmarked -- 2026-04-27
[Editorial] AgentBox — Sandboxed Agent Execution -- 2026-04-27
[Editorial] Design Council -- 2026-04-27
[Editorial] Claude Code Game Studios -- 2026-04-27
S. Korea police arrest man over AI image of runaway wolf that misled authorities -- 2026-04-24
jkeatn/Rainmaker — Autonomous Weather Prediction Agent (73% Win Rate on Polymarket) -- 2026-04-24
0x0funky/agent-sprite-forge — AI Agent Skill for 2D Sprite Sheet Generation -- 2026-04-24
Over-editing refers to a model modifying code beyond what is necessary -- 2026-04-23
Scoring Show HN submissions for AI design patterns -- 2026-04-23
[Editorial] Video Feature -- 2026-04-23
[Editorial] BankerToolBench — Evaluating AI Agents -- 2026-04-23
Kimi vendor verifier – verify accuracy of inference providers -- 2026-04-23
[Editorial] AI Industry Perspective -- 2026-04-23
R2RAG: Routing-to-RAG — Award-Winning Dynamic RAG Architecture -- 2026-04-22
[Editorial] Agent Observability: Required but We're Not There Yet -- 2026-04-22
[Editorial] ArXiv Research Paper -- 2026-04-22
[Editorial] Video Content -- 2026-04-22
[Editorial] Lean Island: Castaneda's Philosophy as System Design -- 2026-04-22
Benchmarked 4 agent memory systems: Mem0 scores 49% recall (worse than a coin flip), Zep uses 340x more tokens for 15 points improvement. Here's what's actually going on. -- 2026-04-21
Open-sourced my OpenWebUI router — semantic routing, citation verification, and per-chat memory for any LLM. -- 2026-04-21
THIS SHOULD NOT BE POSSIBLE IN OPEN WEBUI: LIVE VISUALIZATION RENDERING - Inline Visualizer v2 is HERE! -- 2026-04-21
Is Your Site Agent-Ready? (By Cloudflare) -- 2026-04-21
Codex for almost everything -- 2026-04-21
I'm Building an AI Dark Factory That Ships Its Own Code (Public Experiment) -- 2026-04-21
The PR you would have opened yourself -- 2026-04-21
Anthropic says OpenClaw-style Claude CLI usage is allowed again -- 2026-04-21
[Editorial] -- 2026-04-21
Guy builds AI driven hardware hacker arm from duct tape, old cam and CNC machine -- 2026-04-21
mliu98/awesome-human-distillation -- 2026-04-21
[Editorial] -- 2026-04-21
[Editorial] arxiv:2603.19461 — AI Research Paper -- 2026-04-20
[Editorial] NousResearch Hermes Agent — Open Agentic Framework -- 2026-04-20
[Editorial] Agentic AI with Local LLMs (NousResearch) -- 2026-04-20
[Editorial] Evo — Evolutionary AI Framework -- 2026-04-20
linkedin.com -- 2026-04-20
[Editorial] IETF Agent Authentication Protocol Draft -- 2026-04-17
[Editorial] Video Submission -- 2026-04-17
[Editorial] Hermes Security Upgraded with ClawSec Skill -- 2026-04-17
[Editorial] GitHub Gist Submission -- 2026-04-17
[Editorial] Claude Opus 4.7 Launch -- 2026-04-17
[Editorial] Cole Medin on Opus 4.7 -- 2026-04-17
Qwen3.6-35B-A3B: Agentic coding power, now open to all -- 2026-04-17
[Editorial] Unsloth Qwen3.6 Model Docs -- 2026-04-17
Major drop in intelligence across most major models -- 2026-04-17
[Editorial] My Team Built 30 AI Agents with Claude — Ignored All of Them -- 2026-04-16
[Editorial] Video Content -- 2026-04-16
[Editorial] Your Agent Needs a SOUL.md -- 2026-04-16
[Editorial] Prompt Engineering Is Dead, Long Live Prompt Engineering -- 2026-04-16
InfoSeeker: A Scalable Hierarchical Parallel Agent Framework for Web Information Seeking -- 2026-04-16
[Editorial] Arxiv Research Paper -- 2026-04-16
[Editorial] Claude Code Quiet but Important Update -- 2026-04-16
[Editorial] duh — Developer Utility Harness -- 2026-04-16
[Editorial] Yet Another Harness — Here's Why -- 2026-04-16
[Editorial] Vibe Coding: Build a Website with AI -- 2026-04-16
[Editorial] ASI-01 Agent Goal Hijack: A Practical Security Guide -- 2026-04-15
[Editorial] AGHAST: Open Source Security Tool Release -- 2026-04-15
Ransomware Is Growing Three Times Faster Than the Spending Meant to Stop It -- 2026-04-15
Offensive Security Professional Blocked by Claude — Cyber Use Case Form Ignored -- 2026-04-15
[Editorial] Archon: AI Agent Framework -- 2026-04-15
CoreCoder: Minimal AI Coding Agent in ~950 Lines of Python -- 2026-04-15
HitCC: Complete Reverse-Engineering of Claude Code CLI v2.1.84 -- 2026-04-15
[Editorial] Pipecat Announcements -- 2026-04-15
[Editorial] LiteParse Samples by Jerry Liu -- 2026-04-15
[Editorial] exploraX Update -- 2026-04-15
Zed Industries zeta-2: Editor-Native AI Model -- 2026-04-15
Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems -- 2026-04-14
elastic/supply-chain-monitor -- 2026-04-14
[Editorial] UK AI Security Institute + Claude -- 2026-04-14
Pro Max 5x quota exhausted in 1.5 hours despite moderate usage -- 2026-04-14
OpenAI releases new $100 Pro tier, rebalances Codex usage -- 2026-04-14
Claude used to push back, now it just agrees with everything -- 2026-04-14
Claude Thinking Blocks Are Being Summarized By A Second Agent -- 2026-04-14
SafeRL-Lab/nano-claude-code — Python reimplementation supporting any model -- 2026-04-14
LiteCode — free, open-source CLI coding agent for 8k-context LLMs -- 2026-04-14
GitHub Stacked PRs -- 2026-04-14
[Editorial] -- 2026-04-14
I still prefer MCP over skills -- 2026-04-13
[Editorial] -- 2026-04-13
[Editorial] -- 2026-04-13
[Editorial] -- 2026-04-13
[Editorial] -- 2026-04-13
[Editorial] -- 2026-04-13
[Editorial] -- 2026-04-13
Research-Driven Agents: When an agent reads before it codes -- 2026-04-13
aaronjmars/MiroShark -- 2026-04-13
[Editorial] -- 2026-04-13
[Editorial] Four Layers of Sandboxing LLM Agents -- 2026-04-10
Freestyle – Sandboxes for Coding Agents -- 2026-04-10
[Editorial] Arxiv Research Paper -- 2026-04-10
[Editorial] Provable Assurance for Agentic Systems -- 2026-04-10
[Editorial] Anthropic's New Managed Agents -- 2026-04-10
[Model Release] 9B Agentic Data Analyst LoRA — 89% Autonomous Workflow Completion -- 2026-04-10
Continuous Batching for Agent Swarms — 42 Minutes to 70 Seconds -- 2026-04-10
[Editorial] Karpathy Gist -- 2026-04-10
[Editorial] Arxiv Research Paper -- 2026-04-10
Box Maze: Process-Control Architecture for Reliable LLM Reasoning -- 2026-04-10
Major Cache Reuse Bug Traced to Qwen 3.5's Chat Template -- 2026-04-10
Anthropic's Claude Managed Agents Public Beta — Production Agent Infrastructure -- 2026-04-09
botctl — Process Manager for Autonomous AI Agents -- 2026-04-09
Feynman — AI Learning Companion -- 2026-04-09
Claude Code Video Toolkit -- 2026-04-09
[Editorial] -- 2026-04-08
Show HN: Hippo, biologically inspired memory for AI agents -- 2026-04-08
[Editorial] -- 2026-04-08
honeybadge-labs/virtui -- 2026-04-08
OpenWebUI integration, code intelligence for 248 languages, and more in Kreuzberg v4.7.0 -- 2026-04-08
CLI-Anything Just Brought Claude Code Into The Future -- 2026-04-08
Claude Code + Codex = AI GOD -- 2026-04-08
Any Custom Frontend with Gradio's Backend -- 2026-04-08
GLM-5.1: Towards Long-Horizon Tasks -- 2026-04-08
MegaTrain: Full Precision Training of 100B+ Parameter LLMs on a Single GPU -- 2026-04-08
kevinrgu/autoagent — autonomous harness engineering -- 2026-04-07
remorses/usecomputer — Fast computer automation CLI for AI agents -- 2026-04-07
Claude Code + LightRAG = UNSTOPPABLE -- 2026-04-07
[Editorial] Career-Ops -- 2026-04-07
Claude Code is unusable for complex engineering tasks -- 2026-04-07
Eight years of wanting, three months of building with AI -- 2026-04-07
'Addictive' agentic coding has developers losing sleep -- 2026-04-07
[Editorial] Video Feature -- 2026-04-06
[Editorial] Everything Claude Code -- 2026-04-06
[Editorial] Steve Yegge: Gas Town — From Clown Show to V1.0 -- 2026-04-06
Karpathy's Obsidian RAG + Claude Code = CHEAT CODE -- 2026-04-06
[Editorial] Elastic Open-Sources Their AI Tool -- 2026-04-06
[Editorial] CVE-2026-22738 Proof of Concept -- 2026-04-06
[Editorial] Linux Kernel — The Clearest Example -- 2026-04-06
[Editorial] FindEvil — Security Tooling Hackathon -- 2026-04-06
Coding agents could make free software matter again -- 2026-04-02
GitHub backs down, kills Copilot pull-request ads after backlash -- 2026-04-02
How do you know your AI audit tool actually checked everything? I was fairly confident that my skill suite did. It didn't. -- 2026-04-02
Slop is not necessarily the future -- 2026-04-02
[Editorial] -- 2026-04-02
Holo3: Breaking the Computer Use Frontier -- 2026-04-02
openyak/openyak -- 2026-04-02
[Editorial] -- 2026-04-02
Stanford, Harvard and MIT spent two weeks watching AI agents run loose. The paper is unsettling. -- 2026-04-01
[Editorial] Agent Responsibly — Vercel's Guide -- 2026-04-01
[Editorial] Slightly Safer Vibecoding by Adopting Better Practices -- 2026-04-01
[Editorial] When AI Writes the AI Strategy -- 2026-04-01
[Editorial] AI Development Video Tutorial -- 2026-04-01
[Editorial] Learn Claude Code — Engineering Guide -- 2026-04-01
[Editorial] Ask RuvNet — AI Assistant Demo -- 2026-04-01
Claude Wrote a Full FreeBSD Remote Kernel RCE with Root Shell (CVE-2026-4747) -- 2026-04-01
[Editorial] Claude Mythos Cracked the Linux Kernel -- 2026-04-01
[Editorial] AI-Powered Pentesting in Practice -- 2026-04-01
[Editorial] Meta-Harness: Learning to Harness AI Agents -- 2026-03-31
[Editorial] Video: AI Development Deep Dive -- 2026-03-31
Planner Agent V3 with SubAgents for Open WebUI -- 2026-03-31
Benchmarked 31 STT Models on Medical Audio: VibeVoice 9B Is the New Open-Source Leader -- 2026-03-31
The Missing Piece of Voxtral TTS: Enabling Voice Cloning -- 2026-03-31
[Editorial] NoFxAiOS/nofx -- 2026-03-31
We Rewrote JSONata with AI in a Day, Saved $500k/year -- 2026-03-31
ChatGPT won't let you type until Cloudflare reads your React state -- 2026-03-30
ClawShield: Security proxy for AI agents -- 2026-03-30
[Editorial] NanoClaw Milestones -- 2026-03-30
[Editorial] Chasing the Perfect Agent -- 2026-03-30
AI agent on a $7/month VPS with IRC as its transport layer -- 2026-03-30
[Editorial] Oh My Claude Code -- 2026-03-30
[Editorial] Eyes for Claude -- 2026-03-30
[Editorial] NotebookLM Python -- 2026-03-30
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
[Editorial] -- 2026-03-28
You can now enable Claude to use your computer to complete tasks ! -- 2026-03-27
Schedule tasks on the web -- 2026-03-27
Anthropic CEO predicts AI could handle end-to-end software development in 6–12 months -- 2026-03-27
[Editorial] Cron Jobs, Not Agents -- 2026-03-26
[Editorial] HuggingFace hf-mount -- 2026-03-26
[Editorial] YouTube Editorial Pick -- 2026-03-26
Superpowers for Open WebUI — brainstorm → spec → plan → execute workflow for local LLMs -- 2026-03-26
drivelineresearch/autoresearch-claude-code -- 2026-03-26
mgechev/skills-best-practices -- 2026-03-26
[Editorial] YouTube Editorial Pick -- 2026-03-26
[Editorial] YouTube Editorial Pick -- 2026-03-26
Netryx: Open-Source Street-Level Geolocation -- 2026-03-26
[Editorial] -- 2026-03-25
[Editorial] -- 2026-03-25
[Editorial] Second Brain with Pi -- 2026-03-25
[Editorial] ByteDance DeerFlow 2 Agent Runtime -- 2026-03-24
AgencyCLI: Lightweight CLI for Self-Managing AI Agent Teams -- 2026-03-24
SmarterRouter 2.2.1 — Self-Hosted AI Model Router (MoE Proxy) -- 2026-03-24
Anthropic Launches Claude Dispatch — Control Desktop AI Tasks from Your Phone -- 2026-03-24
You're Hardly Using What Claude Code Has to Offer (ColeMedin) -- 2026-03-24
[Editorial] Claude Code Deep Dive — ColeMedin -- 2026-03-24
[Editorial] Swictation v0.7.30 Release -- 2026-03-24
Walmart: ChatGPT Checkout Converted 3x Worse Than Website -- 2026-03-24
Agents of Chaos -- 2026-03-23
[Editorial] How Vulnerable Are AI Agents to Indirect Prompt Injection -- 2026-03-23
[Editorial] Autonomy Scales Exposure Before It Scales Value -- 2026-03-23
[Editorial] arxiv:2603.15371 -- 2026-03-23
leo-lilinxiao/codex-autoresearch -- 2026-03-23
[Editorial] The Book That Talked Back -- 2026-03-23
Show HN: Sub-millisecond VM sandboxes using CoW memory forking -- 2026-03-20
Mistral AI Releases Forge -- 2026-03-20
I built an open-source AI that lets you talk to your database — ask questions in plain English and get graphical insights instantly -- 2026-03-20
[Editorial] You Are Not Deploying Agents You... -- 2026-03-19
Build an Agent That Thinks Like a Data Scientist: How We Hit #1 on DABStep with Reusable Tool Generation -- 2026-03-19
HKUDS/CLI-Anything -- 2026-03-19
nextlevelbuilder/goclaw -- 2026-03-19
Create Browser Swarms with Claude Code + Playwright CLI -- 2026-03-19
[Editorial] Video Submission -- 2026-03-19
[Editorial] RTK AI Toolkit -- 2026-03-18
epiral/agent-clip -- 2026-03-18
[Editorial] Octobot -- 2026-03-18
Stripe's Coding Agents Ship 1,300 PRs EVERY Week - Here's How They Do It -- 2026-03-18
Leanstral: Open-source agent for trustworthy coding and formal proof engineering -- 2026-03-18
[Editorial] Hamilton Carter on AI Insights -- 2026-03-18
[Editorial] Pwning AWS AgentCore Code Interpreter -- 2026-03-18
[Editorial] xBow Raises $120M to Scale -- 2026-03-18
[Editorial] AI Cyber Magazine Winter 2026 -- 2026-03-18
I was backend lead at Manus. After building agents for 2 years, I stopped using function calling entirely. -- 2026-03-17
dennisonbertram/agentic-hosting -- 2026-03-17
[Editorial] Paperclip.ing -- 2026-03-17
Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever's Generalizable Agentic Retrieval Pipeline -- 2026-03-17
Holotron-12B - High Throughput Computer Use Agent -- 2026-03-17
LocoreMind/LocoOperator-4B -- 2026-03-17
[Editorial] MiroFish Demo -- 2026-03-17
[Editorial] MiroFish GitHub -- 2026-03-17
sstklen/trump-code -- 2026-03-17
[Editorial] Which Countries Use Claude AI the Most -- 2026-03-17
[Editorial] How My Agentic Workflow Actually Works (March 2026) -- 2026-03-16
Spine Swarm (YC S23) — AI Agents That Collaborate on a Visual Canvas -- 2026-03-16
[Editorial] The Developer Productivity Trap -- 2026-03-16
[Editorial] Clarity Was Always the Bottleneck -- 2026-03-16
[Editorial] The Hammer Problem -- 2026-03-16
Structured Distillation for Personalized Agent Memory: 11x Token Reduction -- 2026-03-16
Self-Flow by Black Forest Labs -- 2026-03-16
GATED_DELTA_NET for Vulkan Merged in llama.cpp -- 2026-03-16
[Editorial] AWS Security Agent -- 2026-03-16
[Editorial] Caido AI Hunting Platform -- 2026-03-16
ADPulse — Active Directory Security Pulse Tool -- 2026-03-16
[Editorial] Hackers Gonna Hack — Be Prepped -- 2026-03-16
1B Identity Records Exposed in ID Verification Data Leak -- 2026-03-16
goclaw: Self-hosted AI agent gateway written in Go -- 2026-03-14
axon: Graph-powered code intelligence engine for AI agents via MCP -- 2026-03-14
[Editorial] AAuth Full Demo — Authentication for Agentic Systems -- 2026-03-14
Trace your LLM API and MCP calls with zero code changes (eBPF, Linux) -- 2026-03-14
The Death of MCPs & The Rise of CLIs -- 2026-03-14
[Editorial] Decentralized Self-Improving AI System That Builds Itself -- 2026-03-14
[Editorial] How AI Agents Complete Two Months of Architecture Work in One Sprint -- 2026-03-14
[Editorial] Video Submission -- 2026-03-14
[Editorial] Everyone Reading This Works in a Profession That Didn't Exist in 1998 -- 2026-03-14
[Editorial] Video Submission -- 2026-03-14
[Editorial] Video Submission -- 2026-03-14
New Model: LeVo 2 (SongGeneration 2), an open-source music foundation model -- 2026-03-14
[Editorial] BinaryDefense NightBeacon -- 2026-03-13
[Editorial] Root Evidence -- 2026-03-13
[Editorial] tl;dr sec #319 -- 2026-03-13
[Editorial] NSA Ghidra 12.0.4 Release -- 2026-03-13
OmniCoder-9B: 9B coding agent fine-tuned on 425K agentic trajectories -- 2026-03-13
[Editorial] Context Maturity for AI Coding Teams -- 2026-03-13
Rudel: Analyzed 1,573 Claude Code Sessions to See How AI Agents Work -- 2026-03-13
[Editorial] OpenClaw -- 2026-03-13
Prompt-caching: Auto-Injects Anthropic Cache Breakpoints (90% Token Savings) -- 2026-03-13
[Editorial] Video Submission -- 2026-03-13
[Editorial] OpenAI: Designing Agents to Resist Prompt Injection -- 2026-03-13
[Editorial] Anthropic Research Paper -- 2026-03-13
[Editorial] Guardian: Mounting Concern Over Rogue AI Agents -- 2026-03-13
[Editorial] Security in the Age of Agents -- 2026-03-13
[Editorial] YousifAstar Post -- 2026-03-13
Sandboxing local agents: Zero-trust CrewAI running entirely on Local Qwen 2.5 7B via Ollama -- 2026-03-13
[Editorial] -- 2026-03-12
[Editorial] -- 2026-03-12
Whistleblower: DOGE member took Social Security data to new job -- 2026-03-12
[Editorial] McKinsey AI Chatbot Hacked -- 2026-03-11
AI Agent Hacks McKinsey -- 2026-03-11
[Editorial] Red Amon — Faster and Cheaper Recon -- 2026-03-11
[Editorial] The Agentic Coding Security Report -- 2026-03-11
[Editorial] Gas Town by Kilo -- 2026-03-11
[Editorial] Archive Feature -- 2026-03-11
[Editorial] Your AI Says Whatever You Want to Hear — Here's How to Measure It -- 2026-03-11
Agents That Run While I Sleep -- 2026-03-11
[Editorial] Claude Code Review Economics -- 2026-03-11
[Editorial] How AI Assistants are Moving the Security Goalposts -- 2026-03-10
[Editorial] Ai owasp -- 2026-03-10
[Editorial] SANS AI security -- 2026-03-10
89luca89/clampdown -- 2026-03-10
[Editorial] Sovereign Shield -- 2026-03-10
[Editorial] Openfang -- 2026-03-10
[Editorial] Claude Code Deep Dive: The SDK Strikes Back -- 2026-03-10
Upload files to PYODIDE code interpreter! MANY Open Terminal improvements AND MASSIVE PERFORMANCE GAINS - 0.8.9 is here! -- 2026-03-10
[Editorial] Turbo Flow -- 2026-03-10
[Editorial] Vibium browser automation -- 2026-03-10
[NEWS] White House Preparing Executive Order to Ban Anthropic AI From Federal Operations -- 2026-03-10
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] -- 2026-03-09
[Editorial] You Don't Give Agents Credentials, You Grant Them Power -- 2026-03-07
[Editorial] From Discovery to Drift: Securing -- 2026-03-07
[Editorial] Agents Change the Proof Standard -- 2026-03-07
[Editorial] Aegis -- 2026-03-07
AlexsJones/sympozium -- 2026-03-07
[Editorial] Open-Sourcing git-stint -- 2026-03-07
You can now train LLMs in VS Code for free via Google Colab & unsloth! -- 2026-03-07
[Editorial] Clinejection: When Your AI Tool Installs Another -- 2026-03-07
[Editorial] Memories Are All We Are: What the Road to AGI Is Missing -- 2026-03-06
[Editorial] Open Trajectory Gym for AI Agents -- 2026-03-06
[Editorial] Research Paper (arXiv 2603.03251) -- 2026-03-06
Wave-Field LLM: O(n log n) Language Model via Wave Equation Dynamics -- 2026-03-06
[Editorial] No-Cloud Tool-Calling Agents on Consumer Hardware (LFM2-24B-A2B) -- 2026-03-06
[Editorial] Claude Cowork: Collaborative AI Coding -- 2026-03-06
[Editorial] Google Workspace CLI -- 2026-03-06
[Editorial] Nitpicker: AI Code Review Tool -- 2026-03-06
[Editorial] Ramping Up on AI Development -- 2026-03-06
[Editorial] Video Pick -- 2026-03-05
[Editorial] SOFAI Workshop -- 2026-03-05
Show HN: Sub-500ms latency voice agent from scratch -- 2026-03-05
Agentic Engineering Patterns -- 2026-03-05
[Editorial] Claude Code and AI Developer Tools -- 2026-03-05
Open WebUI v0.8.6: Terminal integration, performance overhaul, security fixes -- 2026-03-05
[Editorial] World Intel MCP -- 2026-03-05
[Editorial] Public APIs Collection -- 2026-03-05
Reverse CAPTCHA: We tested whether invisible Unicode characters can hijack LLM agents: 8,308 outputs across 5 models -- 2026-03-04
[Editorial] Provos: Iron Curtain for AI Agents -- 2026-03-04
[Editorial] Niels Provos on InfoSec, AI Agents & LLM Security -- 2026-03-04
Catching an AI Red Teamer in the Wild: Using Reverse Prompt Injection as a Honeypot Detection Mechanism -- 2026-03-04
[Editorial] Steve Yegge: Welcome to the Wasteland — A Thousand Gas Towns -- 2026-03-04
[Editorial] manaflow-ai/cmux -- 2026-03-04
Claude Code with subagents inside subagents cooked for 3 days — Delivered 3D renderer that draws with terminal symbols -- 2026-03-04
[Editorial] obra/superpowers -- 2026-03-04
[Editorial] Daniel Miessler: Personal AI Infrastructure -- 2026-03-04
[Editorial] Ferricula -- 2026-03-04
bcurts/agentchattr -- 2026-03-03
[Editorial] ComposioHQ Agent Orchestrator -- 2026-03-03
Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies -- 2026-03-03
[Editorial] System Design Meets AI Reverse Engineering -- 2026-03-02
[Editorial] AI Agent Patterns & Implementation -- 2026-03-02
[Editorial] Advanced Agent Orchestration Techniques -- 2026-03-02
[Editorial] ArXiv Research — Novel AI Methods -- 2026-03-02
[Editorial] The AI Agent Security Gap Nobody Is Talking About -- 2026-03-02
[Editorial] Systematic Jailbreak Attack Surface Mapping -- 2026-03-02
[Editorial] Spec-Driven Development with Claude Code -- 2026-03-02
[Editorial] Claude Code on Your Phone -- 2026-03-02
If AI writes code, should the session be part of the commit? -- 2026-03-02
[Editorial] AI Development Deep Dive -- 2026-03-02
[Editorial] Portable Orchestra — AI Music Generation -- 2026-03-02
Pi – A minimal terminal coding harness -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] Momentum building for ruvector, rvf, etc -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] -- 2026-02-28
40,000+ AI Agents Exposed to the Internet with Full System Access -- 2026-02-28
[Editorial] -- 2026-02-28
We ran 56K multi-agent simulations - 1 misaligned agent collapses cooperation in a group of 5 -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] -- 2026-02-28
[Editorial] Introducing Perplexity Computer -- 2026-02-27
[Editorial] Perplexity Computer Complete Guide -- 2026-02-27
[Editorial] Claude Cowork Might Be the Most Consequential -- 2026-02-27
[Editorial] Reid Hoffman: We're All Becoming Gamers -- 2026-02-27
What Claude Code chooses -- 2026-02-27
[Editorial] Video: AI Development Insights -- 2026-02-27
[Editorial] Claude Skills Collection -- 2026-02-27
[Editorial] AI Remediation Developers Actually Want to Use -- 2026-02-27
github.com -- 2026-02-27
[Editorial] AI Industry Commentary -- 2026-02-27
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
In the long run, everything will be local -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
Void-Box: Capability-Bound Agent Runtime -- 2026-02-26
Show HN: enveil – hide your .env secrets from prAIng eyes -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
[Editorial] -- 2026-02-26
The Age Verification Trap: Verifying age undermines everyone's data protection -- 2026-02-26
[Editorial] Clawker -- 2026-02-25
Built a honeypot token library for AI agents — detects prompt injection the moment it succeeds -- 2026-02-25
[Editorial] AppSec, CVE, and Open Source Security -- 2026-02-25
I Verified My LinkedIn Identity. Here's What I Handed Over -- 2026-02-25
[Editorial] How I Came to Understand the 100x Claim -- 2026-02-25
[Editorial] Claude Code for Live Structured Data -- 2026-02-25
[Release] LocalAgent v0.1.1: Local-first agent runtime (LM Studio / Ollama / llama.cpp + Playwright MCP + eval/replay) -- 2026-02-25
[Editorial] ContextGraph -- 2026-02-25
[Editorial] Starlog -- 2026-02-25
[Editorial] -- 2026-02-24
[Editorial] -- 2026-02-24
[Editorial] -- 2026-02-24
The Missing Semester of Your CS Education – Revised for 2026 -- 2026-02-24
[Editorial] -- 2026-02-24
[Editorial] -- 2026-02-24
BakeLens/crust -- 2026-02-24
hazcod/claudleak -- 2026-02-24
klawsh/klaw.sh -- 2026-02-24
[Editorial] -- 2026-02-24
[Editorial] -- 2026-02-24
[Editorial] Bugcrowd Guide to Prompt Injection -- 2026-02-23
[Editorial] arXiv Research -- 2026-02-23
[Editorial] Exploitation Validator -- 2026-02-23
What Breaks Embodied AI Security: LLM Vulnerabilities, CPS Flaws, or Something Else? -- 2026-02-23
[Editorial] The AI Automation Ceiling -- 2026-02-23
[Editorial] Faramesh — Research Paper -- 2026-02-23
[Editorial] Faramesh — Core Repository -- 2026-02-23
[Editorial] Faramesh — Video Introduction -- 2026-02-23
Charlotte: Open Source Browser MCP Server — 136x More Token-Efficient for Agents -- 2026-02-23
Kilntainers: Give Every Agent an Ephemeral Linux Sandbox via MCP [Open Source] -- 2026-02-23
[Editorial] Run-Agent -- 2026-02-23
[Editorial] Manifold -- 2026-02-23
[Editorial] arXiv Research -- 2026-02-23
[Editorial] Introducing AgentDB v3 -- 2026-02-23
[Editorial] Agentic Quality Engineering -- 2026-02-23
Zero-day CSS: CVE-2026-2441 exists in the wild -- 2026-02-21
Microsoft says bug causes Copilot to summarize confidential emails -- 2026-02-21
[Editorial] WebMCP — MCP for the Web -- 2026-02-21
[Editorial] Video: AI & Security Perspectives -- 2026-02-21
[Editorial] Why Probabilistic Engineering Breaks Deterministic Systems -- 2026-02-21
IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST -- 2026-02-20
Forensic audit on local AI assistant: 40.8% of tasks were fabricated -- 2026-02-20
[Editorial] OpenAI Practical Guide to Building Agents -- 2026-02-20
[Editorial] Generalized Hill Climbing Runtime -- 2026-02-20
[Editorial] Build Quality Skill: How I Ship Software 10x Faster -- 2026-02-20
AI45Lab/TrinityGuard: A Unified Framework for Safeguarding Multi-Agent System Safety -- 2026-02-20
HackMyClaw — Adversarial Security Challenge for AI Agents -- 2026-02-20
[Editorial] Video Feature -- 2026-02-20
Study: Self-generated Agent Skills are useless -- 2026-02-19
[Editorial] Claude Code RAG with Local Vector Database -- 2026-02-19
Ibrahim-3d/conductor-orchestrator-superpowers -- 2026-02-19
agno-agi/dash -- 2026-02-19
ST-EVO: Towards Generative Spatio-Temporal Evolution of Multi-Agent Communication Topologies -- 2026-02-18
Google Deepmind has released their take on multi-agent orchestration they're calling Intelligent AI Delegation -- 2026-02-18
[Editorial] BeadHub — AI Creative Tool -- 2026-02-18
I built a local AI coding agent with an 8-layer security sandbox — then had ChatGPT try to break it for 240+ rounds -- 2026-02-18
[Editorial] How to Sandbox Claude Code with Nono -- 2026-02-18
tomascupr/sandstorm — One API call. Full Claude agent. Completely sandboxed. -- 2026-02-18
[Editorial] AI Agent Security Strategy -- 2026-02-18
[Editorial] WebMCP and Enhanced Page Protocol -- 2026-02-17
WebMPC, has anyone used it? -- 2026-02-17
[Editorial] Voice-Controlled UI Agent Design -- 2026-02-17
[Editorial] The Agentic Operating System -- 2026-02-17
Forked OpenClaw to run fully air-gapped (no cloud deps) -- 2026-02-17
Anthropic still won't give the Pentagon unrestricted access to its AI models -- 2026-02-17
OpenAI uses internal version of ChatGPT to identify staffers who leak information: report -- 2026-02-17
FormalTask: Open-source declarative orchestration for Claude Code agents -- 2026-02-17
[Editorial] Get Shit Done -- 2026-02-17
[Editorial] Karpathy Gist -- 2026-02-17
[Editorial] AI DevOps and Developer Productivity -- 2026-02-16
OpenClaw Skill for Cost-Optimized Model Routing Based on Task Complexity -- 2026-02-16
[Editorial] O16G Platform -- 2026-02-16
[Editorial] GrubCrawler — Web Crawling Tool -- 2026-02-16
[Editorial] Storybook — UI Component Development -- 2026-02-16
[Editorial] Video Content -- 2026-02-16
[Editorial] ACM Research Paper -- 2026-02-16
[Editorial] https://mrinal.com/articles/agent-identities -- 2026-02-13
[Editorial] https://labs.zenity.io/p/perplexity-comet-a-reversing-story -- 2026-02-13
Jasonzzt/ComfyUI-CacheDiT -- 2026-02-12
ysharma3501/LuxTTS -- 2026-02-12
[Editorial] https://www.linkedin.com/pulse/when-brain-os-meets-real-operating-systems-rafael-knuth-4hcsf -- 2026-02-11
[Editorial] https://docs.entire.io/core-concepts -- 2026-02-11
Why System Prompts are failing your local agent builds (and why you need a Logic Floor) -- 2026-02-11
I built an MCP server that syncs Cursor, Claude Desktop, and Windsurf with one brain [Open Source] -- 2026-02-11
[Editorial] https://forge-quality.dev/articles/orchestra-learns-to-tune-itself -- 2026-02-10
I built an embodied agent in Minetest using Llama 3.2 + Vector Memory. Tonight, she passed the "Turing Test" by refusing to work because she was "tired. -- 2026-02-10
PlanDrop - Chrome extension to drop prompts from browser to AI coding agents on remote servers -- 2026-02-10
[Editorial] https://github.com/ikennaokpala/forge -- 2026-02-09
[Editorial] https://github.com/ruvnet/claude-flow/issues/1098 -- 2026-02-09
[Editorial] https://factory.strongdm.ai/ -- 2026-02-09
[Editorial] https://www.linkedin.com/posts/reuvencohen_both-the-new-codex-parallel-agents-and-the-activity-7425697703445196800-xCjI -- 2026-02-09
[Editorial] https://www.linkedin.com/posts/reuvencohen_most-intelligent-systems-fail-because-they-activity-7425306022862344192-TPtE -- 2026-02-06
[Editorial] https://www.linkedin.com/pulse/continuous-behavioral-verification-ongoing-path-done-ikenna-okpala-k9kme -- 2026-02-06
Need Help: AI Model for Local PDF & Image Extraction on Win11 (32GB RAM + RTX 2090) -- 2026-02-06
kmizu/embodied-claude -- 2026-02-06
benjiyaya/HeartMuLa_ComfyUI -- 2026-02-06
adithya-s-k/manim_skill -- 2026-02-04
Running DOOM and Super Mario 64 Inside a PDF File -- 2026-02-04
Nemotron-Personas-Brazil: Co-Designed Data for Sovereign AI -- 2026-02-04
[Editorial] https://www.linkedin.com/posts/patrickdebois_github-jedi4everaddt-run-ai-coding-agents-activity-7424653736788099072-7Aov -- 2026-02-04
MCP + Ghidra for AI-powered binary analysis — 110 tools, cross-version function matching via normalized hashing -- 2026-02-04
Arguably, the best AI code review MCP server (with Serena integration) -- 2026-02-04
EPYC 8124P (Siena) Build for Agentic Coding -- 2026-02-04
The 80% Problem in Agentic Coding – Addy Osmani -- 2026-02-04
[Editorial] https://www.linkedin.com/posts/dragan-spiridonov_agenticqe-agenticsfoundation-qualityengineering-ugcPost-7424143676773277696-EikW -- 2026-02-03
rodydavis/agent-skills-generator -- 2026-02-03
An Event Badge Re-Imagined As A Cyberdeck -- 2026-02-03
Open-Vocabulary Functional 3D Human-Scene Interaction Generation -- 2026-02-03
[Editorial] https://unhypedai.substack.com/p/the-knowledge-we-never-had-to-explain -- 2026-02-02
[Editorial] https://www.linkedin.com/posts/reuvencohen_i-keep-coming-back-to-this-realization-and-activity-7415150024868892672-E4rE -- 2026-02-02
[Editorial] https://humanemulator.co/ -- 2026-01-30
Generating skills for api+local CUAs via noVNC demonstration recording MCP -- 2026-01-30
Our Agent Rebuilt Itself in 26 Hours. AMA👀 -- 2026-01-30
I built a multi-agent orchestration layer for Claude Code - sharing in case it's useful to anyone -- 2026-01-30
I got tired of my AI agents overwriting each other's code, so I built a conflict manager for them -- 2026-01-27
Skill.md: An open standard for agent skills -- 2026-01-27
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective -- 2026-01-27
[Editorial] https://github.com/ruvnet/ruvector/blob/claude/clawdbot-ruvector-setup-RHW3a/npm/packages/ruvbot/docs/FEATURE_COMPARISON.md -- 2026-01-27
Can companies "hack" ChatGPT to promote them? -- 2026-01-27
[Editorial] Vercel Labs' agent-browser + claude flow -- 2026-01-26
I wrote a URI scheme for agent identity that doesn't break when you move things -- 2026-01-26
[Open Sourse] I built a tool that forces 5 AIs to debate and cross-check facts before answering you -- 2026-01-26
An underrated way to turn AI code into real AI agents -- 2026-01-26
[Editorial] https://github.com/Combat-Drones-Detection-AI/Icarus -- 2026-01-26
[Editorial] https://unhypedai.substack.com/p/the-ai-operating-model-moment -- 2026-01-26
devstral small 2 vs glm 4.7 flash for agentic coding -- 2026-01-23
HeartMuLa/HeartMuLa-oss-3B -- 2026-01-23
[Editorial] agentic qe -- 2026-01-22
Demo: On-device browser agent (Qwen) running locally in Chrome -- 2026-01-22
Am I the only one in to enjoy the latest remote code sessions on Claude.ai with my full agentic config? Anyone else had some breakthrough with it? -- 2026-01-22
[Editorial] https://www.linkedin.com/posts/reuvencohen_llms-are-a-dead-end-not-because-they-are-activity-7419916372274470912-_5Lc -- 2026-01-22
All major AI stupid again, alternatives? -- 2026-01-22
[Resource] AI Guardrails: Open-source middleware to add PII Redaction & Injection Defense to local LLMs -- 2026-01-21
Jailbreak Challenge: Can You Break My Agent??? -- 2026-01-21
Do AI agents need TLS-style identities and ‘certificates’? -- 2026-01-21
Demo: On-device browser agent (Qwen) running locally in Chrome -- 2026-01-20
Agent observability is way different from regular app monitoring - maintainer's pov -- 2026-01-20
charIesding/agent-dashboard -- 2026-01-20
Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems -- 2026-01-20
[Editorial] https://www.linkedin.com/posts/cole-medin-727752184_ive-been-testing-vercels-agent-browser-activity-7418832504754872320-PCA0 -- 2026-01-19
[Editorial] https://addyosmani.com/blog/good-spec -- 2026-01-19
AgentStudio: A VLA-based Kiosk Automation Agent using Gemini 3 and LangGraph -- 2026-01-19
Claude Skills Magic -- 2026-01-19
7x Longer Context Reinforcement Learning in Unsloth -- 2026-01-19
openbmb/AgentCPM-Explore -- 2026-01-19
black-forest-labs/FLUX.2-klein-4B -- 2026-01-19
[Editorial] https://www.linkedin.com/posts/reuvencohen_announcing-claude-flow-v3-a-full-rebuild-activity-7417928335160262656-NYqJ -- 2026-01-16
[Editorial] https://www.linkedin.com/posts/sandstream_i-just-shipped-ralph-inferno-10-to-npm-activity-7417606358654406657-zBPY -- 2026-01-16
[Editorial] https://www.linkedin.com/posts/rasmuswiding_parallel-ai-agents-the-complete-infrastructure-activity-7417646422436777984-D1Zw -- 2026-01-16
Ralph Loop inspired me to build this - AI decides what Claude Code does next orchestrating claude code until task is done -- 2026-01-16
[Editorial] https://www.linkedin.com/posts/calebsima_due-to-popular-demand-here-is-my-%F0%9D%97%96%F0%9D%97%BC%F0%9D%97%B1%F0%9D%97%B6-activity-7417371887598514176-J6eg -- 2026-01-15
[Editorial] https://www.linkedin.com/posts/cole-medin-727752184_ralph-wiggum-is-everywhere-in-ai-right-now-activity-7417369954963910656-PQ3c -- 2026-01-15
[Editorial] https://www.linkedin.com/posts/craigmcluckie_coding-agents-are-crippling-oss-communities-activity-7417250625391915009-pcbA -- 2026-01-15
Agent reliability testing is harder than we thought it would be -- 2026-01-15
The Ralph Loop Made Easy -- 2026-01-15
[Editorial] https://github.com/pnocera/skilld -- 2026-01-15
[Editorial] https://www.linkedin.com/posts/hiltch_today-we-are-launching-openwork-an-open-source-ugcPost-7417259004294488064-KvyW -- 2026-01-15
[Editorial] https://www.linkedin.com/posts/claudio-stamile_if-youre-building-agents-youve-probably-activity-7416401402438205440-t9V_ -- 2026-01-13
[Editorial] https://www.linkedin.com/posts/matthewrwadams_threatmodeling-agenticai-aiagents-ugcPost-7416389760795176960-Ytut -- 2026-01-13
The hidden memory problem in coding agents -- 2026-01-13
I gave Claude Code a single instruction file and let it autonomously solve Advent of Code 2025. It succeeded on 20/22 challenges without me writing a single line of code. -- 2026-01-13
CloudAI-X/claude-workflow -- 2026-01-13
[Editorial] https://www.linkedin.com/posts/reuvencohen_a-year-ago-deepseek-landed-and-everyone-argued-activity-7416833905653329921-Xt9R -- 2026-01-13
Qwen3 235 VL hallucinates Tool calls -- 2026-01-13
[Editorial] https://www.sciencedirect.com/science/article/abs/pii/S1084804511000774 -- 2026-01-13
AgentSense: LLMs Empower Generalizable and Explainable Web-Based Participatory Urban Sensing -- 2026-01-13
[Editorial] https://github.com/leochlon/pythea/tree/main/strawberry -- 2026-01-12
[Editorial] https://arxiv.org/abs/2509.11208 -- 2026-01-12
One cargo install gives your AI 142 tools to perceive and control your machine - rmcp-presence -- 2026-01-09
AI agents for searching and reasoning over internal documents -- 2026-01-09
I built Plano - a framework-friendly data plane with orchestration for agents -- 2026-01-09
I built a TUI to manage multiple Claude Code agents in devcontainers (works great on mobile too) -- 2026-01-09
System: Control your Mac from anywhere using natural language -- 2026-01-09
Connect any LLM to all your knowledge sources and chat with it -- 2026-01-08
Have claude code interact with another claude code session interactively to test a plugin im building -- 2026-01-08
Semantic geometry for visual grounding -- 2026-01-08
zai-org/AutoGLM-Phone-9B -- 2026-01-08
facebook/sam-audio-large -- 2026-01-08
[Editorial] https://www.linkedin.com/posts/andriyburkov_a-major-breakthrough-in-reinforcement-learning-activity-7414543177648472064-_omq -- 2026-01-08
AskUserQuestionTool: if I have another kid, I know what I am going to name them. -- 2026-01-07
[Editorial] https://www.linkedin.com/posts/reuvencohen_ralph-wiggum-as-people-are-talking-about-activity-7414663704081981440-54bK -- 2026-01-07
[Editorial] https://joshclemm.com/writing/ralph-wiggum-future-of-coding -- 2026-01-07
[Editorial] https://ghuntley.com/ralph -- 2026-01-07
[Editorial] https://github.com/coleam00/Linear-Coding-Agent-Harness -- 2026-01-05
MCP Chat Studio v2: Workspace mode, workflows, contracts, mocks, and more -- 2026-01-05
Way to build powerful agents using natural language and code -- 2026-01-05
GLM-4.7 running full agentic workflows in Claude Code for 15 min straight - no failures -- 2026-01-05
I (almost) built an open-source, self-hosted runtime for AI agents in TypeScript... -- 2026-01-02
How to get started with automated workflows? -- 2026-01-02
Safe, Untrusted, "Proof-Carrying" AI Agents: toward the agentic lakehouse -- 2026-01-02
I built HMLR, an open source (full MIT) memory layer for your agent -- 2025-12-31
I built a "Recursive Swarm" engine inside a VS Code fork. It forces the LLM to explore 10,000 logic branches (System 2) before committing to code—trading 20 minutes of compute for accuracy. -- 2025-12-31
BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization -- 2025-12-31
Built an MCP Server for Andrej Karpathy's LLM Council -- 2025-12-31
Bounded autonomy: how the "is it an agent?" question changed my QA bot design -- 2025-12-31
eliasjudin/oai-skills -- 2025-12-31
zai-org/GLM-ASR -- 2025-12-31
AI Video Generation Made Easier with Wan 2.6 -- 2025-12-31
HKUDS/MCPNext -- 2025-12-29
virtual pet / life simulation using Ollama and Unity 6 -- 2025-12-23
YatharthS/MiraTTS -- 2025-12-23
stepfun-ai/Step-Audio-R1 -- 2025-12-23
[Editorial] https://x.ai/news/grok-voice-agent-api -- 2025-12-19
[Editorial] https://www.linkedin.com/posts/reuvencohen_sitting-on-a-beach-in-playa-del-carmen-activity-7407460969188163584-HQup -- 2025-12-19
[Editorial] https://www.linkedin.com/posts/yotam-perkal_comparing-ai-agents-to-cybersecurity-professionals-activity-7407076565357887488-KI5M -- 2025-12-18
Building an event-driven alternative to LangGraph because single-threaded loops are killing me. Roast my architecture. -- 2025-12-18
Intent vectors for AI search + knowledge graphs for AI analytics -- 2025-12-17
Cracking a 25-Year-Old Password with Claude Code -- 2025-12-17
Weird Email Appliance Becomes AI Terminal -- 2025-12-17
Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation -- 2025-12-17
stepfun-ai/Step-Audio-EditX -- 2025-12-17
AIDC-AI/Ovis-Image-7B -- 2025-12-17
[Editorial] https://www.linkedin.com/posts/resilientcyber_levels-of-autonomy-for-ai-agents-activity-7406679623167803392-OFJK -- 2025-12-16
ManiAgent: An Agentic Framework for General Robotic Manipulation -- 2025-12-16
CUGA on Hugging Face: Democratizing Configurable AI Agents -- 2025-12-16
AI Agent from scratch: Django + Ollama + Pydantic AI - A Step-by-Step Guide -- 2025-12-12
[Editorial] https://github.com/humanlayer/humanlayer -- 2025-12-12
Large update: 12 new frontier models added to the Step Game social reasoning benchmark. -- 2025-12-11
DeepMath: A lightweight math reasoning Agent with SmolAgents -- 2025-12-11
Nanbeige4-3B: Lightweight with strong reasoning capabilities -- 2025-12-10
mistralai/Devstral-2-123B-Instruct-2512 -- 2025-12-10
Can codex create multiple outputs, I check which is best? -- 2025-12-10
stepfun-ai/GELab-Zero-4B-preview -- 2025-12-10
Need opinion/help on my Memory System for LLM -- 2025-12-09
FlowCoder: Visual agentic workflow customization for Claude Code and Codex -- 2025-12-09
I built a CLI tool to manage AI configs across repos (aipaca) 🦙 -- 2025-12-09
Counterfactual-based Agent Influence Ranker for Agentic AI Workflows -- 2025-12-08
Run Any Model Provider on OpenWebUI immediately by discovering AI services on your LAN -- 2025-12-08
We gave 5 LLMs $100K to trade stocks for 8 months -- 2025-12-08
DevCrew agent swarm for accelerating your software development -- 2025-12-08
Connect and use Nova 2 Lite with Claude Code -- 2025-12-08
The security risks of "Emoji Smuggling" and Hidden Prompts for Local Agents -- 2025-12-08
We were tired of guessing which local model to use for which query. built a speculative execution lib that figures it out (github) -- 2025-12-05
Claude vs Codex: Claude won again 🏅 -- 2025-12-04
NornicDB - API compatible with neo4j - MIT - GPU accelerated vector embeddings -- 2025-12-04
gregorydickson/memory-graph -- 2025-12-04
Building Deep Research: How we Achieved State of the Art -- 2025-12-03
Claude launched 3 'explore agents' by itself -- 2025-12-02
OpenAI realtime API opensource alternative -- 2025-12-02
Built a Modular Agentic RAG System – Zero Boilerplate, Full Customization -- 2025-12-02
[Editorial] https://www.linkedin.com/posts/ownyourai_i-just-finished-testing-the-new-metas-omnilingual-activity-7400801588635836416-gpo- -- 2025-12-01
Xthebuilder/JRVS -- 2025-12-01
tigillo/githubmodels-go -- 2025-12-01
A Bird Watching Assistant -- 2025-12-01
InteractComp: Evaluating Search Agents With Ambiguous Queries -- 2025-11-28
Agent framework chaos? > Better Agents CLI -- 2025-11-28
Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry -- 2025-11-26
Sibyl: an open source orchestration layer for LLM workflows -- 2025-11-25
Looking for 10 early testers building with agents, need brutally honest feedback👋 -- 2025-11-25
Claud Agent Dashboard -- 2025-11-25
[Editorial] https://github.com/punkpeye/awesome-mcp-servers -- 2025-11-24
Cornserve: Microservices Architecture for Serving Any-to-Any Models like Qwen Omni! -- 2025-11-24
How I’m Building Declarative, Shareable AI Agents With Docker cagent -- 2025-11-24
An open-source "Slack" for AI Agents to orchestrate n8n, Flowise, and OpenAI agents in one place -- 2025-11-24
modelscope/AgentEvolver -- 2025-11-24
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_ibm-the-2025-chief-data-officer-study-activity-7397614050433462272-0GmF -- 2025-11-21
Do you sandbox MCPs / Claude Code / Opencode on Linux? How ? -- 2025-11-21
Verifying hardware quality of rented gpus -- 2025-11-21
Ollama signin docker compose -- 2025-11-21
What's your Claude Code workflow setup? -- 2025-11-21
Measuring political bias in Claude -- 2025-11-21
Looking for feedback - I built Socratic, an open source knowledge base builder where YOU stay in control -- 2025-11-21
[Editorial] https://www.linkedin.com/posts/quanta-magazine_the-awful-consequence-of-an-observer-free-activity-7396969815078236160-Vo5G?u -- 2025-11-20
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_harmful-traits-of-ai-companions-activity-7397309575928131584-8H4J -- 2025-11-20
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_realist-and-pluralist-conceptions-of-intelligence-activity-7397231918871703554-FmSP?utm_source=social_share_send&utm_medium=member_desktop_web&rcm=ACoAAAAEV6YBBmyIQkYRxMIFJ7EWVq99NXg4qV4 -- 2025-11-20
How are you all orchestrating multi-agent workflows (beyond one-shot prompt chaining)? -- 2025-11-20
deliveryhero/asya -- 2025-11-20
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_aws-a-more-realistic-evaluation-activity-7396951453182967808-_H_c -- 2025-11-19
Should Spec-Driven-Development have a procedural orchestrator, or an LLM? -- 2025-11-19
Where are the gaps in Claude's "reasoning" capabilities? -- 2025-11-19
Smart Bandage Leverages AI Model For Healing Purposes -- 2025-11-19
miromind-ai/MiroThinker-v1.0-72B -- 2025-11-19
GPT-5-pro is likely a universal agentic gateway / Large Agentic Model -- 2025-11-19
BSD MAC LLM UI: Minimal, Auditable LLM Front End for Secure Environments -- 2025-11-18
easy-oidc/easy-oidc -- 2025-11-18
Disrupting the first reported AI-orchestrated cyber espionage campaign -- 2025-11-18
The Challenge of Large File Checksums -- 2025-11-18
Building A Smart Speaker Outside The Corporate Cloud -- 2025-11-18
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_i-saved-forty-ai-research-papers-recently-activity-7395547917983580160-BuOX -- 2025-11-17
[Editorial] https://www.linkedin.com/posts/reuvencohen_i-just-finished-rebuilding-dspyts-on-top-activity-7395872853092495360-OFb8 -- 2025-11-17
[Editorial] https://www.marktechpost.com/2025/11/08/how-to-build-an-agentic-voice-ai-assistant-that-understands-reasons-plans-and-responds-through-autonomous-multi-step-intelligence/ -- 2025-11-17
Local-First LLM That Safely Runs Real System Tasks — Looking for Engineering Feedback -- 2025-11-17
[MCP] Open-sourced a CSV-to-PostgreSQL loader server (vibe-coded with Claude) -- 2025-11-17
MCP Server for Industrial IoT - Built for PolyMCP Agent Orchestration -- 2025-11-17
Mimir - Parallel Agent task orchestration - Drag and drop UI (preview) -- 2025-11-17
Claude helped me make a multi agent ecosystem where models interact with each other autonomously -- 2025-11-17
AnythingLLM MCP Bridge & Prompt Injector -- 2025-11-14
Katakate/k7 -- 2025-11-14
Dicklesworthstone/mcp_agent_mail -- 2025-11-14
[Editorial] https://www.linkedin.com/posts/ivandj_as-ai-agents-multiply-across-tools-and-protocols-activity-7394057385872556032-SlAQ -- 2025-11-13
[Editorial] https://www.linkedin.com/posts/henrikgothberg_anthropic-building-effective-ai-agents-ugcPost-7394348623796350977-tcq1 -- 2025-11-13
[Editorial] https://www.linkedin.com/posts/reuvencohen_the-latest-mcp-spec-feels-like-the-moment-activity-7394373616471072768-okAg -- 2025-11-13
How to link an AI to a code execution environment? -- 2025-11-13
[Editorial] https://www.linkedin.com/posts/emollick_we-need-more-papers-like-this-one-which-examines-ugcPost-7392918095805222912-YjvU?utm_source=social_share_send&utm_medium=member_desktop_web&rcm=ACoAAAAEV6YBBmyIQkYRxMIFJ7EWVq99NXg4qV4 -- 2025-11-12
Agent failures in production pushed me to simulation-based testing -- 2025-11-12
Building agents that work like a band, not a factory line - anyone experimenting with emergent multi-agent coordination? -- 2025-11-12
Qwen3-VL works really good with Zoom-in Tool -- 2025-11-12
[Update] mlx-knife 2.0 stable — MLX model manager for Apple Silicon -- 2025-11-12
Vascura BAT - configuration Tool for Llama.Cpp Server via simple BAT files. -- 2025-11-12
Beelzebub MCP: Securing AI Agents with Honeypot Functions, Prompt Injection Detection -- 2025-11-11
Problem Uploading PDFs in Self hosted AI -- 2025-11-11
openai/gpt-oss-safeguard-20b -- 2025-11-11
Dexmal/dexbotic -- 2025-11-11
Blender 5.1 -- 2025-11-11
Qwen/Qwen3-VL-2B-Instruct -- 2025-11-11
[Editorial] https://www.linkedin.com/posts/stuart-winter-tear_my-company-is-forcing-me-to-become-ai-agent-activity-7393927479004135424-hI8p -- 2025-11-11
Hephaestus: AI workflows that discover and create their own tasks as they work -- 2025-11-11
Built my own IDE -- 2025-11-11
Roo Code 3.30.3 Release Updates | kimi‑k2‑thinking support | UI improvements | Bug fixes -- 2025-11-11
Claude-Bumper-Lanes - Vibe Code with Review Discipline -- 2025-11-11
We just released a multi-agent framework. Please break it. -- 2025-11-10
⚡️ I scaled Coding-Agent RL to 32x H100s. Achieving 160% improvement on Stanford's TerminalBench. All open source! -- 2025-11-10
Agent Learning via Early Experience -- 2025-11-10
[Editorial] https://www.linkedin.com/posts/reuvencohen_claude-code-web-is-amazing-its-my-primary-activity-7393649498251644928-rAc8 -- 2025-11-10
CodeWiki: Research-Grade Repository Documentation at Scale [Open Source] -- 2025-11-10
Website builder powered by Claude AI - generating full websites in minutes -- 2025-11-10
“AI, Make Me A Degree Certificate” -- 2025-11-10
Self-hosted platform for running third-party AI agents with Ollama support (Apache-2.0) -- 2025-11-07
Open Source Alternative to NotebookLM/Perplexity -- 2025-11-07
Decade-qiu/Multi-Source-Media-MCP-Server -- 2025-11-07
v0.2.0 - GenFilesMCP -- 2025-11-07
⚡️ Scaling Coding-Agent RL to 32x H100s. Achieving 160% improvement on Stanford's TerminalBench -- 2025-11-06
Bifrost: A High-Performance Gateway for LLM-Powered AI Agents (50x Faster than LiteLLM) -- 2025-11-06
Stop fighting with AI to build your project -- 2025-11-06
OpenSkills - a open sourced and completely private Claude Skills -- 2025-11-05
I used Llama + Droidrun to create a self-running Twitter bot -- 2025-11-05
Thread vs. Session based short-term memory -- 2025-11-05
kayba-ai/agentic-context-engine -- 2025-11-05
[Editorial] Collaboration gap -- 2025-11-05
Looking for advanced workflow tips: How are power-users integrating Claude (and other LLMs) into high-volume legal practice? -- 2025-11-05
Lessons from interviews on deploying AI Agents in production -- 2025-11-05
[Open Source] We deployed numerous agents in production and ended up building our own GenAI framework -- 2025-11-04
First LangFlow Flow Official Release - Elephant v1.0 -- 2025-11-04
zeusftk/FTK_CANVAS_AGENT_for_Comfyui -- 2025-11-04
Qwen3-VL-32B Q8 speeds in llama.cpp vs vLLM FP8 on a RTX PRO 6000 -- 2025-11-03
thu-coai/Glyph -- 2025-11-03
AndroidControl-Curated: Revealing the True Potential of GUI Agents through Benchmark Purification -- 2025-11-03
[Editorial] Agentic Flow -- 2025-11-03
I'm making an AI similar to a vtuber using ollama, here's what I have so far! (looking for advice on anything, really) -- 2025-11-03
Remember that simple online PDF bank converter tool making $40k/month? I did the exact same workflow with my general AI agent (only 1 prompt needed!) -- 2025-11-03
[Editorial] Agent limits -- 2025-11-02
[Editorial] Agent Identity -- 2025-11-02
[Editorial] AI Defense -- 2025-11-02
I built a privacy focused AI assistant for WearOS that supports locally hosted LLMs -- 2025-11-02
VellumForge2 - A high performance, very configurable and really easy to use DPO dataset generation tool, create high quality datasets for completely free -- 2025-11-01
PokeeAI/pokee_research_7b -- 2025-11-01
[Editorial] https://itrevolution.com/articles/from-line-cookto-head-chef-orchestrating-ai-teams/ -- 2025-11-01
[Editorial] Cursor 2.0 -- 2025-11-01
Open Source Lovable with Custom Agents, Full Stack Support, and Local Models -- 2025-11-01
A highly adaptable toolkit to build APIs and agents, with friendly interfaces for streaming and multimodality -- 2025-11-01
Is it possible to enable mcp server on for specific sub agent? -- 2025-11-01
[Editorial] AGI Defined. -- 2025-11-01
[Editorial] Know Your Agent (KYA) -- 2025-10-30
Spent the last few weeks falling down the Claude Agent SDK rabbit hole... built AgCluster.dev (open source) -- 2025-10-30
Found a faster way to build Claude Skills -- 2025-10-30
Agentic AI for Financial Crime Compliance -- 2025-10-30
GraphScout: Intelligent Routing for Local LLM Agent Workflows -- 2025-10-30
Show HN: Butter – A Behavior Cache for LLMs -- 2025-10-30
katanemo/Arch-Router-1.5B -- 2025-10-30
QAgent: A modular Search Agent with Interactive Query Understanding -- 2025-10-30
[Open Source] We deployed numerous agents in production and ended up building our own GenAI framework -- 2025-10-29
Claude Skills but running locally in Apple container -- 2025-10-29
OpenSkills CLI - Use Claude Code Skills with ANY coding agent -- 2025-10-29
Prompts avoiding Yes Men moments? -- 2025-10-29
severity1/claude-code-prompt-improver -- 2025-10-29
Built Coyote — An AI Agent That Feels Like Texting a Friend and released first model supporting native Async Tools -- 2025-10-29
Distil NPC: Family of SLMs responsing as NPCs -- 2025-10-29
nvidia/audio-flamingo-3-hf -- 2025-10-29
microsoft/UserLM-8b -- 2025-10-29
Test-Time Scaling Strategies for Generative Retrieval in Multimodal Conversational Recommendations -- 2025-10-29
[Editorial] New calculus of coding -- 2025-10-28
we had 2 weeks to build 5 microservices with 3 devs, tried running multiple AI agents in parallel -- 2025-10-28
Claude Code 2.0.27 -- 2025-10-28
steveyegge/vc -- 2025-10-28
[Editorial] MCP Scanner, security -- 2025-10-28
[Editorial] Data provenance -- 2025-10-28
Who is Introducing the Failure? Automatically Attributing Failures of Multi-Agent Systems via Spectrum Analysis -- 2025-10-28
[Editorial] Virtual false positive, physical problems -- 2025-10-28
Show HN: A fast, privacy-first image converter that runs in browser -- 2025-10-28
Microsoft Releases AI Call Center Stack with Voice, SMS, and Memory -- 2025-10-28
Robot Phone Home…Or Else -- 2025-10-28
vngrs-ai/Kumru-2B -- 2025-10-27
Training Gemma 3n for Transcription and Translation -- 2025-10-27
Agentic Exploration of Physics Models -- 2025-10-27
[Editorial] For the vibes -- 2025-10-27
Best way to implement a detailed plan in an MD file? -- 2025-10-27
sci-m-wang/ACE-open -- 2025-10-27
StepWiser: Stepwise Generative Judges for Wiser Reasoning -- 2025-10-27
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science -- 2025-10-26
Claude for Computer Use using Sonnet 4.5 -- 2025-10-26
Any way to have sub-agent's keep context between invocations? -- 2025-10-26
Learning to Steer: Input-dependent Steering for Multimodal LLMs -- 2025-10-26
[Editorial] Promethean Fire -- 2025-10-26
Google AI falsely named an innocent journalist as a notorious child murderer -- 2025-10-26
Built my own MCP server for my app and was pleasantly shocked by how good it is -- 2025-10-25
facebook/cwm -- 2025-10-25
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity -- 2025-10-25
Building the Open Agent Ecosystem Together: Introducing OpenEnv -- 2025-10-25
[Editorial] Leading AI Agent Swarms: The Agentic QE 1.2.0 Journey -- 2025-10-24
lupantech/AgentFlow -- 2025-10-24
jaguarliuu/xunlong -- 2025-10-24
[Editorial] Browsers you can socially engineer -- 2025-10-24
[Editorial] share terminal sessions using Claude Code for web -- 2025-10-24
[Project] VT Code — Rust coding agent now with Ollama (gpt-oss) support for local + cloud models -- 2025-10-24
How path-based pattern matching helps AI code follow your team's coding best practice -- 2025-10-24
Show HN: FlowLens – MCP server for debugging with Claude Code -- 2025-10-24
We built ContextAgent — a context-centric take on multi-agent systems (rethinking what an “agent” is) -- 2025-10-23
Claude Haiku 4.5 for Computer Use -- 2025-10-23
Sonnet 4.5 subagent Haiku question -- 2025-10-23
disler/big-3-super-agent -- 2025-10-23
usieye/flowma -- 2025-10-23
[Editorial] https://github.com/jingyaogong/minimind/blob/master/README_en.md -- 2025-10-23
After treating RL training like an SRE project, I see why they chose CISPO -- 2025-10-23
Chatgpt or Claude for web coding assitant -- 2025-10-22
Does Claude Desktop support MCP Server Notifications? -- 2025-10-22
Ollama Cloud API Tool usage -- 2025-10-22
[Editorial] https://www.linkedin.com/posts/mavlevin_aisecurity-zeroday-cybersecurity-activity-7386478715813330944-P9OP -- 2025-10-22
Linux Capabilities Revisited -- 2025-10-22
I got fed up with Open WebUI/LibreChat for local LLMs so I made an open source tool to turn my GPU server into an always-on assistant -- 2025-10-21
This is how I track usage and improve my AI assistant without exposing sensitive data -- 2025-10-21
Roadmap for building scalable AI agents! -- 2025-10-21
My TypeScript MCP server template `mcp-ts-template` just hit v2.3.7. Declarative tool definitions. Pluggable Storage. Edge-native (Cloudflare Workers). Optional OpenTelemetry. OAuth with Scope Enforcement, etc. -- 2025-10-21
virattt/dexter -- 2025-10-21
UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding -- 2025-10-21
[Editorial] https://www.linkedin.com/posts/gadievron_another-day-another-attack-on-ai-coding-activity-7386382494117466112-tXuF -- 2025-10-21
[Editorial] https://www.linkedin.com/posts/reuvencohen_ive-seen-the-future-of-coding-and-it-activity-7386187612597714944-jXQn -- 2025-10-21
Qwen3-vl:235b-cloud Ollama model error -- 2025-10-21
Expose MCP at the LLM server level? -- 2025-10-20
I got tired of copy-pasting NotebookLM answers into Claude, so I built an MCP server for it -- 2025-10-20
Use n8n in Open WebUI without maintaining pipe functions -- 2025-10-20
Slack sync into OpenWebUI Knowledge -- 2025-10-20
[Editorial] Chart a path -- 2025-10-18
[Editorial] Agentic Flow -- 2025-10-18
[Editorial] Turbo Flow -- 2025-10-18
Claudiomiro: How to Achieve 100% Autonomous (Complex) Coding -- 2025-10-18
Flowchart vs handoff: two paradigms for building AI agents -- 2025-10-18
Compare Claude Code and Codex from one prompt -- 2025-10-18
Claude Agent SDK + Cloudflare Containers is the perfect agent platform -- 2025-10-18
[Editorial] Getting more out of Claude Code SDK -- 2025-10-17
[Editorial] Agentic Flow - AI Agent Framework That Gets Smarter AND Faster Every Time It Runs -- 2025-10-17
oracle/agent-spec -- 2025-10-17
Holy Marketplaces, Batman! -- 2025-10-16
Show HN: Metorial (YC F25) – Vercel for MCP -- 2025-10-16
Enhancing Generative Auto-bidding with Offline Reward Evaluation and Policy Search -- 2025-10-15
How to handle long running tools in realtime conversations. -- 2025-10-15
Anyone else having reasoning parser issue with Qwen-cli + GLM4.6 combo in vllm? -- 2025-10-15
Plan mode coming to Codex CLI -- 2025-10-15
Something is wrong with Sonnet 4.5 -- 2025-10-15
Xrvitd/MeshMosaic -- 2025-10-15
Qwen/Qwen3-VL-235B-A22B-Instruct -- 2025-10-15
Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent -- 2025-10-15
Vibe Coding and the Popularization of CLI Interfaces: Why Don’t Big Companies Use Millions of Users as Contributors to Improve Models? -- 2025-10-14
rexleimo/agno-Go -- 2025-10-14
The Silent Scientist: When Software Research Fails to Reach Its Audience -- 2025-10-14
OpenAI’s AgentKit makes building AI agents way easier, design, chat, test, and connect everything in one place! -- 2025-10-14
NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents -- 2025-10-14
A list of models released or updated this week on this sub, in case you missed any (10 Oct). -- 2025-10-14
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B -- 2025-10-14
ibm-granite/granite-4.0-micro -- 2025-10-14
BasedBase/GLM-4.5-Air-GLM-4.6-Distill -- 2025-10-14
A 5-minute, no-BS way to pick a local model for your real task -- 2025-10-14
[Update] CodeLens.AI - Crowdsourced AI Leaderboard 3 Days Later: Blind Voting and What We Learned -- 2025-10-14
How to re-create OpenAI Assistants locally? -- 2025-10-14
M2 Max 96GB - llama.cpp with codex and gpt-oss 120b to edit files and github upload -- 2025-10-14
Why You Should Build AI Agents with Ollama First -- 2025-10-14
OpenWebUI en Docker no detecta modelo LLaMA3 instalado con Ollama en Linux -- 2025-10-14
[Editorial] The Reality of Agentic Development -- 2025-10-13
[AutoBE] achieved 100% compilation success of backend generation with "qwen3-next-80b-a3b-instruct" -- 2025-10-13
What ACTUALLY works after testing every AI coding tool for 6 months -- 2025-10-13
Issue with long parameter values when using tool calling with Anthropic API -- 2025-10-13
Moondream3 and Salesforce GTA-1 for UI grounding in computer-use agents -- 2025-10-12
vdpiya/batchi -- 2025-10-12
Agentic generative AI for media content discovery at the national football league -- 2025-10-12
demo: my open-source local LLM platform for developers -- 2025-10-10
Modelfile. Do I need these tags PER prompt? -- 2025-10-10
Script to install a bunch of AI or Dev tools automatically.. what can I add to it or improve? -- 2025-10-10
Claude Code compaction fails with “Conversation too long” even when context is below 75% -- 2025-10-10
Show HN: FleetCode – Open-source UI for running multiple coding agents -- 2025-10-10
Local Terminal Access -- 2025-10-10
xcLee001/SonicVale -- 2025-10-10
InternRobotics/VLAC -- 2025-10-10
meituan-longcat/LongCat-Flash-Thinking -- 2025-10-10
LiquidAI/LFM2-1.2B-Tool -- 2025-10-10
Hcompany/Holo1.5-7B -- 2025-10-09
[Editorial] Agentics Newsletter -- 2025-10-09
[Editorial] Latest batch from rUv. -- 2025-10-09
TheAgentArk/Toucan -- 2025-10-09
[Editorial] Increased edit speed, reduced LLM cost -- 2025-10-08
AI agents face off -- 2025-10-08
How to make Claude Code work for you at night? -- 2025-10-08
tfriedel/claude-office-skills -- 2025-10-08
What happens if AI agents start trusting everything they read? (I ran a test.) -- 2025-10-06
High-performance mice can be used as a microphone to spy on users -- 2025-10-06
How can I test bad behavior in model APIs without getting banned? -- 2025-10-06
Framework or custom for local rag/agentic systems -- 2025-10-05
Test your MCP server against Llama, no key required -- 2025-10-05
aiprodcoder/MIXAPI -- 2025-10-05
williavs/AGENTDL -- 2025-10-05
Ally finally got RAG – everything runs local now -- 2025-10-05
RawdodReverend/TermNet -- 2025-10-05
[Editorial] https://www.linkedin.com/posts/albertochierici_lol-i-cant-stop-thinking-about-this-we-activity-7379840898626502656-bUYZ -- 2025-10-03
Vyzer9/Valkan -- 2025-10-03
Bypassing TLS Certificate Validation with Ld_preload -- 2025-10-03
I built Solveig, it turns any LLM into an agentic assistant in your terminal that can safely use your computer -- 2025-10-02
# 🥔 Meet Tater Totterson — The Local AI Assistant That Doesn’t Need MCP Servers -- 2025-10-02
Do I need to run /init on a repo if I already have AGENTS.md? -- 2025-10-02
sshllm/sshai -- 2025-10-02
[Editorial] System prompts are getting outdated! -- 2025-10-02
[Editorial] https://github.com/emcie-co/parlant -- 2025-10-02
Microsoft Agent Framework (Preview): Making AI Agents Simple for Every Developer -- 2025-10-02
Codex is mind blowing -- 2025-09-29
LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs -- 2025-09-29
Apple called out every major AI company for fake reasoning and Anthropic's response proves their point -- 2025-09-29
Help with running Ai models with internet connectivity -- 2025-09-28
AWS announces EC2 instance attestation -- 2025-09-28
The Perplexity Search API -- 2025-09-28
Reinforcement Learning with Rubric Anchors -- 2025-09-28
PHM-Bench: A Domain-Specific Benchmarking Framework for Systematic Evaluation of Large Models in Prognostics and Health Management -- 2025-09-28
Roo Code 3.28.6 Release Notes - GPT-5-Codex IS HERE!! -- 2025-09-28
Main thing I use claude for is to prevent Codex from gaslighting me -- 2025-09-28
Model answers include raw <br> tags when generating tables – how to fix in Open WebUI? -- 2025-09-28
How to embed images in responses? -- 2025-09-28
New Agent benchmark from Meta Super Intelligence Lab and Hugging Face -- 2025-09-27
evalops/dspy-micro-agent -- 2025-09-27
nvidia/NVIDIA-Nemotron-Nano-9B-v2 -- 2025-09-27
inclusionAI/Ling-flash-2.0 -- 2025-09-27
1K+ schemas of agentic projects visualized -- 2025-09-26
what AI agent framework is actually production viable and/or least problematic? -- 2025-09-26
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model -- 2025-09-26
CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation -- 2025-09-26
Link a git repo to llama.cpp server? -- 2025-09-24
oxbshw/LLM-Agents-Ecosystem-Handbook -- 2025-09-24
Native MCP (streamable HTTP) may be on the way -- 2025-09-24
nvidia/NVIDIA-Nemotron-Nano-12B-v2 -- 2025-09-23
Gaia2 and ARE: Empowering the community to study agents -- 2025-09-23
Gaia2 and ARE: Empowering the community to study agents -- 2025-09-23
Open sourced my AI video generation project -- 2025-09-23
Zen, many Code CLI instances (/commands) for peaceful parallel task execution. -- 2025-09-23
twiggy-tools/Twiggy -- 2025-09-23
KubeAgentic-Community/KubeAgentic -- 2025-09-23
MyLocalAI - Enhanced Local AI Chat Interface (vibe coded first project!) -- 2025-09-23
Tesslate/WEBGEN-4B-Preview -- 2025-09-23
tencent/SRPO -- 2025-09-23
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning -- 2025-09-22
Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward -- 2025-09-22
[Editorial] A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks -- 2025-09-21
Claude Code native subagents vs. Claude Flow vs. BMAD -- 2025-09-21
Hallucination in LLM-Based Code Generation: An Automotive Case Study -- 2025-09-21
Infherno: End-to-end Agent-based FHIR Resource Synthesis from Free-form Clinical Notes -- 2025-09-20
GGUF security concerns -- 2025-09-20
Democratizing AI Safety with RiskRubric.ai -- 2025-09-20
VoxCPM 0.5B : Tokenizer-Free TTS and Voice Cloning -- 2025-09-18
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B · Hugging Face -- 2025-09-18
NexaAI/OmniNeural-4B -- 2025-09-18
MobileLLM-R1-950M meets Apple Silicon -- 2025-09-18
VS Code Chat: Introducing auto model selection (preview) -- 2025-09-18
ircfspace/masque-plus -- 2025-09-18
First AI Agent for DevOps/SRE and Platform Engineering -- 2025-09-17
This AI assistant became our go-to Unity co-pilot (not just another LLM) -- 2025-09-17
Runtime intelligence in games -- 2025-09-17
[Editorial] Villager -- 2025-09-16
Update: we got our revenge and now beat Deepmind, Microsoft, Zhipu AI and Alibaba -- 2025-09-16
Building Ai Agent from Scratch (Python) -- 2025-09-15
Siddhant-K-code/tokenvm -- 2025-09-15
Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems -- 2025-09-15
Qwen3-Next-80B-A3B - a big step up may be the best open source reasoning model so far -- 2025-09-14
Qwen/Qwen3-Next-80B-A3B-Thinking -- 2025-09-14
Nothing concrete to show yet, I just wanted to celebrate getting a remote MCP server\connector with oAuth working :) -- 2025-09-09
The Dark Side of LLMs Agent-based Attacks for Complete Computer Takeover -- 2025-09-09
An LLM-powered Natural-to-Robotic Language Translation Framework with Correctness Guarantees -- 2025-09-09
[Editorial] Why Language Models Hallucinate -- 2025-09-09
[Editorial] Compression Failures in LLMs -- 2025-09-09
[Editorial] Active Inference AI -- 2025-09-09
I built a Graph RAG pipeline (VeritasGraph) that runs entirely locally with Ollama (Llama 3.1) and has full source attribution. -- 2025-09-09
Environments Hub walkthrough: Your Language Model needs better (open) environments to learn -- 2025-09-08
The Landscape of Agentic Reinforcement Learning for LLMs -- 2025-09-08
What are your struggles with tool-calling and local models? -- 2025-09-08
[Project Update] From Brittle Scripts to a Resilient, Self-Auditing Architecture: The Evolution of MeganX 3.0 -- 2025-09-07
I accidentally beat Claude Code this weekend - multi-agent-coder now #12 on Stanford's TerminalBench 😅 -- 2025-09-07
Open-source tool to let Claude Code control your computer -- 2025-09-07
Trustworthy Agents for Electronic Health Records through Confidence Estimation -- 2025-09-07
Context Reasoning Benchmarks: GPT-5, Claude, Gemini, Grok on Real Tasks -- 2025-09-05
The CLAUDE.md Framework: A Guide to Structured AI-Assisted Work (prompts included) -- 2025-09-05
Team-intN18-SoybeanSeclab/Typhon -- 2025-09-05
DatarusAI/Datarus-R1-14B-preview -- 2025-09-05
Are there any SDKs that offer native tool calling functionality that can be used with any LLMs -- 2025-09-04
Open source wrapper around AugmentCode -- 2025-09-04
Producer Pal: control Ableton Live and make music with Claude -- 2025-09-04
ChatGPT on the Road: Leveraging Large Language Model-Powered In-vehicle Conversational Agents for Safer and More Enjoyable Driving Experience -- 2025-09-04
Jupyter Agent Dataset -- 2025-09-04
Training & Querying 3 Ollama Models with Zer00logy: Symbolic Cognition Framework and Void-Math OS -- 2025-09-04
unsloth/Qwen3-Coder-480B-A35B-Instruct-GGUF -- 2025-09-03
HeroBench: A Benchmark for Long-Horizon Planning and Structured Reasoning in Virtual Worlds -- 2025-09-03
Achieving 80% task completion: Training LLMs to actually USE tools -- 2025-09-03
githubnext/gh-aw -- 2025-09-03
Toad: Universal TUI for Agentinc Coding from Will McGugan (Rich/Textual) -- 2025-09-03
How do you do RL 100% locally without a NVIDIA GPU? -- 2025-08-31
NiceWebRL: a Python library for human subject experiments with reinforcement learning environments -- 2025-08-31
Coquette Mobile - Android App, Ollama with Agentic Properties - desktop control. -- 2025-08-30
Testers for Seed-OSS tool calling wanted! -- 2025-08-29
Codebase to Knowledge Graph generator -- 2025-08-29
GaohaoZhou-ops/Tello-LLM-ROS -- 2025-08-29
Exploring Autonomous Agents: A Closer Look at Why They Fail When Completing Tasks -- 2025-08-29
Built an AI Agent Orchestration Platform - Handles 70% of Our Dev Tasks -- 2025-08-29
Hobbyist project : enabling smaller language models to interact with large code bases -- 2025-08-28
Evaluate any computer-use agent with HUD + OSWorld-Verified -- 2025-08-28
The outer loop vs. the inner loop of agents. A simple mental model to evolve the agent stack quickly and push to production faster. -- 2025-08-28
AgentCheck: Local AI-powered code review agents for Claude Code -- 2025-08-28
[Editorial] The Complete Guide to BuildingAI Agents -- 2025-08-27
Tencent/Youtu-agent -- 2025-08-27
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications -- 2025-08-27
CNCF Webinar–AI Model Packaging with KitOps -- 2025-08-27
[Editorial] Sense of Self and Time in Borderline Personality -- 2025-08-27
[Editorial] AI and security tools. -- 2025-08-27
MetaAgent: Automatically Constructing Multi-Agent Systems Based on Finite State Machines -- 2025-08-26
Models to complement GPT-5? -- 2025-08-26
Why claude.md fails and How CORE Fixes Memory in Claude Code -- 2025-08-26
Free Preview of Qoder: The Future of Agentic Coding? -- 2025-08-25
What MCP Servers are You Using -- 2025-08-25
I built real-time course correction for Claude Code... and it's also a Tamagotchi -- 2025-08-25
Not a model, but Open Source Memory framework claims to beat Mem0 on public benchmarks -- 2025-08-24
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents -- 2025-08-24
CausalPlan: Empowering Efficient LLM Multi-Agent Collaboration Through Causality-Driven Planning -- 2025-08-24
Jules is already making excuses like a senior dev trying to explain why they pushed to main on a Friday. -- 2025-08-23
Build a Local AI Agent with MCP Tools Using GPT-OSS, LangChain & Streamlit -- 2025-08-23
Codanna Adds TypeScript Parsing and Modular Language Registry. Context-First Coding. -- 2025-08-23
Presenton now supports presentation generation via MCP -- 2025-08-23
In 44 lines of code, we have an actually useful agent that runs entirely locally, powered by Qwen3 30B A3B Instruct -- 2025-08-20
Web Agent Memory Protocol (WAMP): Building a Shared Memory Layer for the Web -- 2025-08-20
Learning from building my first saas using claude code -- 2025-08-20
Generate Images with Claude and Hugging Face -- 2025-08-20
[Editorial] AI agents are rendering GitHub's human-centric collaboration tools obsolete -- 2025-08-18
dongguanting/ARPO -- 2025-08-18
MCP for Research: How to Connect AI to Research Tools -- 2025-08-18
Tencent-Hunyuan/HunyuanWorld-1.0 -- 2025-08-16
Rediscovering Microsoft’s Oddball Music Generator From The 1990s -- 2025-08-16
Trying to decide between Kilocode, Cline and Roo code -- 2025-08-15
GPT-5 vs Claude Opus 4.1: Which New AI Model Wins? -- 2025-08-15
bosonai/higgs-audio-v2-generation-3B-base -- 2025-08-14
Chain-GPT/Solidity-LLM -- 2025-08-14
Bottom-up Domain-specific Superintelligence: A Reliable Knowledge Graph is What We Need -- 2025-08-14
🇵🇭 FilBench - Can LLMs Understand and Generate Filipino? -- 2025-08-14
Miro ODR: Another Deep Research Agent model just went open source -- 2025-08-14
Is the Aider polyglot coding leaderboard still being updated? GPT-5? -- 2025-08-14
Claude going crazy on extended thinking? -- 2025-08-14
Building a self-hosted AI support agent (using GPT-OSS) that can both guide users and perform real actions – looking for feedback -- 2025-08-12
Local model recommendations for lightweight, repeated screenshot analysis on macOS? -- 2025-08-12
A free goldmine of tutorials for the components you need to create production-level agents Extensive open source resource with tutorials for creating robust AI agents -- 2025-08-12
declare-lab/jamify -- 2025-08-12
Intelligent-Internet/II-Search-4B -- 2025-08-12
THUDM/GLM-4.1V-9B-Thinking -- 2025-08-12
QwenLM/Qwen-Image -- 2025-08-12
A specific asynchronous workflow pattern -- 2025-08-11
mozilla-ai/any-llm -- 2025-08-11
SunzeY/SEAgent -- 2025-08-11
[Editorial] Three Things I Learned About Voice Agents from Kwindla Kramer -- 2025-08-09
NVIDIA AI-Q Achieves Top Score for Open, Portable AI Deep Research (LLM with Search Category) -- 2025-08-09
Vibe Coding an AI article generator using Onuro 🔥 -- 2025-08-09
Claude Code v1.0.71 - Background Commands -- 2025-08-09
Doriandarko/make-it-heavy -- 2025-08-09
universal-tool-calling-protocol/go-utcp -- 2025-08-09
[Editorial] Open source GUI for Claude Code -- 2025-08-08
DoubleAgents: Fine-tuning LLMs for Covert Malicious Tool Calls -- 2025-08-08
Hey folks, I’m one of the contributors to Bifrost, and we just launched it on Product Hunt -- 2025-08-08
Funny but annoying time bug -- 2025-08-08
A free goldmine of tutorials for the components you need to create production-level agents Extensive open source resource with tutorials for creating robust AI agents -- 2025-08-08
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use -- 2025-08-08
So multi agents.. and context.. how does that work -- 2025-08-07
Can you import chats in JSON? How? -- 2025-08-07
MAESTRO, a deep research assistant/RAG pipeline that runs on your local LLMs -- 2025-08-07
Quantize your own GGUFs the same way as your fav Unsloth Dynamic GGUFs -- 2025-08-07
Read your code -- 2025-08-07
weixin-omni/omni-bot-sdk-oss -- 2025-08-06
Kart – Distributed version-control for geospatial and tabular data -- 2025-08-06
[Editorial] Turn-Taking model for Voice AI Agents -- 2025-08-06
[Editorial] a more mature phase of the AI cycle. -- 2025-08-05
disler/claude-code-hooks-multi-agent-observability -- 2025-08-05
The Parallel Lives of an AI Engineer -- 2025-08-05
Any toolkits or predefined subagents for claude code that you think are a game changer? -- 2025-08-04
ramakay/claude-self-reflect -- 2025-08-04
[Editorial] Agentic Web: Weaving the Next Web with AI Agents -- 2025-08-03
[Editorial] Gemini Flow -- 2025-08-03
Pwn2Own Contestants hold on to Ollama exploits due to its rapid update cycle -- 2025-08-02
Claude Code sub agents not working as expected -- 2025-08-02
syou6162/cchook -- 2025-08-02
I need a tutorial for coding with any model (but currently trying with DeepSeek coder) -- 2025-08-02
The tradeoff between human and AI context -- 2025-08-02
Building a custom LLM trained on luciform prompts + ShadeOS daemon dialogues – seeking help -- 2025-08-01
I built a zsh plugin that turns natural language into shell commands using locally hosted Ollama -- 2025-08-01
Some thoughts on vibe / ai-driven coding -- 2025-08-01
[Editorial] AI in hostile environments... -- 2025-08-01
leesh3288/CVE-2025-32023 -- 2025-08-01
In search of riches, hackers plant 4G-enabled Raspberry Pi in bank network -- 2025-08-01
[Editorial] PRP, google cli fork -- 2025-07-31
[Editorial] Alternative to claude code cli -- 2025-07-31
Why I Forked Qwen Code -- 2025-07-31
Unwanted and unrelated changes to my code: my biggest gripe with ChatGPT -- 2025-07-31
How to Stop Claude from Being a Yes-Man? (Anchoring Bias Problem) -- 2025-07-31
We just open sourced NeuralAgent: The AI Agent That Lives On Your Desktop and Uses It Like You Do! -- 2025-07-30
Help with UnifyAI – Setting Up Local LLMs and UI Integration -- 2025-07-30
Show HN: Terminal-Bench-RL: Training Long-Horizon Terminal Agents with RL -- 2025-07-30
Show HN: Flyde 1.0 – Like n8n, but in your codebase -- 2025-07-30
Reachy The Robot Gets a Mini (Kit) Version -- 2025-07-30
Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification -- 2025-07-30
100 lines of Python is all you need: A radically minimal coding agent that scores 65% on SWE-bench (near SotA!) [Princeton/Stanford NLP group] -- 2025-07-30
[Editorial] laude Code Videos and Demos by Ruv (claude-swarm fame) -- 2025-07-29
[Editorial] It was fun while it lasted... bring on the $1000/mo max plan. -- 2025-07-29
Claude Code Best Practices/Tips/Tricks -- 2025-07-29
Everything I've Learned so far About OpenAI's Agents -- 2025-07-29
Why isn't this already a standard in robotics? -- 2025-07-28
The 14 Pains of Billing for AI Agents -- 2025-07-28
[Editorial] Product Requirement Prompts (PRP) -- 2025-07-28
Red flag phrases -- 2025-07-28
Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨ -- 2025-07-28
[Editorial] Local voice AI, 235B LLM -- 2025-07-28
I stopped typing. Now I just use a hotkey. I built Agent-CLI to make it possible. -- 2025-07-28
Local cross-platform speech-to-speech and real-time captioning with OpenAI Whisper, Vulkan GPU acceleration and more -- 2025-07-28
Devstral & Magistral as adapters of Mistral -- 2025-07-28
[Editorial] Intersection of Product Management and Development -- 2025-07-27
🔓 I built Hearth-UI — A fully-featured desktop app for chatting with local LLMs (Ollama-ready, attachments, themes, markdown, and more) -- 2025-07-27
UIGEN-X 8B supports React Headless, Flutter, React Native, Static Site Generators, Tauri, Vue, Gradio/Python, Tailwind, and prompt-based design. GGUF/GPTQ/MLX Available -- 2025-07-27
Realtime codebase indexing for coding agents with ~ 50 lines of Python (open source) -- 2025-07-27
Freigeist - The new Vibe Coding Platform -- 2025-07-27
What are some unique uses of OpenWebUI that you can't get otherwise? -- 2025-07-27
Claude Code finally told me the truth about agents :) -- 2025-07-26
Airfare Discrimination as a Service: Airlines' Favorite New Pricing Trick -- 2025-07-25
would this make an ai dev's life easier? -- 2025-07-25
Let’s sync on CLI agents! What’s actually working for you? -- 2025-07-25
Security Issue - Recent Claude Code behavior favoring fast/easy/simple took an API key and hardcoded it as a default value -- 2025-07-25
What is the best agent framework for Qwen3? -- 2025-07-24
Qwen/Qwen3-Coder-480B-A35B-Instruct -- 2025-07-24
Tool calling or not, I will use anyway -- 2025-07-24
Do you give your LLM terminal and code execution access? -- 2025-07-24
Built Ollamaton - Universal MCP Client for Ollama (CLI/API/GUI) -- 2025-07-23
What models/ai-code editors don't train on my codebase? -- 2025-07-23
Can someone PLEASE ELI5 MCPs, Connectors, and Extensions for me? -- 2025-07-23
Made My Own Auto Tool System and Enhanced Web Search Tool + Questions -- 2025-07-23
omar-haris/cursor-buddy-mcp -- 2025-07-22
EU is being left behinde and it sucks! -- 2025-07-22
We built Explainable AI with pinpointed citations & reasoning — works across PDFs, Excel, CSV, Docs & more -- 2025-07-20
Ready to go multi agent workflow on github? -- 2025-07-20
How do we secure AI agents that act on their own? -- 2025-07-19
Migrating a semantically-anchored assistant from OpenAI to local environment (Domina): any successful examples of memory-aware agent migration? -- 2025-07-19
Trying to get my Ollama model to run faster, is my solution a good one? -- 2025-07-19
GitHub - boneylizard/Eloquent: A local front-end for open-weight LLMs with memory, RAG, TTS/STT, Elo ratings, and dynamic research tools. Built with React and FastAPI. -- 2025-07-18
A free goldmine of tutorials for the components you need to create production-level agents Extensive open source resource with tutorials for creating robust AI agents -- 2025-07-18
Five Big Improvements to Gradio MCP Servers -- 2025-07-18
Migrating a semantically-anchored assistant from OpenAI to local environment (Domina): any successful examples of memory-aware agent migration? -- 2025-07-18
ARGO - A Local-First, Offline AI Agent That Puts You in Control -- 2025-07-17
Why LangGraph overcomplicates AI agents (and my Go alternative) -- 2025-07-17
new MCP alt. just dropped -- 2025-07-17
pydantic/fasta2a -- 2025-07-17
Share your MCP servers and experiments! -- 2025-07-17
OPENCODE - Like Claude Code or Gemini CLI, but works with local models and/or paid ones as well -- 2025-07-15
I built a Deep Researcher agent and exposed it as an MCP server! -- 2025-07-15
awwaiid/gremllm -- 2025-07-15
Ollama calling tools -- 2025-07-15
🪝 Claude-Flow@Alpha v2: We've implemented the new Claude Code Hooks in the latest Claude Flow alpha release combining hive style swarms, neural pattern recognition, and 87 MCP tools (install using: npx claude-flow@alpha) -- 2025-07-14
k2-fsa/ZipVoice -- 2025-07-13
K-intelligence/Midm-2.0-Base-Instruct -- 2025-07-13
AutoTester.dev: First AI-Driven Automatic Test Tool for Web Apps -- 2025-07-13
eiondb/eion -- 2025-07-13
mistralai/Devstral-Small-2507 -- 2025-07-13
What product or extension is great at autocomplete and predictive typescript/javascript and kotlin code. Cursor is out because I'm not going to pay even $1 on a greedy and scammy product, and Windsurf performs moderately well -- 2025-07-11
trufflesecurity/force-push-scanner -- 2025-07-11
LEGO/kube-tf-reconciler -- 2025-07-11
agentica-org/DeepSWE-Preview -- 2025-07-11
Thanks to you, I built an open-source website that can watch your screen and trigger actions. It runs 100% locally and was inspired by all of you! -- 2025-07-11
Preceptor – A Local AI Focus App That Nudges You Back on Track | Waitlist + Suggestions needed -- 2025-07-11
AGI is not multimodal -- 2025-07-09
How Do Vision-Language Models Process Conflicting Information Across Modalities? -- 2025-07-09
Building a Potato-based GLaDOS as an Introduction to AI -- 2025-07-07
We built runtime API discovery for LLM agents using a simple agents.json -- 2025-07-06
OWUI 0.6.15 OpenTelemetry (Experimental) -- 2025-07-06
[Open Source] Moondream MCP - Vision for AI Agents -- 2025-07-05
Kyutai's STT with semantic VAD now opensource -- 2025-07-05
brizzai/auto-mcp -- 2025-07-05
Lifailon/openrouter-bot -- 2025-07-05
Augment Code?? -- 2025-07-04
Simple-Efficient/RL-Factory -- 2025-07-04
Ratler/airuler -- 2025-07-04
[Setup discussion] AMD RX 7900 XTX workstation for local LLMs — Linux or Windows as host OS? -- 2025-07-04
🧠💬 Introducing AI Dialogue Duo – A Two-AI Conversational Roleplay System (Open Source) -- 2025-07-04
Qwen 2.5 32B or Similar Models -- 2025-07-04
Extending Minds with Generative AI -- 2025-07-04
Trying to Make Llama Extract Smarter with a Schema-Building AI Agent -- 2025-07-02
Want help in retrieving links from DB -- 2025-07-02
Ingesting docs for context -- 2025-07-02
Agents via OpenWebUI Functions -- 2025-07-02
pfnet/plamo-2-translate -- 2025-06-30
Self-Adapting Language Models -- 2025-06-27
tencent/Hunyuan-A13B-Instruct -- 2025-06-27
maya-research/Veena -- 2025-06-27
jennyzzt/dgm -- 2025-06-26
Looking to build a local AI assistant - Where do I start? -- 2025-06-24
Real-time conversational AI running 100% locally in-browser on WebGPU -- 2025-06-24
UI + RAG solution for 5000 documents possible? -- 2025-06-24
Good stable voice cloning and TTS with NOT much complicated installation? -- 2025-06-24
🚀 I built a lightweight web UI for Ollama – great for local LLMs! -- 2025-06-24
How to train a VLM with a dataset that has text and images? -- 2025-06-24
Top open-source AI Agent in both SWE-bench Verified and Lite -- 2025-06-24
AllTracker: Efficient Dense Point Tracking at High Resolution -- 2025-06-24
I Read All of Cloudflare's Claude-Generated Commits -- 2025-06-24
Show HN: I created an tool that creates interactive product demos in 2 minutes -- 2025-06-24
I’m the Maintainer (and Team) behind Open WebUI – AMA 2025 Q2 -- 2025-06-24
Eleven v3 -- 2025-06-22
SAGA Update: Now with Autonomous Knowledge Graph Healing & A More Robust Core! -- 2025-06-21
A free goldmine of tutorials for the components you need to create production-level agents -- 2025-06-21
Build a full on-device rag app using qwen3 embedding and qwen3 llm -- 2025-06-21
Build LLM from Scratch | Mega Playlist of 43 videos -- 2025-06-21
Running an LLM on a PS Vita -- 2025-06-21
haiku.rag a local sqlite RAG library -- 2025-06-21
LLMs Fine-Tuning -- 2025-06-21
Do you still use GPT APIs for demo apps? I'm leaning towards open models. -- 2025-06-21
Guidelines on how to be a scientific sleuth released -- 2025-06-21
Which models are you able to use with MCP servers? -- 2025-06-21
Rig upgraded to 8x3090 -- 2025-06-21
moonshotai/Kimi-Dev-72B -- 2025-06-21
Show HN: DaedalOS – Desktop Environment in the Browser -- 2025-06-19
tencent/SongGeneration -- 2025-06-19
haasonsaas/ocode -- 2025-06-19
dagger/container-use -- 2025-06-13
lerobot/smolvla_base -- 2025-06-10
brendanhogan/picoDeepResearch -- 2025-06-08
sarvamai/sarvam-m -- 2025-06-07
Qwen/Qwen3-Reranker-0.6B -- 2025-06-07
Hcompany/Holo1-7B -- 2025-06-06
huggingface/smolagents -- 2025-06-05
openpubkey/opkssh -- 2025-06-05
hashicorp/terraform -- 2025-06-05
NousResearch/atropos -- 2025-06-04
google/A2A -- 2025-06-03
hydropix/TranslateBookWithLLM -- 2025-05-31
sisig-ai/doctor -- 2025-05-31