Skip to main content

Posts

Featured

Show HN: Statewright – Visual state machines that make AI agents reliable https://ift.tt/pGJxdgl

Show HN: Statewright – Visual state machines that make AI agents reliable Agentic problem solving in its current state is very brittle. I fell in love with it, but it creates as many problems as it solves. I'm Ben Cochran, I spent 20+ years in the trenches with full-stack Engineering, DevOps, high performance computing & ML with stints at NVIDIA, AMD and various other organizations most recently as a Distinguished Engineer. For agents to work reliably you either need massive parameter counts or massive context windows to keep the solution spaces workable. Most people are brute forcing reliability with bigger models and longer prompts. What if I made the problem smaller instead of making the model bigger? I took a different approach by using smaller models: models in the 13-20B parameter range and set them to task solving real SWE-bench problems. I constrained the tool and solution spaces using formal state machines. Each state in the machine defines which tools the model can a...

Latest Posts

Show HN: Mimik – open-source local-first alternative to Scribe and Tango https://ift.tt/F3PEVx7

Show HN: SyncBank – Self-hosted bank sync for EU banks https://ift.tt/7tuCiky

Show HN: adamsreview – better multi-agent PR reviews for Claude Code https://ift.tt/AKmVjhH

Show HN: I trained a chess engine to play like humans https://ift.tt/MUAYg2G

Show HN: Hustler Bingo – a tiny bingo game about startup Twitter clichés https://ift.tt/CvZ2IJt

Show HN: Mosaic – arrange iOS icons by color using an evolutionary algorithm https://ift.tt/Ueu8GCc

Show HN: Free OSS transcription app I made and found it's faster than wispr flow https://ift.tt/szJjNDA

Show HN: Create flashcards with Space CLI https://ift.tt/7FfI2H0

Show HN: tltv – Federation protocol for 24/7 TV channels https://ift.tt/36ARpnx

Show HN: The independent guide to agent orchestrators https://ift.tt/CpbUFdr