Every AI agent project I start ends up with the same boilerplate: chunk docs, pick an embedding model, set up a vector store, write retrieval logic, wire it into a custom tool.<p>It works, but it's plumbing — and it needs to be rebuilt for every new agent or runtime.<p>The idea I'm explori...
Hey guys,<p>I wanted to share a small project I've been working on to solve a personal pain point: TinyTTS.<p>We all love our massive 70B+ LLMs, but when building local voice assistants, running a heavy TTS framework alongside them often eats up way too much precious VRAM and compute. I wanted ...