Author: sanchitmonga22 - Lobster Roll

Launch HN: RunAnywhere (YC W26) – Faster AI Inference on Apple Silicon (github.com)

HN 235 pts 147 comments by sanchitmonga22 1d ago AI / Machine Learning Apple / macOS / iOS thread

Hi HN, we're Sanchit and Shubham (YC W26). We built a fast inference engine for Apple Silicon. LLMs, speech-to-text, text-to-speech – MetalRT beats llama.cpp, Apple's MLX, Ollama, and sherpa-onnx on every modality we tested. Custom Metal shaders, no framework overhead.<p>Also, we've o...

Show HN: On-device browser agent (Qwen) running locally in Chrome (github.com)

HN 19 pts 3 comments by sanchitmonga22 Jan 20, 2026 Web Development thread

Demo of LOCAL Browser agent (powered by Web GPU Liquid LFM & Alibaba Qwen models) opening the All in Podcast on Youtube running as a chrome extension.<p>Source: <a href="https://github.com/RunanywhereAI/on-device-browser-agent" rel="nofollow">https://github.com&#x2F...

Fastest LLM decode engine on Apple Silicon. 658 tok/s on M4-max,beats mlx by 19% (runanywhere.ai)

HN 5 pts 3 comments by sanchitmonga22 5d ago AI / Machine Learning Apple / macOS / iOS thread

Runanywhere – Make every CPU and GPU count (github.com)

HN 5 pts 2 comments by sanchitmonga22 Aug 20, 2025 Systems / Low-Level / OS thread

🦞🌯 Lobster Roll

Stories by sanchitmonga22