Author: matt_d - Lobster Roll

Compiling LLMs into a MegaKernel: A path to low-latency inference (zhihaojia.medium.com)

HN 314 pts 76 comments by matt_d Jun 19, 2025 thread

Algorithms for Modern Processor Architectures (lemire.github.io)

HN 290 pts 58 comments by matt_d Jul 22, 2025 Science / Math / Physics thread

Test Results for AMD Zen 5 (agner.org)

HN 255 pts 79 comments by matt_d Jul 26, 2025 thread

Zen 5's AVX-512 Frequency Behavior (chipsandcheese.com)

HN 213 pts 66 comments by matt_d Mar 1, 2025 thread

High-Performance DBMSs with io_uring: When and How to use it (arxiv.org)

HN 194 pts 49 comments by matt_d Jan 6, 2026 thread

Evolving the OCaml Programming Language (2025) [pdf] (kcsrk.info)

HN 159 pts 48 comments by matt_d Sep 5, 2025 Programming (General)Programming Languages / CS Theory thread

Decompiling 2024: A Year of Resurgance in Decompilation Research (mahaloz.re)

HN 147 pts 61 comments by matt_d Jan 30, 2025 thread

You could have invented Fenwick trees (cambridge.org)

HN 131 pts 29 comments by matt_d Jan 25, 2025 thread

FlashAttention-T: Towards Tensorized Attention (dl.acm.org)

HN 116 pts 57 comments by matt_d Feb 3, 2026 thread

DWARF as a Shared Reverse Engineering Format (lief.re)

HN 109 pts 19 comments by matt_d May 28, 2025 thread

Using obscure graph theory to solve programming languages problems (reasonablypolymorphic.com)

HN 106 pts 28 comments by matt_d May 13, 2025 Programming (General) thread

TorchLean: Formalizing Neural Networks in Lean (leandojo.org)

HN 104 pts 20 comments by matt_d Mar 1, 2026 AI / Machine Learning thread

Slicing Is All You Need: Towards a Universal One-Sided Distributed MatMul (arxiv.org)

HN 99 pts 8 comments by matt_d Nov 19, 2025 thread

Property-Based Testing for the People (repository.upenn.edu)

HN 99 pts 67 comments by matt_d Jan 6, 2025 thread

Explainable Linear Programs (jeremykun.com)

HN 98 pts 31 comments by matt_d Feb 7, 2025 thread

Modern Minimal Perfect Hashing: A Survey (arxiv.org)

HN 89 pts 32 comments by matt_d Jun 10, 2025 thread

Demystifying ARM SME to Optimize General Matrix Multiplications (arxiv.org)

HN 88 pts 19 comments by matt_d Jan 31, 2026 thread

How to Think About GPUs (jax-ml.github.io)

HN 88 pts 1 comments by matt_d Aug 19, 2025 thread

Safepoints and Fil-C (fil-c.org)

HN 87 pts 44 comments by matt_d Sep 16, 2025 thread

Binding Application in Idris (andrevidela.com)

HN 87 pts 5 comments by matt_d Jul 10, 2025 thread

The Hoare Cube (johnwickerson.wordpress.com)

HN 86 pts 23 comments by matt_d Dec 4, 2024 thread

Gluon: a GPU programming language based on the same compiler stack as Triton (github.com)

HN 83 pts 24 comments by matt_d Sep 17, 2025 Systems / Low-Level / OS Programming (General) thread

GPEmu: A GPU emulator for rapid, low-cost deep learning prototyping [pdf] (vldb.org)

HN 82 pts 12 comments by matt_d Jun 30, 2025 AI / Machine Learning Gaming / Retro Computing thread

Orders of Infinity (terrytao.wordpress.com)

HN 82 pts 15 comments by matt_d May 4, 2025 thread

How to train your program verifier (risemsr.github.io)

HN 80 pts 16 comments by matt_d Feb 18, 2026 Programming (General) thread

Packed Data Support in Haskell (arthi-chaud.github.io)

HN 77 pts 12 comments by matt_d Apr 28, 2025 Programming Languages / CS Theory thread

The Calculated Typer (bahr.io)

HN 75 pts 5 comments by matt_d Mar 18, 2025 thread

Safe and efficient C++ interoperability via non-escapable types and lifetimes (forums.swift.org)

🦞🌯 Lobster Roll

Stories by matt_d