Thread: Sparsely-Gated Mixture of Experts (MoE)

Sparsely-Gated Mixture of Experts (MoE) (eli.thegreenplace.net)

HN 2 pts 0 comments by ibobev Apr 18, 2025 thread

Sparsely-Gated Mixture of Experts (MoE) (eli.thegreenplace.net)

HN 15 pts 0 comments by mfrw Apr 20, 2025 thread

Sparsely-Gated Mixture of Experts (Moe) (eli.thegreenplace.net)

HN 1 pts 0 comments by signa11 Apr 28, 2025 thread

The Sparsely-Gated Mixture-of-Experts Layer (2017) [pdf] (arxiv.org)

HN 1 pts 0 comments by swatson741 Dec 30, 2025 thread

Mixture-of-Experts with Expert Choice Routing (2022) (ai.googleblog.com)

L 2 pts 0 comments by river Apr 5, 2023 ai AI / Machine Learning thread

Mixture of A Million Experts: PEER (parameter efficient expert retrieval) (arxiv.org)

HN 2 pts 0 comments by mnoorfawi Jul 18, 2024 thread

Mixture of a Million Experts (web3.arxiv.org)

HN 5 pts 1 comments by jonbaer Jul 27, 2024 thread

Mixture of Nested Experts: Adaptive Processing of Visual Tokens (arxiv.org)

HN 2 pts 0 comments by rch Aug 4, 2024 thread

Layerwise Recurrent Router for Mixture-of-Experts (arxiv.org)

HN 1 pts 0 comments by mnoorfawi Aug 15, 2024 thread

A Visual Guide to Mixture of Experts (Moe) LLMs (newsletter.maartengrootendorst.com)

HN 3 pts 0 comments by jayalammar Oct 9, 2024 thread

ARIA: An Open Multimodal Native Mixture-of-Experts Model (arxiv.org)

HN 97 pts 21 comments by jinqueeny Oct 11, 2024 thread

Mixture of Parrots: Experts improve memorization more than reasoning (arxiv.org)

HN 3 pts 0 comments by sandwichsphinx Oct 28, 2024 thread

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts LMs (arxiv.org)

HN 2 pts 0 comments by doener Dec 27, 2024 thread

Mixture-of-Experts (MoE) LLMs (cameronrwolfe.substack.com)

HN 1 pts 0 comments by Philpax Jan 27, 2025 thread

Mixture-of-Experts (Moe) LLMs (cameronrwolfe.substack.com)

HN 2 pts 0 comments by Brajeshwar Jan 28, 2025 thread

Scaling a 300B Mixture-of-Experts LING LLM Without Premium GPUs (arxiv.org)

HN 2 pts 0 comments by PaulHoule Mar 25, 2025 AI / Machine Learning thread

Efficient and Portable Mixture-of-Experts Communication (perplexity.ai)

HN 1 pts 0 comments by shihab Apr 4, 2025 thread

NanoMoE: Mixture-of-Experts (Moe) LLMs from Scratch in PyTorch (cameronrwolfe.substack.com)

HN 4 pts 0 comments by danboarder Apr 6, 2025 AI / Machine Learning thread

Mixture of Experts: When Does It Deliver Energy Efficiency? (neuralwatt.com)

HN 2 pts 0 comments by scottcha Apr 22, 2025 thread

Mixture of Tunable Experts-DeepSeek R1 Behavior Modification at Inference Time (huggingface.co)

HN 5 pts 1 comments by pr337h4m May 1, 2025 thread

Pangu Pro Moe: Mixture of Grouped Experts for Efficient Sparsity (arxiv.org)

HN 2 pts 0 comments by diggan Jul 1, 2025 thread

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model (twitter.com)

HN 348 pts 179 comments by c4pt0r Jul 11, 2025 AI / Machine Learning thread

GitHub: <a href="https://github.com/MoonshotAI/Kimi-K2">https://github.com/MoonshotAI/Kimi-K2</a>

Kimi K2 is a state-of-the-art mixture-of-experts (MoE) language model (github.com)

HN 352 pts 2 comments by ConteMascetti71 Jul 12, 2025 AI / Machine Learning thread

A Mixture of Experts Approach to Handle Concept Drifts (arxiv.org)

HN 1 pts 1 comments by adbabdadb Jul 25, 2025 thread

Dor awards submission: Mixture Of Experts ft. AGI [video] (youtube.com)

HN 1 pts 0 comments by freemanindia Oct 7, 2025 thread

REAP: One-Shot Pruning for Trillion-Parameter Mixture-of-Experts Models (cerebras.ai)

L 2 pts 0 comments by asb Oct 18, 2025 ai AI / Machine Learning thread

Reap: One-Shot Pruning for Trillion-Parameter Mixture-of-Experts Models (cerebras.ai)

HN 3 pts 0 comments by todsacerdoti Oct 18, 2025 thread

Mixture-of-Experts explained with PyTorch implementation (medium.com)

HN 1 pts 1 comments by kinderpingui Nov 13, 2025 AI / Machine Learning thread

Intro to Routing: Mixture-of-Experts and Expert Choice (neelsomaniblog.com)

HN 1 pts 0 comments by nsomani Nov 14, 2025 thread

Sparse Mixture of Experts for Game AI: An Accidental Architecture (github.com)

HN 2 pts 1 comments by ColorSwitchDev Jan 27, 2026 AI / Machine Learning Gaming / Retro Computing thread