ModelsReddit r/LocalLLaMA — 13 d ago

poolside/Laguna-M.1 · Hugging Face - 225B-A23B

Laguna M.1 is a newly released 225 billion parameter Mixture-of-Experts (MoE) model featuring 23 billion activated parameters per token, optimized for agentic coding and long-horizon tasks. It employs a 70-layer architecture with 67 sparse MoE layers, 256 experts, and global attention, achieving competitive benchmark results on SWE-bench and Terminal-Bench 2.0. The model supports interleaved reasoning and has a context window of 262,144 tokens, making it a significant advancement for practitioners focusing on high-capacity AI applications.

lagunahugging_facemoerelevance 0.00 · engagement 0.00

Read at source ↗← all news