ai-digest.dev
last updated 2 h ago
ModelsReddit r/LocalLLaMA 13 d ago

poolside/Laguna-M.1 · Hugging Face - 225B-A23B

Laguna M.1 is a newly released 225 billion parameter Mixture-of-Experts (MoE) model featuring 23 billion activated parameters per token, optimized for agentic coding and long-horizon tasks. It employs a 70-layer architecture with 67 sparse MoE layers, 256 experts, and global attention, achieving competitive benchmark results on SWE-bench and Terminal-Bench 2.0. The model supports interleaved reasoning and has a context window of 262,144 tokens, making it a significant advancement for practitioners focusing on high-capacity AI applications.

lagunahugging_facemoerelevance 0.00 · engagement 0.00
Read at source ↗← all news
poolside/Laguna-M.1 · Hugging Face - 225B-A23B — AI News Digest