Coding
Add arch support for cohere2-MoE by michaelw9999 · Pull Request #24260 · ggml-org/llama.cpp
A pull request has been made to add architecture support for the Cohere2-MoE model in the llama.cpp framework. The North Mini Code model, developed by Cohere Labs, features a total of 30 billion parameters with 3 billion active parameters, optimized for code generation and software engineering tasks, and supports a context length of up to 256K tokens. This release is significant for practitioners as it enhances capabilities for handling large context inputs and improves performance in terminal-based applications.
coheremoepull_request