ai-digest.dev
last updated 2 h ago
AgentsReddit r/LocalLLaMA 12 d ago

Tmax-27b - a Qwen3.6-27b terminal agent for small GPUs trained with DPPO (RL)

Ai2 has released the Tmax-27B, a terminal agent LLM built on Qwen3.6, utilizing DPPO for reinforcement learning, achieving approximately 43% on Terminal Bench 2.0 and 69% on TB Lite. The original model is 54 GB at FP16, but various quantized versions (ranging from 2-5 bits-per-weight) have been developed to fit consumer GPUs, with sizes from approximately 8.47 GB to 14.05 GB, making it more accessible for practitioners. This enables developers to leverage advanced terminal capabilities in AI applications without requiring high-end hardware.

tmax-27bterminal-agentdpporelevance 0.00 · engagement 0.00
Read at source ↗← all news
Tmax-27b - a Qwen3.6-27b terminal agent for small GPUs trained with DPPO (RL) — AI News Digest