Daily digest — 2026-06-24

A Theory of Training Profit-Optimal LLMs

The paper presents a theoretical framework for optimizing the training of large language models (LLMs) based on economic principles, specifically focusing on the trade-offs between model size, training tokens, and associated costs. It establishes that in a compute-bound regime, the optimal model size and token budget should align with hardware efficiency, while in a data-bound regime, training expenditure scales quadratically with data availability and inversely with hardware efficiency. This model provides a basis for practitioners to make informed economic decisions regarding LLM training investments, highlighting the importance of balancing quality improvements with cost efficiency.

arXiv cs.AI — 15 d ago · found 13 d agoTraining

From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG

The article introduces EPIC (Efficient Preference-aligned Index Construction), a novel approach for on-device Retrieval-Augmented Generation (RAG) that prioritizes user preferences to optimize memory usage and retrieval accuracy. EPIC demonstrates a dramatic reduction in indexing memory by 2,404 times, an 18.79% improvement in preference-following accuracy, and achieves 32.17 times lower retrieval latency compared to existing baselines, while operating within a memory constraint of under 1 MB and supporting latency between 5.21 to 29.35 ms per query across multiple platforms. This advancement is significant for practitioners as it enhances the efficiency and responsiveness of personal AI agents while maintaining user privacy through local context management.

arXiv cs.AI — 15 d ago · found 13 d agoRAG

CoRe-MoE: Contrastive Reweighted Mixture of Experts for Multi-Terrain Humanoid Locomotion with Gait Adaptation

The CoRe-MoE framework introduces a two-stage reinforcement learning approach for humanoid locomotion that effectively integrates gait adaptation and multi-terrain navigation. By decoupling gait generation from terrain adaptation, it employs a Mixture-of-Experts (MoE) architecture with a contrastive objective to enhance expert specialization and structured terrain representation. Simulation results indicate superior performance in success rate and stability, with real-world validation on a Unitree G1 robot demonstrating effective locomotion across diverse terrains, making it a significant advancement for practitioners in humanoid robotics and adaptive locomotion systems.

arXiv cs.AI — 15 d ago · found 13 d agoAgents

The day in AI, distilled.

A Theory of Training Profit-Optimal LLMs

From Volume to Value: Preference-Aligned Memory Construction for On-Device RAG

CoRe-MoE: Contrastive Reweighted Mixture of Experts for Multi-Terrain Humanoid Locomotion with Gait Adaptation

Models & Releases

Research

Safety & Security

Tooling & Open Source