Daily digest — 2026-06-17

CITRAS: Covariate-Informed Transformer for Time Series Forecasting

CITRAS, a decoder-only Transformer model for time series forecasting, has been introduced to effectively integrate observed and known covariates, addressing the common challenges in leveraging these variables due to length discrepancies. It features two novel mechanisms: Key-Value (KV) Shift, which aligns future known covariates with target variables, and Attention Score Smoothing, which enhances local dependencies into global variate-level dependencies. Experimental results indicate that CITRAS significantly improves forecasting accuracy across diverse real-world datasets, making it a valuable tool for practitioners aiming to enhance model performance in time series analysis.

arXiv cs.AI — 54 d agoTraining

NuWa: Deriving Lightweight Class-Specific Vision Transformers for Edge Devices

NuWa is a novel method for deriving lightweight class-specific Vision Transformers (ViTs) tailored for resource-constrained edge devices, addressing the limitations of existing model compression techniques. It employs self-knowledge purification to eliminate class-detrimental weights and utilizes closed-form optimization to create compact ViTs without the need for post-pruning retraining. Experimental results indicate that NuWa achieves up to 29% higher accuracy on class-specific tasks compared to state-of-the-art training-free pruning methods, with a 33.69x speedup in pruning and a 99.83% reduction in pruning costs, making it highly efficient for practitioners focused on deploying optimized models in edge environments.

arXiv cs.AI — 54 d agoResearch

Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs

The article introduces two new diversity evaluation metrics for generative AI models: Conditional-Vendi and Conditional-RKE, which are designed to assess prompt-induced variability in outputs. These metrics leverage conditional entropy from positive semidefinite matrices, with Conditional-RKE achieving an $O(1/\sqrt{n})$ convergence rate and Conditional-Vendi utilizing a truncated-spectrum approximation for scalability. The methods demonstrate effectiveness across various tasks, including text-to-image generation and image captioning, providing practitioners with improved tools for evaluating diversity in prompt-guided generation.

arXiv cs.AI — 54 d agoResearch

The day in AI, distilled.

CITRAS: Covariate-Informed Transformer for Time Series Forecasting

NuWa: Deriving Lightweight Class-Specific Vision Transformers for Edge Devices

Conditional Vendi Score: Prompt-Aware Diversity Evaluation for Generative AI Models and LLMs

Models & Releases

Research

Safety & Security