Training
GRID: Scaling Task-Agnostic Inference in Continual Prompt Tuning
GRID is a new framework for prompt-based continual learning that enhances the scalability and efficiency of adapting large language models (LLMs) across task sequences. It features an output-space-aware decoding mechanism and a gradient-guided prompt selection strategy that aggregates less informative prompts, resulting in improved backward transfer and reduced memory usage on models like T5, Qwen, and LLaMA. This development is significant for practitioners as it provides a more efficient approach to continual learning without the need for task identifiers, enabling better performance across diverse tasks in LLM applications.
prompt tuningcontinual learningllm