Research
Multi-Granular Node Pruning for Causal Circuit Discovery
The article presents a novel multi-granular node pruning framework for causal circuit discovery in large language models (LLMs), which enhances the scalability and granularity of existing methods. By utilizing learnable masks at various levels, from entire blocks to individual neurons, the approach achieves a 5-10x reduction in memory usage and identifies smaller, more efficient subnetworks while maintaining task performance. This method allows practitioners to optimize model architectures more effectively by targeting specific neuron-level contributions, potentially leading to more efficient deployments of LLMs.
circuit discoveryLLMspruning