Models
Making Time Editable in Video Diffusion Transformers
The paper introduces a methodology for enhancing video generation in Diffusion Transformers by integrating explicit time editing capabilities into a pretrained DiT model. This approach utilizes a lightweight temporal module to enable control over motion speed and temporal structure, while maintaining the original generative prior of the model. This advancement is significant for practitioners as it allows for greater flexibility in video generation, enhancing the ability to manipulate temporal dynamics without extensive model redesign.
videodiffusiontemporal control