Injects reference image features into Anima’s DiT via decoupled cross-attention, enabling character-consistent image generation.