vision transformer | Diptesh Kanojia

StableTalk: Advancing Audio-to-Talking Face Generation with Stable Diffusion and Vision Transformer

Audio-to-talking face generation stands at the forefront of advancements in generative AI. It bridges the gap between audio and visual representations by generating synchronized and realistic talking faces. Despite recent progress, the lack of …