Taekyung Ki

I am a 1st year Ph.D. student at KAIST MLAI advised by Prof. Sung Ju Hwang. From March 2022 to March 2025, I conducted research on video generation as part of my mandatory military service in South Korea. I began working in deep learning in February 2021. I received my M.S. in Mathematics in February 2021 and my B.S. in Mathematics in February 2019.

I am interested in the following research topics:

Generative models (score models, flow models, one-step models, etc.)
Foundation video generation models
Conversational virtual avatar agent systems
Audio-visual, vision-language and vision-language-action learning

I am open to research collaborations globally! If the topics above align with your research, feel free to contact me an e-mail.

Email / CV / Scholar / X / GitHub / Hugging Face / LinkedIn

Highlights

[Sep. 2025] Starting Ph.D. program at KAIST AI
[June 2025] One paper accepted to ICCV 2025
[June 2025] New preprint out. Frame Guidance; Training-free frame-level guidance method for large video diffusion models
[Mar. 2025] Joined KAIST MLAI as a researcher
[July 2024] One paper accepted to ECCV 2024
[Mar. 2022 - Mar. 2025] Mandatory military service for South Korea

Publications

*: Equal contribution, †: Corresponding author

	Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models Sangwon Jang^, Taekyung Ki^, Jaehyeong Jo, Jaehong Yoon, Soo Ye Kim, Zhe Lin, Sung Ju Hwang^† arXiv, 2025 Project Page / arXiv / Code / Hugging Face Frame Guidance is a training-free guidance method for large-scale VDMs, which enables to generate frame-level controllable videos only using a single GPU.
	FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait Taekyung Ki, Dongchan Min, Gyeongsu Chae ICCV, 2025 Project Page / arXiv / Code / Hugging Face FLOAT is a motion latent flow matching model for audio-driven talking portrait generation and editing using its learned orthonormal motion basis.
	Learning to Generate Conditional Tri-plane for 3D-aware Expression Controllable Portrait Animation Taekyung Ki, Dongchan Min, Gyeongsu Chae ECCV, 2024 Project Page / Paper / arXiv / Supp We propose a contrastive pre-training framework for appearance-free facial expression hidden in 3DMM and 3D-aware, expression-controllable portrait animation model.
	StyleLipSync: Style-based Personalized Lip-sync Video Generation Taekyung Ki^, Dongchan Min^ ICCV, 2023 Project Page / Paper / arXiv / Code / Supp StyleLipSync can generate person-agnostic audio-lip synchronized videos by leveraging the strong facial prior of style-based generator.
	Deep Scattering Network with Max-pooling Taekyung Ki, Youngmi Hur^† DCC, 2021 Paper / Code We mathematically prove that the pooling operator is a crucial component for translation-invariant feature extraction in Scattering Network.

Awards and Honors

[Oct. 2021] 1st Prize Winner, NLP-based Math-Word Problem Track in 2021 AI Grand Challenge (AGC), funded by South Korea Ministry of Science and ICT.
[Feb. 2019] Graduated with top honors, ranked 1st in Department of Mathematics.

Academic Services

Reviewer: CVPR 2025, ICCV 2025.