: Generates a realistic, personalized talking-head video using a portrait and voice sample.
You can find more details, the full paper, and video demos on the official Paper2Video Project Page .
: Uses Vision-Language Models (VLMs) to create narration subtitles and visual-focus prompts.
: Automatically generates and refines LaTeX-based slides from the paper's text.
: Generates a realistic, personalized talking-head video using a portrait and voice sample.
You can find more details, the full paper, and video demos on the official Paper2Video Project Page .
: Uses Vision-Language Models (VLMs) to create narration subtitles and visual-focus prompts.
: Automatically generates and refines LaTeX-based slides from the paper's text.