This task can be performed using Seedance 2
Seedance 2: The Future of Multimodal Video Creation
Best product for this task
Seedance 2 is a powerful multimodal video generation model that supports text, image, video, and audio inputs, allowing creators to combine references freely and produce highly controllable, cinematic-quality videos.

What to expect from an ideal product
- Combine text prompts with reference images to guide the visual style and composition of your video scenes
- Upload audio tracks that automatically influence the video's pacing, mood, and visual elements to match the sound
- Use existing video clips as style references while adding your own text descriptions to create new scenes with similar cinematic quality
- Mix different input types in a single project to maintain consistent visual themes across multiple video segments
- Control specific visual elements like lighting, camera angles, and scene transitions by providing targeted image and text references together
