These are tools that either generate video (from text, images or video) or heavily assist video creators with AI enhancements:
Google Veo / Veo 3
Text → 8-second videos with built-in audio, cinematic motion, camera control. (Gemini) Very current “flagship” from Google. Supports vertical format now. (The Verge) OpenAI Sora
Text → video, with community remixing features. (Zapier) Still maturing; model versions and limits vary.
Runway (Gen, Act-Two, etc.)
Image / video editing & generative video modeling, transformation, “video to video” style edits. (MASV) Very popular among creators for flexibility.
Synthesia
Text → avatar video generation (talking heads, narrators) in many languages. (Synthesia) Great for corporate / educational / explainer content.
InVideo AI
Text → video with scenes, transitions, voice, etc. (Invideo) Good for social, shorter content.
Pictory
Script / text → video with visuals, animations. (HubSpot Blog) Helpful for repurposing content (e.g. blog → video).
Hailuo MiniMax / Hailuo AI
Short video generation with strong prompt adherence. (Tom's Guide) Emerging tool; useful for experimental creative clips.
Kling AI
Visual realism, smooth motion, lip-syncing capabilities. (Tom's Guide) Often cited in comparison reviews.
Luma / Luma Labs
Image-to-video, 3D motion from static frames, transformation tools. (Tom's Guide) Especially useful when animating illustrations or renders.
Pika (Pika Labs)
Generative video from prompts / stylized animations. (Tom's Guide) Creative / experimental use.
HeyGen
Text / image / audio → video with narration, visuals, avatars. (HeyGen) Flexible option in the “all-in-one” space.
Tools / Models Frequently Used in the AI Video / Visual Workflow (Complementary)
These may not always generate full video themselves, but are heavily used by creators in their pipelines (for images, referencing, style consistency, assets, etc.):
NanoBanana (Gemini’s image model)
High-fidelity image generation & editing, character consistency, style preservation.
Not a full video tool, but often used to generate visuals that are later animated or included in videos. (Analytics Vidhya) MidJourney / Stable Diffusion / DALL·E
Generative images, concept art, backgrounds, assets.
Many video creators start with stills / assets before animating.
ChatGPT / Large Language Models
Prompt engineering, script generation, scene description, storyboards.
Vital for turning ideas into structured prompts or video plans.
Audio / Voice Models (ElevenLabs, etc.)
Speech generation, lip sync, voiceovers.