Agents Directory
SkillsRankingsAgents
CategoriesModelsBenchmarksCompareAgent LeaderboardSkillsRankingsAgentsAbout

ai-video-generation

SkillCommunityAudit warnings

Generate AI videos with Google Veo, Seedance 2.0, HappyHorse, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 2.0, HappyHorse 1.0, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, reference-to-video, video editing, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative, seedance, happyhorse

Compatibility:
Hermes logoHermesOpenClaw logoOpenClawClaude Code logoClaude CodeCodex logoCodex
Visit ai-video-generation
Install:
npx skills add qu-skills/skills --skill ai-video-generation
View on skills.shInstall source

Install the belt CLI skill: npx skills add belt-sh/cli

AI Video Generation

Generate videos with 40+ AI models via inference.sh CLI.

AI Video Generation

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate a video with Veo
belt app run google/veo-3-1-fast --input '{"prompt": "drone shot flying over a forest"}'

Available Models

Text-to-Video

ModelApp IDBest For
Veo 3.1 Fastgoogle/veo-3-1-fastFast, with optional audio
Veo 3.1google/veo-3-1Best quality, frame interpolation
Veo 3google/veo-3High quality with audio
Veo 3 Fastgoogle/veo-3-fastFast with audio
Veo 2google/veo-2Realistic videos
P-Videopruna/p-videoFast, economical, with audio support
WAN-T2Vpruna/wan-t2vEconomical 480p/720p
Grok Videoxai/grok-imagine-videoxAI, configurable duration
Seedance 2.0bytedance/seedance-2-0Text/image/ref-to-video with sync audio, up to 1080p
Seedance 2.0 Fastbytedance/seedance-2-0-fastFast variant, same capabilities
HappyHorse T2Valibaba/happyhorse-1-0-t2vPhysically realistic, up to 15s

Image-to-Video

ModelApp IDBest For
Wan 2.5falai/wan-2-5Animate any image
Wan 2.5 I2Vfalai/wan-2-5-i2vHigh quality i2v
WAN-I2Vpruna/wan-i2vEconomical 480p/720p
P-Videopruna/p-videoFast i2v with audio
Seedance 2.0bytedance/seedance-2-0Animate images with sync audio, up to 1080p
Seedance 2.0 Fastbytedance/seedance-2-0-fastFast variant, same capabilities
HappyHorse I2Valibaba/happyhorse-1-0-i2vAnimate images, up to 1080P/15s
HappyHorse R2Valibaba/happyhorse-1-0-r2vCharacter-preserving from references

Avatar / Lipsync

ModelApp IDBest For
OmniHuman 1.5bytedance/omnihuman-1-5Multi-character
OmniHuman 1.0bytedance/omnihuman-1-0Single character
Fabric 1.0falai/fabric-1-0Image talks with lipsync
PixVerse Lipsyncfalai/pixverse-lipsyncRealistic lipsync

Video Editing

ModelApp IDBest For
HappyHorse Editalibaba/happyhorse-1-0-video-editNatural language video editing

Utilities

ToolApp IDDescription
HunyuanVideo Foleyinfsh/hunyuanvideo-foleyAdd sound effects to video
Topaz Upscalerfalai/topaz-video-upscalerUpscale video quality
Media Mergerinfsh/media-mergerMerge videos with transitions

Browse All Video Apps

belt app store --category video

Examples

Text-to-Video with Veo

belt app run google/veo-3-1-fast --input '{
  "prompt": "A timelapse of a flower blooming in a garden"
}'

Grok Video

belt app run xai/grok-imagine-video --input '{
  "prompt": "Waves crashing on a beach at sunset",
  "duration": 5
}'

Image-to-Video with Wan 2.5

belt app run falai/wan-2-5 --input '{
  "image_url": "https://your-image.jpg"
}'

AI Avatar / Talking Head

belt app run bytedance/omnihuman-1-5 --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Fabric Lipsync

belt app run falai/fabric-1-0 --input '{
  "image_url": "https://face.jpg",
  "audio_url": "https://audio.mp3"
}'

Seedance 2.0 Text-to-Video with Audio

belt app run bytedance/seedance-2-0 --input '{
  "prompt": "a jazz band performing in a dimly lit club",
  "generate_audio": true,
  "duration": 10
}'

Seedance 2.0 Image-to-Video

belt app run bytedance/seedance-2-0 --input '{
  "image": "https://your-image.jpg",
  "prompt": "gentle camera movement, leaves rustling in the wind",
  "generate_audio": true
}'

Seedance 2.0 Reference-to-Video

belt app run bytedance/seedance-2-0 --input '{
  "prompt": "A person who looks like the reference walking through a garden",
  "reference_image": "https://portrait.jpg",
  "generate_audio": true
}'

HappyHorse Text-to-Video

belt app run alibaba/happyhorse-1-0-t2v --input '{
  "prompt": "a golden retriever running through autumn leaves, slow motion",
  "duration": 10,
  "resolution": "1080P"
}'

HappyHorse Video Editing

belt app run alibaba/happyhorse-1-0-video-edit --input '{
  "video": "https://your-video.mp4",
  "prompt": "change the background to a snowy mountain landscape"
}'

PixVerse Lipsync

belt app run falai/pixverse-lipsync --input '{
  "image_url": "https://portrait.jpg",
  "audio_url": "https://speech.mp3"
}'

Video Upscaling

belt app run falai/topaz-video-upscaler --input '{"video_url": "https://..."}'

Add Sound Effects (Foley)

belt app run infsh/hunyuanvideo-foley --input '{
  "video_url": "https://silent-video.mp4",
  "prompt": "footsteps on gravel, birds chirping"
}'

Merge Videos

belt app run infsh/media-merger --input '{
  "videos": ["https://clip1.mp4", "https://clip2.mp4"],
  "transition": "fade"
}'

Related Skills

# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# Pruna P-Video (fast & economical)
npx skills add inference-sh/skills@p-video

# Google Veo specific
npx skills add inference-sh/skills@google-veo

# Seedance 2.0
npx skills add inference-sh/skills@seedance

# HappyHorse 1.0
npx skills add inference-sh/skills@happyhorse

# AI avatars & lipsync
npx skills add inference-sh/skills@ai-avatar-video

# Text-to-speech (for video narration)
npx skills add inference-sh/skills@text-to-speech

# Image generation (for image-to-video)
npx skills add inference-sh/skills@ai-image-generation

# Twitter (post videos)
npx skills add inference-sh/skills@twitter-automation

Browse all apps: belt app store

Documentation

  • Running Apps - How to run apps via CLI
  • Streaming Results - Real-time progress updates
  • Content Pipeline Example - Building media workflows
Share:
Details:
  • Installs


    299,310
  • First seen


    Jun 14, 2026
Security audits
Gen Agent Trust HubPASS
SocketWARN
SnykWARN (medium risk)
View Repository

Auto-fetched from GitHub 6 days ago.

Stats via skills.sh.

Skills similar to ai-video-generation:

Website favicon

 

 
 
  • Installs


Website favicon

 

 
 
  • Installs


Website favicon

 

 
 
  • Installs


Browse:SkillsRankingsModelsBenchmarksProvidersAgentsAgent LeaderboardCompareCategories
Quick Links:AboutBlog

© 2026 Agents Directory

Skills similar to ai-video-generation:

happyhorse-1-0

Skill
HappyHorse 1.0 is a text-to-video model hosted on the RunComfy Model API that generates 1080p video with synchronized audio. It is designed for multi-shot storytelling and maintaining character consistency across generated clips.
Image & Video
Generate 1080p video with synchronized audio and character consistency using the HappyHorse 1.0 model.
  • Installs


    283,823

ai-image-generation

Skill
Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image
Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI.
  • Installs


    298,888

ai-avatar-video

Skill
Create AI avatar and talking head videos via inference.sh CLI. Recommended: P-Video-Avatar (fastest, cheapest, built-in TTS). Also: OmniHuman, Fabric, PixVerse. Audio: Inworld TTS-2 (100+ languages, emotion steering for characters), ElevenLabs, Kokoro. Capabilities: audio-driven avatars, text-to-avatar, lipsync videos, talking head generation, virtual presenters, UGC content. Use for: AI presenters, explainer videos, virtual influencers, dubbing, marketing videos, UGC ads, gaming avatars, NPC dialogue. Triggers: ai avatar, talking head, lipsync, avatar video, virtual presenter, ai spokesperson, audio driven video, heygen alternative, synthesia alternative, talking avatar, lip sync, video avatar, ai presenter, digital human, ugc, ugc video, ugc ad, avatar ugc
Create AI avatar and talking head videos via inference.sh CLI.
  • Installs


    298,843