Agents Directory
SkillsRankingsAgents
CategoriesModelsBenchmarksCompareAgent LeaderboardSkillsRankingsAgentsAbout

ai-image-generation

SkillCommunityAudit warnings

Generate AI images with GPT-Image-2, FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: GPT-Image-2, FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image, gpt image, openai image, chatgpt image

Compatibility:
Hermes logoHermesOpenClaw logoOpenClawClaude Code logoClaude CodeCodex logoCodex
Visit ai-image-generation
Install:
npx skills add qu-skills/skills --skill ai-image-generation
View on skills.shInstall source

Install the belt CLI skill: npx skills add belt-sh/cli

AI Image Generation

Generate images with 50+ AI models via inference.sh CLI.

AI Image Generation

Quick Start

Requires inference.sh CLI (belt). Install instructions

belt login

# Generate an image with FLUX
belt app run falai/flux-dev-lora --input '{"prompt": "a cat astronaut in space"}'

Available Models

ModelApp IDBest For
GPT-Image-2openai/gpt-image-2Text-to-image, editing, inpainting
FLUX Dev LoRAfalai/flux-dev-loraHigh quality with custom styles
FLUX.2 Klein LoRAfalai/flux-2-klein-loraFast with LoRA support (4B/9B)
P-Imagepruna/p-imageFast, economical, multiple aspects
P-Image-LoRApruna/p-image-loraFast with preset LoRA styles
P-Image-Editpruna/p-image-editFast image editing
Gemini 3 Progoogle/gemini-3-pro-image-previewGoogle's latest
Gemini 2.5 Flashgoogle/gemini-2-5-flash-imageFast Google model
Grok Imaginexai/grok-imagine-imagexAI's model, multiple aspects
Seedream 4.5bytedance/seedream-4-52K-4K cinematic quality
Seedream 4.0bytedance/seedream-4-0High quality 2K-4K
Seedream 3.0bytedance/seedream-3-0-t2iAccurate text rendering
Revefalai/reveNatural language editing, text rendering
ImagineArt 1.5 Profalai/imagine-art-1-5-pro-previewUltra-high-fidelity 4K
FLUX Klein 4Bpruna/flux-klein-4bUltra-cheap ($0.0001/image)
Topaz Upscalerfalai/topaz-image-upscalerProfessional upscaling

Browse All Image Apps

belt app store --category image

Examples

GPT-Image-2

belt app run openai/gpt-image-2 --input '{
  "prompt": "professional product photo of sneakers, studio lighting",
  "quality": "high"
}'

GPT-Image-2 Editing

belt app run openai/gpt-image-2 --input '{
  "prompt": "change the background to a beach at sunset",
  "images": ["https://your-image.jpg"]
}'

Text-to-Image with FLUX

belt app run falai/flux-dev-lora --input '{
  "prompt": "professional product photo of a coffee mug, studio lighting"
}'

Fast Generation with FLUX Klein

belt app run falai/flux-2-klein-lora --input '{"prompt": "sunset over mountains"}'

Google Gemini 3 Pro

belt app run google/gemini-3-pro-image-preview --input '{
  "prompt": "photorealistic landscape with mountains and lake"
}'

Grok Imagine

belt app run xai/grok-imagine-image --input '{
  "prompt": "cyberpunk city at night",
  "aspect_ratio": "16:9"
}'

Reve (with Text Rendering)

belt app run falai/reve --input '{
  "prompt": "A poster that says HELLO WORLD in bold letters"
}'

Seedream 4.5 (4K Quality)

belt app run bytedance/seedream-4-5 --input '{
  "prompt": "cinematic portrait of a woman, golden hour lighting"
}'

Image Upscaling

belt app run falai/topaz-image-upscaler --input '{"image_url": "https://..."}'

Stitch Multiple Images

belt app run infsh/stitch-images --input '{
  "images": ["https://img1.jpg", "https://img2.jpg"],
  "direction": "horizontal"
}'

Related Skills

# Full platform skill (all 250+ apps)
npx skills add inference-sh/skills@infsh-cli

# Pruna P-Image (fast & economical)
npx skills add inference-sh/skills@p-image

# GPT-Image-2 (OpenAI)
npx skills add inference-sh/skills@gpt-image

# FLUX-specific skill
npx skills add inference-sh/skills@flux-image

# Upscaling & enhancement
npx skills add inference-sh/skills@image-upscaling

# Background removal
npx skills add inference-sh/skills@background-removal

# Video generation
npx skills add inference-sh/skills@ai-video-generation

# AI avatars from images
npx skills add inference-sh/skills@ai-avatar-video

Browse all apps: belt app store

Documentation

  • Running Apps - How to run apps via CLI
  • Image Generation Example - Complete image generation guide
  • Apps Overview - Understanding the app ecosystem
Share:
Details:
  • Installs


    298,888
  • First seen


    Jun 14, 2026
Security audits
Gen Agent Trust HubPASS
SocketWARN
SnykPASS (low risk)
View Repository

Auto-fetched from GitHub 6 days ago.

Stats via skills.sh.

Skills similar to ai-image-generation:

Website favicon

 

 
 
  • Installs


Website favicon

 

 
 
  • Installs


Website favicon

 

 
 
  • Installs


Browse:SkillsRankingsModelsBenchmarksProvidersAgentsAgent LeaderboardCompareCategories
Quick Links:AboutBlog

© 2026 Agents Directory

Skills similar to ai-image-generation:

gpt-image-edit

Skill
This skill provides access to the OpenAI GPT Image 2 edit endpoint via the RunComfy CLI. It is designed for tasks requiring high-fidelity image modifications, such as multilingual text replacement, layout adjustments, and multi-image composition while preserving subject identity.
Image & Video
Perform targeted image edits and text replacements using the OpenAI GPT Image 2 model.
  • Installs


    283,964

ai-video-generation

Skill
Generate AI videos with Google Veo, Seedance 2.0, HappyHorse, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 2.0, HappyHorse 1.0, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, reference-to-video, video editing, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative, seedance, happyhorse
Generate AI videos with Google Veo, Seedance 2.0, HappyHorse, Wan, Grok and 40+ models via inference.sh CLI.
  • Installs


    299,310

ai-avatar-video

Skill
Create AI avatar and talking head videos via inference.sh CLI. Recommended: P-Video-Avatar (fastest, cheapest, built-in TTS). Also: OmniHuman, Fabric, PixVerse. Audio: Inworld TTS-2 (100+ languages, emotion steering for characters), ElevenLabs, Kokoro. Capabilities: audio-driven avatars, text-to-avatar, lipsync videos, talking head generation, virtual presenters, UGC content. Use for: AI presenters, explainer videos, virtual influencers, dubbing, marketing videos, UGC ads, gaming avatars, NPC dialogue. Triggers: ai avatar, talking head, lipsync, avatar video, virtual presenter, ai spokesperson, audio driven video, heygen alternative, synthesia alternative, talking avatar, lip sync, video avatar, ai presenter, digital human, ugc, ugc video, ugc ad, avatar ugc
Create AI avatar and talking head videos via inference.sh CLI.
  • Installs


    298,843