Skip to main content

Puppetry for AI Agents

Machine-readable capabilities and API integration guide

What Puppetry Can Do

🎬 Text → Video

Provide a portrait image + text script → get a talking head video with AI-generated speech

🎙️ Audio → Video

Provide a portrait image + audio file → get a lip-synced talking head video

🎥 Video → Video

Provide a portrait image + driving video → transfer motion to the portrait

🗣️ 500+ AI Voices

65+ languages including Arabic, Hindi, Spanish, French, Japanese, Korean, and more

Machine-Readable Resources

ResourceURLFormat
OpenAPI Spec/openapi.jsonOpenAPI 3.1
AI Plugin Manifest/.well-known/ai-plugin.jsonJSON
LLMs.txt/llms.txtPlain text
LLMs Full/llms-full.txtPlain text
Robots.txt/robots.txtPlain text

Quick Start

# 1. Create a talking head video from text
curl -X POST https://puppetry.com/api/v1/animate/text \
  -H "Authorization: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "imageURL": "https://example.com/portrait.png",
    "inputText": "Hello! Welcome to my channel.",
    "outputFormat": "mp4"
  }'

# Response: { "taskId": "abc-123", "status": "PENDING" }

# 2. Poll for completion
curl https://puppetry.com/api/v1/task/abc-123 \
  -H "Authorization: YOUR_API_KEY"

# Response: { "status": "SUCCESS", "video_url": "https://..." }

Integration Ideas

  • 📚 Education platforms: Auto-generate instructor videos from course scripts
  • 📧 Email marketing: Personalized video messages at scale
  • 🛒 E-commerce: Product demo videos from product descriptions
  • 📰 News/content: Convert articles into video summaries
  • 💬 Customer support: Video responses to common questions
  • 🎮 Game dev: NPC dialogue videos from character portraits
  • 🌐 Localization: Translate and re-voice videos in 65+ languages

Frequently Asked Questions

What is Puppetry?

Puppetry is an AI video creation platform that turns any photo into a talking head video. Upload a portrait, type a script or upload audio, and get a realistic lip-synced video in minutes.

How much does Puppetry cost?

Plans start at $3/month (Starter). Creator is $15/month with 100 videos and all features. Studio is $30/month with voice cloning, priority rendering, and 300 videos/month.

Can I use my own voice?

Yes. You can upload any audio file up to 5 minutes, or clone your voice with just 30 seconds of sample audio.

What languages does Puppetry support?

Puppetry supports 65+ language codes including Arabic, Hindi, Spanish, French, Japanese, Korean, Mandarin, and many more. We offer 500+ AI voices across these languages.

Can I use Puppetry videos commercially?

Yes, all paid plans (Starter, Creator, Studio) include full commercial licensing rights.

Does Puppetry have an API?

Yes. All plans include API access for programmatic video generation, perfect for integrating into automated workflows.

How long does it take to generate a video?

Most videos render in under 10 minutes depending on length and complexity.

Can I add captions to my videos?

Yes. Caption Studio provides automatic speech-to-text transcription, multiple caption styles, word-level highlighting, and exports with burned-in captions or SRT/VTT files.

Limits & Pricing

Free tier includes limited video generations. Paid plans offer higher quotas and no watermarks. See pricing for details.

For enterprise API access or custom integrations, contact support@puppetry.com