Click to zoomOverview
A fully automated video production pipeline that takes a text brief or landing page URL and outputs a complete 30-second vertical video ad with AI voiceover, B-roll footage, styled captions, and cloud delivery -- no manual editing required.
Stack
What needed solving
Creating video ads requires a full production team: copywriters, voiceover artists, video editors, and caption designers. Most small and mid-size brands cannot afford this pipeline or the time it takes.
How I approached it
A single n8n workflow that replaces the entire video production pipeline. Feed it a landing page or offer text and it outputs a complete 30-second vertical ad with cinematic B-roll, AI voiceover, synced captions, and automatic Google Drive delivery.
Step by step
- 01
GPT generates a 5 to 8 scene ad storyboard following Hook, Problem, Solution, Benefit, Call-to-Action structure
- 02
ElevenLabs generates a natural-sounding voiceover from the script
- 03
Pexels API matches each scene keyword to royalty-free B-roll footage
- 04
Whisper transcribes the voiceover and generates SRT captions with precise timing
- 05
Shotstack compiles all assets -- voiceover, B-roll, and captions -- into a rendered video
- 06
Caption overlay is styled and positioned with transparent background layers
- 07
Final MP4 is automatically uploaded to Google Drive for delivery
Outcomes
- ✓
30-second vertical video ad produced end-to-end with no manual editing
- ✓
Full production pipeline replaced by a single workflow
- ✓
Consistent output quality across every render
- ✓
Assets delivered automatically to Google Drive on completion
Key features
Cinematic B-roll matched to scene keywords via Pexels API
AI voiceover synced precisely to video timing via ElevenLabs
Auto-generated captions with styled overlay via Shotstack
Entire pipeline from text brief to final MP4 in one n8n workflow