Sora 2 Prompt Examples: 50 Production-Ready Prompts for AI Video
Sora 2 interprets prompts like a director briefing a camera crew. The model prioritizes subject behavior, character performance, and narrative events β what happens in the frame matters more than how the camera captures it. This means effective Sora 2 prompting focuses on action, interaction, and story beats first, then layers in camera and style direction.

These 50 prompts are structured to leverage Sora 2's specific strengths: complex multi-element scenes, extended duration up to 25 seconds, strong character performance, dialogue generation, and physics-aware motion. Each prompt is tested, production-ready, and organized by use case. You can test Sora 2 directly in the AI Video Generator β launch from the Sora 2 model page.
For a complete technical breakdown of Sora 2's architecture and production workflows, see the full Sora 2 guide.
How Sora 2 Prompting Differs
Sora 2 responds best when you describe what happens, not just what exists. A prompt that reads like a storyboard β with clear action beats, character behavior, and temporal progression β outperforms a prompt that reads like a photograph description.
Action-first structure: Lead with what the subject does, then describe the environment, camera, and style. Sora 2 allocates the most attention to the first elements it encounters.
Narrative beats: Break complex actions into sequential stages. "She enters the room, pauses at the doorway, then walks to the window" gives the model three clear temporal anchors.
Dialogue block: When using audio, place dialogue in a dedicated section with speaker attribution. Short, natural lines (under 15 words) produce the most reliable lip-sync.
Duration awareness: Sora 2 generates 4-25 second clips. Shorter clips (4-8s) produce tighter results. Longer clips (12-25s) work best with clear pacing described in the prompt.
Style as context: Append visual style as a modifier rather than leading with it. "A woman runs through rain β shot on 35mm, shallow depth of field, warm grain" lets the model prioritize the action.
Launch the AI Video Generator and select Sora 2 to start generating.
Narrative and Character Prompts (1-10)
1. Character Introduction β Walk and Reveal
A woman in a dark green coat walks slowly down a tree-lined street in autumn.
Leaves fall around her. She stops, turns to look at something off-camera,
and a slight smile crosses her face. The camera follows her at shoulder height
in a smooth tracking shot. Late afternoon golden light. Shot on 35mm film,
warm color grade with soft grain.
Settings: 12s, 16:9, Standard
2. Emotional Close-Up β Reaction Shot
Close-up of a man's face as he reads a letter. His expression shifts gradually
from neutral to surprise to quiet emotion. His eyes glisten. He lowers the letter
slowly and looks up toward a window. Soft diffused light from camera left.
Shallow depth of field. Intimate documentary feel.
Settings: 8s, 16:9, Pro
3. Two Characters β Dialogue Exchange
Medium two-shot inside a warm cafe. A woman across the table leans forward
and speaks. A man listens, nods, then responds. Natural conversational rhythm
with pauses between lines. Warm pendant lighting from above, shallow depth of field.
Dialogue:
[Woman]: "I think we should go. Before the season changes."
[Man]: "You always say that. And we never do."
Audio: quiet cafe ambience, ceramic cups, distant conversation.
Settings: 12s, 16:9, Pro
4. Child Discovery β Emotional Arc
A young girl crouches beside a puddle on a rainy sidewalk. She notices her
reflection and reaches toward the water. Her finger touches the surface and
ripples distort the reflection. She laughs and looks up at someone off-camera.
Overcast daylight, muted colors. Camera at child's eye level. Handheld,
documentary intimacy. Raindrops visible in shallow depth of field foreground.
Settings: 10s, 16:9, Standard
5. Solo Performance β Musician
A man sits alone on a wooden stool in a dimly lit room, playing acoustic guitar.
His fingers move naturally on the fretboard. He sways slightly to the rhythm.
A single warm spotlight from above, the rest of the room in shadow.
Medium shot showing full upper body and guitar. Smoke or dust particles
drift through the light beam. Intimate performance aesthetic.
Audio: acoustic guitar melody, room ambience, subtle string vibration.
Settings: 15s, 16:9, Pro
6. Morning Routine β Montage Feel
A woman wakes up in a sunlit bedroom. She stretches, sits up, and runs
her hand through her hair. She walks to the kitchen, pours coffee from
a french press, and stands at the window holding the mug with both hands.
Morning light floods the apartment. Camera follows her through the space
with smooth steadicam movement. Naturalistic, slice-of-life aesthetic.
Warm golden tones. 24fps cinematic cadence.
Settings: 20s, 16:9, Standard

7. Elderly Character β Portrait in Motion
An elderly man in a worn cardigan sits at a wooden workbench in a cluttered
workshop. He picks up a small wooden figure he has been carving and turns it
in his weathered hands, examining it closely. He sets it down among a row
of similar figures and reaches for his tools. Warm side-lighting from a
window, dust motes visible. Shot on 16mm film, documentary style.
Settings: 12s, 16:9, Pro
8. Farewell Scene β Two Characters
A train platform. A woman and a man stand facing each other. She adjusts
his collar. He takes her hand briefly. A train horn sounds in the distance.
She steps back and waves. He picks up his bag and turns toward the train.
Wide shot showing the full platform with distant passengers. Overcast light,
muted blue-gray palette. Melancholic but restrained.
Audio: train station ambience, distant horn, footsteps on concrete.
Settings: 15s, 16:9, Pro
9. Monologue β Direct Address
Medium shot of a woman in her 30s looking directly at camera. Simple neutral
background. Soft even lighting from both sides. She speaks clearly and naturally
with subtle hand gestures.
Dialogue:
[Woman, calm confident tone]: "Everyone told me it was impossible. But the
thing about impossible is β it only applies to people who've already decided."
Audio: quiet room tone, no music.
Settings: 8s, 16:9, Pro
10. Running Late β Comedy Beat
A man in a suit rushes down a city sidewalk carrying a coffee and a briefcase.
He dodges a cyclist, nearly trips on a curb, steadies himself, and checks his
watch with an exasperated expression. He takes a breath, straightens his tie,
and pushes through a revolving door. Bright morning city light. Slightly
handheld camera following from behind then circling to front. Urban comedy tone.
Settings: 12s, 16:9, Standard
Complex Scene Prompts (11-18)
11. Street Market β Multiple Characters
A busy outdoor food market on a warm afternoon. A vendor hands a wrapped item
to a customer. Behind them, two children chase each other between stalls.
An older woman examines produce at the next booth. Steam rises from a food
stall to the left. Camera drifts slowly through the scene at eye height.
Warm afternoon sunlight with dappled shade from canvas awnings.
Photorealistic, vibrant colors.
Audio: market chatter, sizzling food, children laughing, distant music.
Settings: 15s, 16:9, Pro
12. Office Meeting β Group Interaction
A glass-walled conference room. Four people sit around a table during a meeting.
One person stands at a whiteboard gesturing while explaining something.
Another nods and writes notes. A third leans back in their chair, arms crossed.
The fourth types on a laptop. Natural window light from the right.
Camera holds in a wide static shot capturing all four. Corporate documentary style.
Audio: muffled speaking, pen on paper, laptop keyboard clicks.
Settings: 10s, 16:9, Standard

13. Restaurant Kitchen β Controlled Chaos
Interior of a busy restaurant kitchen during service. A chef plates a dish
at the pass while calling out an order. A line cook flips something in a pan,
flames briefly visible. Another cook carries plates past camera. Steam rises
from multiple stations. The choreography of a working kitchen in full flow.
Warm overhead lighting mixed with blue-white from equipment. Handheld camera
moving through the space. Documentary veritΓ© style.
Audio: sizzling, clanking pans, chef calling orders, kitchen chatter.
Settings: 12s, 16:9, Pro
14. Park Scene β Family Gathering
A wide shot of a family picnic in a public park. Parents sit on a blanket
while two children play with a dog nearby. Grandparents approach from the
walking path carrying a picnic basket. The dog runs to greet them.
One child waves. Natural afternoon backlight creating warm rim lighting
on figures. Trees frame the scene. Camera holds wide and static.
Warm, naturalistic, nostalgic tone.
Settings: 15s, 16:9, Standard
15. Classroom β Teaching Moment
A teacher stands at the front of a classroom with a chalkboard behind her.
She writes an equation, then turns to the class and asks a question.
A student in the front row raises their hand. She points to them and
they begin to answer. Other students look on or take notes. Fluorescent
overhead lighting mixed with window light from the left. Camera holds
in a medium wide shot. Education documentary aesthetic.
Settings: 10s, 16:9, Standard
16. Dance Studio β Rehearsal
A group of five dancers rehearse a contemporary routine in a studio with
floor-to-ceiling mirrors. They move in loose synchronization β intentionally
imperfect, raw. A choreographer watches from the corner, occasionally
gesturing corrections. Natural daylight through high windows. Hardwood
floors reflect the dancers. Camera slowly circles the group.
Behind-the-scenes arts documentary feel.
Audio: dance shoes on wood floor, heavy breathing, muffled counting.
Settings: 15s, 16:9, Pro
17. Street Performance β Crowd Reaction
A street musician plays violin on a city sidewalk. A small crowd has gathered
in a semicircle. A child in the front watches wide-eyed. An older couple
sways gently. A passerby stops, listens, then drops money in the open case.
Late afternoon city light with long shadows. Camera captures both performer
and audience in a medium wide shot. Warm urban documentary aesthetic.
Audio: violin melody, distant traffic, coins dropping, crowd murmur.
Settings: 15s, 16:9, Pro
18. Construction Site β Workers in Action
Wide shot of a construction site at golden hour. A crane moves a steel beam
while workers below guide it with hand signals. A welder works sparks flying
to the left. A foreman checks plans on a table. Multiple activities happening
simultaneously with coordinated purpose. Hard directional light creating
dramatic shadows. Industrial documentary aesthetic.
Audio: machinery hum, metal clanking, welding crackle, distant shouting.
Settings: 12s, 16:9, Standard
Product and Commercial Prompts (19-26)
19. Perfume Ad β Aspirational Lifestyle
A woman in an elegant dress walks through the gardens of a grand estate
at golden hour. She trails her fingers along a stone balustrade. Wind catches
her dress and hair. She pauses at a fountain, looks back over her shoulder
with a subtle expression, and continues walking. Camera follows from behind
then circles to profile. Rich warm cinematic color grade. Luxury brand
commercial aesthetic, shot on anamorphic lens with characteristic flare.
Settings: 15s, 16:9, Pro

20. Car Commercial β Hero Drive
A silver sedan drives along a winding coastal road at sunset. Camera tracks
alongside from a low angle showing the car and ocean backdrop. The car rounds
a curve, sunlight flaring off the windshield. Cut to interior: driver's hands
on the leather steering wheel, dashboard instruments glowing. Cut back to
exterior wide shot of the car receding along the coastline. Cinematic commercial
quality. Warm golden palette with deep shadows.
Audio: engine purr, wind, subtle cinematic score.
Settings: 20s, 16:9, Pro
21. Tech Product β In-Context Usage
A person sits at a clean desk in a modern apartment. They open a laptop
and begin working, typing naturally. A notification appears and they pick up
their phone to check it, then set it down and return to the laptop.
Natural window light from camera left. Shallow depth of field isolating
the subject. Clean, aspirational tech lifestyle aesthetic.
No visible brand logos on screen.
Settings: 12s, 16:9, Standard
22. Food Commercial β Preparation and Serve
Close-up of hands slicing fresh tomatoes on a wooden cutting board.
The slices are arranged on a plate. Olive oil is drizzled in a thin stream.
Fresh basil leaves are torn and scattered. Camera pulls back to reveal
a complete caprese salad on a rustic table with wine glasses and bread.
Warm directional side-light from a window. Shallow depth of field.
Premium food photography aesthetic.
Audio: knife on cutting board, oil pouring, ambient kitchen warmth.
Settings: 15s, 16:9, Pro
23. Fitness Brand β Athlete Training
An athlete performs box jumps in an industrial gym. The camera captures the full
explosive movement from a low angle. Chalk dust rises from their hands on
impact. They step down, breathe, and immediately launch into the next rep.
Sweat visible. Hard side-lighting from industrial windows.
Motivational sports commercial aesthetic. Desaturated palette with high contrast.
Shot at high frame rate for subtle slow-motion quality.
Audio: explosive landing, heavy breathing, gym ambient.
Settings: 10s, 16:9, Pro
24. Beverage β Pour and Enjoy
A cold glass sits on a bar counter. A bartender pours amber beer from a tap
in a smooth continuous motion. Foam rises to the rim and settles.
Condensation beads form on the glass. A hand reaches in and picks up
the glass. Camera holds in a close-up throughout. Warm bar lighting
with backlit amber tones through the glass. Premium beverage commercial quality.
Audio: pouring liquid, foam settling, glass pickup, bar ambience.
Settings: 8s, 16:9, Pro
25. Fashion β Editorial Movement
A model walks through a sun-drenched courtyard in a flowing white dress.
The fabric catches wind and billows dramatically. She turns at the end
of the courtyard and walks back toward camera. Long shadows on stone floor.
Harsh midday sun creating high contrast. Camera locked off on a wide shot.
High-fashion editorial aesthetic. Minimal styling, maximum light and texture.
Settings: 12s, 9:16, Pro

26. Jewelry β Detail and Emotion
Close-up of a woman's hands as she clasps a delicate gold necklace
around her neck. Her fingers adjust the pendant. She looks in a mirror β
we see her reflection smiling softly. She touches the pendant one more time.
Warm intimate lighting. Shallow depth of field. Camera holds close throughout.
Luxury jewelry commercial quality. Skin tones warm and natural.
Settings: 8s, 16:9, Pro
Social Media and Vertical Prompts (27-34)
27. Get Ready With Me β Morning Prep
A woman applies skincare in a bright bathroom mirror. She blends moisturizer
with both hands, then reaches for a cosmetics product. Quick, confident
movements. She smiles at her reflection. Camera is static, framing her
from the mirror's perspective. Bright even bathroom lighting. Natural,
relatable UGC aesthetic.
Settings: 10s, 9:16, Standard
28. Street Style β Walking Shot
Full body shot of a man in a fitted leather jacket and sneakers walking
confidently down a wide urban sidewalk. Camera moves backward at his pace.
Morning city light, long shadows. He adjusts his sunglasses mid-stride.
Shallow depth of field, city blurred behind. Street fashion editorial aesthetic.
Cool desaturated color grade.
Settings: 8s, 9:16, Standard
29. Recipe Quick-Cut β Satisfying Assembly
Bird's eye view of hands rapidly assembling a layered dessert in a glass jar.
Granola base, yogurt layer, berry compote, more yogurt, fresh berries on top.
Each addition is quick and deliberate. Final product is centered and pristine.
Bright natural overhead lighting. Clean white surface. Satisfying food content
aesthetic optimized for vertical social.
Settings: 8s, 9:16, Standard
30. Motivational β Athlete Reel
Quick sequence: athlete laces up shoes, steps onto a track, takes starting
position, and explodes into a sprint. Camera angle starts close on hands
tying laces, then shifts to low angle on the track capturing the start.
Dawn light, dew visible on the track surface. Motivational sports content
tone. High contrast, slightly desaturated.
Audio: shoes on track, explosive start, breathing.
Settings: 10s, 9:16, Pro
31. Pet Content β Cat Personality
A tabby cat sits on a windowsill watching birds outside. It tilts its head
sharply, ears perked forward. It raises one paw and taps the glass twice.
The cat turns to look at camera with an expression of casual indifference,
then returns to watching. Soft natural window light. Camera holds static.
Charming pet content aesthetic. Warm homey tones.
Settings: 8s, 9:16, Standard

32. Unboxing β First Reveal
Hands carefully open a premium matte box with magnetic closure. Tissue paper
is peeled back revealing a product inside. Hands lift the item out and hold it
toward camera. Brief rotation to show angles. Soft studio lighting from above.
Clean background. First-person perspective. Satisfying unboxing aesthetic.
No visible brand logos.
Settings: 10s, 9:16, Standard
33. Travel Moment β Unexpected Beauty
A person steps out of a narrow alley and pauses as they see a sweeping panoramic
view of a coastal town below. Wind catches their hair. They take a breath
and smile. Camera is behind them, slowly widening from medium to wide
to reveal the view alongside them. Golden hour. Travel discovery moment.
Audio: wind, distant waves, seagulls, footsteps.
Settings: 10s, 9:16, Pro
34. UGC Style β Talking Head Product Review
A person in their late 20s holds a small product in their hand and speaks
to camera with natural energy. They gesture with the product, turn it to show
different angles, and nod while explaining. Bright natural lighting from a
window. Clean background, slightly out of focus. Shot on phone-style
front camera perspective with subtle handheld sway. Authentic UGC aesthetic.
Dialogue:
[Person, enthusiastic casual tone]: "Okay I've been using this for two weeks
and honestly? It's the only thing that actually worked."
Settings: 10s, 9:16, Standard
Extended Duration Prompts (35-40)
These prompts leverage Sora 2's unique 25-second maximum duration for complete narrative sequences.
35. Day in the Life β Complete Arc
Total duration: 20 seconds. Pacing: slow opening building to purposeful close.
A man wakes up in a minimal apartment, morning light through blinds.
He sits up slowly (0-4s). Walks to the kitchen and makes pour-over coffee,
methodical ritual (4-10s). Steps onto a balcony holding the mug and looks
out at a city skyline as morning fog lifts (10-15s). He takes a sip, nods
to himself with resolve, and walks back inside with purpose (15-20s).
Warm golden morning tones throughout. Smooth steadicam following movement.
Slice-of-life authenticity with cinematic quality.
Audio: alarm, footsteps, water pouring, coffee brewing, city morning ambient.
Settings: 20s, 16:9, Pro
36. Brand Story β Origin Moment
Total duration: 20 seconds.
A workshop. An artisan examines a piece of raw leather (0-5s). Hands
cut and shape the material with deliberate precision (5-10s). Stitching
by hand under a warm desk lamp, close-up of needle through leather (10-15s).
The finished wallet is placed on a wooden surface and the artisan steps back
to admire it (15-20s). Camera moves from detail close-ups to wider reveals
throughout. Workshop environment with tools and materials visible.
Warm practical lighting. Handcraft documentary aesthetic.
Audio: cutting, stitching, leather creaking, workshop ambient.
Settings: 20s, 16:9, Pro
37. Real Estate β Property Walkthrough
Total duration: 25 seconds.
Camera enters through a front door into a bright open-plan living space (0-5s).
Smooth steadicam forward through the living room showing high ceilings and
natural light (5-10s). Turn right into a modern kitchen with island counter
and pendant lights (10-15s). Continue past sliding glass doors that open
to a backyard patio with garden (15-20s). Final wide shot from the patio
looking back at the house (20-25s). Professional real estate video style.
Bright, airy, aspirational.
Settings: 25s, 16:9, Standard

38. Music Video β Moody Narrative
Total duration: 20 seconds.
Night city. A woman walks alone under streetlights, rain reflecting
neon on wet asphalt (0-5s). She stops and leans against a wall, looking
at the sky (5-10s). Pulls out a phone, hesitates, puts it back in her pocket
(10-15s). She pushes off the wall and walks into the distance, camera
holding as she recedes (15-20s). Teal and amber color grade.
Anamorphic lens characteristics. Film noir meets contemporary.
Audio: rain on pavement, distant city hum, melancholic ambient score.
Settings: 20s, 16:9, Pro
39. Cooking Tutorial β Full Process
Total duration: 25 seconds.
Overhead view: ingredients arranged on a counter β pasta, garlic, olive oil,
parmesan, pepper (0-3s). Hands boil water and add pasta (3-8s).
In a pan: garlic sizzles in olive oil, camera close on the action (8-13s).
Tongs transfer pasta into the pan, toss with sauce (13-18s).
Plated on a ceramic dish, parmesan grated over top, pepper cracked (18-23s).
Final beauty shot of the completed dish, steam rising (23-25s).
Warm kitchen lighting. Clean food video aesthetic.
Audio: boiling water, sizzling garlic, tongs on pan, grater, pepper grinder.
Settings: 25s, 16:9, Standard
40. Fitness β Full Workout Set
Total duration: 20 seconds.
An athlete stands before a loaded barbell, chalks hands (0-4s).
Grips the bar and performs a clean and jerk with explosive power (4-9s).
Holds the barbell overhead, controlled (9-12s). Drops the weight and
steps back, breathing hard (12-16s). Walks to a bench, sits down, wipes
face with towel, drinks water (16-20s). Industrial gym environment.
Hard directional lighting. Camera captures the full lift from a
45-degree angle then shifts to profile for the drop and recovery.
Audio: chalk clap, deep breath, explosive lift, weight drop, heavy breathing.
Settings: 20s, 16:9, Pro
Image-to-Video Prompts (41-44)
41. Product Photo β Add Life
Starting from the reference image, animate with natural subtle movement.
Add gentle steam if hot beverage is present. Fabric settles slightly as if
just placed. Environmental light shifts subtly as clouds pass a window.
Camera holds the exact composition from the source image. No structural
changes to the subject. Preserve all lighting and color from the original.
Audio: appropriate ambient sound for the scene depicted.
Settings: 8s, match source aspect ratio
42. Portrait β Breathe and React
Animate from reference with natural life. Subject breathes naturally,
blinks once, and makes subtle micro-expression as if listening to
something interesting. Slight head adjustment. Maintain exact identity,
clothing, and lighting from source. Camera static. Very restrained β
this should feel like a living photograph, not a performance.
Settings: 8s, match source aspect ratio
43. Architecture β Environmental Life
Animate from source with environmental movement only. Clouds move across
the sky. Trees sway in gentle wind. Shadows shift subtly with cloud movement.
If water is present, add realistic surface motion. Building and structural
elements remain completely static. Camera holds exact position from source.
Preserve all architectural geometry and lighting relationships.
Settings: 8s, match source aspect ratio

44. Food Photo β Appetizing Motion
Add steam rising from hot elements. Sauce glistens with subtle light
interaction. Herbs settle slightly. Condensation forms on cold glass
if present. Camera holds static from source composition. Only thermal
and micro-motion animation. Preserve exact plating, colors, and
lighting from the original photograph.
Audio: subtle ambient dining sounds.
Settings: 4s, match source aspect ratio
Dialogue and Audio Prompts (45-50)
45. Interview β Single Speaker
Medium close-up of a man in a casual button-down shirt seated in a
well-lit office. Bookshelves blurred behind. He speaks naturally, making
eye contact slightly off-camera left. Occasional hand gestures.
Warm key light from camera right, subtle fill from left.
Dialogue:
[Man, thoughtful professional tone]: "The hardest part wasn't building it.
The hardest part was convincing people it was worth building."
Audio: quiet office tone, subtle air conditioning hum.
Settings: 8s, 16:9, Pro
46. Couple Conversation β Emotional Beat
Medium shot of two people sitting on a park bench in autumn. Leaves fall
around them. He looks at the ground. She speaks first, gently.
Dialogue:
[Woman, soft warm tone]: "You don't have to figure it all out today."
[Man, after a pause, looking up]: "I know. I just wish I believed that."
Camera holds in a static two-shot. Late afternoon warm backlight.
Shallow depth of field with park blurred behind.
Audio: wind in leaves, distant park ambient, footsteps on path.
Settings: 12s, 16:9, Pro
47. Customer Testimonial β Authentic Feel
A woman in a bright kitchen speaks directly to camera with natural energy
and genuine enthusiasm. She holds a product casually, referencing it
during her testimonial. Bright window light. Slightly imperfect
phone-camera framing for authenticity.
Dialogue:
[Woman, genuine enthusiastic tone]: "I was skeptical at first. But after
a month? My mornings are completely different. I actually look forward to it."
Audio: kitchen ambient, natural room tone.
Settings: 8s, 9:16, Standard
48. Narrator Voice β B-Roll With VO
Aerial shot slowly gliding over a mountain range at dawn. Mist fills the
valleys. First light touches the peaks. No visible people. Pure landscape.
Camera drifts forward smoothly. Epic nature documentary aesthetic.
Dialogue:
[Narrator, deep measured tone, no visible speaker]: "Some places remind you
that the world existed long before you. And will continue long after."
Audio: wind, distant bird calls, subtle orchestral undertone.
Settings: 12s, 16:9, Pro
49. Multilingual β Code-Switch
Medium shot of two women at a small outdoor cafe table. Mediterranean
setting, warm afternoon light. String lights overhead.
Dialogue:
[Woman A, in Spanish]: "Este lugar me recuerda a Barcelona."
[Woman B, switching to English]: "Me too. We should go back this summer."
Natural head movement and expression. Lip-sync matched to each language.
Camera holds in a natural two-shot.
Audio: cafe ambient, distant street, gentle breeze.
Settings: 10s, 16:9, Pro

50. Comedic Timing β Dry Delivery
Medium shot of a man sitting at a desk in a very ordinary office cubicle.
Fluorescent lighting. He looks directly at camera with a completely
deadpan expression and speaks.
Dialogue:
[Man, completely flat deadpan delivery]: "They said the meeting could
have been an email. It couldn't. It should have been nothing."
He holds the stare for two beats, then looks back at his computer screen.
Audio: office ambient, keyboard from adjacent cubicle, distant printer.
Settings: 8s, 16:9, Standard
Prompting Quick Reference
Sora 2 Responds Best To
| Element | Approach |
|---|---|
| Subject action | Sequential beats: "enters, pauses, then walks to" |
| Multiple characters | Assign distinct roles and simultaneous actions |
| Camera | Append after action description, not before |
| Dialogue | Dedicated block with speaker attribution and tone |
| Style | Append as modifier: "shot on 35mm, warm grade" |
| Duration pacing | Describe temporal arc: "slow opening, building energy" |
Duration Strategy
| Duration | Best For |
|---|---|
| 4-8s | Tight single actions, reaction shots, product reveals |
| 8-15s | Complete scenes, dialogue exchanges, narrative beats |
| 15-25s | Full narrative arcs, walkthroughs, extended sequences |
Common Pitfalls
Avoid stacking too many simultaneous actions in complex scenes. Sora 2 handles multiple characters doing different things, but five characters each performing three-step actions will produce simplification. Keep the primary action clear and let secondary characters perform simpler supporting behaviors.
Avoid overly long dialogue. Keep spoken lines under 15 words per speaker per turn for reliable lip-sync. Longer monologues risk accelerated speech or audio drift.
Next Steps
These prompts are starting points. Adjust action specificity, camera direction, and style modifiers based on your results. Sora 2 rewards iteration β small changes to subject behavior descriptions produce meaningful output differences.

For prompt engineering principles that apply across all models, see the prompt engineering guide.
To compare Sora 2's strengths against other models and build routing logic, see the Kling 3.0 vs Sora 2 comparison.
Browse all available models including Kling 3.0, Veo 3, and Runway alongside Sora 2 in the AI models library.
Related Articles
- Sora 2 Complete Guide: Professional Video Generation Mastery
- Kling 3.0 vs Sora 2: Complete AI Video Model Comparison 2026
- Kling 3.0 Prompt Examples: 50 Production-Ready Prompts
- Veo 3 Prompt Examples: 50 Production-Ready Prompts
- AI Prompt Engineering: The Complete Guide 2026
- AI Video Generation: The Complete Guide 2026