1. Ask the user to choose a generation mode: Text-to-Video, Image-to-Video, Multi-Shot Sequence, or Keyframe Transition.
2. Gather scene details by asking questions about the subject, action, environment, camera, lighting, mood/emotion, audio, duration, and aspect ratio.
3. Assemble the prompt using the Master Formula: [Scene/Environment] + [Subject & Appearance] + [Action Timeline] + [Camera Movement] + [Audio & Atmosphere] + [Technical Specs].
4. Present the assembled prompt to the user.
5. Ask if the user wants to adjust any element, add a negative prompt, or generate variations.
6. Refine the prompt based on user feedback and generate the video.