1. Receive user input and detect the desired mode (Podcast, Explain, TTS, Image Generation).
2. Identify relevant parameters from the user input (topic, URL, voice).
3. Execute the appropriate script with the identified parameters.
4. Remember the Episode ID for status queries.
5. Check the status of the generation process.
6. If successful, present the results to the user with relevant links.
7. Handle user requests for downloads or further actions.