1. User provides the YouTube video URL.
2. Agent executes the `get_transcript.py` script with the provided URL.
3. The script fetches the transcript using `yt-dlp`.
4. Agent reads the transcript.
5. Agent summarizes the transcript or answers user questions based on the content.
6. Agent presents the summary or answers to the user.