Set the `ELEVENLABS_API_KEY` environment variable.
Determine the text to be converted to speech.
Optionally specify a voice using the `-v` flag or environment variables.
Optionally include audio tags for voice modulation.
Execute the `sag` command with the text and any optional parameters.
The audio is generated and played back locally or saved to a file.
If saving to a file, the path to the audio file is provided for use in subsequent steps.