Receive audio file path and desired output format.
Determine the Whisper model to use (defaults to `turbo`).
Check if the model is cached; download if needed.
Execute the Whisper CLI command with the specified parameters.
Transcribe or translate the audio.
Save the output to the specified directory in the chosen format.
Provide the path to the transcribed text file.