Drop in audio or video. Get back a formatted transcript, timestamped, as a Word document.
Files are decoded and chunked in your browser, then each chunk is sent to
this Worker's /api/transcribe route, which calls Workers AI's
Whisper model. Nothing is stored server-side. Very long files take longer
since audio is processed in ~25-second segments.