This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Whisper AI

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. You can generate transcripts from audio files.
    Responsible AI Checklist
    Can be used with customer data❌ No
    Can the output be used at customer✅ Yes
    Can be used with Xebia Internal secret data❌ No
    Can the output be used commercially by Xebia❓TODO VERIFY PROOF
    Is data being stored in the region you use it❌ No
    Do we have a Xebia license / managed solution for this tool❌ No

    What is Whisper AI

    Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. You can generate transcripts from audio files.

    How to use Whisper AI

    You can visit the website to follow instructions. In short. You download the tools and run whisper on your commandline. You can use multiple language models. The bigger the model, the more accurate the transcript. Although in many cases, small or medium is sufficient.

    License / Costs

    whisper is free

    Suitable to use with clients?

    It generates a transcript from an audio file. It does so locally. But please check customer. Also make sure the transcript does not end up somewhere.