Whisper is a powerful artificial intelligence-based system designed for speech recognition. It utilizes a large-scale weak supervision setup and is a comprehensive model suitable for multi-lingual speech recognition, language translation and spoken language identification. It works through a sequence-to-sequence approach that combines sequence tokens and predicts decoding. It is open-source and comes with five distinct model sizes that can be adjusted for desired speed and accuracy. All of this is provided under the MIT license.

Sign In


Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.