At Rev, our customers can access two types of transcription services, depending on their requirements:
AI-based transcription: performed using automated speech recognition. While fast and relatively less expensive, its accuracy is impacted by various factors, such as background noise, speaker accents, etc.
Human transcription: performed by humans and up to 99% accurate. However, it is also more expensive and takes longer.
Thus far, AI-based transcription services have been delivered through the Rev AI APIs and human transcription services through the Rev.com website. Today, we're happy to announce that we're breaking the wall and allowing developers to access human transcription services through the Rev AI APIs.
This new capability allows developers of downstream applications to provide transcripts with higher levels of accuracy than any ASR system can currently provide, while still retaining the option to obtain ASR-based transcripts as before, without significant additional integration time and effort.
In addition to requesting full human- or ASR-transcribed results, this new feature also allows developers to selectively mix and match both options and create hybrid models to flexibly meet end-user needs. For example, developers can request ASR-based transcription first and then selectively "upgrade" segments (of two minutes or longer) of the ASR transcript to human quality.
Notably, timestamps of the human-transcribed segments will be aligned to those of the ASR transcription result, making it extremely easy to merge the results. All the standard job parameters— profanity filtering, punctuation, diarization— as well as custom vocabularies submitted through the API will be honored by the human transcriber.
To learn more, read our tutorial on integrating human transcription into ASR applications, which includes detailed notes and code samples to help you get started with this new feature.