Amazon debuts automatic speech recognition service, Amazon Transcribe Medical
Amazon is expanding its automatic transcription service for AWS, Amazon Transcribe, to include support for medical speech, the company announced this morning at its AWS re:Invent conference. The new machine learning-powered service, Amazon Transcribe Medical, will allow physicians to quickly dictate their clinical notes and speech into accurate text in real time, without any human intervention, Amazon claims.
Unlike some services, the physicians won’t have to say things like “comma” or “full stop,” but can speak normally during the dictation process. The text can then be fed to downstream systems, including ER systems or AWS language services, like Amazon Comprehend Medical for entity extraction.
The service is also HIPAA-eligible and scales with the users’ needs, meaning you’ll only pay for what you actually use and without upfront fees, notes Amazon.
From a technical perspective, the service works as follows.
You first capture audio through a device’s microphone, then send PCM audio to a streaming API, based on the WebSocket protocol. The API responds with a series of JSON blobs with the transcribed text, plus word-level time stamps and punctuation. This can also be optionally saved to an Amazon Simple Storage Service (S3) bucket.