STT/ASR Competitive Analysis: Google & IBM
Google's Cloud Speech-to-Text API is available in 120 languages as a cloud service, and as an application, while IBM's Watson STT is a cloud service available in 17 languages. The details of each product were added to the attached spreadsheet and a summary of our findings below.
- Google's Cloud Speech-to-Text API is available in 120 languages and different variants for free between 0 and 60 minutes and for a price between $0.004 and $0.009 per 15 seconds for speech-to-text recognition above 60 minutes.
- While the product was created as a cloud service, it is also available for desktops, laptops, phones, or tablets.
- Additionally, it allows users to customize up to 5,000 words, phrases, and other specific business words like years, addresses, currencies, conversions, etc.
- IBM's Speech-to-Text program, Watson, is a cloud service available in 17 languages and variations, under three plans: Lite, Standard, and Premium.
- Lite: offers between 100 and 500 minutes per month for free without access to customization.
- Standard: Charges between $0.02 and $0.015 per minute between one and over 1 million minutes.
- Premium: Charges vary based on model combinations and customization.
- The product allows language customization based on business needs, as seen in the medical language sample video.