Changelog

2025-10-05

  • September 23'rd in 2019: Accessibility requirements apply to all public websites launched after this date.
  • September 23'rd in 2020: The law will be applied backwards too - that is, it will apply to all public sector sites.
  • June 23'rd in 2021: Legislation applies to mobile applications too.

Speech to text is an automatic speech recognition (ASR) system that consists primarily of statistical models which map continuous spoken utterances or speech waveforms to text in human language. The ASR system is put together from a language model, a pronunciation model (lexicon/dictionary), and an acoustic model. When the ASR system is consistently fed and trained with new speech data by multiple speakers it receives an extended vocabulary, and the accuracy of the ASRs transcript increases. Therefore, the more the ASR has used the better accuracy it receives. The accuracy levels are measured and set by the Word Error Rate (WER).

For an ASR model to be considered highly accurate, the WER correspondence needs to be less than 10%. Txtplays ASR model is considered being highly accurate and that's because we deliver an accuracy of 94%.

2025-09-01

  • Language barriers: subtitles allow people to understand languages that aren't their mother tongue/they don't speak by translating words or sentences into the user's preferred language.
  • Improved concentration: help for people with reading or learning difficulties. People with reading or learning difficulties can benefit from subtitles to understand the content better and faster.
  • Noisy environments: in noisy environments, for example in public places, aeroplanes or trains, subtitles make it possible to follow along in the dialogue.
  • Subtitles: is an excellent resource for language learning.
  • Avoid disturbing others; most people watch videos on mute while in public.
  • Citations and references: for researchers, writers and journalists, subtitles can be a valuable source of citations and references when writing about film or media content.
  • Transcription of long interviews: 1 hour of audio content takes about 6 hours to transcribe manually,

Speech-to-text and automatic speech recognition (ASR) enable audio content to be visually accessible for everyone by adding text. Adding speech-to-text and subtitles provides accessibility for hearing-impaired audiences who would otherwise be excluded from this content. Therefore, automatic speech recognition (ASR) has become a necessity to possess to make content accessible to everyone.

Making online video content available to everyone has become a new EU directive set by the European Disability Forum Guidelines and the Web Content Accessibility Guidelines (WCAG).

EU directive dates and legislation:

Changelog

Speech to text is an automatic speech recognition (ASR) system that consists primarily of statistical models which map continuous spoken utterances or speech waveforms to text in human language. The ASR system is put together from a language model, a pronunciation model (lexicon/dictionary), and an acoustic model. When the ASR system is consistently fed and trained with new speech data by multiple speakers it receives an extended vocabulary, and the accuracy of the ASRs transcript increases. Therefore, the more the ASR has used the better accuracy it receives. The accuracy levels are measured and set by the Word Error Rate (WER).

For an ASR model to be considered highly accurate, the WER correspondence needs to be less than 10%. Txtplays ASR model is considered being highly accurate and that's because we deliver an accuracy of 94%.

  • Language barriers: subtitles allow people to understand languages that aren't their mother tongue/they don't speak by translating words or sentences into the user's preferred language.
  • Improved concentration: help for people with reading or learning difficulties. People with reading or learning difficulties can benefit from subtitles to understand the content better and faster.
  • Noisy environments: in noisy environments, for example in public places, aeroplanes or trains, subtitles make it possible to follow along in the dialogue.
  • Subtitles: is an excellent resource for language learning.
  • Avoid disturbing others; most people watch videos on mute while in public.
  • Citations and references: for researchers, writers and journalists, subtitles can be a valuable source of citations and references when writing about film or media content.
  • Transcription of long interviews: 1 hour of audio content takes about 6 hours to transcribe manually,

Speech-to-text and automatic speech recognition (ASR) enable audio content to be visually accessible for everyone by adding text. Adding speech-to-text and subtitles provides accessibility for hearing-impaired audiences who would otherwise be excluded from this content. Therefore, automatic speech recognition (ASR) has become a necessity to possess to make content accessible to everyone.

Making online video content available to everyone has become a new EU directive set by the European Disability Forum Guidelines and the Web Content Accessibility Guidelines (WCAG).

EU directive dates and legislation:

  • September 23'rd in 2019: Accessibility requirements apply to all public websites launched after this date.
  • September 23'rd in 2020: The law will be applied backwards too - that is, it will apply to all public sector sites.
  • June 23'rd in 2021: Legislation applies to mobile applications too.