Βι¶ΉΤΌΕΔ

Kaldi

A speech-to-text system for quick, cost free transcription
This tool is no longer available via the Βι¶ΉΤΌΕΔ.

How does it work?

  • 1
    Download and run the debian package
  • 2
    Upload video/speech files into the tool, using the instructions provided
  • 3
    Edit the transcription, as required

What is Kaldi?

Kaldi is a speech recognition toolkit, built upon the open source software originally developed for use by speech recognition researchers. The quickest way to search through a piece of audio or video is via a transcript, but transcription by hand is a costly and time consuming endeavour. Kaldi has been designed as a means of automating the same process for free, requiring only a small amount of installation effort from a software developer. The Βι¶ΉΤΌΕΔ-Kaldi component provides a machine learning model built using the tools in the Open Source Kaldi toolkit and audio / text data from Βι¶ΉΤΌΕΔ programme and an easy to use interface that makes it simple to get up and running.

Top tips to get you ready

What have we learnt from Kaldi?

Kaldi has been used extensively within the Βι¶ΉΤΌΕΔ, with a variety of learnings from each project. Βι¶ΉΤΌΕΔ Newslabs' project showed that speech-to-text is a tremendous timesaver for journalists who need to be across a wide number of video feeds. Their project proved that creating subtitles for viral videoclips can be done much quicker than was previously imagined. Βι¶ΉΤΌΕΔ Rewind also used speech-to-text to open up almost a million hours of material from the Βι¶ΉΤΌΕΔ Archive.

Latest Discussions

Sorry, discussions on Kaldi couldn’t be loaded at this time. Please try again later.