Speech to Text

This plugin provides a recipe and a macro to convert Speech to Text using Deep Learning

Plugin information

Version	0.3.1
Author	Dataiku (Arthur DOUILLARD, Alexandre ABRAHAM, Alex COMBESSIE)
Released	2018-08
Last updated	2023-04
License	MPL-2.0

⚠️ The underlying DeepSpeech library requires the following system libraries:

libstdc++6 >= 4.8.5
glibc >= 2.19

Note that libstdc++6 >= 4.8 is not installed by default on several Linux distributions. If that is the case, you will need sudo access to the server hosting your Dataiku DSS instance in order to upgrade libstdc++6.

How to use

Download DeepSpeech model macro

This macro downloads the weights of the DeepSpeech pre-trained model into a folder in your project. Note that this model has been trained on American English speech data.

Speech to Text recipe

This recipe takes as input the folder with DeepSpeech weights from the macro and a folder with audio files of .WAV format. The output will be a dataset with two columns: the audio file path and the associated transcription.

Get the Dataiku Data Sheet

Learn everything you ever wanted to know about Dataiku (but were afraid to ask), including detailed specifications on features and integrations.

Get the Data Sheet