Janus Recognition Toolkit

Janus Recognition Toolkit (JRTk), sometimes referred to as Janus, is a general purpose speech recognition toolkit developed and maintained by the Interactive Systems Laboratories at Carnegie Mellon University and Karlsruhe Institute of Technology. It is useful for both research and application development and is part of the JANUS speech-to-speech translation system.[1]

The JRTk provides a flexible Tcl/Tk script based environment which enables researchers to build state-of-the-art speech recognizers and allows them to develop, implement, and evaluate new methods. It implements an object oriented approach that unlike other toolkits is not a set of libraries and precompiled modules but a programmable shell with transparent, yet efficient objects.

Since version 5 JRTk features the IBIS decoder, a one-pass decoder that is based on a re-entrant single pronunciation prefix tree and makes use of the concept of linguistic context polymorphism. It is therefore able to incorporate full linguistic knowledge at an early stage. It is possible to decode in one pass, using the same engine in combination with a statistical n-gram language model as well as context- free grammars. It is also possible to use the decoder to rescore lattices in a very efficient way.

JRTk utilizes the concept of Hidden Markov Models (HMMs) for acoustic modeling and offers many state-of-the-art techniques for acoustic pre-processing, acoustic model training, and speech decoding. Through its flexible, object oriented architecture it allows to configure all components in a very flexible way (e.g., pre-processing steps to execute, HMM topology, training sequence, algorithm parameters, adaptation sequences, etc.), without the need to modify source code or recompile.

JRTk has been used by the Interactive System Labs in many projects for speech recognition, such as:

References edit

  1. ^ Sebastian Stüker. "KIT - Janus Recognition Toolkit". Isl.ira.uka.de. Retrieved 2012-04-23.
  2. ^ Patricia Lichtblau (2012-02-01). "EU-Bridge - Homepage". Eu-bridge.eu. Retrieved 2012-04-23.
  3. ^ Esben Eidevik (2012-04-15). "EVEIL-3D-Startseite". Eveil-3d.eu. Retrieved 2012-04-23.
  4. ^ "IARPA - Solicitations - Office of Incisive Analysis, Babel Program". Iarpa.gov. Archived from the original on 2012-04-22. Retrieved 2012-04-23.
  5. ^ "Bienvenue sur le site de Quaero". Quaero.org. Retrieved 2012-04-23.
  6. ^ "SFB 588 - Humanoid Robots". Sfb588.uni-karlsruhe.de. Retrieved 2012-04-23.
  7. ^ "European Commission : CORDIS : Projects & Results Service : Technology and corpora for speech to speech translation". Cordis.europa.eu. Retrieved 2016-07-16.
  8. ^ "European Commission : CORDIS : Projects & Results Service : Facilitating Agent in Multicultural Exchange". Cordis.europa.eu. Retrieved 2016-07-16.
  9. ^ "European Commission : CORDIS : Projects & Results Service : NEgotiating through SPOken Language in E-commerce". Cordis.europa.eu. Retrieved 2016-07-16.