Institute of Phonetics and Speech Processing

Links and Functions

Language Selection

Deutsch

Breadcrumb Navigation

Startseite
Research
Software

Main Navigation

print

Content

Software

Parts of the research efforts of the IPS involve the development of software to aid in the scientific work with speech signals and corpora. The software is used both for basic research in speech science and for the creation of speech technology.

Data collection is costly and sometimes tedious in experiments on speech production and perception. But most steps in the workflow can be drastically sped up by using and combining computer tools. Some tools have been designed specifically for single steps along the workflow, others are apt for multiple tasks. For some tasks there is a variety of tools to choose from.

Some programs can be downloaded, some are offered as a web service. Web services can be used directly in the browser (more information here).

EMU-SDMS

Manages copora of speech recordings (both acoustic and physiogical) and allows for segmentation, annotation, hierarchical structuring and statistical evaluation.

WebMAUS

Performs forced alignment (phonetic segmentation) and labeling based on audio files and the corresponding orthographical/phonological transcription in several languages.

SpeechRecorder

Platform-independent speech recorder designed specifically for phonetic experiments.

Percy

Online perception experiment for mobile devices and computers.

WebTranscribe

Distributed orthographical transcription of spoken language.

WikiSpeech

Platform-independent web-based speech recorder.

Coala

Converts metadata tables to CMDI files.

G2P

Converts a written text into a canonical phonetic transcription. Several languages are supported.

Chunk Preparation

Generates BAS partitur files with a speech chunk tier TRN from various input formats. Used to enhance the performance of our MAUS segmentation services for sound files that contain long pauses or dialog data.

WebMINNI

Automatic phonetic segmentation and labelling without text input for several languages.

Pho2Syl

Syllabification of phonological and phonetic transcripts in several languages.

AudioEditorUI

Audio recorder and audio editor.

TextAlign

Aligns text sequence pairs by minimizing their edit distance.

MaryTTS

Web interface for MaryTTS, a text to speech system developed by phoneticians of Uni Saarland and DFKI.

Related Links

print

top