STx Script Examples

SPExL is the STx application for transcription (audio-to-text transcription). It is equally suited to support transcriptions and annotations of audio signals, optionally according to user controlled transcription templates. Sound file segments are created and automatically linked to *.wav files.

Introduction

STx provides all methods necessary for computation of Mel Frequency CepstralCoefficients (MFCC). All the methods are described in the STProgrammer Guide. This document shows how to use the STx Macro Language to extract MFCC feature vectors from a sound file. All macros and sound processing units (SPUs) are written in the file mfcc extraction.sts which is  listed in Chapter 3. This macro file has to be run from inside STx. Note that  you may have to adjust the path of the #soundfile variable which has to point to a valid WAV sound file.

Read More (PDF)

Type: Script
Version: -
STx Version: 3.7
File:
mfcc_extraction.sts
Author: Timo Becker
Contact: Timo Becker,
c/o Department of Speaker Identification and Audio Analysis, BKA (Federal Criminal Police) KT 54, 65173 Wiesbaden, Germany

A simple timer application which shows the elapsed time in a dialog and can be controlled via buttons or shell messages.
STx timer script

Type: Script
Version: 1.0
STx Version:
3.8 beta Revision 1853
File:
timer.sts
Author: This email address is being protected from spambots. You need JavaScript enabled to view it.

Extract speech components from an existing signal and resynthesise the signal. This script demonstrates the use of the Playwave class, the SPUnits window, csigin, lpcoefs and in2table. The script takes the following parameters:

 [ [ soundfile_path ] ; [ segment=Signal.All ] ; [ frame_length=20 ] ; [ frame_shift=10 ] ; [ number_of_coeficients=16 ] ]

If parameters are missing, the default values are used. There is no default soundfile_path; if no soundfile was specified, the user is prompted via a dialog to choose a file from the DataSet.

Type: Script
Version: 1.1
STx Version:
3.7
File:
speech_resynthesis.sts
Author: Jonnie White
Email:
This email address is being protected from spambots. You need JavaScript enabled to view it.

Upcoming Events

Improving speech technology with the open source VOiCES dataset

ARI guest talk by Michael R. Lomnitz

19. September 2019

14.30

Seminar Room, Wohllebengasse 12-14 / Ground Floor

Read more ...
 

SSW10

The 10th ISCA Speech Synthesis Workshop

20. - 22. September 2019

Vienna, Austria

 

News