IFA Dutch Spoken Language Corpus: Raw sentences

This service allows you to query a (PostgreSQL) sentence database of the IFA Dutch Spoken Language Corpus for speech. All speech has been chopped into "sentence-size" units. For information, please contact:Rob van Son Rob.van.Son@hum.uva.nl.

Copyrigth © 2000, 2012, Nederlandse Taalunie and R.J.J.H. van Son
All materials presented here are licensed under the GNU General Public License (GPL) unless explicitely stated otherwise. Please note that these materials are presented WITHOUT ANY WARRANTY. For information or further details see the License, GNU.org, or contact me at R.J.J.H.vanSon@uva.nl.

You can find information on the label protocol, the naming conventions, and the phoneme labels here (in Dutch) and here (some of it in English).

Text types are:

Speaking styles are:

Information on how to use our Web interface to the IFA-corpus PostgreSQL database, look here.

Available file-types are: A label file (Status *: unaligned, R: complete, handaligned, H: handaligned phonemestring, A: automatically aligned phoneme string), two separate channels from the same recording, from a fixed microphone (_fm) and from a head mounted microphone (_hm), and the corresponding spectral Center of Gravity (CoG) files. The Spectral Center of Gravity is a compact way to represent spectral changes, especially changes in speech source (noise and voicing) and changes related to articulatory movements. These are stored as Praat ASCII Sound files. Sound files are CD quality (directly recorded on Audio CD), i.e., 44.1 kHz and 16 bit.

Select the speech material (select Speaker: F/M - female/male, Age, ID A-Z; Text Type: Fixed or Variable; and Speaking Styles):
(use the info buttons to display more or less information, e.g., file sizes and descriptions in the status bar)

Info: default more info no listing POST GET method
Speaker
Text material
Speaking Style
Narrow down the selection of files:
ID does (~) or does NOT (!~) match regexp
<= Duration <=
Transliteration does (~) or does NOT (!~) match regexp

Link to labels with STATUS


Selected Files
(look at the bottom of the page for collected TAR archives)