README FILE FOR Chatino Speech Corpus Archive Dataset Created by: Damir Cavar, Malgorzata Cavar, Hilaria Cruz Name: Damir Cavar Address: Indiana University, Department of Linguistics, Ballantine Hall, room 850 Address: 1021 E. Kirkwood Avenue, Bloomington, IN 47405-7005 URL: http://pages.iu.edu/~dcavar/ email: dcavar@iu.edu --------------------------------------- FILE LIST 20150715-ctp-001.zip --------------------------------------- FILE INFORMATION This zip file contains WAV-audio files and annotations. The recordings were produced using a digital audio recorder (ZOOM H6) and can be listened to using any sound software that can play WAV-audio files. The annotations can be viewed and edited by the ELAN software packages. ELAN (https://tla.mpi.nl/tools/tla-tools/elan/) is a professional tool for the creation of complex annotations of video and audio resources. --------------------------------------- RESEARCH QUESTION(S) The data is the result of experiments related to the process of creating speech technologies to document a low-resourced or endangered language. The language that we picked for the creation of speech corpora and training of forced alignment tools is Eastern Chatino, an unwritten and low-resourced language from Oaxaca, Mexico. As far as we can tell, this is the first such resource available under a free Creative Commons license. --------------------------------------- COPYRIGHT & LICENSING INFORMATION This data is licensed for reuse under a Creative Commons Attribution Share-Alike 4.0 International license.