Show simple item record

dc.contributor.author Batzinger, Robert P.
dc.date.accessioned 2013-09-12T15:58:14Z
dc.date.available 2013-09-12T15:58:14Z
dc.date.issued 2011-10-07
dc.identifier.uri http://hdl.handle.net/2022/16793
dc.description Thesis (M.S.)--Indiana University South Bend, 2011. en
dc.description.abstract The goal of this study was to explore the use of machine learning techniques in the development of a web-based application that transcribes between multiple orthographies of the same language. To this end, source text files used in the publishing of the Iu Mien Bible translation in 4 scripts were merged into a single textbase that served as a text corpus for this study. All syllables in the corpus were combined into a list of parallel renderings which were subjected to ID3 and neural networks with the back propagation in an attempt to achieve machine learning of transcription between the different Iu Mien orthographies. The most effective set of neural net transcription rules were captured and incorporated into a web-based service where visitors could submit text in one writing system and receive a webpage containing the corresponding text rendered in the other writing systems of this language. Transcriptions that are in excess of 90% correct were achieved between a Roman script and another Roman script or between a non-Roman script and another non-Roman script. Transcriptions between a Roman script and a non-Roman yield output that were only 50% correct. This system is still being tested and improved by linguists and volunteers from various organizations associated with the target community within Thailand, Laos, Vietnam and the USA. This study demonstrates the potential of this approach for developing written materials in languages with multiple scripts. This study also provides useful insights on how this technology might be improved. en
dc.language.iso en_US en
dc.publisher Indiana University South Bend en
dc.subject Web services. en
dc.subject Indiana University South Bend--Dissertations. en
dc.subject Dissertations, Academic--Indiana--South Bend. en
dc.title Development of a Web-Based Service to Transcribe Between Multiple Orthographies of the Iu Mien Language en
dc.type Thesis en


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IUScholarWorks


Advanced Search

Browse

My Account