Show simple item record

dc.contributor.author Batzinger, Robert P.
dc.date.accessioned 2013-09-12T15:58:14Z
dc.date.available 2013-09-12T15:58:14Z
dc.date.issued 2011-10-07
dc.identifier.uri http://hdl.handle.net/2022/16793
dc.description Thesis (M.S.)--Indiana University South Bend, 2011. en_US
dc.description.abstract The goal of this study was to explore the use of machine learning techniques in the development of a web-based application that transcribes between multiple orthographies of the same language. To this end, source text files used in the publishing of the Iu Mien Bible translation in 4 scripts were merged into a single textbase that served as a text corpus for this study. All syllables in the corpus were combined into a list of parallel renderings which were subjected to ID3 and neural networks with the back propagation in an attempt to achieve machine learning of transcription between the different Iu Mien orthographies. The most effective set of neural net transcription rules were captured and incorporated into a web-based service where visitors could submit text in one writing system and receive a webpage containing the corresponding text rendered in the other writing systems of this language. Transcriptions that are in excess of 90% correct were achieved between a Roman script and another Roman script or between a non-Roman script and another non-Roman script. Transcriptions between a Roman script and a non-Roman yield output that were only 50% correct. This system is still being tested and improved by linguists and volunteers from various organizations associated with the target community within Thailand, Laos, Vietnam and the USA. This study demonstrates the potential of this approach for developing written materials in languages with multiple scripts. This study also provides useful insights on how this technology might be improved. en_US
dc.language.iso en_US en_US
dc.publisher Indiana University South Bend en_US
dc.subject Web services. en_US
dc.subject Indiana University South Bend--Dissertations. en_US
dc.subject Dissertations, Academic--Indiana--South Bend. en_US
dc.title Development of a Web-Based Service to Transcribe Between Multiple Orthographies of the Iu Mien Language en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IUScholarWorks


Advanced Search

Browse

My Account

Statistics