| OVERVIEW | CRCL | CALLS | DICTS | FONTS | SOFTWARE | PAPERS | PROJECTS | WHO?... | LISTS | SPOKEN... | REF CARDS | SEEKING | BASICS... | HOW?... | CLOCKS | LOCAL | CONTENTS... |
All papers relevant to the Southeast Asian linguistics and computing community are welcome. Authors seeking opinions or constructive criticism are encouraged to submit papers-in-progress, and requests along the lines of:
To submit papers, please contact doug@nwg.nectec.or.th When you save your paper, be sure to use a plain, vanilla, PostScript driver -- install the plain 'PostScript Printer' driver if you're using Windows. Also, make sure that all non-standard fonts are embedded as part of the paper. It is wise to assume paper no larger than 8"X11". Please include a plain text file that describes your paper as below. The ABSTRACT entry, below, may be a plain text file (*.txt), or -- to allow foreign characters and illustrations -- a PostScript (*.ps) or GIF (*.gif) file. To make a gif, I recommend displaying the abstract in your favorite wsywyg word processor, then using a screen capture utility (the one in PaintShop Pro works great) to grab it. Your milage may vary, but in Microsoft Word, a 5 inch text line displayed at 100% is 600 pixels wide, which is ideal. Reduce to 2 colors, and save as a non-interlaced gif. See the how? page for more details. THAIOCR.ZIP TITLE -- Fuzzy Letters and Thai Optical Character Recognition FORMAT -- 239K, 12 pages, zipped postscript. WHO ---Doug Cooper, Computational Linguistics Research Center, Bangkok PUBLISHED -- Symposium on Natural Language Processing '95, Kasetsart University, Bangkok. ABOUT -- This paper discusses a strategy -- narrow down the possibilities, then post-process -- for Thai OCR. CONTACT -- doug@nwg.nectec.or.th ABSTRACT -- Available as viewable gif (16K) THAIFDES.ZIP TITLE -- Font Design for Thai/English Typesetting FORMAT -- 304K, 12 pages, zipped postscript. WHO --- Doug Cooper, Computational Linguistics Research Center, Bangkok PUBLISHED -- Symposium on Natural Language Processing '95, Kasetsart University, Bangkok. ABOUT -- Describes some of the conceptual and technical problems that arise in the design of two-alphabet fonts. CONTACT -- doug@nwg.nectec.or.th ABSTRACT -- Available as viewable gif (11K). TELLTHAI.ZIP TITLE -- How Do Thais Tell Letters Apart? FORMAT -- 483K, 17 pages, zipped postscript. This is a DRAFT -- please report bugs. WHO --- Doug Cooper, Center for Research in Computational Linguistics, Bangkok PUBLISHED -- to appear in Pan-Asiatic Linguistics '96 / 4th Int'l Symposium on Language and Linguistics. ABOUT -- Describes the secondary characteristics of Thai letters, and discusses implications for teaching and OCR. CONTACT -- doug@nwg.nectec.or.th ABSTRACT -- Available as viewable gif (19K) MIXFONT.ZIP TITLE -- As Easy As K. Kay -- A Brief Guide to Mixing Thai, Roman, and Transcribed Text FORMAT -- 165K, 8 pages, zipped postscript. WHO --- Doug Cooper, Computational Linguistics Research Center, Bangkok PUBLISHED -- Submitted to Journal of Language and Linguistics, Thammasat University. DRAFT: Please comment. ABOUT -- Advice on using Thai/Roman and SIL IPA fonts for high-quality text output. Contains many samples. CONTACT -- doug@nwg.nectec.or.th ABSTRACT -- Available as viewable gif (12k). THAISORT.ZIP TITLE -- How to Sort Thai Without Rewriting Sort FORMAT -- 136K, 6 pages, zipped postscript. WHO --- Doug Cooper, Center for Research in Computational Linguistics, Bangkok PUBLISHED -- DRAFT: Please comment. ABOUT -- Presents a method -- using ASCII signatures -- for using standard sort code to lexically order Thai. CONTACT -- doug@nwg.nectec.or.th ABSTRACT -- Available as viewable gif (16K) SOUNDSRT.ZIP TITLE -- Sorting by Sound -- Arbitrary Lexical Ordering for Transcribed Thai Text FORMAT -- 176K, 10 pages, zipped postscript. WHO --- Doug Cooper, Center for Research in Computational Linguistics, Bangkok PUBLISHED -- To appear in 10th Pacific Asia Conference on Language, Information and Computing. This is an extended version. ABOUT -- Presents a method -- using phonemic signatures -- for using standard sort code to create arbitrary lexical orders for transcribed text. CONTACT -- doug@nwg.nectec.or.th ABSTRACT -- Available as viewable gif (16K)
All original work © 1995 Doug Cooper. Please see this disclaimer, which takes responsibility for content, and the release notice, which gives you the right to copy it. We believe that all files referenced by these pages may be distributed for research / educational purposes. If any file should not be distributed, please let us know and we will remove it. |