border
border OVERVIEW | CRCL | CALLS | DICTS | FONTS | SOFTWARE | PAPERS | PROJECTS | WHO?... | LISTS | SPOKEN... | REF CARDS | SEEKING | BASICS... | HOW?... | CLOCKS | LOCAL | CONTENTS...

CRCL logo SOUTHEAST ASIAN
COMPUTING AND LINGUISTICS

Produced by Doug Cooper / Center for Research in Computational Linguistics,
Bangkok. Presented in cooperation with . . . ( 0.2 -- comment only )


RESEARCH PAPERS

border All papers relevant to the Southeast Asian linguistics and computing community are welcome. Authors seeking opinions or constructive criticism are encouraged to submit papers-in-progress, and requests along the lines of:
    IN PROGRESS -- Seeking comments about relevance of this work to Mon-Khmer languages. Am I on to something?
Please give your colleagues a hand by reading and commenting.

To submit papers, please contact doug@nwg.nectec.or.th

When you save your paper, be sure to use a plain, vanilla, PostScript driver -- install the plain 'PostScript Printer' driver if you're using Windows. Also, make sure that all non-standard fonts are embedded as part of the paper. It is wise to assume paper no larger than 8"X11". Please include a plain text file that describes your paper as below.

The ABSTRACT entry, below, may be a plain text file (*.txt), or -- to allow foreign characters and illustrations -- a PostScript (*.ps) or GIF (*.gif) file. To make a gif, I recommend displaying the abstract in your favorite wsywyg word processor, then using a screen capture utility (the one in PaintShop Pro works great) to grab it. Your milage may vary, but in Microsoft Word, a 5 inch text line displayed at 100% is 600 pixels wide, which is ideal. Reduce to 2 colors, and save as a non-interlaced gif. See the how? page for more details.


THAIOCR.ZIP
TITLE -- Fuzzy Letters and Thai Optical Character Recognition
FORMAT -- 239K, 12 pages, zipped postscript.
WHO ---Doug Cooper, Computational Linguistics Research Center, Bangkok
PUBLISHED -- Symposium on Natural Language Processing '95, Kasetsart University, Bangkok.
ABOUT -- This paper discusses a strategy -- narrow down the possibilities, then post-process -- for Thai OCR.
CONTACT -- doug@nwg.nectec.or.th
ABSTRACT -- Available as viewable gif (16K)
THAIFDES.ZIP
TITLE -- Font Design for Thai/English Typesetting
FORMAT -- 304K, 12 pages, zipped postscript.
WHO --- Doug Cooper, Computational Linguistics Research Center, Bangkok
PUBLISHED -- Symposium on Natural Language Processing '95, Kasetsart University, Bangkok.
ABOUT -- Describes some of the conceptual and technical problems that arise in the design of two-alphabet fonts.
CONTACT -- doug@nwg.nectec.or.th
ABSTRACT -- Available as viewable gif (11K).
TELLTHAI.ZIP
TITLE -- How Do Thais Tell Letters Apart?
FORMAT -- 483K, 17 pages, zipped postscript. This is a DRAFT -- please report bugs.
WHO --- Doug Cooper, Center for Research in Computational Linguistics, Bangkok
PUBLISHED -- to appear in Pan-Asiatic Linguistics '96 / 4th Int'l Symposium on Language and Linguistics.
ABOUT -- Describes the secondary characteristics of Thai letters, and discusses implications for teaching and OCR.
CONTACT -- doug@nwg.nectec.or.th
ABSTRACT -- Available as viewable gif (19K)
MIXFONT.ZIP
TITLE -- As Easy As K. Kay -- A Brief Guide to Mixing Thai, Roman, and Transcribed Text
FORMAT -- 165K, 8 pages, zipped postscript.
WHO --- Doug Cooper, Computational Linguistics Research Center, Bangkok
PUBLISHED -- Submitted to Journal of Language and Linguistics, Thammasat University. DRAFT: Please comment.
ABOUT -- Advice on using Thai/Roman and SIL IPA fonts for high-quality text output. Contains many samples.
CONTACT -- doug@nwg.nectec.or.th
ABSTRACT -- Available as viewable gif (12k).
THAISORT.ZIP
TITLE -- How to Sort Thai Without Rewriting Sort
FORMAT -- 136K, 6 pages, zipped postscript.
WHO --- Doug Cooper, Center for Research in Computational Linguistics, Bangkok
PUBLISHED -- DRAFT: Please comment.
ABOUT -- Presents a method -- using ASCII signatures -- for using standard sort code to lexically order Thai.
CONTACT -- doug@nwg.nectec.or.th
ABSTRACT -- Available as viewable gif (16K)
SOUNDSRT.ZIP
TITLE -- Sorting by Sound -- Arbitrary Lexical Ordering for Transcribed Thai Text
FORMAT -- 176K, 10 pages, zipped postscript.
WHO --- Doug Cooper, Center for Research in Computational Linguistics, Bangkok
PUBLISHED -- To appear in 10th Pacific Asia Conference on Language, Information and Computing. This is an extended version.
ABOUT -- Presents a method -- using phonemic signatures -- for using standard sort code to create arbitrary lexical orders for transcribed text.
CONTACT -- doug@nwg.nectec.or.th
ABSTRACT -- Available as viewable gif (16K)
OVERVIEW | CRCL | CALLS | DICTS | FONTS | SOFTWARE | PAPERS | PROJECTS | WHO?... | LISTS | SPOKEN... | REF CARDS | SEEKING | BASICS... | CLOCKS | HOW?... | LOCAL | CONTENTS...

All original work © 1995 Doug Cooper. Please see this disclaimer, which takes responsibility for content, and the release notice, which gives you the right to copy it. We believe that all files referenced by these pages may be distributed for research / educational purposes. If any file should not be distributed, please let us know and we will remove it.
red bar