[Corpora-List] ANN: First public release of Morphix-NLP

From: Zhang Le (ejoy@xinhuanet.com)
Date: Tue Nov 11 2003 - 16:58:29 MET

  • Next message: Ute Römer: "[Corpora-List] labels of COLT files in BNC spoken"

    Hi all,
      I'm pleased to announce that the first public release of Morphix-NLP
      Live CD is now available for download.

    What is Morphix-NLP?
    ====================
       Morphix-NLP is a Live CD Linux distribution with a rich collection of
       Natural Language Processing (NLP) applications. Though the field of
       NLP has undergone decades of intensive research, software designed in
       the NLP community are often scattered around the net and are not
       known by the larger computer user community. Consequently, most NLP
       software can not be found in mainstream distributions even years
       after the first public release.

       The purpose of this CD is twofold:
         * In the first place, it tries to break the software acquisition and
           installation barrier facing many researchers and students in the
           NLP community by providing most NLP related software on a single
           Live CD.
         * In the second place, the CD can be used to promote Natural
           Language Processing among average computer users. Simply plugging
           the CD into cd-drive and watching some NLP applications in action,
           most users will get some knowledge of Natural Language Processing
           and what NLP can do.

    System Requirements
    ===================
    x86 machine with more than 96 MB RAM plus a bootable CD-ROM (VMware is
    ok).

    What is included on this CD?
    ============================
    A broad range of NLP software are included for performing common NLP tasks
    including:

    Tokenizers:
        Qtoken, MXTERMINATOR, Chinese word segmenters...

    POS Taggers:
        Brill's TBL Tagger, MXPOST, fnTBL tagger, QTag, Tree-Tagger,
        Memory-based Tagger...

    Parsers:
        Collins' Parser, Link Parser, LoPar...

    Language Modeling Tools:
        CMU SLM toolkit, Trigger Toolkit, Ngram Statistics Package...

    Speech Software:
        Festival Speech Synthesis System

    Develpment Tools:
        SVM-light, Maxent, SNoW, TiMBL, fnTBL

    Other software:
        WordNet Browser 2.0, Word Concordance program (antconc), unaccent,
        and many other software...

    All software are well tested and documented. More software will be
    included in next release.

    The compressed ISO image is only 448MB (with kernel 2.4, XFACE, gimp1.3,
    gcc3.2, XFree86-4...), leaving plenty room for future extension. One can
    easily add extra personal data (demo software, corpus...) on the CD
    before burning it.

    Where to get it?
    ================
    Current location of the CD is:
    http://www.nlplab.cn/zhangle/morphix-nlp/

    Online Manual:
    http://www.nlplab.cn/zhangle/morphix-nlp/manual/

    ISO Image:
    http://f4f.ivyol.com/morphix-nlp/morphix-nlp-1.1.iso
    http://f4f.ivyol.com/morphix-nlp/morphix-nlp-1.1.iso.md5

    Comments, suggestions and bug reports are always welcome :-)

    Have fun!

    -- 
    Zhang Le
    Natural Language Processing Lab
    Northeastern University, P.R.China
    http://www.nlplab.cn/zhangle/
    



    This archive was generated by hypermail 2b29 : Tue Nov 11 2003 - 17:03:33 MET