* v0.3.8 Sat Mar 16 10:58:22 CET 2013 - fixed a serious problem with the sentence splitter - added a timout-option to uplug-hunalign * v0.3.7 - Sat Jan 19 19:03:24 CET 2013 - hunalign wrapper with tokenized and lowercased text - default dictionaries for hunalign - fixed bug in tools/xces2plain - fixed opus/ru/anotate - fixed problems with malformed data handling in uplug-tag, uplug-toktag and uplug-chunk * v0.3.6 - Sat Jan 5 19:34:41 CET 2013 - added tools/xces2plain - fixed tests that require java - added dependency for DB_File and GDBM_File - added auto_install * v0.3.5 - Fri Dec 28 23:28:20 CET 2012 - added all dependencies in Makefile.PL * v0.3.4 - Mon Dec 17 17:08:09 CET 2012 - fixed several issues with installation routines - added config files for preparing OPUS data - changed handling of Uplug environment variables and standard locations - fixed call to Grok tagger/chunker - fixed word alignment test - fixed POS-tagging for Swedish with Grok * v0.3.3 - Mon Dec 10 19:14:00 CET 2012 - fixed some problems with documentation - prepared language packs and Uplug add-ons - added documentation for language packs * v0.3.2 - Fri Dec 7 20:33:14 CET 2012 - added documentation - global installation - makefile for generating documentation in different formats * v0.3.1 - Thu Dec 6 20:33:14 CET 2012 - install now compiles external tools on systems without pre-compiled binaries - improved uplug-treetagger package (automatic download/install) - added flag for fallback modules - added test if modules exist to uplug startup script - added help information to uplug via command line flags - added documentation to uplug startup script - list available modules - fixed problem with uplug-giza * v0.3 - Mon Nov 26 11:57:26 CET 2012 - re-arranging modules, scripts, config-files - include installation routines - divide Uplug in several packages (language packs etc) - integrate MaltParser and pre-trained models - new pre-compiled binaries (also for Mac OSX) - improved pre-processing pipelines for several languages - pre-trained hunpos models - integration of Europarl/Moses sentence splitter/tokenizer - (some) unit tests - moved to git and bitbucket - added Moses-style word alignment format for clue alignment output * unversioned - toktag.pl: better handling of already tokenized documents - added wrapper for alpino (dutch) tokenizer) - tools/subtitles/srt2xml: handling of formating tags ( etc) is fixed! - tools/subtitles/srtalign.pl: added check for illegal subtitle synchronization scaling factor (<=0) and made it possible to use cognates and dictionaries together for automatic synchronization - ICA: show only non-empty sentence alignments (x:0 sentence alignments don't make sense for word alignment), changed in include/xmldoc.inc - added -adj to all word alignment config files for the 'link' step * v0.2.0c and v0.2.0d - added scripts to tools/subtitles to handle (pre-process and sentence align) movie subtitle in srt format - robust conversion of encodings in tag.pl/toktag.pl/chunk.pl - added treetagger startup scripts for es and nl, replace "nbsp" to " " - robust conversion between encodings in bitext-indexer.pl/opus-indexer.pl - added startup scripts for spanish and dutch tree-tagger models - updated startup scripts for other treetagger models according to latest TreeTagger distribution - fixed hunalign (bug in converting alignment output to xml) - added missing ';' at line 40 in Uplug.pm * 13-10-2006 joerg - ISA & ICA with simple user management (multiple users, password protection) - integrated hunalign sentence aligner - integrated GMA sentence aligner - alternative sentence aligner with cognate filter: uplugalign * 26-05-2006 joerg - improved ISA & ICA (some fixes, allow empty alignments in ISA ...) - bin/wordalign.pl can bee run in server mode (hard-coded port, localhost) to serve ICA * 16-03-2006 joerg - nothing really new, serveral bug-fixes done since last time - ISA & ICA - two web interfaces for interactive alignment (in web/php) * 28-07-2005 joerg - fixed a bug in learning bootstrapped clues with linkclue.pl (target language feature strings were not produced correctly) - took away Makefile.PL (isn't used anyway and seems to confuse) (there should be a proper installation script but there is none ...) * July 2005 joerg - added web interfaces for interactive alignment (web/php) - fixed some bugs in word alignment (problems with bitext segments other than 1:1 sentence) * 21-06-2005 joerg - fixed some stupid bugs in bin/markup.pl * 20-06-2005 joerg - fixed some more bugs in word alignment configuration files - added subtree indexing for XML files (enables jumping to certain positions in the corpus) - added experimental align.php file for alignment visualization * 24-03-2005 joerg - found&fixed some bugs in systems/align/word/link and systems/align/word/test/link (shortcut definitions for dynamic clue weights) - fixed a bug in Uplug/Config.pm in CheckParam * 03-03-2005 joerg - re-implementation of Uplug.pm, Uplug/Config.pm, uplug - some minor changes in Uplug/IO* (mainly cleaning up, it's still messy though) - some minor changes in the configuration files for pre-processing and * 15-02-2005 joerg - new version of ini/DataStreams.ini - new/changed configuration files for word-alignment (basic, default, tagged, advanced) * 15-02-2005 lars - started manual - started installation script - changed Uplug.pm to call converter.pl from $UPLUGHOME * 26-10-2004 joerg - Grok source code files removed from Uplug release - added Giza.pm for running GIZA++ * 15-02-2005 joerg - giza.pl creates 2 clue-dbms when running in both alignment directions - standard word alignment settings: basic, default, tagged, advanced - some minor bug-fixes