Revision history for Perl extension Alvis::NLPPlatform. 0.6 - Word semgentation : Non break chracter (\xA0) was missing - Bug fix in the wrapper of Yatea, while producing the input of Yatea - Bug fix in the reader on Alvis document (Alvis::NLPPlatform::Document::get_langage) - Workaround for a missing feature in Module::Info : if no prefix is set while running "perl Build.PL", additionnal directories 'etc' and 'conf' are not installed (or well installed) - The location of the rc file (yatea.rc) is automatically set, while the configuration (in Build.PL) - in standalone mode, annotated documents are printed just after processing (and not more keep in memory) - change in the default term tagger wrapper to take into account carriage return in the sentence (in case of dirty text) - Corrections in the LICENSE file - Semantic tags provided by the default term tagger are integrated at the semantic features level - bug fixed in the argument management of the script ogmios-nlp-standalone - for some OS, Config::General returns while setting variables as yatea.rc is on read-inly mode 0.5 - Addition of missing packages in the installation file (Build.PL) - Switching the Makefile.PL on Build.PL - Correction in the Yatea wrapper in the handling of the output file. 0.4 - Correction in the function sigint handler : nlp_host and nlp_port are now declared as global. - Correction in the TermTagging : language switch was well taken into account - Correction in the management of the ".proc_id" file - correction in the computing of the xml rendering time (the variable is set to zero ;-) - stderr when NLP tools are called, is redirected in a log file - addition of a variable DEBUG defining a debug mode (temporary files are not removed) - alvis-nlp-standalone can read a file given in argument or on the STDIN stream - Documentation of the modules and scripts are gathered at the end of each file - Addition of DTD and XSD files in the documentation (etc directory) - Additional functionality: Loading files in various formats (PDF, LaTeX, Word, etc.) before carrying out linguistic annotations. - Addition of the modules Alvis::NLPPlatform::Convert and Alvis::NLPPlatform::Document for converting files in various formats in ALVIS XML. - Definition of the ogmios-standalone, ogmios-nlp-server, ogmios-nlp-client: annotation scripts from various formats - Improvement in the sentence segmentation: taking into account sectioning (!) - Addition of a Build.PL file - Enable to load empty markups - best management of UTF8 (use of Encode module) - various fixes and optimization - Yatea wrapper: new variable to get an yatea XML output or not - Yatea warpper: addition of the output of yatea in the XML output for the platform. - bug fixes - Rewrite of the TreeTagger wraper always by using hash_words_punct but less complexe - Modification in the Wrapper of bioLG : options are set in the XML form - integration of the cleanning of the output of bioLg in the code. - Corrections in the LGbio wrapper - Output Data can be stored in a descriptor or a scalar. - Addition of the constituents in the BioLG wrapper (UserNLPWrapper.pm) - Addition of examples - Best management of the options (if they are not set) # - Bad hack for the quick integration of the semantic tagging # (tool SemanticTypeTagger) ** COMMENTED CODE ** 0.3 - additional options for the link parser wrapper, to write link parser postscritp output (PARSING_IN_P0STSCRIPT) and/or link parser graphics output (PARSING_GRAPHICS) in file - bug fix in the default term wrapper (a term embedded in a named entity was not detected) - bug fix in the default syntactic parser. Take into account empty sentence parsing. - Modification of the TermTagger : term list is loaded once. - Display of the processing time for each step - fix a bug in the XML loader of semantic unit/named-entity - Definition of a section to manage XML input : the option PRESERVEWHITESPACE is set in; addition of the option LINGUISTIC_ANNOTATION_LOADING - Definition of a section to manage XML output - Render time is saved in the xml file (Client/server and stantalone mode) 0.2 - rename client.pl, server.pl, go.pl, as alvis-nlp-client, alvis-nlp-server, alvis-nlp-standalone - set the default parsing wrapper (forgot in the version 0.1) - configuration files have been moved in the alvis-nlpplatform - writing functions printing header and footer of the documentCollection - managing annoation of several document in the standalone mode - correction in the term wrapper, in the generation of the semantic units - correction in the default syntactic wrapper, in the generationo of the head and modifier element. Corrections also reported in the syntactic wrapper in the UserNLPWrapper module - default term tagger wrapper takes into account given canonical form of terms if the term tagger returns it. - modified dependencies for syntactic parsing (now needs PoS tagging) - control characters are not visibles - addition of an option for preserving or not the white spaces (XML blank nodes, but also beginning and end of the line spaces). Default behaviour is removing the white spaces. - addition of a patch for Link Gramar Parser 4.1a and 4.1b. This patch adds a option (!whitespaces) to avoid LG tokenization. 0.1 - version with CPAN configuration