Revision history for Parse-MediaWikiDump 1.0.6 Dec 04, 2010 * Fix for bug #58196 - error "not a MediaWiki link dunp file" due to absence of 'LOCK TABLES ...' line in link dump file? * Added option to pass input to ::Pages constructor via named parameters so that MediaWiki::DumpFile::Compat and ::Pages share the same API * Software is nearly retired, only need more testing on MediaWiki::DumpFile::Compat; Please begin using MediaWiki::DumpFile::Compat instead of this package and report problems if you encounter them. 1.0.5 Apr 21, 2010 * Updated docs pointing people to MediaWiki::DumpFile::Compat 1.0.4 Jan 04, 2010 * Fixed bug #53361: Incorrectly assigned usernames with anon edits * Added support for getting access to IP of anonymous edits 1.0.3 Nov 21, 2009 * Fixed bug #51607 "Build failed CPAN smoke test for for i686pc solaris" by specifying minimum version numbers for all dependencies in Makefile.PL. 1.0.2 Nov 15, 2009 * Fixed bug #51461 "Warnings During Build" - now the test process squelches the harmless warning from Test::Memory::Cycle. 1.0.1 Nov 11, 2009 * CPAN indexer did not like previous version number 1.0.0 Nov 11, 2009 * Stable status achieved * Slight speed tweak on character handler for XML parser * Added dependency on Devel::Cycle 1.11 as 1.10 causes a false positive to be thrown on the memory leak test 0.98 Oct 28, 2009 * Bumped processing speed back up * Fixed possible infinite loop scenario * Ordered tests * Added test to find circular references 0.97 Oct 23, 2009 * Fixed all known memory leaks * No more Object::Destroyer * Cleaned out some old cruft 0.96 Oct 22, 2009 * Allowed parsing of 0.4 version XML dump files but not support for new features * Added in a method to retrieve the version number of the XML dump file 0.95 Oct 14, 2009 * Found and removed a circular reference but it did not stop the memory leak * Fixed bug 50092 - some times $page->text would return a reference to an undefined value * Implemented support for compressed file GLOB objects per bug 50241 0.94 Sep 28, 2009 * Fix bug 49979 - "redirect in newer Wikipedia dumps" by allowing unknown tag names to exist 0.93 Sep 15, 2009 * Made ::Pages a subclass of ::Revisions * Discovered a bug regression: ::Pages and ::Revisions leak memory/are not properly garbage collected 0.92 Apr 15, 2009 * Completed documentation for all modules * Added test for backwards compatibility to the pre-factory Parse::MediaWikiDump interface 0.91 May 13, 2009 * Updated documentation to more explicitly list what kind of dump files each parser object can deal with. * Added dependency on perl 5.8.8 for :utf8 compatibility. * Split up lib/ into multiple files. * Fix for bug #46054 - using categories method of Parse::MediaWikiDump::page object causes script to crash. 0.90 May 07, 2009 * Implemented new parsing engine and called it Parse::MediaWikiDump::Revisions. Soon it will be replacing Parse::MediaWikiDump::Pages as a base engine. It is fully backwards compatible so please feel free to test it in your existing utilities and report success and failure to the author. * Moved namespace logic into Parse::MediaWikiDump::page and updated Parse::MediaWikiDump::Pages to support it. 0.51 May 31, 2008 * Fix for bug 36255 "Parse::MediaWikiDump::page::namespace may return a string which is not really a namespace" provided by Amir E. Aharoni. * Moved test data into t/ and moved speed_test.pl into examples/ * Exceedingly complicated functions (parse_head() and parse_page()) are not funny. Added some comments on how to rectify that situation. * Tightened up the tests a little bit. 0.50 Jun 27, 2006 * Added category links parser. * Removed all instances of shift() from the code. 0.40 Jun 21, 2006 * Increased processing speed by around 40%! Thank you Andrew Rodland. 0.33 Jun 18, 2006 * Added current_byte and size methods to page dumper. 0.32 Feb 25, 2006 * Added a line to create a package named Parse::MediaWikiDump so the module will get listed on CPAN search and the cpan command line tool. 0.31 Jan 10, 2006 * Fix bug 16981 - Parse::MediaWikiDump::page->redirect does not work with redirects that have a : in them. * Fix bug 16981 part two: title with a non-breaking space in it would come through as undefined. 0.30 December 23, 2005 * the Pages and Links class now both use a method named next() to get the next record. The old link() and page() methods are retained for now but should be migrated away from as soon as is convenient. * Added list of dump files that this module can process to the README file. 0.24 December 19, 2005 * Fixed bug #16616 - the category method only works properly on English language dump files. 0.23 December 19, 2005 * Fixed email address for author. * Fixed omission of namespace method for pages objects in the documentation. * Added limitations section to README. * Fixed http://rt.cpan.org bug #16583 - Module dies when parsing the 20051211 German Wikipedia dump. * Added some comments to the source code. 0.22 September 15, 2005 * Created some new and more comprehensive examples. * Parse::MediaWikiDump::Pages now dies with a specific error if it is asked to parse a comprehensive (full pages) dump file. * Updated Parse::MediaWikiDump::Links to new dump file format. * Added tests for Parse::MediaWikiDump::Links. * Solved a bug: Expat's current_byte method returns a 32 bit signed integer and the english Wikipedia dumps cause the number to wrap; implemented a work around for this. 0.21 September 10, 2005 * Improve testing of Parse::MediaWikiDump::Pages * Fix silly bug related to opening file handle references * Found new bug: The links dump format has been changed and the existing code can not parse the new format * Found new bug: comprehensive dump files like 20050909_pages_full.xml.gz cause the stack to grow too large and the module to abort early. 0.2 September 9, 2005 * Add tests and test data 0.1 September 6, 2005 * First version, released on an unsuspecting world.