Corpuscatcher

A Catalogue of Free/Open Source Software for Translators

Jump to: navigation, search
Corpuscatcher
Category: Language Tools
Typology: Corpus processing
http://translate.sourceforge.net/wiki/corpuscatcher/index
Operating systems: Windows, GNU/Linux, Mac OS X
Requirements: Python 2.4+, mechanize module (only tested with version 0.1.7b), pysearch module (only tested with version 3.0)
Latest release: 0.1 (2008-07-27)
License: GNU General Public License v.2
Affiliation: http://translate.org.za
Available Resources
Download page: http://sourceforge.net/projects/translate/files/CorpusCatcher/
Documentation: http://translate.sourceforge.net/wiki/corpuscatcher/readme
IRC: irc://irc.freenode.net/#pootle
Project Details
Green.png
Source code repository.info.pnghttps://translate.svn.sourceforge.net/svnroot/translate/src/trunk/corpuscatcher/



From the project's web-site:
CorpusCatcher is a corpus collection toolset. It can help you to build language or topic specific corpora from publically available web resources. It was originally written to simplify the use of BootCaT (http://sslmit.unibo.it/~baroni/tools_and_resources.html), but has grown to replace the used BootCaT parts with Python ports.



You need JavaScript enabled for viewing comments