Web lists-archives.com

Bug#898432: ITP: libai-categorizer-perl -- Automatic Text Categorization




Package: wnpp
Severity: wishlist
Owner: Laurent Baillet <laurent.baillet@xxxxxxxxx>

* Package name    : libai-categorizer-perl
  Version         : 0.09
  Upstream Author : Ken Williams <kwilliams@xxxxxxxx>
* URL             : http://search.cpan.org/~kwilliams/AI-Categorizer-0.09/lib/AI/Categorizer.pm
* License         : GPL-1+ or Artistic
  Programming Lang: Perl
  Description     : Automatic Text Categorization

AI::Categorizer is a framework for automatic text categorization.
It consists of a collection of Perl modules that implement common
categorization tasks, and a set of defined relationships among 
those modules. The various details are flexible - for example, 
you can choose what categorization algorithm to use, what 
features (words or otherwise) of the documents should be used 
(or how to automatically choose these features), what format the 
documents are in, and so on.

The basic process of using this module will typically involve 
obtaining a collection of pre-categorized documents, creating a 
"knowledge set" representation of those documents, training a 
categorizer on that knowledge set, and saving the trained 
categorizer for later use. There are several ways to carry out 
this process. The top-level AI::Categorizer module provides an 
umbrella class for high-level operations, or you may use the 
interfaces of the individual classes in the framework.

I will package it within the Debian Perl Team.