Package dragon.ir.classification.featureselection

Feature Selectors for Text Classification or Other Related Applications.

See:
          Description

Interface Summary
FeatureSelector Interface of feature selector which often work tegether with text classifiers.
 

Class Summary
AbstractFeatureSelector Abstract function class for feature selection
ChiFeatureSelector CHI Feature Selector which uses the statistics CHI to select top features
DocFrequencySelector Unsupervised Feature Selector which exclude features with its document frequency less than a given threshold
InfoGainFeatureSelector A Feature Selector which uses information gain to select top features
MutualInfoFeatureSelector A Feature Selector which uses mutual information to select top features
NullFeatureSelector A Null Feature Selector
 

Package dragon.ir.classification.featureselection Description

Feature Selectors for Text Classification or Other Related Applications.

Package Specification

To create your own feature selectors, one should implement the interface called FeatureSelector. One can extend the Abstract Feature Selector instead of coding from the scratch. A feature selector could be either supervised or unsupervised. Please read Yiming Yang's paper for more details.

Yiming Yang and Jan O. Pedersen, A comparative study on feature selection in text categorization, Proceedings of {ICML}-97, 14th International Conference on Machine Learning, pp. 412--420 [PDF]