Package dragon.nlp.extract

Various concept extractors and relationship extractors.

See:
          Description

Interface Summary
ConceptExtractor Interface of Concept Extractors
ConceptFilter Interface of concept filter
DualConceptExtractor Interface of Dual Concept Extractor
PhraseExtractor Interface of Mutliword Phrase Extractor
TermExtractor Interface of Ontological Term Extractors
TokenExtractor Interface of token extractors
TripleExtractor Interface of triple extractors
 

Class Summary
Abbreviation Checking abbreviation for associated terms
AbstractConceptExtractor Abstract class for concept extraction which is the super class of AbstractPhraseExtractor, AbstractTermExtractor, AbstractTokenExtrator, and AbstractTripleExtractor
AbstractDualConceptExtractor Dual concept extractor for dual indexing
AbstractPhraseExtractor Abstract class for phrase extraction
AbstractTermExtractor Abstract class for UMLS term (CUI) extraction
AbstractTokenExtractor Abstract class for token extraction
AbstractTripleExtractor Abstract class for triple extraction
AttributeChecker Checking whetherterms are of attribute type
BasicConceptFilter Basic Concept Filter
BasicPhraseExtractor Phrase extraction
BasicTermExtractor Ontological term extraction
BasicTokenExtractor Token extraction
BasicTripleExtractor Triple Extraction
ClauseFinder Identify clauses in an English sentence
CnSimpleDocumentParser Title:
CompoundTermFinder Finding compond terms
CoordinatingChecker Coordinating Component Identification
CoReference Find out co-reference within one sentence or consecutive sentences
EngDocumentParser Document Parser for English Text
 

Package dragon.nlp.extract Description

Various concept extractors and relationship extractors.

Package Specification

The toolkit defines three types of concept extractors. The first is token extractor, which extracts a sequence of individual words from a sentence or a document. The second is phrase extractor, namely extracting multiword phrases from a sentence or a document. The phrase extractor needs a phrase dictionary as input; the phrase dictionary could be automatically built by phrase tools such as Xtract. The third is term extractor, which extracts ontological terms from a sentence or a document.