|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Object | +--dragon.nlp.extract.AbstractDualConceptExtractor
Dual concept extractor for dual indexing
Copyright: Copyright (c) 2005
Company: IST, Drexel University
| Field Summary | |
protected java.util.ArrayList |
firstConceptList
|
protected DocumentParser |
parser
|
protected java.util.ArrayList |
secondConceptList
|
| Constructor Summary | |
AbstractDualConceptExtractor()
|
|
| Method Summary | |
boolean |
extractFromDoc(Article article)
Extracts concepts from a semi-structured article |
boolean |
extractFromDoc(java.lang.String doc)
Extracts concepts from a raw document |
protected java.lang.String |
getArticleContent(Article article)
|
DocumentParser |
getDocumentParser()
Gets document parser. |
java.util.ArrayList |
getFirstConceptList()
Gets the first set of concepts extracted before |
java.util.ArrayList |
getSecondConceptList()
Gets the second set of concepts extracted before |
void |
initDocExtraction()
It is required to call this method before one calls extractFromDoc method. |
boolean |
isExtractionMerged()
|
void |
setDocumentParser(DocumentParser parser)
Sets the document parser for the concept extractor. |
boolean |
supportConceptEntry()
Tests if the extracted concept has an entry ID. |
boolean |
supportConceptName()
Tests if the extracted concept has a name. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Methods inherited from interface dragon.nlp.extract.DualConceptExtractor |
extractFromDoc, extractFromSentence |
| Field Detail |
protected java.util.ArrayList firstConceptList
protected java.util.ArrayList secondConceptList
protected DocumentParser parser
| Constructor Detail |
public AbstractDualConceptExtractor()
| Method Detail |
public boolean extractFromDoc(Article article)
DualConceptExtractor
extractFromDoc in interface DualConceptExtractorarticle - a semi-structured article for extraction
public boolean extractFromDoc(java.lang.String doc)
DualConceptExtractor
extractFromDoc in interface DualConceptExtractordoc - a raw document for extraction
public java.util.ArrayList getFirstConceptList()
DualConceptExtractor
getFirstConceptList in interface DualConceptExtractorpublic java.util.ArrayList getSecondConceptList()
DualConceptExtractor
getSecondConceptList in interface DualConceptExtractorpublic boolean isExtractionMerged()
public boolean supportConceptName()
DualConceptExtractor
supportConceptName in interface DualConceptExtractorpublic boolean supportConceptEntry()
DualConceptExtractor
supportConceptEntry in interface DualConceptExtractorpublic void initDocExtraction()
DualConceptExtractor
initDocExtraction in interface DualConceptExtractorprotected java.lang.String getArticleContent(Article article)
public DocumentParser getDocumentParser()
DualConceptExtractor
getDocumentParser in interface DualConceptExtractorpublic void setDocumentParser(DocumentParser parser)
DualConceptExtractor
setDocumentParser in interface DualConceptExtractorparser - document parser
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||