strusAnalyzer  0.17
Classes | Public Member Functions | List of all members
strus::PosTaggerDataInterface Class Referenceabstract

Interface for the data built by a POS tagger. More...

#include <posTaggerDataInterface.hpp>

Classes

class  Element
 Output element declaration for POS tagging. More...
 

Public Member Functions

virtual ~PosTaggerDataInterface ()
 
virtual void declareIgnoredToken (const std::string &value)=0
 Declare a token to be ignored in the document elements, if it does not match. More...
 
virtual void insert (int docno, const std::vector< Element > &sequence)=0
 Add a tagged text chunk. More...
 
virtual void markupSegment (TokenMarkupContextInterface *markupContext, int docno, int &docitr, const SegmenterPosition &segmentpos, const char *segmentptr, std::size_t segmentsize) const =0
 Get a text chunk tagged. More...
 

Detailed Description

Interface for the data built by a POS tagger.

Constructor & Destructor Documentation

virtual strus::PosTaggerDataInterface::~PosTaggerDataInterface ( )
inlinevirtual

Member Function Documentation

virtual void strus::PosTaggerDataInterface::declareIgnoredToken ( const std::string &  value)
pure virtual

Declare a token to be ignored in the document elements, if it does not match.

Parameters
[in]valuevalue of the token
Remarks
e.g. delimiter that might or might not be part of document segmentation because it belongs to the set of tokens potentially added in the input creation as delimiter.
virtual void strus::PosTaggerDataInterface::insert ( int  docno,
const std::vector< Element > &  sequence 
)
pure virtual

Add a tagged text chunk.

Parameters
[in]sequencetagged text chunk
virtual void strus::PosTaggerDataInterface::markupSegment ( TokenMarkupContextInterface markupContext,
int  docno,
int &  docitr,
const SegmenterPosition segmentpos,
const char *  segmentptr,
std::size_t  segmentsize 
) const
pure virtual

Get a text chunk tagged.

Parameters
[in]markupContextdocument context to do the markup of the POS tags
[in]docnodocument number of the segment to tag
[in,out]docitriterator in the document (initial value 0)
[in]segmentposposition of the segment in the original source
[in]segmentptrpointer to content of segment to tag
[in]segmentsizesize of content of segment to tag in bytes

The documentation for this class was generated from the following file: