10 #ifndef _STRUS_ANALYZER_TOKENIZER_FUNCTION_INSTANCE_INTERFACE_HPP_INCLUDED
11 #define _STRUS_ANALYZER_TOKENIZER_FUNCTION_INSTANCE_INTERFACE_HPP_INCLUDED
35 virtual std::vector<analyzer::Token>
tokenize(
const char* src, std::size_t srcsize)
const=0;
Structure describing a token in the document by its start position and size.
Structure describing the internal representation of a normalizer/tokenizer/aggregator function in the...
Structure describing the internal representation of a normalizer/tokenizer/aggregator function in the...
Definition: functionView.hpp:29
Interface for tokenization.
Definition: tokenizerFunctionInstanceInterface.hpp:21
virtual bool concatBeforeTokenize() const =0
Flag defined by tokenizer indicating that different segments defined by the tag hierarchy should be c...
virtual std::vector< analyzer::Token > tokenize(const char *src, std::size_t srcsize) const =0
Tokenize a segment into a list of tokens.
virtual analyzer::FunctionView view() const =0
Get the definition of the function as structure for introspection.
virtual ~TokenizerFunctionInstanceInterface()
Destructor.
Definition: tokenizerFunctionInstanceInterface.hpp:25