strusAnalyzer
0.17
|
strus | Strus toplevel namespace |
analyzer | Analyzer parameter and return value objects namespace |
AggregatorView | Structures describing the internal representation of an aggregator function in a document analyzer |
ContentStatisticsElementView | Structure describing the internal representation of a content statistics library element |
ContentStatisticsItem | Defines an item describing the statistics in a collection |
ContentStatisticsResult | Defines the content statistics result from a collection sample |
ContentStatisticsView | Structure describing the internal representation of a content statistics library |
Document | Structure of a document created as result of a document analysis |
DocumentAnalyzerMapElementView | Structure describing the internal representation of one element of a document analyzer map for introspection |
DocumentAnalyzerMapView | Structure describing the internal representation of a document analyzer map for introspection |
DocumentAnalyzerView | Structure describing the internal representation of a document analyzer for introspection |
DocumentAttribute | Structure describing a document attribute |
DocumentClass | Defines a description of the properties of an original document processed by the segmenter |
DocumentMetaData | Structure describing a document meta data element |
DocumentTerm | Structure describing a typed document term |
FeatureOptions | Options to stear the creation of terms in the analyzer |
FeatureView | Structure describing the internal representation of a feature in the document analyzer |
FunctionView | Structure describing the internal representation of a normalizer/tokenizer/aggregator function in the analyzer |
is_atomic | Conditional for atomic type |
PatternLexem | Structure describing a token with an id used for pattern matching |
PatternMatcherResult | Structure desribing a result of a token pattern matcher |
PatternMatcherResultItem | Result item structure of a pattern match result |
PatternMatcherStatistics | Object descriping the statistics of a token pattern match run for runtime analysis |
Item | Statistics item |
Position | Structure describing a position in a document source by segment and offset |
QueryAnalyzerView | Structure describing the internal representation of a document analyzer for introspection |
QueryElementView | Structure describing the internal representation of a feature in the document analyzer |
QueryTerm | Structure describing a typed query term |
QueryTermExpression | Expression of a query terms as result of a query analysis |
Instruction | Query instruction |
SegmenterOptions | Options to stear the segmenter behaviour or the structure of input if not self defined in the documents processed |
SubContentDefinitionView | Structure describing the internal representation of a sub content definition in the document analyzer |
SubDocumentDefinitionView | Structure describing the internal representation of a sub document definition in the document analyzer |
Token | Structure describing a token in the document by its start and end position |
TokenMarkup | Structure defining an annotation of text in a document |
Attribute | Structure describing a document markup attribute |
AggregatorFunctionInstanceInterface | Interface for a parameterized aggregator function |
AggregatorFunctionInterface | Interface for the aggregator function constructor |
AnalyzerObjectBuilderInterface | Interface providing a mechanism to create complex multi component objects for the document and query analysis in strus |
ContentIteratorInterface | Defines an iterator on content provided by a segmenter |
ContentStatisticsContextInterface | Defines a program for analyzing a document, splitting it into normalized terms that can be fed to the strus IR engine |
ContentStatisticsInterface | Defines a program for analyzing a document, splitting it into normalized terms that can be fed to the strus IR engine |
DocumentAnalyzerContextInterface | Defines the context for analyzing multi part documents, iterating on the sub documents defined, splitting them into normalized terms that can be fed to the strus IR engine |
DocumentAnalyzerInstanceInterface | Defines a program for analyzing a document, splitting it into normalized terms that can be fed to the strus IR engine |
DocumentAnalyzerMapInterface | Defines a program for analyzing a document, splitting it into normalized terms that can be fed to the strus IR engine |
DocumentClassDetectorInterface | Defines a detector that returns a content description for a document content it recognizes |
TagAttributeMarkupInterface | |
DocumentTagMarkupDef | |
PatternResultFormatContext | Context for mapping result format strings (allocator,maps,etc.) |
PatternResultFormatVariableMap | Interface to map variables to a pointer to string |
PatternResultFormatTable | Parser for result format strings |
PatternResultFormatChunk | Single chunk of a result format for iterating ans build the pattern match result |
PatternResultFormatMap | Result format for the output of pattern match results with names of members as variables in curly brackets '{' '}' |
PatternSerializer | Object with all interfaces needed for serialization |
NormalizerFunctionInstanceInterface | Interface for a parameterized normalization function |
NormalizerFunctionInterface | Interface for the normalizer constructor |
PatternLexerContextInterface | Interface for detecting lexems used as basic entities by pattern matching in text |
PatternLexerInstanceInterface | Interface for building the automaton for detecting lexems used as basic entities by pattern matching in text |
PatternLexerInterface | Interface for instantiating the data structure of an automaton for detecting lexems used as basic entities by pattern matching in text |
PatternMatcherContextInterface | Interface for detecting patterns (structures formed by atomic tokens) in one document |
PatternMatcherInstanceInterface | Interface for building the automaton for detecting patterns in text |
PatternMatcherInterface | Interface for creating an automaton for detecting patterns of tokens in a document stream |
PatternTermFeederInstanceInterface | Instance interface for defining a mapping of terms of the document analysis outout as lexems used as basic entities by pattern matching |
PatternTermFeederInterface | Interface for instantiating the data structure of an automaton for detecting lexems used as basic entities by pattern matching in text |
PosTaggerContextInterface | Context to markup documents with tags derived from POS tagging |
PosTaggerDataInterface | Interface for the data built by a POS tagger |
Element | Output element declaration for POS tagging |
PosTaggerInstanceInterface | Interface to define a POS tagger instance for creating the input for POS tagging to build the data and to create to context for tagging with the data build from the POS tagging output |
PosTaggerInterface | Interface for the construction of a POS tagger instance for a specified segmenter |
QueryAnalyzerContextInterface | Defines the context for analyzing queries for the strus IR engine |
QueryAnalyzerInstanceInterface | Defines a program for analyzing chunks of a query |
SegmenterContextInterface | Defines the context for segmenting one document |
SegmenterInstanceInterface | Defines a program for splitting a source text it into chunks with an id correspoding to a selecting expression |
SegmenterInterface | Defines an interface for creating instances of programs for document segmentation |
SegmenterMarkupContextInterface | Defines the context for inserting markups into one document |
TextProcessorInterface | Interface for the object providing tokenizers and normalizers used for creating terms from segments of text and functions for collecting overall document statistics |
TokenizerFunctionInstanceInterface | Interface for tokenization |
TokenizerFunctionInterface | Interface for a tokenizer function |
TokenMarkupContextInterface | Interface for annotation of text in one document |
TokenMarkupInstanceInterface | Interface for building the automaton for detecting patterns of tokens in a document stream |