Recent

Blogs.

We write regularly about different terminology and jargon you’ll hear in the text analytics industry. Join us in our blog to make the complex, simple.

Custom text analysis engine

A custom text analysis engine is a software application that is designed to process and analyze text data for a specific purpose.…

Corpus topic probabilities

Corpus topic probabilities refers to the probability that a given topic will be present in a document, based on the presence of…

Common communication layer

The common communication layer is an industry-specific term that refers to a software component that allows different text analytics applications to exchange…

Common analysis structure consumer

The term common analysis structure consumer is used to describe someone who reads and analyzes texts for the purpose of extracting information.…

Common analysis structure

The common analysis structure, or CAS, is a framework that is used to organize and interpret text data. This framework is particularly…

Character normalization

Character normalization is the process of converting different character forms into a single form. The purpose of this is to reduce the…

Boost Class

The term “boost class” is used in the text analytics industry to refer to a group of terms that are given extra…

Bi Variant

The term “Bi Variant” is used in the text analytics industry to mean a word or phrase that has two different possible…

Base annotators

The term “base annotators” is used to refer to the algorithms or models that are used to provide the initial annotations for…

Bag-of-n-grams model

The Bag-of-n-grams model is a statistical language model for text prediction that is used in the text analytics industry. This model predicts…

Open Computer Vision

Open Computer Vision and programming functions are tools for text analytics. OpenCV is an open source computer vision and machine learning software…

Normalized form

Normalized form is defined as a process that is used to convert data into a standard format. This standard format allows for…
This is a staging enviroment