Part of Speech (PoS) Tagging and Sentiment Bearing Phrases

Part of Speech (PoS) Tagging and Sentiment Bearing Phrases

The first step in determining the tone of a document is to break the document into its basic parts of speech (POS tagging). POS tagging is a mature technology that identifies all the structural elements of a document or sentence, including verbs, nouns, adjectives, adverbs, etc.

We use well defined, well understood techniques that generate extremely high accuracy for tagging the various Parts of Speech.

Even though you don’t really realize you’re doing it, to determine the sentiment of a document, you’re mentally identifying the parts of speech within a document that indicate emotion. In most cases these are adjective-noun combinations like "horrible pitching" and "devastating loss". That’s how our software works, too. These combinations are called “sentiment-bearing phrases”.

The difference, of course, is that our software needs to actually assign a number to the sentiment – as opposed to you, when you’re reading it, just think “darn, they lost again”. What we’ve done is create a very, very large dictionary of sentiment bearing phrases and their relative scores. These scores are pre-determined by how frequently a given phrase occurs near a set of known good words (e.g. good, wonderful, spectacular) and a set of bad words (e.g. bad, horrible, awful).

demos

Instant demo.
No sales call necessary.




resources

Download whitepapers, datasheets, videos, get to support, you name it, it's there.



contact us

Ask us anything.