Determining Context: Clustering, N-grams, Part-of-speech-based Extraction, Themes, and Facets

Determining Context: Clustering, N-grams, Part-of-speech-based Extraction, Themes, and Facets

Context ExtractionThe “bread and butter” text analytics operations are: “who is being discussed”, “what is the context of the conversation”, and “what is the tone/sentiment of that conversation”. This paper is specifically about those techniques available for determining context.

It turns out that nouns and noun phrases are the most generically useful parts of speech with which to determine context. To be even more specific, it’s those nouns that you’re not getting to through entity extraction. Entity extraction deals (roughly speaking) with proper nouns. We’re considering those entities that, largely, are not proper nouns. There may be proper nouns that are picked up as part of the “noun analysis” that you’re doing, but that is because they were not identified as an entity.

Consider the following sentence. It’s politically controversial, but gives a good example of how important it is to separately recognize the entities and the context.

President Barack Obama did a great job with that awful oil spill.

Entity extraction will give you “President Barack Obama” as a person. Sentiment analysis will note a positive sentiment pointed back towards the person “President Barack Obama”. However, without understanding the additional nouns, you’ll have no idea of the context in which President Barak Obama is receiving praise.

And so, other than a vague positive sentiment, you don’t really know anything; as opposed to knowing that some author (or someone being quoted by some author) is giving thumbs up to President Barak Obama’s mad oil spill handling skillz.

Extracting non-entity phrases is an excellent next step to greater understanding of the content.

We're going to talk about five computational techniques for extracting these contextual phrases: clusteringN-grams, noun-phrase extraction, themes, and facets.

demos

Instant demo.
No sales call necessary.




resources

Download whitepapers, datasheets, videos, get to support, you name it, it's there.



contact us

Ask us anything.