Categorization is a core function of text mining software. It's most easily explained with an example: I wonder who will win in the California mid-term congressional elections? would be classified or associated with a topic called “Politics”. If your input contains many documents similar to the above, our text mining tool will show you that your customers are very interested in politics, without you ever having to read a single document yourself. In fact, if you tune the classification models further, they’ll show just which area of politics people are discussing: in this case, the California congressional mid-terms.
Below you will find a high-level overview of categories and Lexalytics’ categorization functions. For a more in-depth explanation, read through “Categorization: Sorting Relevant Content”. You can also check out our web demo to see Lexalytics in action, or get in touch to schedule a live demo with our team of data ninjas.
Everyone categorizes content, but few text analytics tools do it as well as Lexalytics. Our categorization processes are effective, reliable, and fully customizable, capable of showing you everything from the broader picture down to the minutiae that drive informed business decisions.
Categories help you sort large volumes of text, without actually reading them. Take 10,000 consumer Tweets and classify them under politics, gaming, religion, food, or whatever else the consumers are discussing; sort through hundreds of academic papers to find the ones relevant to your research; sift through thousands of TripAdvisor reviews to see what areas of your hotel need improving. Analyzing the equivalent number of documents by hand would take thousands of man-hours; automatic categorizing of text saves you time and returns immediately-actionable results.
Lexalytics provides three powerful ways to categorize content: query topics (simple search categories), model-based classifiers (machine-learning systems), and the Lexalytics Concept Matrix, a sophisticated web of relationships and associations between words and phrases.
All three categorization systems provide powerful, reliable content categorization and are fully customizable to the user’s needs. Users can utilize Lexalytics’ pre-defined query topics and pre-trained models, or train their own model-based classifiers to sort content into whatever categories fit their business.
Our classification techniques deliver meaningful information on the themes and topics that your consumers are focusing on — so that you can act immediately, safe in the knowledge that you are making an informed decision to further your business.