Lexalytics Announces Availability of Salience in Chinese
Lexalytics, a leader in text and sentiment analysis software, is pleased to announce the release of the new Chinese Native Language Pack for their Salience Engine. Chinese is the first character-based written language to be supported by Salience. Lexalytics is one of the few text analysis companies to offer a native language based approach to […]
Lexalytics is pleased to announce the release of its latest Native Language Pack for Salience in Chinese (Mandarin). Mandarin is the sixth language supported by Salience, following English, French, Spanish, Portuguese, and German.
Lexalytics is one of the few text analysis companies to offer a native language based approach to foreign language analytics. “Applying sentiment analysis in native Chinese, as opposed to first translating the text to English, gives us much more accurate and nuanced results. By eliminating that middle step, we’re able to catch things that would otherwise have been lost in translation,” explains Ori Sasson, CEO of Simulation Software and Technology (S2T).
The Chinese Native Language Pack includes all the standard Salience tools, redesigned to accomodate the unique structure of the Chinese language. This includes n-grams at the character level, rather than word level, as well as Named Entity Recognition, Text Classification, Sentiment, and everything else needed to identify the “who”, “what”, and “how” that is being discussed. The Chinese language pack currently only supports Mandarin, but does support both Simplified and Traditional characters.
“What made Chinese language support such a great challenge was the way it forced us to think outside the contraints of how our language is structured,” said Seth Redmore, Lexalytics’ VP of Product Management. “ For instance, Chinese doesn’t have any seperation between words the way English and the other languages we’ve worked with so far do. In order to solve this problem we developed composite word support, so that our software could recognize words without needing spaces or other indicators to divide them. That actually opened the doors to analyzing social media hashtags, like “#ilovefood”, which we hadn’t been able to do previously.”
The Salience Engine is the leading text processing engine that provides in-depth text analysis, categorization and classification for over 3 billion documents per day. Chinese is the sixth language to be supported by Salience, following German, Spanish, Portuguese, French, and English.
About Simulation Software and Technology
Simulation Software & Technology (S2T) Pte Ltd is a systems house based in Singapore. S2T is the leading supplier of text analytics systems in South East Asia for applications in National Security, Law Enforcement, Social Media Monitoring, and Competitive Intelligence. S2T’s GoldenSpear line of products provides customers a single platform where insights can be extracted from both social media and web sources as well as internal sources such as CRM data and internal reports. For more information visit http://www.simulation.com.sg
Lexalytics processes billions of words every day, globally, for data analytics companies and enterprise data analyst teams that need to tell powerful stories from text data. The company’s Salience®, Semantria® and Lexalytics Intelligence Platform™ products combine natural language processing with artificial intelligence to transform text in all its forms into usable data. Lexalytics solutions can be deployed on premises, in the cloud or within hybrid cloud infrastructure to reveal context-rich patterns and insights for voice of customer, voice of employee, customer experience management, market research, social listening, news monitoring and other business intelligence programs. Based in Boston, Lexalytics has offices in the United States and Canada. For more information, please visit www.lexalytics.com, email sales(at)lexalytics(dot)com or call 1-617-249-1049. Follow Lexalytics on Twitter, Facebook, and LinkedIn.