Salience 5.1.1 Feature Highlight: Ingest HTML

Prior to 5.1.1, in order to process a web page, you’d need to strip out all of the HTML yourself.  In the pursuit of making our customer’s lives easier, we’ve made it so that you can feed HTML directly into Salience, and we’ll deal with all of the tags.  You simply have to let the engine know that you’re going to be feeding it an HTML document directly by setting the Salience Option “Process HTML”.

This is great for folks that are snagging RSS feeds, or building web spiders, as we can just handle all your HTML stripping directly in the engine, and you don’t have to worry about that as part of your pipeline.

