|Weekly News Digest
October 25, 2012 — In addition to this week's NewsBreaks article and the monthly NewsLink Spotlight, Information Today, Inc. (ITI) offers Weekly News Digests that feature recent product news and company announcements. Watch for additional coverage to appear in the next print issue of Information Today. For other up-to-the-minute news, check out ITIís Twitter account: @ITINewsBreaks.
CLICK HERE to view more Weekly News Digest items.
dtSearch Rolls Out More Document Filters
dtSearch Corp., a supplier of enterprise and developer text retrieval software along with document filters, announces version 7.70 of the dtSearch product line. The release improves the document filters embedded across the entire dtSearch product portfolio. For customers in need of data parsing, conversion, and extraction only, the dtSearch Engine APIs (native 64-bit/32-bit, Windows/Linux C++, Java, and .NET through 4.x) also offers the document filters for separate OEM licensing.
dtSearch’s proprietary document filters support a broad range of data types:
- “Office” documents: Microsoft Office, OpenOffice, RTF, PDF, etc.
- Emails: Microsoft Exchange, Outlook, Thunderbird, etc., all with nested attachments
- Compression formats: ZIP, RAR, GZIP/TAR, etc.
- Web-ready data (including image and text support): HTML, XML/XSL, and PDF—all including integrated image and text support
- Dynamic data (including image and text support): PHP, ASP.NET, SharePoint, etc.
- Databases: SQL including BLOB data (through the dtSearch Engine APIs), Microsoft Access, XBASE, XML, CSV, etc.
The document filters support parsing of all of these data types as well as text extraction and/or conversion to HTML as required for browser display with highlighted hits.
Embedded image and other enhancements: The new version also extends the document filters to add image support to Word (.doc/.docx), PowerPoint (.ppt/.pptx), Excel, (.xls/.xlsx), Access (.mdb/accdb), RTF, and email files including Thunderbird (mbox/.eml) and Outlook (.pst/.msg) files. The release displays these formats showing highlighted hits in context with both text and now images. The release also adds support for Japanese Ichitaro documents.
Multilevel nested file enhancements: The release also increases support for documents and images in multilevel nested configurations. For example, the new version supports not only viewing images in an email file but also images in a PowerPoint presentation embedded in a Word document attached as a zipped file to an email message. A new “object extraction” API lets developers navigate through the structure of each embedded object as a hierarchy and optionally extract each object.
The new release also covers dtSearch Web with Spider for quickly publishing instantly searchable data to an internet or intranet site, dtSearch Network with Spider for instantly searching across a network, dtSearch Publish for publishing searchable data to portable media, and dtSearch Desktop with Spider for desktop search.
Source: dtSearch Corp.
Send correspondence concerning the Weekly News Digest to NewsBreaks Editor