Information Today, Inc. Corporate Site KMWorld CRM Media Streaming Media Faulkner Speech Technology Unisphere/DBTA
Other ITI Websites
American Library Directory Boardwalk Empire Database Trends and Applications DestinationCRM Faulkner Information Services Fulltext Sources Online InfoToday Europe KMWorld Literary Market Place Plexus Publishing Smart Customer Service Speech Technology Streaming Media Streaming Media Europe Streaming Media Producer Unisphere Research


News & Events > NewsBreaks
Back Index Forward
Threads bluesky LinkedIn FaceBook Instagram RSS Feed
Weekly News Digest

October 25, 2012 — In addition to this week's NewsBreaks article and the monthly NewsLink Spotlight, Information Today, Inc. (ITI) offers Weekly News Digests that feature recent product news and company announcements. Watch for additional coverage to appear in the next print issue of Information Today.

CLICK HERE to view more Weekly News Digest items.

dtSearch Rolls Out More Document Filters

dtSearch Corp., a supplier of enterprise and developer text retrieval software along with document filters, announces version 7.70 of the dtSearch product line. The release improves the document filters embedded across the entire dtSearch product portfolio. For customers in need of data parsing, conversion, and extraction only, the dtSearch Engine APIs (native 64-bit/32-bit, Windows/Linux C++, Java, and .NET through 4.x) also offers the document filters for separate OEM licensing.

dtSearch’s proprietary document filters support a broad range of data types:

  •  “Office” documents: Microsoft Office, OpenOffice, RTF, PDF, etc.
  • Emails: Microsoft Exchange, Outlook, Thunderbird, etc., all with nested attachments
  • Compression formats: ZIP, RAR, GZIP/TAR, etc.
  • Web-ready data (including image and text support): HTML, XML/XSL, and PDF—all including integrated image and text support
  • Dynamic data (including image and text support): PHP, ASP.NET, SharePoint, etc.
  • Databases: SQL including BLOB data (through the dtSearch Engine APIs), Microsoft Access, XBASE, XML, CSV, etc.

The document filters support parsing of all of these data types as well as text extraction and/or conversion to HTML as required for browser display with highlighted hits.

Embedded image and other enhancements: The new version also extends the document filters to add image support to Word (.doc/.docx), PowerPoint (.ppt/.pptx), Excel, (.xls/.xlsx), Access (.mdb/accdb), RTF, and email files including Thunderbird (mbox/.eml) and Outlook (.pst/.msg) files. The release displays these formats showing highlighted hits in context with both text and now images. The release also adds support for Japanese Ichitaro documents.

Multilevel nested file enhancements: The release also increases support for documents and images in multilevel nested configurations. For example, the new version supports not only viewing images in an email file but also images in a PowerPoint presentation embedded in a Word document attached as a zipped file to an email message. A new “object extraction” API lets developers navigate through the structure of each embedded object as a hierarchy and optionally extract each object.

The new release also covers dtSearch Web with Spider for quickly publishing instantly searchable data to an internet or intranet site, dtSearch Network with Spider for instantly searching across a network, dtSearch Publish for publishing searchable data to portable media, and dtSearch Desktop with Spider for desktop search.

Source: dtSearch Corp.

Send correspondence concerning the Weekly News Digest to NewsBreaks Editor Brandi Scardilli

Related Articles

6/16/2015dtSearch Corp. Updates Its Products
5/21/2015dtSearch Engine Aids Law Firms With E-Discovery
11/27/2014dtSearch Engine Is Now Available in MEGA’s HOPEX Platform
9/11/2014dtSearch Corp. Releases Android Beta for Its dtSearch Engine
5/13/2013New dtSearch Covers More Data Types

Comments Add A Comment

              Back to top