Information Today, Inc. Corporate Site KMWorld CRM Media Streaming Media Faulkner Speech Technology Unisphere/DBTA
PRIVACY/COOKIES POLICY
Other ITI Websites
American Library Directory Boardwalk Empire Database Trends and Applications DestinationCRM Faulkner Information Services Fulltext Sources Online InfoToday Europe KMWorld Literary Market Place Plexus Publishing Smart Customer Service Speech Technology Streaming Media Streaming Media Europe Streaming Media Producer Unisphere Research



News & Events > NewsBreaks
 



Back Index Forward
Twitter RSS Feed
Weekly News Digest

May 12, 2015 — In addition to this week's NewsBreaks article and the monthly NewsLink Spotlight, Information Today, Inc. (ITI) offers Weekly News Digests that feature recent product news and company announcements. Watch for additional coverage to appear in the next print issue of Information Today. For other up-to-the-minute news, check out ITI’s Twitter account: @ITINewsBreaks.

CLICK HERE to view more Weekly News Digest items.

HathiTrust Premieres New Dataset

The HathiTrust Research Center (HTRC) released the HTRC Extracted Features Dataset, which was sourced from 4.8 million public domain volumes from the HathiTrust Digital Library collection. These volumes contain more than 734 billion words in dozens of languages, as well as works from multiple centuries.

The dataset’s features include volume-level metadata, part-of-speech-tagged token counts, header and footer identification, sentence and line count, and algorithmic language detection. Researchers can use these and other page- and line-level features to analyze large worksets of volumes at previously difficult-to-implement scales.

For more information, read the press release.



Send correspondence concerning the Weekly News Digest to NewsBreaks Editor Brandi Scardilli

Related Articles

4/25/2013New Data Mining and Analytics Tools for the HathiTrust Digital Library
12/3/2013HathiTrust Doesn’t Monkey Around With Metadata Management
6/5/2014HathiTrust Dataset Analyzes Page-Level Features
6/30/2016HathiTrust Makes Digitized Books Available for Blind and Print-Disabled Readers
3/2/2017HathiTrust Collection Reaches 15 Million Volumes


Comments Add A Comment

              Back to top