Information Today, Inc. Corporate Site KMWorld CRM Media Streaming Media Faulkner Speech Technology Unisphere/DBTA
Other ITI Websites
American Library Directory Boardwalk Empire Database Trends and Applications DestinationCRM Faulkner Information Services Fulltext Sources Online InfoToday Europe KMWorld Literary Market Place Plexus Publishing Smart Customer Service Speech Technology Streaming Media Streaming Media Europe Streaming Media Producer Unisphere Research


News & Events > NewsBreaks
Back Index Forward
Threads bluesky LinkedIn FaceBook Instagram RSS Feed
Weekly News Digest

June 5, 2014 — In addition to this week's NewsBreaks article and the monthly NewsLink Spotlight, Information Today, Inc. (ITI) offers Weekly News Digests that feature recent product news and company announcements. Watch for additional coverage to appear in the next print issue of Information Today.

CLICK HERE to view more Weekly News Digest items.

HathiTrust Dataset Analyzes Page-Level Features

The HathiTrust Research Center (HTRC) released the alpha version of a new dataset of page-level features (notable or informative text characteristics) extracted from HathiTrust’s original, scanned representations of public domain volumes.

Extracted features include occurrences of terms as parts of speech, term-frequency counts, and line and sentence counts on each page of text, with a total of more than 67 million pages. Pages are broken into header, body, and footer sections so they can be analyzed at scale.

The HTRC welcomes feedback on how the dataset can help researchers.

Source: HathiTrust

Send correspondence concerning the Weekly News Digest to NewsBreaks Editor Brandi Scardilli

Related Articles

3/2/2017HathiTrust Collection Reaches 15 Million Volumes
6/30/2016HathiTrust Makes Digitized Books Available for Blind and Print-Disabled Readers
5/12/2015HathiTrust Premieres New Dataset
4/16/2015HathiTrust Adds Duke University Press Backlist Titles
12/3/2013HathiTrust Doesn’t Monkey Around With Metadata Management
9/10/2013HathiTrust Records Go Live on the DPLA
4/25/2013New Data Mining and Analytics Tools for the HathiTrust Digital Library
10/15/2012HathiTrust Lawsuit Decision Reaffirms Libraries in the Digital Age

Comments Add A Comment

              Back to top