HTRC Data API Performance Study

dc.altmetrics.displaytrue
dc.contributor.authorSun, Yiming
dc.contributor.authorPlale, Beth
dc.contributor.authorZeng, Jiaan
dc.date.accessioned2014-09-11T12:45:39Z
dc.date.available2014-09-11T12:45:39Z
dc.description.abstractHathiTrust Research Center (HTRC) allows users to access more than 3 million volumes through a service called Data API. Data API plays an important role in HTRC infrastructure. It hides internal complexity from user, protects against malicious or inadvertent damages to data and separates underlying storage solution with interface so that underlying storage may be replaced with better solutions without affecting client code. We carried out extensive evaluations on the HTRC Data API performance over the Spring 2013. Specifically, we evaluated the rate at which data can be retrieved from the Cassandra cluster under different conditions, impact of different compression levels, and HTTP/HTTPS data transfer. The evaluation presents performance aspects of different software pieces in Data API as well as guides us to have optimal settings for Data API.
dc.identifier.urihttps://hdl.handle.net/2022/18721
dc.language.isoen_US
dc.subjectCassandra
dc.subjectPerformance
dc.subjectHTRC
dc.subjectAPI
dc.subjectperformance evaluation
dc.subjectHathiTrust Research Center
dc.titleHTRC Data API Performance Study
dc.typeTechnical Report

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
dataapi-report-v2.pdf
Size:
1.03 MB
Format:
Adobe Portable Document Format
Description:
Can’t use the file because of accessibility barriers? Contact us