2024-03-29T01:25:34Zhttps://scholarworks.iu.edu/dspace-oai/requestoai:scholarworks.iu.edu:2022/129872014-06-27T17:21:33Zcom_2022_357com_2022_356com_2022_19673col_2022_12986oai:scholarworks.iu.edu:2022/141332021-10-19T01:18:11Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth A. et al.
2012-01-24T15:41:49Z
2012-01-24T15:41:49Z
2012
http://hdl.handle.net/2022/14133
PIs (exec mgt team): Beth A. Plale, Indiana University; Marshall Scott Poole, University of Illinois
Urbana-Champaign ; Robert McDonald, IU; John Unsworth (UIUC) Senior investigators: Loretta
Auvil (UIUC); Johan Bollen (IU), Randy Butler (UIUC); Dennis Cromwell (IU), Geoffrey Fox (IU),
Eileen Julien (IU), Stacy Kowalczyk (IU); Danny Powell (UIUC); Beth Sandore (UIUC); Craig
Stewart (IU); John Towns (UIUC); Carolyn Walters (IU), Michael Welge (UIUC); Eric Wernert
(IU)
Submitted by Jennifer Laherty (jlaherty@indiana.edu) on 2012-01-24T15:41:34Z
No. of bitstreams: 1
HTRC-proposal 20100701.pdf: 377315 bytes, checksum: f63de331854d2b0f9207ad80dc116f2f (MD5)
Approved for entry into archive by Jennifer Laherty(jlaherty@indiana.edu) on 2012-01-24T15:41:49Z (GMT) No. of bitstreams: 1
HTRC-proposal 20100701.pdf: 377315 bytes, checksum: f63de331854d2b0f9207ad80dc116f2f (MD5)
Made available in DSpace on 2012-01-24T15:41:49Z (GMT). No. of bitstreams: 1
HTRC-proposal 20100701.pdf: 377315 bytes, checksum: f63de331854d2b0f9207ad80dc116f2f (MD5)
Previous issue date: 2012
en_US
HathiTrust Research Center: Computational Research on the HathiTrust Repository
true
ORIGINAL
HTRC-proposal 20100701.pdf
HTRC-proposal 20100701.pdf
application/pdf
377315
https://scholarworks.iu.edu/dspace/bitstream/2022/14133/1/HTRC-proposal%2020100701.pdf
f63de331854d2b0f9207ad80dc116f2f
MD5
1
LICENSE
license.txt
license.txt
text/plain
2057
https://scholarworks.iu.edu/dspace/bitstream/2022/14133/2/license.txt
4059155151d7b8399aeacceb7905bae5
MD5
2
TEXT
HTRC-proposal 20100701.pdf.txt
HTRC-proposal 20100701.pdf.txt
Extracted text
text/plain
78277
https://scholarworks.iu.edu/dspace/bitstream/2022/14133/3/HTRC-proposal%2020100701.pdf.txt
c49e85aec0d28f6e95146bf52f82c154
MD5
3
THUMBNAIL
HTRC-proposal 20100701.pdf.jpg
HTRC-proposal 20100701.pdf.jpg
IM Thumbnail
image/jpeg
1690
https://scholarworks.iu.edu/dspace/bitstream/2022/14133/4/HTRC-proposal%2020100701.pdf.jpg
0144f770e4fa9ac9c1dc56f6a692dde9
MD5
4
2022/14133
oai:scholarworks.iu.edu:2022/14133
2021-10-18 21:18:11.277
IUScholarWorks
iusw@indiana.edu
TGljZW5zZSBncmFudGVkIGJ5IEplbm5pZmVyIExhaGVydHkgKGpsYWhlcnR5QGluZGlhbmEuZWR1KSBvbiAyMDEyLTAxLTI0VDE1OjQxOjM0WiAoR01UKToKCklVU2Nob2xhcldvcmtzIExpY2Vuc2UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0IApvd25lcikgZ3JhbnQgdG8gSW5kaWFuYSBVbml2ZXJzaXR5IHRoZSBub24tZXhjbHVzaXZlIHJpZ2h0IHRvIHJlcHJvZHVjZSwgCnRyYW5zbGF0ZSAoYXMgZGVmaW5lZCBiZWxvdyksIGFuZC9vciBkaXN0cmlidXRlIHlvdXIgc3VibWlzc2lvbiAoaW5jbHVkaW5nIAp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sIAppbmNsdWRpbmcgYnV0IG5vdCBsaW1pdGVkIHRvIGF1ZGlvIG9yIHZpZGVvLgoKWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgCnRyYW5zbGF0ZSB0aGUgc3VibWlzc2lvbiB0byBhbnkgbWVkaXVtIG9yIGZvcm1hdCBmb3IgdGhlIHB1cnBvc2Ugb2YgCnByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIApzdWJtaXNzaW9uIGZvciBwdXJwb3NlcyBvZiBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgCnRoZSByaWdodCB0byBncmFudCB0aGUgcmlnaHRzIGNvbnRhaW5lZCBpbiB0aGlzIGxpY2Vuc2UuIFlvdSBhbHNvIApyZXByZXNlbnQgdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCAKaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgCnlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBvYnRhaW5lZCB0aGUgdW5yZXN0cmljdGVkIHBlcm1pc3Npb24gb2YgdGhlIApjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgSW5kaWFuYSBVbml2ZXJzaXR5IHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyAKbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgCmFuZCBhY2tub3dsZWRnZWQgd2l0aGluIHRoZSB0ZXh0IG9yIGNvbnRlbnQgb2YgdGhlIHN1Ym1pc3Npb24uCgpJRiBUSEUgU1VCTUlTU0lPTiBJUyBCQVNFRCBVUE9OIFdPUksgVEhBVCBIQVMgQkVFTiBTUE9OU09SRUQgT1IgU1VQUE9SVEVEIApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gSU5ESUFOQSBVTklWRVJTSVRZLCBZT1UgUkVQUkVTRU5UIApUSEFUIFlPVSBIQVZFIEZVTEZJTExFRCBBTlkgUklHSFQgT0YgUkVWSUVXIE9SIE9USEVSIE9CTElHQVRJT05TIFJFUVVJUkVEIApCWSBTVUNIIENPTlRSQUNUIE9SIEFHUkVFTUVOVC4KCkluZGlhbmEgVW5pdmVyc2l0eSB3aWxsIGNsZWFybHkgaWRlbnRpZnkgeW91ciBuYW1lKHMpIGFzIHRoZSBhdXRob3Iocykgb3IgCm93bmVyKHMpIG9mIHRoZSBzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiAKYXMgYWxsb3dlZCBieSB0aGlzIGxpY2Vuc2UsIHRvIHlvdXIgc3VibWlzc2lvbi4KCklGIFlPVSBBUkUgU1VCTUlUVElORyBUSElTIElURU0gT04gQkVIQUxGIE9GIFRIRSBSSUdIVFMtSE9MREVSLCBZT1UgTVVTVCAKSEFWRSBUSEUgUklHSFRTIE9XTkVSJ1MgPGEgaHJlZj0iaHR0cDovL3d3dy5pbmRpYW5hLmVkdS9+aXVzdy9wZXJtaXNzaW9uLyIgdGFyZ2V0PSJyZXNvdXJjZSB3aW5kb3ciPldSSVRURU4gUEVSTUlTU0lPTjwvYT4gVE8gQUNDRVBUIFRISVMgTElDRU5TRSBPTiAKSElTL0hFUiBCRUhBTEYuCgo=
oai:scholarworks.iu.edu:2022/146652021-10-19T02:10:18Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Chen, Peng
Plale, Beth
Aktas, Mehmet S.
2012-09-14T20:11:54Z
2012-09-14T20:11:54Z
2012-09
Chen, Peng, Beth Plale, and Mehmet Aktas. “Temporal Representation for Scientific Data Provenance.” Preprint of paper accepted for the 8th IEEE International Conference on eScience (eScience 2012), submitted September 14, 2012. http://hdl.handle.net/2022/14665.
http://hdl.handle.net/2022/14665
Provenance of digital scientific data is an important piece of the metadata of a data object. It can however grow voluminous quickly
because the granularity level of capture can be high. It can also be quite feature rich. We propose a representation of the provenance data based on logical time that reduces the feature space. Creating time and frequency domain representations of the provenance, we apply clustering, classification and association rule mining to the abstract representations to determine the usefulness of the temporal representation. We evaluate the temporal representation using an existing 10 GB database of provenance captured from a range of scientific workflows.
Made available in DSpace on 2012-09-14T20:11:54Z (GMT). No. of bitstreams: 1
eSciencePrePrint.pdf: 616873 bytes, checksum: 38a8ab866b7aaf1b8fc355d9181753f3 (MD5)
Previous issue date: 2012-09
Submitted by Peng Chen (chenpeng@indiana.edu) on 2012-09-14T19:53:27Z
No. of bitstreams: 1
eSciencePrePrint.pdf: 616873 bytes, checksum: 38a8ab866b7aaf1b8fc355d9181753f3 (MD5)
Approved for entry into archive by Stacy Konkiel(skonkiel@indiana.edu) on 2012-09-14T20:11:54Z (GMT) No. of bitstreams: 1
eSciencePrePrint.pdf: 616873 bytes, checksum: 38a8ab866b7aaf1b8fc355d9181753f3 (MD5)
NASA grant NNX10AM03G
en_US
provenance representation
logical clock
temporal data mining
Temporal Representation for Scientific Data Provenance
Preprint
true
ORIGINAL
eSciencePrePrint.pdf
eSciencePrePrint.pdf
application/pdf
616873
https://scholarworks.iu.edu/dspace/bitstream/2022/14665/1/eSciencePrePrint.pdf
38a8ab866b7aaf1b8fc355d9181753f3
MD5
1
LICENSE
license.txt
license.txt
text/plain
2050
https://scholarworks.iu.edu/dspace/bitstream/2022/14665/2/license.txt
4578444c044d7b8f40ff9374a9be0ecf
MD5
2
TEXT
eSciencePrePrint.pdf.txt
eSciencePrePrint.pdf.txt
Extracted text
text/plain
42429
https://scholarworks.iu.edu/dspace/bitstream/2022/14665/3/eSciencePrePrint.pdf.txt
39e62fee167c165b102c877600e08a2e
MD5
3
THUMBNAIL
eSciencePrePrint.pdf.jpg
eSciencePrePrint.pdf.jpg
IM Thumbnail
image/jpeg
4432
https://scholarworks.iu.edu/dspace/bitstream/2022/14665/4/eSciencePrePrint.pdf.jpg
8caa99d01deba112f1bc139fc37c811b
MD5
4
2022/14665
oai:scholarworks.iu.edu:2022/14665
2021-10-18 22:10:18.206
IUScholarWorks
iusw@indiana.edu
TGljZW5zZSBncmFudGVkIGJ5IFBlbmcgQ2hlbiAoY2hlbnBlbmdAaW5kaWFuYS5lZHUpIG9uIDIwMTItMDktMTRUMTk6NTM6MjZaIChHTVQpOgoKSVVTY2hvbGFyV29ya3MgTGljZW5zZQoKQnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGF1dGhvcihzKSBvciBjb3B5cmlnaHQgCm93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCAKdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgCnRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCAKdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0IGZvciB0aGUgcHVycG9zZSBvZiAKcHJlc2VydmF0aW9uLgoKWW91IGFsc28gYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5IGtlZXAgbW9yZSB0aGFuIG9uZSBjb3B5IG9mIHRoaXMgCnN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZSAKdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gCnJlcHJlc2VudCB0aGF0IHlvdXIgc3VibWlzc2lvbiBkb2VzIG5vdCwgdG8gdGhlIGJlc3Qgb2YgeW91ciBrbm93bGVkZ2UsIAppbmZyaW5nZSB1cG9uIGFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LCAKeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgCmNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIApsaWNlbnNlLCBhbmQgdGhhdCBzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCAKYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklGIFRIRSBTVUJNSVNTSU9OIElTIEJBU0VEIFVQT04gV09SSyBUSEFUIEhBUyBCRUVOIFNQT05TT1JFRCBPUiBTVVBQT1JURUQgCkJZIEFOIEFHRU5DWSBPUiBPUkdBTklaQVRJT04gT1RIRVIgVEhBTiBJTkRJQU5BIFVOSVZFUlNJVFksIFlPVSBSRVBSRVNFTlQgClRIQVQgWU9VIEhBVkUgRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgCkJZIFNVQ0ggQ09OVFJBQ1QgT1IgQUdSRUVNRU5ULgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUocykgYXMgdGhlIGF1dGhvcihzKSBvciAKb3duZXIocykgb2YgdGhlIHN1Ym1pc3Npb24sIGFuZCB3aWxsIG5vdCBtYWtlIGFueSBhbHRlcmF0aW9uLCBvdGhlciB0aGFuIAphcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgoKSUYgWU9VIEFSRSBTVUJNSVRUSU5HIFRISVMgSVRFTSBPTiBCRUhBTEYgT0YgVEhFIFJJR0hUUy1IT0xERVIsIFlPVSBNVVNUIApIQVZFIFRIRSBSSUdIVFMgT1dORVInUyA8YSBocmVmPSJodHRwOi8vd3d3LmluZGlhbmEuZWR1L35pdXN3L3Blcm1pc3Npb24vIiB0YXJnZXQ9InJlc291cmNlIHdpbmRvdyI+V1JJVFRFTiBQRVJNSVNTSU9OPC9hPiBUTyBBQ0NFUFQgVEhJUyBMSUNFTlNFIE9OIApISVMvSEVSIEJFSEFMRi4KCg==
oai:scholarworks.iu.edu:2022/146922021-10-18T18:38:04Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Chen, Peng
Plale, Beth
Cheah, You-Wei
Ghoshal, Devarshi
Jensen, Scott
Luo, Yuan
2012-09-28T17:44:00Z
2012-09-28T17:44:00Z
2012-09
Peng Chen, Beth Plale, You-Wei Cheah, Devarshi Ghoshal, Scott Jensen, and Yuan Luo. “Visualization of Network Data Provenance.” Preprint of paper accepted for Workshop on Massive Data Analytics on Scalable Systems (DataMASS 2012), co-located with the IEEE International Conference on High Performance Computing (HiPC), submitted September 25, 2012.
http://hdl.handle.net/2022/14692
Visualization facilitates the understanding of scientific data both through exploration and explanation of the visualized data. Provenance also contributes to the understanding of data by containing the contributing factors behind a result. The visualization of provenance, although supported in existing workflow management systems, generally focuses on small (medium) sized provenance data, lacking techniques to deal with big data with high complexity. This paper discusses visualization techniques developed for exploration and explanation of provenance, including layout algorithm, visual style, graph abstraction techniques, and graph matching algorithm, to deal with the high complexity. We demonstrate through application to two extensively analyzed case studies that involved provenance capture and use over three year projects, the first involving provenance of a satellite imagery ingest processing pipeline and the other of provenance in a large-scale computer network testbed.
Submitted by Peng Chen (chenpeng@indiana.edu) on 2012-09-25T16:37:42Z
No. of bitstreams: 1
DataMASS_preprint.pdf: 7417068 bytes, checksum: 035a9648beeb28c1e4baf30d0ad033e3 (MD5)
Approved for entry into archive by Malinda Husk (mlingwal@iu.edu) on 2012-09-28T17:44:00Z (GMT) No. of bitstreams: 1
DataMASS_preprint.pdf: 7417068 bytes, checksum: 035a9648beeb28c1e4baf30d0ad033e3 (MD5)
Made available in DSpace on 2012-09-28T17:44:00Z (GMT). No. of bitstreams: 1
DataMASS_preprint.pdf: 7417068 bytes, checksum: 035a9648beeb28c1e4baf30d0ad033e3 (MD5)
Previous issue date: 2012-09
en
provenance visualization
graph matching
Visualization of Network Data Provenance
Preprint
true
ORIGINAL
DataMASS_preprint.pdf
DataMASS_preprint.pdf
application/pdf
7417068
https://scholarworks.iu.edu/dspace/bitstream/2022/14692/1/DataMASS_preprint.pdf
035a9648beeb28c1e4baf30d0ad033e3
MD5
1
LICENSE
license.txt
license.txt
text/plain
1748
https://scholarworks.iu.edu/dspace/bitstream/2022/14692/2/license.txt
8a4605be74aa9ea9d79846c1fba20a33
MD5
2
TEXT
DataMASS_preprint.pdf.txt
DataMASS_preprint.pdf.txt
Extracted text
text/plain
63961
https://scholarworks.iu.edu/dspace/bitstream/2022/14692/3/DataMASS_preprint.pdf.txt
b776495a1d1d1a6ee5dc40c094e48172
MD5
3
THUMBNAIL
DataMASS_preprint.pdf.jpg
DataMASS_preprint.pdf.jpg
IM Thumbnail
image/jpeg
1933
https://scholarworks.iu.edu/dspace/bitstream/2022/14692/4/DataMASS_preprint.pdf.jpg
29f68e3a4bf427f14a5ffce2f8810bcd
MD5
4
2022/14692
oai:scholarworks.iu.edu:2022/14692
2021-10-18 14:38:04.006
IUScholarWorks
iusw@indiana.edu
Tk9URTogUExBQ0UgWU9VUiBPV04gTElDRU5TRSBIRVJFClRoaXMgc2FtcGxlIGxpY2Vuc2UgaXMgcHJvdmlkZWQgZm9yIGluZm9ybWF0aW9uYWwgcHVycG9zZXMgb25seS4KCk5PTi1FWENMVVNJVkUgRElTVFJJQlVUSU9OIExJQ0VOU0UKCkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBhdXRob3Iocykgb3IgY29weXJpZ2h0Cm93bmVyKSBncmFudHMgdG8gRFNwYWNlIFVuaXZlcnNpdHkgKERTVSkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLAp0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZwp0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBEU1UgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlCnN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbi4KCllvdSBhbHNvIGFncmVlIHRoYXQgRFNVIG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yCnB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZQp0aGUgcmlnaHQgdG8gZ3JhbnQgdGhlIHJpZ2h0cyBjb250YWluZWQgaW4gdGhpcyBsaWNlbnNlLiBZb3UgYWxzbyByZXByZXNlbnQKdGhhdCB5b3VyIHN1Ym1pc3Npb24gZG9lcyBub3QsIHRvIHRoZSBiZXN0IG9mIHlvdXIga25vd2xlZGdlLCBpbmZyaW5nZSB1cG9uCmFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LAp5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZQpjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgRFNVIHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdApzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkCndpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLgoKSUYgVEhFIFNVQk1JU1NJT04gSVMgQkFTRUQgVVBPTiBXT1JLIFRIQVQgSEFTIEJFRU4gU1BPTlNPUkVEIE9SIFNVUFBPUlRFRApCWSBBTiBBR0VOQ1kgT1IgT1JHQU5JWkFUSU9OIE9USEVSIFRIQU4gRFNVLCBZT1UgUkVQUkVTRU5UIFRIQVQgWU9VIEhBVkUKRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgQlkgU1VDSApDT05UUkFDVCBPUiBBR1JFRU1FTlQuCgpEU1Ugd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZShzKSBhcyB0aGUgYXV0aG9yKHMpIG9yIG93bmVyKHMpIG9mIHRoZQpzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMKbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgo=
oai:scholarworks.iu.edu:2022/147442021-10-18T07:06:09Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Cheah, You-Wei
Plale, Beth
2012-10-22T13:24:45Z
2012-10-22T13:24:45Z
2012
http://hdl.handle.net/2022/14744
Data provenance, a key piece of metadata that describes the lifecycle of a data product, is crucial in aiding scientists to better understand and facilitate reproducibility and reuse of scientific results. Provenance collection systems often capture provenance on the fly and the protocol between application and provenance tool may not be reliable. As a result, data provenance can become ambiguous or simply inaccurate. In this paper, we identify likely quality issues in data provenance. We also establish crucial quality dimensions that are especially critical for the evaluation of provenance quality. We analyze synthetic and real-world provenance based on these quality dimensions and summarize our contributions to provenance quality.
Submitted by You-Wei Cheah (yocheah@umail.iu.edu) on 2012-10-18T19:49:47Z
No. of bitstreams: 1
Escience-Preprint.pdf: 1026596 bytes, checksum: 62b86270ce1a31b9e136934babf711a2 (MD5)
Approved for entry into archive by Stacy Konkiel (skonkiel@indiana.edu) on 2012-10-22T13:24:45Z (GMT) No. of bitstreams: 1
Escience-Preprint.pdf: 1026596 bytes, checksum: 62b86270ce1a31b9e136934babf711a2 (MD5)
Made available in DSpace on 2012-10-22T13:24:45Z (GMT). No. of bitstreams: 1
Escience-Preprint.pdf: 1026596 bytes, checksum: 62b86270ce1a31b9e136934babf711a2 (MD5)
Previous issue date: 2012
en_US
Data Provenance
Provenance Quality
Scientific Workflows
Provenance Analysis
Provenance Analysis: Towards Quality Provenance
true
ORIGINAL
Escience-Preprint.pdf
Escience-Preprint.pdf
application/pdf
1026596
https://scholarworks.iu.edu/dspace/bitstream/2022/14744/1/Escience-Preprint.pdf
62b86270ce1a31b9e136934babf711a2
MD5
1
LICENSE
license.txt
license.txt
text/plain
1966
https://scholarworks.iu.edu/dspace/bitstream/2022/14744/2/license.txt
da47e32a232df8b677e5e865df861531
MD5
2
TEXT
Escience-Preprint.pdf.txt
Escience-Preprint.pdf.txt
Extracted text
text/plain
44817
https://scholarworks.iu.edu/dspace/bitstream/2022/14744/3/Escience-Preprint.pdf.txt
fa7d1797c18f67de055814b715ccaec2
MD5
3
THUMBNAIL
Escience-Preprint.pdf.jpg
Escience-Preprint.pdf.jpg
IM Thumbnail
image/jpeg
2036
https://scholarworks.iu.edu/dspace/bitstream/2022/14744/4/Escience-Preprint.pdf.jpg
7c373d1137ad0758360f983fbbd0903c
MD5
4
2022/14744
oai:scholarworks.iu.edu:2022/14744
2021-10-18 03:06:09.713
IUScholarWorks
iusw@indiana.edu
SVVTY2hvbGFyV29ya3MgTGljZW5zZQoKQnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGF1dGhvcihzKSBvciBjb3B5cmlnaHQgCm93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCAKdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgCnRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCAKdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0IGZvciB0aGUgcHVycG9zZSBvZiAKcHJlc2VydmF0aW9uLgoKWW91IGFsc28gYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5IGtlZXAgbW9yZSB0aGFuIG9uZSBjb3B5IG9mIHRoaXMgCnN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZSAKdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gCnJlcHJlc2VudCB0aGF0IHlvdXIgc3VibWlzc2lvbiBkb2VzIG5vdCwgdG8gdGhlIGJlc3Qgb2YgeW91ciBrbm93bGVkZ2UsIAppbmZyaW5nZSB1cG9uIGFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LCAKeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgCmNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIApsaWNlbnNlLCBhbmQgdGhhdCBzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCAKYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklGIFRIRSBTVUJNSVNTSU9OIElTIEJBU0VEIFVQT04gV09SSyBUSEFUIEhBUyBCRUVOIFNQT05TT1JFRCBPUiBTVVBQT1JURUQgCkJZIEFOIEFHRU5DWSBPUiBPUkdBTklaQVRJT04gT1RIRVIgVEhBTiBJTkRJQU5BIFVOSVZFUlNJVFksIFlPVSBSRVBSRVNFTlQgClRIQVQgWU9VIEhBVkUgRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgCkJZIFNVQ0ggQ09OVFJBQ1QgT1IgQUdSRUVNRU5ULgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUocykgYXMgdGhlIGF1dGhvcihzKSBvciAKb3duZXIocykgb2YgdGhlIHN1Ym1pc3Npb24sIGFuZCB3aWxsIG5vdCBtYWtlIGFueSBhbHRlcmF0aW9uLCBvdGhlciB0aGFuIAphcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgoKSUYgWU9VIEFSRSBTVUJNSVRUSU5HIFRISVMgSVRFTSBPTiBCRUhBTEYgT0YgVEhFIFJJR0hUUy1IT0xERVIsIFlPVSBNVVNUIApIQVZFIFRIRSBSSUdIVFMgT1dORVInUyA8YSBocmVmPSJodHRwOi8vd3d3LmluZGlhbmEuZWR1L35pdXN3L3Blcm1pc3Npb24vIiB0YXJnZXQ9InJlc291cmNlIHdpbmRvdyI+V1JJVFRFTiBQRVJNSVNTSU9OPC9hPiBUTyBBQ0NFUFQgVEhJUyBMSUNFTlNFIE9OIApISVMvSEVSIEJFSEFMRi4KCg==
oai:scholarworks.iu.edu:2022/152472021-10-18T11:16:27Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
McDonald, Robert H.
Chandrasekar, Kavitha
Kouper, Inna
Konkiel, Stacy
Hedstrom, Margaret L.
Myers, Jim
Kumar, Praveen
2013-01-22T18:25:07Z
2013-01-22T18:25:07Z
1/16/2013
Plale B, McDonald R, Chandrasekar K, Kouper I, Konkiel S, Hedstrom M, Myers J & Kumar P. (2013). "SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long Term Data Preservation." Practice Paper presented at the 8th International Digital Curation Conference. Amsterdam, Netherlands. 14-16 January 2012.
http://hdl.handle.net/2022/15247
Major research universities are grappling with their response to the deluge of scientific data emerging through research by their faculty. Many are looking to their libraries and the institutional repository as a solution. Scientific data introduces substantial challenges that the document-based institutional repository may not be suited to deal with. The Sustainable Environment - Actionable Data (SEAD) Virtual Archive specifically addresses the challenges of “long tail” scientific data. In this paper, we propose requirements, policy and architecture to support not only the preservation of scientific data today using institutional repositories, but also its rich access and use into the future.
Submitted by Stacy Konkiel (skonkiel@indiana.edu) on 2013-01-22T18:13:56Z
No. of bitstreams: 2
IDCC_SVA_FINAL_skonkiel.pdf: 134707 bytes, checksum: 0158d122c86b48c4ab1b7de092955626 (MD5)
SEADVA_presentation IDCC final.pdf: 988510 bytes, checksum: 060a2837cbc7040a4764aed0e334eff8 (MD5)
Approved for entry into archive by Stacy Konkiel (skonkiel@indiana.edu) on 2013-01-22T18:25:07Z (GMT) No. of bitstreams: 2
IDCC_SVA_FINAL_skonkiel.pdf: 134707 bytes, checksum: 0158d122c86b48c4ab1b7de092955626 (MD5)
SEADVA_presentation IDCC final.pdf: 988510 bytes, checksum: 060a2837cbc7040a4764aed0e334eff8 (MD5)
Made available in DSpace on 2013-01-22T18:25:07Z (GMT). No. of bitstreams: 2
IDCC_SVA_FINAL_skonkiel.pdf: 134707 bytes, checksum: 0158d122c86b48c4ab1b7de092955626 (MD5)
SEADVA_presentation IDCC final.pdf: 988510 bytes, checksum: 060a2837cbc7040a4764aed0e334eff8 (MD5)
Previous issue date: 2013-01-16
This work funded by the National Science Foundation under cooperative agreement #OCI0940824 and by the grant from the Council on Library and Information Resources and the Alfred P. Sloan Foundation, award #4112440.
en_US
digital curation
institutional repositories
sustainability science
data management
SEAD Virtual Archive: Building a Federation of Institutional Repositories for Long-Term Data Preservation in Sustainability Science
Article
TRUE
true
ORIGINAL
IDCC_SVA_FINAL_skonkiel.pdf
IDCC_SVA_FINAL_skonkiel.pdf
Paper - Full Text
application/pdf
134707
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/1/IDCC_SVA_FINAL_skonkiel.pdf
0158d122c86b48c4ab1b7de092955626
MD5
1
SEADVA_presentation IDCC final.pdf
SEADVA_presentation IDCC final.pdf
Presentation Slides
application/pdf
988510
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/2/SEADVA_presentation%20IDCC%20final.pdf
060a2837cbc7040a4764aed0e334eff8
MD5
2
LICENSE
license.txt
license.txt
text/plain
1966
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/3/license.txt
da47e32a232df8b677e5e865df861531
MD5
3
TEXT
IDCC_SVA_FINAL_skonkiel.pdf.txt
IDCC_SVA_FINAL_skonkiel.pdf.txt
Extracted text
text/plain
24840
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/4/IDCC_SVA_FINAL_skonkiel.pdf.txt
c80eeb1d1fc9dc1734bf17df77e9243b
MD5
4
SEADVA_presentation IDCC final.pdf.txt
SEADVA_presentation IDCC final.pdf.txt
Extracted text
text/plain
6203
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/5/SEADVA_presentation%20IDCC%20final.pdf.txt
603e56fbb782411528ca1b3280e91e09
MD5
5
THUMBNAIL
IDCC_SVA_FINAL_skonkiel.pdf.jpg
IDCC_SVA_FINAL_skonkiel.pdf.jpg
IM Thumbnail
image/jpeg
1234
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/6/IDCC_SVA_FINAL_skonkiel.pdf.jpg
466e8abbb4960c28dd3ca9b2686ca944
MD5
6
SEADVA_presentation IDCC final.pdf.jpg
SEADVA_presentation IDCC final.pdf.jpg
IM Thumbnail
image/jpeg
2374
https://scholarworks.iu.edu/dspace/bitstream/2022/15247/7/SEADVA_presentation%20IDCC%20final.pdf.jpg
107de1b8751d6052d7a8b9048ae76dba
MD5
7
2022/15247
oai:scholarworks.iu.edu:2022/15247
2021-10-18 07:16:27.075
IUScholarWorks
iusw@indiana.edu
SVVTY2hvbGFyV29ya3MgTGljZW5zZQoKQnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGF1dGhvcihzKSBvciBjb3B5cmlnaHQgCm93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCAKdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgCnRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCAKdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0IGZvciB0aGUgcHVycG9zZSBvZiAKcHJlc2VydmF0aW9uLgoKWW91IGFsc28gYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5IGtlZXAgbW9yZSB0aGFuIG9uZSBjb3B5IG9mIHRoaXMgCnN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZSAKdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gCnJlcHJlc2VudCB0aGF0IHlvdXIgc3VibWlzc2lvbiBkb2VzIG5vdCwgdG8gdGhlIGJlc3Qgb2YgeW91ciBrbm93bGVkZ2UsIAppbmZyaW5nZSB1cG9uIGFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LCAKeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgCmNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIApsaWNlbnNlLCBhbmQgdGhhdCBzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCAKYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklGIFRIRSBTVUJNSVNTSU9OIElTIEJBU0VEIFVQT04gV09SSyBUSEFUIEhBUyBCRUVOIFNQT05TT1JFRCBPUiBTVVBQT1JURUQgCkJZIEFOIEFHRU5DWSBPUiBPUkdBTklaQVRJT04gT1RIRVIgVEhBTiBJTkRJQU5BIFVOSVZFUlNJVFksIFlPVSBSRVBSRVNFTlQgClRIQVQgWU9VIEhBVkUgRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgCkJZIFNVQ0ggQ09OVFJBQ1QgT1IgQUdSRUVNRU5ULgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUocykgYXMgdGhlIGF1dGhvcihzKSBvciAKb3duZXIocykgb2YgdGhlIHN1Ym1pc3Npb24sIGFuZCB3aWxsIG5vdCBtYWtlIGFueSBhbHRlcmF0aW9uLCBvdGhlciB0aGFuIAphcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgoKSUYgWU9VIEFSRSBTVUJNSVRUSU5HIFRISVMgSVRFTSBPTiBCRUhBTEYgT0YgVEhFIFJJR0hUUy1IT0xERVIsIFlPVSBNVVNUIApIQVZFIFRIRSBSSUdIVFMgT1dORVInUyA8YSBocmVmPSJodHRwOi8vd3d3LmluZGlhbmEuZWR1L35pdXN3L3Blcm1pc3Npb24vIiB0YXJnZXQ9InJlc291cmNlIHdpbmRvdyI+V1JJVFRFTiBQRVJNSVNTSU9OPC9hPiBUTyBBQ0NFUFQgVEhJUyBMSUNFTlNFIE9OIApISVMvSEVSIEJFSEFMRi4KCg==
oai:scholarworks.iu.edu:2022/152482021-10-18T11:20:46Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Hedstrom, Margaret
Plale, Beth
McDonald, Robert H.
Chandrasekar, Kavitha
Kouper, Inna
Konkiel, Stacy
Kumar, Praveen
Myers, James
2013-01-22T18:25:17Z
2013-01-22T18:25:17Z
1/14/2013
McDonald R, Plale B, Myers J, Hedstrom M, Kumar P, Chandrasekar K, Kouper I, & Konkiel S. (2012). "The SEAD (Sustainable Environment-Actionable Data) DataNet Prototype: Preserving data for environmental sciences in areas of climate, land use and environmental management." Poster presentation at the 8th International Digital Curation Conference. Amsterdam, Netherlands. 14-16 January 2012.
http://hdl.handle.net/2022/15248
Submitted by Stacy Konkiel (skonkiel@indiana.edu) on 2013-01-22T18:23:41Z
No. of bitstreams: 2
idcc13poster2.pdf: 4941537 bytes, checksum: 8bda8cc248a4ca2d8e74f87562d5a30f (MD5)
IDCC Minute Madness-SEAD-McDonald.pptx: 186274 bytes, checksum: cbbacf9ba354347f85e4103336e47a80 (MD5)
Approved for entry into archive by Stacy Konkiel (skonkiel@indiana.edu) on 2013-01-22T18:25:17Z (GMT) No. of bitstreams: 2
idcc13poster2.pdf: 4941537 bytes, checksum: 8bda8cc248a4ca2d8e74f87562d5a30f (MD5)
IDCC Minute Madness-SEAD-McDonald.pptx: 186274 bytes, checksum: cbbacf9ba354347f85e4103336e47a80 (MD5)
Made available in DSpace on 2013-01-22T18:25:17Z (GMT). No. of bitstreams: 2
idcc13poster2.pdf: 4941537 bytes, checksum: 8bda8cc248a4ca2d8e74f87562d5a30f (MD5)
IDCC Minute Madness-SEAD-McDonald.pptx: 186274 bytes, checksum: cbbacf9ba354347f85e4103336e47a80 (MD5)
Previous issue date: 2013-01-14
SEAD is funded by the National Science Foundation under cooperative agreement #OCI0940824
sustainability science
data management
institutional repositories
SEAD: Preserving Data for Environmental Sciences in Areas of Climate, Land-Use, and Environmental Management
Other
TRUE
true
ORIGINAL
idcc13poster2.pdf
idcc13poster2.pdf
Poster
application/pdf
4941537
https://scholarworks.iu.edu/dspace/bitstream/2022/15248/1/idcc13poster2.pdf
8bda8cc248a4ca2d8e74f87562d5a30f
MD5
1
IDCC Minute Madness-SEAD-McDonald.pptx
IDCC Minute Madness-SEAD-McDonald.pptx
application/octet-stream
186274
https://scholarworks.iu.edu/dspace/bitstream/2022/15248/2/IDCC%20Minute%20Madness-SEAD-McDonald.pptx
cbbacf9ba354347f85e4103336e47a80
MD5
2
LICENSE
license.txt
license.txt
text/plain
1966
https://scholarworks.iu.edu/dspace/bitstream/2022/15248/3/license.txt
da47e32a232df8b677e5e865df861531
MD5
3
TEXT
idcc13poster2.pdf.txt
idcc13poster2.pdf.txt
Extracted text
text/plain
3826
https://scholarworks.iu.edu/dspace/bitstream/2022/15248/4/idcc13poster2.pdf.txt
8d0ae4486e49472355c579bc7571c3ec
MD5
4
THUMBNAIL
idcc13poster2.pdf.jpg
idcc13poster2.pdf.jpg
IM Thumbnail
image/jpeg
7080
https://scholarworks.iu.edu/dspace/bitstream/2022/15248/5/idcc13poster2.pdf.jpg
0bcf53927f357486662580d36eeb355a
MD5
5
2022/15248
oai:scholarworks.iu.edu:2022/15248
2021-10-18 07:20:46.111
IUScholarWorks
iusw@indiana.edu
SVVTY2hvbGFyV29ya3MgTGljZW5zZQoKQnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGF1dGhvcihzKSBvciBjb3B5cmlnaHQgCm93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCAKdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgCnRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgCmluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCAKdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0IGZvciB0aGUgcHVycG9zZSBvZiAKcHJlc2VydmF0aW9uLgoKWW91IGFsc28gYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5IGtlZXAgbW9yZSB0aGFuIG9uZSBjb3B5IG9mIHRoaXMgCnN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uCgpZb3UgcmVwcmVzZW50IHRoYXQgdGhlIHN1Ym1pc3Npb24gaXMgeW91ciBvcmlnaW5hbCB3b3JrLCBhbmQgdGhhdCB5b3UgaGF2ZSAKdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gCnJlcHJlc2VudCB0aGF0IHlvdXIgc3VibWlzc2lvbiBkb2VzIG5vdCwgdG8gdGhlIGJlc3Qgb2YgeW91ciBrbm93bGVkZ2UsIAppbmZyaW5nZSB1cG9uIGFueW9uZSdzIGNvcHlyaWdodC4KCklmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LCAKeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgCmNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIApsaWNlbnNlLCBhbmQgdGhhdCBzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCAKYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklGIFRIRSBTVUJNSVNTSU9OIElTIEJBU0VEIFVQT04gV09SSyBUSEFUIEhBUyBCRUVOIFNQT05TT1JFRCBPUiBTVVBQT1JURUQgCkJZIEFOIEFHRU5DWSBPUiBPUkdBTklaQVRJT04gT1RIRVIgVEhBTiBJTkRJQU5BIFVOSVZFUlNJVFksIFlPVSBSRVBSRVNFTlQgClRIQVQgWU9VIEhBVkUgRlVMRklMTEVEIEFOWSBSSUdIVCBPRiBSRVZJRVcgT1IgT1RIRVIgT0JMSUdBVElPTlMgUkVRVUlSRUQgCkJZIFNVQ0ggQ09OVFJBQ1QgT1IgQUdSRUVNRU5ULgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUocykgYXMgdGhlIGF1dGhvcihzKSBvciAKb3duZXIocykgb2YgdGhlIHN1Ym1pc3Npb24sIGFuZCB3aWxsIG5vdCBtYWtlIGFueSBhbHRlcmF0aW9uLCBvdGhlciB0aGFuIAphcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLgoKSUYgWU9VIEFSRSBTVUJNSVRUSU5HIFRISVMgSVRFTSBPTiBCRUhBTEYgT0YgVEhFIFJJR0hUUy1IT0xERVIsIFlPVSBNVVNUIApIQVZFIFRIRSBSSUdIVFMgT1dORVInUyA8YSBocmVmPSJodHRwOi8vd3d3LmluZGlhbmEuZWR1L35pdXN3L3Blcm1pc3Npb24vIiB0YXJnZXQ9InJlc291cmNlIHdpbmRvdyI+V1JJVFRFTiBQRVJNSVNTSU9OPC9hPiBUTyBBQ0NFUFQgVEhJUyBMSUNFTlNFIE9OIApISVMvSEVSIEJFSEFMRi4KCg==
oai:scholarworks.iu.edu:2022/165992021-10-18T11:21:41Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
Kouper, Inna
Seiffert, Kurt
Konkiel, Stacy R
2013-05-29T14:28:12Z
2013-05-29T14:28:12Z
2013
http://hdl.handle.net/2022/16599
In this back of envelope study we calculate the 15-year fixed and variable costs of setting up and running a data repository (or database) to store and serve the publications and datasets derived from research funded by the National Science Foundation (NSF). Costs are computed on a yearly basis using a fixed estimate of the number of papers that are published each year that list NSF as their funding agency. We assume each paper has one dataset and estimate the size of that dataset based on experience. By our estimates, the number of papers generated each year is 64,340. The average dataset size over all seven directorates of NSF is 32 gigabytes (GB). A total amount of data added to the repository is two petabytes (PB) per year, or 30 PB over 15 years. The architecture of the data/paper repository is based on a hierarchical storage model that uses a combination of fast disk for rapid access and tape for high reliability and cost efficient long-term storage. Data are ingested through workflows that are used in university institutional repositories, which add metadata and ensure data integrity. Average fixed costs is approximately 0.90 cents per GB over a 15-year span. Variable costs are estimated at a sliding scale of 150-100 dollars per new dataset for up-front curation, or 4.87-3.22 dollars per GB. Variable costs reflect a 3% annual decrease in curation costs as efficiency and automated metadata and provenance capture are anticipated to help reduce what are now largely manual curation efforts. The total projected cost of the data and paper repository is estimated at 167,000,000 dollars over 15 years of operation, curating close to one million of datasets and one million papers. After 15 years and 30 PB of data accumulated and curated, we estimate the cost per gigabyte at 5.56 dollars. This $167 million cost is a direct cost in that it does not include federally allowable indirect costs return (ICR). After 15 years, it is reasonable to assume that some datasets will be compressed and rarely accessed. Others may be deemed no longer valuable, e.g., because they are replaced by more accurate results. Therefore, at some point the data growth in the repository will need to be adjusted by use of strategic preservation.
Submitted by Inna Kouper (inkouper@indiana.edu) on 2013-05-29T13:54:43Z
No. of bitstreams: 0
Approved for entry into archive by Stacy Konkiel (skonkiel@indiana.edu) on 2013-05-29T14:28:12Z (GMT) No. of bitstreams: 0
Made available in DSpace on 2013-05-29T14:28:12Z (GMT). No. of bitstreams: 0
Previous issue date: 2013
en_US
Research Subject Categories
data repository
National Science Foundation (NSF)
cost model
digital preservation
data curation
Repository of NSF-funded Publications and Related Datasets: “Back of Envelope” Cost Estimate for 15 years
Technical Report
Working Paper
TRUE
true
LICENSE
license.txt
license.txt
text/plain
2036
https://scholarworks.iu.edu/dspace/bitstream/2022/16599/1/license.txt
36196899241d5157c049b9335ee93d86
MD5
1
ORIGINAL
Plale-2013-NSF-repository-estimate.pdf
Plale-2013-NSF-repository-estimate.pdf
application/pdf
133535
https://scholarworks.iu.edu/dspace/bitstream/2022/16599/2/Plale-2013-NSF-repository-estimate.pdf
dec9f71249670d58e6a68d3a27b802a2
MD5
2
TEXT
Plale-2013-NSF-repository-estimate.pdf.txt
Plale-2013-NSF-repository-estimate.pdf.txt
Extracted text
text/plain
24171
https://scholarworks.iu.edu/dspace/bitstream/2022/16599/3/Plale-2013-NSF-repository-estimate.pdf.txt
d3bf12d970a3b08c5cde4f6baed81c5e
MD5
3
THUMBNAIL
Plale-2013-NSF-repository-estimate.pdf.jpg
Plale-2013-NSF-repository-estimate.pdf.jpg
IM Thumbnail
image/jpeg
1436
https://scholarworks.iu.edu/dspace/bitstream/2022/16599/4/Plale-2013-NSF-repository-estimate.pdf.jpg
5c3d4ed9bffc11e4b0f3c52947aff6a6
MD5
4
2022/16599
oai:scholarworks.iu.edu:2022/16599
2021-10-18 07:21:41.631
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCB0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZyB0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sIGluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbiwgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkgYW5kIHByZXNlcnZhdGlvbiBwdXJwb3Nlcy4KCllvdSBhbHNvIGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uIAoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIAoKSWYgdGhlIHN1Ym1pc3Npb24gY29udGFpbnMgbWF0ZXJpYWwgZm9yIHdoaWNoIHlvdSBkbyBub3QgaG9sZCBjb3B5cmlnaHQsIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBvYnRhaW5lZCB0aGUgdW5yZXN0cmljdGVkIHBlcm1pc3Npb24gb2YgdGhlIGNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIGxpY2Vuc2UsIGFuZCB0aGF0IHN1Y2ggdGhpcmQtcGFydHkgb3duZWQgbWF0ZXJpYWwgaXMgY2xlYXJseSBpZGVudGlmaWVkIGFuZCBhY2tub3dsZWRnZWQgd2l0aGluIHRoZSB0ZXh0IG9yIGNvbnRlbnQgb2YgdGhlIHN1Ym1pc3Npb24uIAoKSWYgdGhlIHN1Ym1pc3Npb24gaXMgYmFzZWQgdXBvbiB3b3JrIHRoYXQgaGFzIGJlZW4gc3BvbnNvcmVkIG9yIHN1cHBvcnRlZCBieSBhbiBhZ2VuY3kgb3Igb3JnYW5pemF0aW9uIG90aGVyIHRoYW4gSW5kaWFuYSBVbml2ZXJzaXR5LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgZnVsZmlsbGVkIGFueSByaWdodCBvZiByZXZpZXcgb3Igb3RoZXIgb2JsaWdhdGlvbnMgcmVxdWlyZWQgYnkgc3VjaCBjb250cmFjdCBvciBhZ3JlZW1lbnQuCgpJbmRpYW5hIFVuaXZlcnNpdHkgd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZSBhcyB0aGUgY3JlYXRvciBhbmQvb3IgY29weXJpZ2h0IG93bmVyIG9mIHRoZSBzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLiBXZSBhZ3JlZSB0byBub3QgbWFrZSBhdmFpbGFibGUgYW55IGl0ZW1zIHRoYXQgYXJlIGVtYmFyZ29lZCB1bnRpbCB0aGUgZW1iYXJnbyBoYXMgZXhwaXJlZC4KCklmIHlvdSBhcmUgc3VibWl0dGluZyB0aGlzIGl0ZW0gb24gYmVoYWxmIG9mIHRoZSByaWdodHNob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcu+/vXMgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAo=
oai:scholarworks.iu.edu:2022/167452021-10-18T08:49:44Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Chen, Peng
Plale, Beth
Evans, Tom
2013-08-28T15:57:17Z
2013-08-28T15:57:17Z
2013-08
Chen, Peng, Beth Plale, and Tom Evans. “Dependency Provenance in Agent Based Modeling.” Preprint of paper accepted for the 9th IEEE International Conference on eScience (eScience 2013), submitted August 1, 2013.
http://hdl.handle.net/2022/16745
Researchers who use agent-based models (ABM) to model social patterns often focus on the model's aggregate phenomena. However, aggregation of individuals complicates the understanding of agent interactions and the uniqueness of individuals. We develop a method for tracing and capturing the provenance of individuals and their interactions in the NetLogo ABM, and from this create a "dependency provenance slice", which combines a data slice and a program slice to yield insights into the cause-effect relations among system behaviors. To cope with the large volume of fine-grained provenance traces, we propose use-inspired filters to reduce the amount of provenance, and a provenance slicing technique called "non-preprocessing provenance slicing" that directly queries over provenance traces without recovering all provenance entities and dependencies beforehand. We evaluate performance and utility using a well known ecological NetLogo model called "wolf-sheep-predation".
Submitted by Peng Chen (chenpeng@indiana.edu) on 2013-08-20T02:55:51Z
No. of bitstreams: 1
prov in ABM - preprint.pdf: 502854 bytes, checksum: 1c6461462d51c686165ef8150420c403 (MD5)
Approved for entry into archive by Department IUScholarWorks (iusw@indiana.edu) on 2013-08-28T15:57:17Z (GMT) No. of bitstreams: 1
prov in ABM - preprint.pdf: 502854 bytes, checksum: 1c6461462d51c686165ef8150420c403 (MD5)
Made available in DSpace on 2013-08-28T15:57:17Z (GMT). No. of bitstreams: 1
prov in ABM - preprint.pdf: 502854 bytes, checksum: 1c6461462d51c686165ef8150420c403 (MD5)
Previous issue date: 2013-08
en_US
Agent Based Modeling
Dependency provenance
Dependency Provenance in Agent Based Modeling
Preprint
true
ORIGINAL
prov in ABM - preprint.pdf
prov in ABM - preprint.pdf
application/pdf
502854
https://scholarworks.iu.edu/dspace/bitstream/2022/16745/1/prov%20in%20ABM%20-%20preprint.pdf
1c6461462d51c686165ef8150420c403
MD5
1
LICENSE
license.txt
license.txt
text/plain
2036
https://scholarworks.iu.edu/dspace/bitstream/2022/16745/2/license.txt
36196899241d5157c049b9335ee93d86
MD5
2
TEXT
prov in ABM - preprint.pdf.txt
prov in ABM - preprint.pdf.txt
Extracted text
text/plain
93074
https://scholarworks.iu.edu/dspace/bitstream/2022/16745/3/prov%20in%20ABM%20-%20preprint.pdf.txt
7cb380bc9e053fe74393d3d043921cff
MD5
3
THUMBNAIL
prov in ABM - preprint.pdf.jpg
prov in ABM - preprint.pdf.jpg
IM Thumbnail
image/jpeg
4615
https://scholarworks.iu.edu/dspace/bitstream/2022/16745/4/prov%20in%20ABM%20-%20preprint.pdf.jpg
e7c92596a255166c5522fb19d37271fc
MD5
4
2022/16745
oai:scholarworks.iu.edu:2022/16745
2021-10-18 04:49:44.164
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCB0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZyB0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sIGluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbiwgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkgYW5kIHByZXNlcnZhdGlvbiBwdXJwb3Nlcy4KCllvdSBhbHNvIGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uIAoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIAoKSWYgdGhlIHN1Ym1pc3Npb24gY29udGFpbnMgbWF0ZXJpYWwgZm9yIHdoaWNoIHlvdSBkbyBub3QgaG9sZCBjb3B5cmlnaHQsIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBvYnRhaW5lZCB0aGUgdW5yZXN0cmljdGVkIHBlcm1pc3Npb24gb2YgdGhlIGNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIGxpY2Vuc2UsIGFuZCB0aGF0IHN1Y2ggdGhpcmQtcGFydHkgb3duZWQgbWF0ZXJpYWwgaXMgY2xlYXJseSBpZGVudGlmaWVkIGFuZCBhY2tub3dsZWRnZWQgd2l0aGluIHRoZSB0ZXh0IG9yIGNvbnRlbnQgb2YgdGhlIHN1Ym1pc3Npb24uIAoKSWYgdGhlIHN1Ym1pc3Npb24gaXMgYmFzZWQgdXBvbiB3b3JrIHRoYXQgaGFzIGJlZW4gc3BvbnNvcmVkIG9yIHN1cHBvcnRlZCBieSBhbiBhZ2VuY3kgb3Igb3JnYW5pemF0aW9uIG90aGVyIHRoYW4gSW5kaWFuYSBVbml2ZXJzaXR5LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgZnVsZmlsbGVkIGFueSByaWdodCBvZiByZXZpZXcgb3Igb3RoZXIgb2JsaWdhdGlvbnMgcmVxdWlyZWQgYnkgc3VjaCBjb250cmFjdCBvciBhZ3JlZW1lbnQuCgpJbmRpYW5hIFVuaXZlcnNpdHkgd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZSBhcyB0aGUgY3JlYXRvciBhbmQvb3IgY29weXJpZ2h0IG93bmVyIG9mIHRoZSBzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLiBXZSBhZ3JlZSB0byBub3QgbWFrZSBhdmFpbGFibGUgYW55IGl0ZW1zIHRoYXQgYXJlIGVtYmFyZ29lZCB1bnRpbCB0aGUgZW1iYXJnbyBoYXMgZXhwaXJlZC4KCklmIHlvdSBhcmUgc3VibWl0dGluZyB0aGlzIGl0ZW0gb24gYmVoYWxmIG9mIHRoZSByaWdodHNob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcu+/vXMgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAo=
oai:scholarworks.iu.edu:2022/184722021-10-18T10:01:34Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Ruan, Guangchen
Plale, Beth
2014-07-02T14:58:09Z
2014-07-02T14:58:09Z
2014-07-02
http://hdl.handle.net/2022/18472
As digital data sources grow in number and size, they pose an opportunity for computational investigation by means of text mining, NLP, and other text analysis techniques. The HathiTrust Re-search Center (HTRC) was recently established to provision for automated analytical techniques on the over 11 million digitized volumes (books) of the HathiTrust digital repository. The HTRC data store that hosts and provisions access to HathiTrust volumes needs to be efficient, fault-tolerant and large-scale. In this paper, we propose three schema designs of Cassandra NoSQL store to represent HathiTrust corpus and perform extensive performance evaluation using simulated workloads. The experimental results demonstrate that encapsulating the whole volume within a single row with regular columns delivers the best overall performance.
Submitted by Guangchen Ruan (gruan@indiana.edu) on 2014-07-02T14:21:58Z
No. of bitstreams: 1
schemaeval.pdf: 574660 bytes, checksum: 7343f2f63d692e860dbbfa57bc15dfef (MD5)
Approved for entry into archive by Jan Holloway (holloway@indiana.edu) on 2014-07-02T14:58:09Z (GMT) No. of bitstreams: 1
schemaeval.pdf: 574660 bytes, checksum: 7343f2f63d692e860dbbfa57bc15dfef (MD5)
Made available in DSpace on 2014-07-02T14:58:09Z (GMT). No. of bitstreams: 1
schemaeval.pdf: 574660 bytes, checksum: 7343f2f63d692e860dbbfa57bc15dfef (MD5)
en
http://www.cs.indiana.edu/cgi-bin/techreports/TRNNN.cgi?trnum=TR713
Cassandra
schema design
performance evaluation
Evaluation of Data Storage in HathiTrust Research Center Using Cassandra
Technical Report
false
LICENSE
license.txt
license.txt
text/plain
2036
https://scholarworks.iu.edu/dspace/bitstream/2022/18472/2/license.txt
36196899241d5157c049b9335ee93d86
MD5
2
TEXT
schemaeval.pdf.txt
schemaeval.pdf.txt
Extracted text
text/plain
54392
https://scholarworks.iu.edu/dspace/bitstream/2022/18472/3/schemaeval.pdf.txt
a53f804e3df3776c063292f324704788
MD5
3
ORIGINAL
schemaeval.pdf
schemaeval.pdf
application/pdf
574523
https://scholarworks.iu.edu/dspace/bitstream/2022/18472/4/schemaeval.pdf
b26d3c2529844ed7c0eb47bc7846d685
MD5
4
THUMBNAIL
schemaeval.pdf.jpg
schemaeval.pdf.jpg
IM Thumbnail
image/jpeg
4250
https://scholarworks.iu.edu/dspace/bitstream/2022/18472/5/schemaeval.pdf.jpg
e1b0decc4a78eefcaaf79e8062a45236
MD5
5
2022/18472
oai:scholarworks.iu.edu:2022/18472
2021-10-18 06:01:34.311
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCB0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZyB0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sIGluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbiwgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkgYW5kIHByZXNlcnZhdGlvbiBwdXJwb3Nlcy4KCllvdSBhbHNvIGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uIAoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIAoKSWYgdGhlIHN1Ym1pc3Npb24gY29udGFpbnMgbWF0ZXJpYWwgZm9yIHdoaWNoIHlvdSBkbyBub3QgaG9sZCBjb3B5cmlnaHQsIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBvYnRhaW5lZCB0aGUgdW5yZXN0cmljdGVkIHBlcm1pc3Npb24gb2YgdGhlIGNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIGxpY2Vuc2UsIGFuZCB0aGF0IHN1Y2ggdGhpcmQtcGFydHkgb3duZWQgbWF0ZXJpYWwgaXMgY2xlYXJseSBpZGVudGlmaWVkIGFuZCBhY2tub3dsZWRnZWQgd2l0aGluIHRoZSB0ZXh0IG9yIGNvbnRlbnQgb2YgdGhlIHN1Ym1pc3Npb24uIAoKSWYgdGhlIHN1Ym1pc3Npb24gaXMgYmFzZWQgdXBvbiB3b3JrIHRoYXQgaGFzIGJlZW4gc3BvbnNvcmVkIG9yIHN1cHBvcnRlZCBieSBhbiBhZ2VuY3kgb3Igb3JnYW5pemF0aW9uIG90aGVyIHRoYW4gSW5kaWFuYSBVbml2ZXJzaXR5LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgZnVsZmlsbGVkIGFueSByaWdodCBvZiByZXZpZXcgb3Igb3RoZXIgb2JsaWdhdGlvbnMgcmVxdWlyZWQgYnkgc3VjaCBjb250cmFjdCBvciBhZ3JlZW1lbnQuCgpJbmRpYW5hIFVuaXZlcnNpdHkgd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZSBhcyB0aGUgY3JlYXRvciBhbmQvb3IgY29weXJpZ2h0IG93bmVyIG9mIHRoZSBzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLiBXZSBhZ3JlZSB0byBub3QgbWFrZSBhdmFpbGFibGUgYW55IGl0ZW1zIHRoYXQgYXJlIGVtYmFyZ29lZCB1bnRpbCB0aGUgZW1iYXJnbyBoYXMgZXhwaXJlZC4KCklmIHlvdSBhcmUgc3VibWl0dGluZyB0aGlzIGl0ZW0gb24gYmVoYWxmIG9mIHRoZSByaWdodHNob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcu+/vXMgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAo=
oai:scholarworks.iu.edu:2022/184882021-10-18T07:26:49Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
2014-07-10T13:33:16Z
2014-07-10T13:33:16Z
2013-11-13
http://hdl.handle.net/2022/18488
The ubiquity of today's data is not just transforming what is, it is transforming what will be
laying the groundwork to drive new innovation. Today, research questions are
addressed by complex models, by large data analysis tasks, and by sophisticated data
visualization techniques, all requiring data. To address the growing global need for data
infrastructure, the Research Data Alliance (RDA) was launched in FY13 as an
international community-driven organization. We propose to bring together members of
RDA with the HPC community to create a shared conversation around the utility of RDA
for data-driven challenges in HPC.
Submitted by Inna Kouper (inkouper@indiana.edu) on 2014-07-10T13:31:57Z
No. of bitstreams: 1
BigDataHPCRDA-SC13BOF.pdf: 149672 bytes, checksum: 71c8183d503f6065546dcf79142dfe94 (MD5)
Approved for entry into archive by Inna Kouper (inkouper@indiana.edu) on 2014-07-10T13:33:16Z (GMT) No. of bitstreams: 1
BigDataHPCRDA-SC13BOF.pdf: 149672 bytes, checksum: 71c8183d503f6065546dcf79142dfe94 (MD5)
Made available in DSpace on 2014-07-10T13:33:16Z (GMT). No. of bitstreams: 1
BigDataHPCRDA-SC13BOF.pdf: 149672 bytes, checksum: 71c8183d503f6065546dcf79142dfe94 (MD5)
Previous issue date: 2013-11-13
en_US
free to use with appropriate citation and attribution
data, big data, research data alliance, RDA, high performance computing, supercomputing
Big Data and HPC: Exploring Role of Research Data Alliance (RDA), a Report On Supercomputing 2013 Birds of a Feather
Technical Report
true
ORIGINAL
BigDataHPCRDA-SC13BOF.pdf
BigDataHPCRDA-SC13BOF.pdf
report in pdf format
application/pdf
149672
https://scholarworks.iu.edu/dspace/bitstream/2022/18488/1/BigDataHPCRDA-SC13BOF.pdf
71c8183d503f6065546dcf79142dfe94
MD5
1
LICENSE
license.txt
license.txt
text/plain
2036
https://scholarworks.iu.edu/dspace/bitstream/2022/18488/2/license.txt
36196899241d5157c049b9335ee93d86
MD5
2
TEXT
BigDataHPCRDA-SC13BOF.pdf.txt
BigDataHPCRDA-SC13BOF.pdf.txt
Extracted text
text/plain
5884
https://scholarworks.iu.edu/dspace/bitstream/2022/18488/3/BigDataHPCRDA-SC13BOF.pdf.txt
8925b9d6cea3d385c5d2f0e7d50326f2
MD5
3
THUMBNAIL
BigDataHPCRDA-SC13BOF.pdf.jpg
BigDataHPCRDA-SC13BOF.pdf.jpg
IM Thumbnail
image/jpeg
4143
https://scholarworks.iu.edu/dspace/bitstream/2022/18488/4/BigDataHPCRDA-SC13BOF.pdf.jpg
f8f6c3dd1b3eb512c5703d4181643567
MD5
4
2022/18488
oai:scholarworks.iu.edu:2022/18488
2021-10-18 03:26:49.403
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCB0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZyB0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sIGluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbiwgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkgYW5kIHByZXNlcnZhdGlvbiBwdXJwb3Nlcy4KCllvdSBhbHNvIGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uIAoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIAoKSWYgdGhlIHN1Ym1pc3Npb24gY29udGFpbnMgbWF0ZXJpYWwgZm9yIHdoaWNoIHlvdSBkbyBub3QgaG9sZCBjb3B5cmlnaHQsIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBvYnRhaW5lZCB0aGUgdW5yZXN0cmljdGVkIHBlcm1pc3Npb24gb2YgdGhlIGNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIGxpY2Vuc2UsIGFuZCB0aGF0IHN1Y2ggdGhpcmQtcGFydHkgb3duZWQgbWF0ZXJpYWwgaXMgY2xlYXJseSBpZGVudGlmaWVkIGFuZCBhY2tub3dsZWRnZWQgd2l0aGluIHRoZSB0ZXh0IG9yIGNvbnRlbnQgb2YgdGhlIHN1Ym1pc3Npb24uIAoKSWYgdGhlIHN1Ym1pc3Npb24gaXMgYmFzZWQgdXBvbiB3b3JrIHRoYXQgaGFzIGJlZW4gc3BvbnNvcmVkIG9yIHN1cHBvcnRlZCBieSBhbiBhZ2VuY3kgb3Igb3JnYW5pemF0aW9uIG90aGVyIHRoYW4gSW5kaWFuYSBVbml2ZXJzaXR5LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgZnVsZmlsbGVkIGFueSByaWdodCBvZiByZXZpZXcgb3Igb3RoZXIgb2JsaWdhdGlvbnMgcmVxdWlyZWQgYnkgc3VjaCBjb250cmFjdCBvciBhZ3JlZW1lbnQuCgpJbmRpYW5hIFVuaXZlcnNpdHkgd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZSBhcyB0aGUgY3JlYXRvciBhbmQvb3IgY29weXJpZ2h0IG93bmVyIG9mIHRoZSBzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLiBXZSBhZ3JlZSB0byBub3QgbWFrZSBhdmFpbGFibGUgYW55IGl0ZW1zIHRoYXQgYXJlIGVtYmFyZ29lZCB1bnRpbCB0aGUgZW1iYXJnbyBoYXMgZXhwaXJlZC4KCklmIHlvdSBhcmUgc3VibWl0dGluZyB0aGlzIGl0ZW0gb24gYmVoYWxmIG9mIHRoZSByaWdodHNob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcu+/vXMgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAo=
oai:scholarworks.iu.edu:2022/187212021-10-18T12:10:32Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Sun, Yiming
Plale, Beth
Zeng, Jiaan
2014-09-11T12:45:39Z
2014-09-11T12:45:39Z
http://hdl.handle.net/2022/18721
HathiTrust Research Center (HTRC) allows users
to access more than 3 million volumes through a service
called Data API. Data API plays an important role in HTRC
infrastructure. It hides internal complexity from user, protects
against malicious or inadvertent damages to data and separates
underlying storage solution with interface so that underlying
storage may be replaced with better solutions without affecting
client code. We carried out extensive evaluations on the HTRC
Data API performance over the Spring 2013. Specifically, we
evaluated the rate at which data can be retrieved from the
Cassandra cluster under different conditions, impact of different
compression levels, and HTTP/HTTPS data transfer. The
evaluation presents performance aspects of different software
pieces in Data API as well as guides us to have optimal settings
for Data API.
Submitted by Jiaan Zeng (jiaazeng@indiana.edu) on 2014-09-09T21:06:42Z
No. of bitstreams: 1
dataapi-report.pdf: 1076011 bytes, checksum: 50e5a733806e73932577adfdc3a5e108 (MD5)
Approved for entry into archive by Inna Kouper (inkouper@indiana.edu) on 2014-09-11T12:45:39Z (GMT) No. of bitstreams: 1
dataapi-report.pdf: 1076011 bytes, checksum: 50e5a733806e73932577adfdc3a5e108 (MD5)
Made available in DSpace on 2014-09-11T12:45:39Z (GMT). No. of bitstreams: 1
dataapi-report.pdf: 1076011 bytes, checksum: 50e5a733806e73932577adfdc3a5e108 (MD5)
en_US
Cassandra
Performance
HTRC
API
performance evaluation
HathiTrust Research Center
HTRC Data API Performance Study
Technical Report
true
ORIGINAL
dataapi-report-v2.pdf
dataapi-report-v2.pdf
application/pdf
1077625
https://scholarworks.iu.edu/dspace/bitstream/2022/18721/4/dataapi-report-v2.pdf
8a2a3a94a8950724529e2fbf73849c24
MD5
4
LICENSE
license.txt
license.txt
text/plain
2036
https://scholarworks.iu.edu/dspace/bitstream/2022/18721/2/license.txt
36196899241d5157c049b9335ee93d86
MD5
2
TEXT
dataapi-report-v2.pdf.txt
dataapi-report-v2.pdf.txt
Extracted text
text/plain
21880
https://scholarworks.iu.edu/dspace/bitstream/2022/18721/5/dataapi-report-v2.pdf.txt
3446278337c9a2044b724d101707d097
MD5
5
THUMBNAIL
dataapi-report-v2.pdf.jpg
dataapi-report-v2.pdf.jpg
IM Thumbnail
image/jpeg
4925
https://scholarworks.iu.edu/dspace/bitstream/2022/18721/6/dataapi-report-v2.pdf.jpg
ff157f16699ba838cd0a394acd2e0284
MD5
6
2022/18721
oai:scholarworks.iu.edu:2022/18721
2021-10-18 08:10:32.151
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIG5vbi1leGNsdXNpdmUgcmlnaHQgdG8gcmVwcm9kdWNlLCB0cmFuc2xhdGUgKGFzIGRlZmluZWQgYmVsb3cpLCBhbmQvb3IgZGlzdHJpYnV0ZSB5b3VyIHN1Ym1pc3Npb24gKGluY2x1ZGluZyB0aGUgYWJzdHJhY3QpIHdvcmxkd2lkZSBpbiBwcmludCBhbmQgZWxlY3Ryb25pYyBmb3JtYXQgYW5kIGluIGFueSBtZWRpdW0sIGluY2x1ZGluZyBidXQgbm90IGxpbWl0ZWQgdG8gYXVkaW8gb3IgdmlkZW8uCgpZb3UgYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5LCB3aXRob3V0IGNoYW5naW5nIHRoZSBjb250ZW50LCB0cmFuc2xhdGUgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQgZm9yIHRoZSBwdXJwb3NlIG9mIHByZXNlcnZhdGlvbiwgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkgYW5kIHByZXNlcnZhdGlvbiBwdXJwb3Nlcy4KCllvdSBhbHNvIGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSBrZWVwIG1vcmUgdGhhbiBvbmUgY29weSBvZiB0aGlzIHN1Ym1pc3Npb24gZm9yIHB1cnBvc2VzIG9mIHNlY3VyaXR5LCBiYWNrLXVwIGFuZCBwcmVzZXJ2YXRpb24uIAoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIAoKSWYgdGhlIHN1Ym1pc3Npb24gY29udGFpbnMgbWF0ZXJpYWwgZm9yIHdoaWNoIHlvdSBkbyBub3QgaG9sZCBjb3B5cmlnaHQsIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBvYnRhaW5lZCB0aGUgdW5yZXN0cmljdGVkIHBlcm1pc3Npb24gb2YgdGhlIGNvcHlyaWdodCBvd25lciB0byBncmFudCBJbmRpYW5hIFVuaXZlcnNpdHkgdGhlIHJpZ2h0cyByZXF1aXJlZCBieSB0aGlzIGxpY2Vuc2UsIGFuZCB0aGF0IHN1Y2ggdGhpcmQtcGFydHkgb3duZWQgbWF0ZXJpYWwgaXMgY2xlYXJseSBpZGVudGlmaWVkIGFuZCBhY2tub3dsZWRnZWQgd2l0aGluIHRoZSB0ZXh0IG9yIGNvbnRlbnQgb2YgdGhlIHN1Ym1pc3Npb24uIAoKSWYgdGhlIHN1Ym1pc3Npb24gaXMgYmFzZWQgdXBvbiB3b3JrIHRoYXQgaGFzIGJlZW4gc3BvbnNvcmVkIG9yIHN1cHBvcnRlZCBieSBhbiBhZ2VuY3kgb3Igb3JnYW5pemF0aW9uIG90aGVyIHRoYW4gSW5kaWFuYSBVbml2ZXJzaXR5LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgZnVsZmlsbGVkIGFueSByaWdodCBvZiByZXZpZXcgb3Igb3RoZXIgb2JsaWdhdGlvbnMgcmVxdWlyZWQgYnkgc3VjaCBjb250cmFjdCBvciBhZ3JlZW1lbnQuCgpJbmRpYW5hIFVuaXZlcnNpdHkgd2lsbCBjbGVhcmx5IGlkZW50aWZ5IHlvdXIgbmFtZSBhcyB0aGUgY3JlYXRvciBhbmQvb3IgY29weXJpZ2h0IG93bmVyIG9mIHRoZSBzdWJtaXNzaW9uLCBhbmQgd2lsbCBub3QgbWFrZSBhbnkgYWx0ZXJhdGlvbiwgb3RoZXIgdGhhbiBhcyBhbGxvd2VkIGJ5IHRoaXMgbGljZW5zZSwgdG8geW91ciBzdWJtaXNzaW9uLiBXZSBhZ3JlZSB0byBub3QgbWFrZSBhdmFpbGFibGUgYW55IGl0ZW1zIHRoYXQgYXJlIGVtYmFyZ29lZCB1bnRpbCB0aGUgZW1iYXJnbyBoYXMgZXhwaXJlZC4KCklmIHlvdSBhcmUgc3VibWl0dGluZyB0aGlzIGl0ZW0gb24gYmVoYWxmIG9mIHRoZSByaWdodHNob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcu+/vXMgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAo=
oai:scholarworks.iu.edu:2022/192772021-10-18T13:21:16Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
Prakash, Atul
McDonald, Robert
2015-02-04T16:57:05Z
2015-02-04T16:57:05Z
http://hdl.handle.net/2022/19277
Digital texts with access and use protections form a unique and fast growing collection of materials. Growing equally quickly is the development of text and data mining algorithms that process large text-based collections for purposes of exploring the content computationally. There is a strong need for research to establish the foundations for secure computational and data technologies that can ensure a non-consumptive environment for use-protected texts such as the copyrighted works in the HathiTrust Digital Library.
Developing a secure computation and data environment for non-consumptive research for the HathiTrust Research Center is funded through a grant from the Alfred P. Sloan Foundation. In this research, researchers at HTRC and the University of Michigan are developing a “data capsule framework” that is founded on a principle of “trust but verify”. The project has resulted in a novel experimental framework that permits analytical investigation of a corpus but prohibits data from leaving the capsule. The HTRC Data Capsule is both a system architecture and set of policies that enable computational investigation over the protected content of the HT digital repository that is carried out and controlled directly by a researcher.
Submitted by Miao Chen (miaochen@indiana.edu) on 2015-02-04T16:54:58Z
No. of bitstreams: 1
HTRCSloanReport_ScholarWorks.pdf: 321502 bytes, checksum: 938041b005fc74ecb136bcbbabb9efb7 (MD5)
Approved for entry into archive by Inna Kouper (inkouper@indiana.edu) on 2015-02-04T16:57:05Z (GMT) No. of bitstreams: 1
HTRCSloanReport_ScholarWorks.pdf: 321502 bytes, checksum: 938041b005fc74ecb136bcbbabb9efb7 (MD5)
Made available in DSpace on 2015-02-04T16:57:05Z (GMT). No. of bitstreams: 1
HTRCSloanReport_ScholarWorks.pdf: 321502 bytes, checksum: 938041b005fc74ecb136bcbbabb9efb7 (MD5)
Alfred P. Sloan Foundation
en
HathiTrust Research Center, HTRC Data Capsule, non-consumptive research
The Data Capsule for Non-Consumptive Research: Final Report
Technical Report
true
ORIGINAL
HTRCSloanReport_ScholarWorks.pdf
HTRCSloanReport_ScholarWorks.pdf
HTRC Sloan Project Final Report
application/pdf
321502
https://scholarworks.iu.edu/dspace/bitstream/2022/19277/1/HTRCSloanReport_ScholarWorks.pdf
938041b005fc74ecb136bcbbabb9efb7
MD5
1
LICENSE
license.txt
license.txt
text/plain
2012
https://scholarworks.iu.edu/dspace/bitstream/2022/19277/2/license.txt
2d12280d99d4502d180cd616e4c6d855
MD5
2
TEXT
HTRCSloanReport_ScholarWorks.pdf.txt
HTRCSloanReport_ScholarWorks.pdf.txt
Extracted text
text/plain
14620
https://scholarworks.iu.edu/dspace/bitstream/2022/19277/3/HTRCSloanReport_ScholarWorks.pdf.txt
d88f56cc33ae82096a056d47dcd5ff16
MD5
3
THUMBNAIL
HTRCSloanReport_ScholarWorks.pdf.jpg
HTRCSloanReport_ScholarWorks.pdf.jpg
IM Thumbnail
image/jpeg
4258
https://scholarworks.iu.edu/dspace/bitstream/2022/19277/4/HTRCSloanReport_ScholarWorks.pdf.jpg
695a392ba5dbbb08863798f4953dd5c3
MD5
4
2022/19277
oai:scholarworks.iu.edu:2022/19277
2021-10-18 09:21:16.119
IUScholarWorks
iusw@indiana.edu
CkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBjcmVhdG9yIG9yIGNvcHlyaWdodCBvd25lcikgZ3JhbnQgdG8gSW5kaWFuYSBVbml2ZXJzaXR5IGEgbm9uLWV4Y2x1c2l2ZSwgcGVycGV0dWFsLCBpcnJldm9jYWJsZSByaWdodCB0byByZXByb2R1Y2UsIHRyYW5zbGF0ZSAoYXMgZGVmaW5lZCBiZWxvdyksIGFuZC9vciBkaXN0cmlidXRlIHlvdXIgc3VibWlzc2lvbiAoaW5jbHVkaW5nIHRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgaW5jbHVkaW5nIGJ1dCBub3QgbGltaXRlZCB0byBhdWRpbyBvciB2aWRlby4KCllvdSBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXksIHdpdGhvdXQgY2hhbmdpbmcgdGhlIGNvbnRlbnQsIHRyYW5zbGF0ZSB0aGUgc3VibWlzc2lvbiB0byBhbnkgbWVkaXVtIG9yIGZvcm1hdCBmb3IgcHJlc2VydmF0aW9uIG9yIGFjY2VzcywgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkuCgpZb3UgYWxzbyBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXkga2VlcCBtb3JlIHRoYW4gb25lIGNvcHkgb2YgdGhpcyBzdWJtaXNzaW9uIGZvciBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgY29weXJpZ2h0IG93bmVyIHRvIGdyYW50IEluZGlhbmEgVW5pdmVyc2l0eSB0aGUgcmlnaHRzIHJlcXVpcmVkIGJ5IHRoaXMgbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklmIHRoZSBzdWJtaXNzaW9uIGlzIGJhc2VkIHVwb24gd29yayB0aGF0IGhhcyBiZWVuIHNwb25zb3JlZCBvciBzdXBwb3J0ZWQgYnkgYW4gYWdlbmN5IG9yIG9yZ2FuaXphdGlvbiBvdGhlciB0aGFuIEluZGlhbmEgVW5pdmVyc2l0eSwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIGZ1bGZpbGxlZCBhbnkgcmlnaHQgb2YgcmV2aWV3IG9yIG90aGVyIG9ibGlnYXRpb25zIHJlcXVpcmVkIGJ5IHN1Y2ggY29udHJhY3Qgb3IgYWdyZWVtZW50LgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb24sIG90aGVyIHRoYW4gYXMgYWxsb3dlZCBieSB0aGlzIGxpY2Vuc2UsIHRvIHlvdXIgc3VibWlzc2lvbi4gV2UgYWdyZWUgdG8gbm90IG1ha2UgYXZhaWxhYmxlIGFueSBmaWxlcyB0aGF0IGFyZSBlbWJhcmdvZWQgdW50aWwgdGhlIGVtYmFyZ28gaGFzIGV4cGlyZWQuCgpJZiB5b3UgYXJlIHN1Ym1pdHRpbmcgdGhpcyBpdGVtIG9uIGJlaGFsZiBvZiB0aGUgcmlnaHRzIGhvbGRlciwgeW91IG11c3QgaGF2ZSB0aGUgcmlnaHRzIG93bmVyJ3Mgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuCgo=
oai:scholarworks.iu.edu:2022/197602021-10-18T13:45:56Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
Jones, Matt
Thain, Douglas
2015-03-31T22:45:51Z
2015-03-31T22:45:51Z
2015-03-31
http://hdl.handle.net/2022/19760
The second annual NSF Software Infrastructure for Sustained Innovation (SI2) PI meeting took place in Arlington, VA February 24-25, 2014. It was hosted by Beth Plale, Indiana University; Douglas Thain, University of Notre Dame; and Matt Jones, National Center for Ecological Analysis and Synthesis.
This report captures the challenges and outcomes emerging from the meeting over the four topic areas discussed i) Attribution and Citation, ii) Reproducibility, Reusability, and Preservation, iii) Project/Software Sustainability, and iv) Career Paths. The report is an academic synthesis with credit to all the participants and to the notetakers who took prodigious notes and synthesized the results upon which the conclusions of this report are derived.
Submitted by Beth Plale (plale@indiana.edu) on 2015-03-31T22:42:09Z
No. of bitstreams: 1
2014SI2FinalReportSoftwareinScience.pdf: 114197 bytes, checksum: 94739672617fb051bd863aadb29468bd (MD5)
Approved for entry into archive by Beth Plale (plale@indiana.edu) on 2015-03-31T22:45:51Z (GMT) No. of bitstreams: 1
2014SI2FinalReportSoftwareinScience.pdf: 114197 bytes, checksum: 94739672617fb051bd863aadb29468bd (MD5)
Made available in DSpace on 2015-03-31T22:45:51Z (GMT). No. of bitstreams: 1
2014SI2FinalReportSoftwareinScience.pdf: 114197 bytes, checksum: 94739672617fb051bd863aadb29468bd (MD5)
Previous issue date: 2015-03-31
National Science Foundation, award # 1419131
en_US
CC BY 4.0
software, reproducibility, citation, annotation, career paths
Software in Science: a Report of Outcomes of the 2014 National Science Foundation Software Infrastructure for Sustained Innovation (SI2) Meeting
Technical Report
true
TEXT
2014SI2FinalReportSoftwareinScience.pdf.txt
2014SI2FinalReportSoftwareinScience.pdf.txt
Extracted text
text/plain
23931
https://scholarworks.iu.edu/dspace/bitstream/2022/19760/3/2014SI2FinalReportSoftwareinScience.pdf.txt
1ad1f7995bd908e556b8a5c122938c1c
MD5
3
ORIGINAL
2014SI2FinalReportSoftwareinScience.pdf
2014SI2FinalReportSoftwareinScience.pdf
NSF Workshop report
application/pdf
114197
https://scholarworks.iu.edu/dspace/bitstream/2022/19760/1/2014SI2FinalReportSoftwareinScience.pdf
94739672617fb051bd863aadb29468bd
MD5
1
LICENSE
license.txt
license.txt
text/plain
2012
https://scholarworks.iu.edu/dspace/bitstream/2022/19760/2/license.txt
2d12280d99d4502d180cd616e4c6d855
MD5
2
THUMBNAIL
2014SI2FinalReportSoftwareinScience.pdf.jpg
2014SI2FinalReportSoftwareinScience.pdf.jpg
IM Thumbnail
image/jpeg
4151
https://scholarworks.iu.edu/dspace/bitstream/2022/19760/4/2014SI2FinalReportSoftwareinScience.pdf.jpg
c29a2392e6552bad9dfa2f8d1acdc72e
MD5
4
2022/19760
oai:scholarworks.iu.edu:2022/19760
2021-10-18 09:45:56.132
IUScholarWorks
iusw@indiana.edu
CkJ5IHNpZ25pbmcgYW5kIHN1Ym1pdHRpbmcgdGhpcyBsaWNlbnNlLCB5b3UgKHRoZSBjcmVhdG9yIG9yIGNvcHlyaWdodCBvd25lcikgZ3JhbnQgdG8gSW5kaWFuYSBVbml2ZXJzaXR5IGEgbm9uLWV4Y2x1c2l2ZSwgcGVycGV0dWFsLCBpcnJldm9jYWJsZSByaWdodCB0byByZXByb2R1Y2UsIHRyYW5zbGF0ZSAoYXMgZGVmaW5lZCBiZWxvdyksIGFuZC9vciBkaXN0cmlidXRlIHlvdXIgc3VibWlzc2lvbiAoaW5jbHVkaW5nIHRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgaW5jbHVkaW5nIGJ1dCBub3QgbGltaXRlZCB0byBhdWRpbyBvciB2aWRlby4KCllvdSBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXksIHdpdGhvdXQgY2hhbmdpbmcgdGhlIGNvbnRlbnQsIHRyYW5zbGF0ZSB0aGUgc3VibWlzc2lvbiB0byBhbnkgbWVkaXVtIG9yIGZvcm1hdCBmb3IgcHJlc2VydmF0aW9uIG9yIGFjY2VzcywgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkuCgpZb3UgYWxzbyBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXkga2VlcCBtb3JlIHRoYW4gb25lIGNvcHkgb2YgdGhpcyBzdWJtaXNzaW9uIGZvciBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgY29weXJpZ2h0IG93bmVyIHRvIGdyYW50IEluZGlhbmEgVW5pdmVyc2l0eSB0aGUgcmlnaHRzIHJlcXVpcmVkIGJ5IHRoaXMgbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklmIHRoZSBzdWJtaXNzaW9uIGlzIGJhc2VkIHVwb24gd29yayB0aGF0IGhhcyBiZWVuIHNwb25zb3JlZCBvciBzdXBwb3J0ZWQgYnkgYW4gYWdlbmN5IG9yIG9yZ2FuaXphdGlvbiBvdGhlciB0aGFuIEluZGlhbmEgVW5pdmVyc2l0eSwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIGZ1bGZpbGxlZCBhbnkgcmlnaHQgb2YgcmV2aWV3IG9yIG90aGVyIG9ibGlnYXRpb25zIHJlcXVpcmVkIGJ5IHN1Y2ggY29udHJhY3Qgb3IgYWdyZWVtZW50LgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb24sIG90aGVyIHRoYW4gYXMgYWxsb3dlZCBieSB0aGlzIGxpY2Vuc2UsIHRvIHlvdXIgc3VibWlzc2lvbi4gV2UgYWdyZWUgdG8gbm90IG1ha2UgYXZhaWxhYmxlIGFueSBmaWxlcyB0aGF0IGFyZSBlbWJhcmdvZWQgdW50aWwgdGhlIGVtYmFyZ28gaGFzIGV4cGlyZWQuCgpJZiB5b3UgYXJlIHN1Ym1pdHRpbmcgdGhpcyBpdGVtIG9uIGJlaGFsZiBvZiB0aGUgcmlnaHRzIGhvbGRlciwgeW91IG11c3QgaGF2ZSB0aGUgcmlnaHRzIG93bmVyJ3Mgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuCgo=
oai:scholarworks.iu.edu:2022/207392021-10-18T12:33:10Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Ruan, Guangchen
Zhang, Hui
Wernert, Eric
Plale, Beth
2016-03-10T19:37:54Z
2016-03-10T19:37:54Z
2014-07-13
Ruan, G., Zhang, H., Wernert, E., & Plale, B. (2014, July 13). TextRWeb: Large-Scale Text Analytics with R on the Web. Paper presented at XSEDE14, Atlanta, GA. doi:10.1145/2616498.2616557
http://hdl.handle.net/2022/20739
10.1145/2616498.2616557
As digital data sources grow in number and size, they pose an opportunity for computational investigation by means of text mining, NLP, and other text analysis techniques. R is a popular and powerful text analytics tool; however, it needs to run in parallel and re- quires special handling to protect copyrighted content against full access (consumption). The HathiTrust Research Center (HTRC) currently has 11 million volumes (books) where 7 million volumes are copyrighted. In this paper we propose HTRC TextRWeb, an interactive R software environment which employs complexity hiding interfaces and automatic code generation to allow large-scale text analytics in a non-consumptive means. For our principal test case of copyrighted data in HathiTrust Digital Library, TextRWeb permits us to code, edit, and submit text analytics methods empowered by a family of interactive web user interfaces. All these methods combine to reveal a new interactive paradigm for large-scale text analytics on the web.
Submitted by Winona Snapp-Childs (wsnappch@indiana.edu) on 2016-03-10T19:36:06Z
No. of bitstreams: 1
a63-ruan(1).pdf: 2055237 bytes, checksum: fdb25c30274484fc21447120d4e56b8d (MD5)
Approved for entry into archive by Winona Snapp-Childs (wsnappch@indiana.edu) on 2016-03-10T19:37:54Z (GMT) No. of bitstreams: 1
a63-ruan(1).pdf: 2055237 bytes, checksum: fdb25c30274484fc21447120d4e56b8d (MD5)
Made available in DSpace on 2016-03-10T19:37:54Z (GMT). No. of bitstreams: 1
a63-ruan(1).pdf: 2055237 bytes, checksum: fdb25c30274484fc21447120d4e56b8d (MD5)
Previous issue date: 2014-07-13
en_US
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org.
R, text analysis, interactive, non-consumptive use, parallel computing
TextRWeb: Large-Scale Text Analytics with R on the Web
Presentation
true
ORIGINAL
a63-ruan(1).pdf
a63-ruan(1).pdf
Main Paper
application/pdf
2055237
https://scholarworks.iu.edu/dspace/bitstream/2022/20739/1/a63-ruan%281%29.pdf
fdb25c30274484fc21447120d4e56b8d
MD5
1
LICENSE
license.txt
license.txt
text/plain
2042
https://scholarworks.iu.edu/dspace/bitstream/2022/20739/2/license.txt
ec02ee705cdd83d448aa3c67b4ef102c
MD5
2
TEXT
a63-ruan(1).pdf.txt
a63-ruan(1).pdf.txt
Extracted text
text/plain
43879
https://scholarworks.iu.edu/dspace/bitstream/2022/20739/3/a63-ruan%281%29.pdf.txt
0d7bb29a061594bf1b72da8c458bcae1
MD5
3
THUMBNAIL
a63-ruan(1).pdf.jpg
a63-ruan(1).pdf.jpg
IM Thumbnail
image/jpeg
3430
https://scholarworks.iu.edu/dspace/bitstream/2022/20739/4/a63-ruan%281%29.pdf.jpg
d2a6d445f77e13fd07d7ed0376453823
MD5
4
2022/20739
oai:scholarworks.iu.edu:2022/20739
2021-10-18 08:33:10.914
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgYSBub24tZXhjbHVzaXZlLCBwZXJwZXR1YWwsIGlycmV2b2NhYmxlIHJpZ2h0IHRvIHJlcHJvZHVjZSwgdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgdGhlIGFic3RyYWN0KSB3b3JsZHdpZGUgaW4gcHJpbnQgYW5kIGVsZWN0cm9uaWMgZm9ybWF0IGFuZCBpbiBhbnkgbWVkaXVtLCBpbmNsdWRpbmcgYnV0IG5vdCBsaW1pdGVkIHRvIGF1ZGlvIG9yIHZpZGVvLgoKWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0LCBub3cga25vd24gb3IgbGF0ZXIgZGV2ZWxvcGVkLCBmb3IgcHJlc2VydmF0aW9uIG9yIGFjY2VzcywgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkuCgpZb3UgYWxzbyBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXkga2VlcCBtb3JlIHRoYW4gb25lIGNvcHkgb2YgdGhpcyBzdWJtaXNzaW9uIGZvciBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgY29weXJpZ2h0IG93bmVyIHRvIGdyYW50IEluZGlhbmEgVW5pdmVyc2l0eSB0aGUgcmlnaHRzIHJlcXVpcmVkIGJ5IHRoaXMgbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklmIHRoZSBzdWJtaXNzaW9uIGlzIGJhc2VkIHVwb24gd29yayB0aGF0IGhhcyBiZWVuIHNwb25zb3JlZCBvciBzdXBwb3J0ZWQgYnkgYW4gYWdlbmN5IG9yIG9yZ2FuaXphdGlvbiBvdGhlciB0aGFuIEluZGlhbmEgVW5pdmVyc2l0eSwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIGZ1bGZpbGxlZCBhbnkgcmlnaHQgb2YgcmV2aWV3IG9yIG90aGVyIG9ibGlnYXRpb25zIHJlcXVpcmVkIGJ5IHN1Y2ggY29udHJhY3Qgb3IgYWdyZWVtZW50LgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb25zLCBvdGhlciB0aGFuIGFzIGFsbG93ZWQgYnkgdGhpcyBsaWNlbnNlLCB0byB5b3VyIHN1Ym1pc3Npb24uIFdlIGFncmVlIHRvIG5vdCBtYWtlIGF2YWlsYWJsZSBhbnkgZmlsZXMgdGhhdCBhcmUgZW1iYXJnb2VkIHVudGlsIHRoZSBlbWJhcmdvIGhhcyBleHBpcmVkLgoKSWYgeW91IGFyZSBzdWJtaXR0aW5nIHRoaXMgaXRlbSBvbiBiZWhhbGYgb2YgdGhlIHJpZ2h0cyBob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcidzIHdyaXR0ZW4gcGVybWlzc2lvbiB0byBhY2NlcHQgdGhpcyBsaWNlbnNlIG9uIGhpcy9oZXIgYmVoYWxmLgo=
oai:scholarworks.iu.edu:2022/208032021-10-18T10:02:17Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
Study undertaken April 2016
2016-04-14T14:57:12Z
2016-04-14T14:57:12Z
http://hdl.handle.net/2022/20803
This study is undertaken to assess the computational and storage needs for a large-scale research activity to study water in the State of Indiana. It draws its data and compute numbers from the Vortex II Forecast Data study of 2010 carried out by the Data To Insight Center at Indiana University. Detail of the study can be found in each of the archived data products (which contains results of a single weather forecast plus 42 visualizations created for each forecast.) See https://scholarworks.iu.edu/dspace/handle/2022/15153 for example archived data product.
Submitted by Beth Plale (plale@indiana.edu) on 2016-04-14T14:55:25Z
No. of bitstreams: 1
ComputeStorageEstimate-WaterGC.pdf: 84597 bytes, checksum: 4f2bbb931cd10cd7bfa109ddbd0255b3 (MD5)
Approved for entry into archive by Beth Plale (plale@indiana.edu) on 2016-04-14T14:57:11Z (GMT) No. of bitstreams: 1
ComputeStorageEstimate-WaterGC.pdf: 84597 bytes, checksum: 4f2bbb931cd10cd7bfa109ddbd0255b3 (MD5)
Made available in DSpace on 2016-04-14T14:57:12Z (GMT). No. of bitstreams: 1
ComputeStorageEstimate-WaterGC.pdf: 84597 bytes, checksum: 4f2bbb931cd10cd7bfa109ddbd0255b3 (MD5)
en_US
none
Cyberinfrastructure, HPC, data management, water resource models, hydroinformatics
Grand Challenge of Indiana Water: Estimate of Compute and Data Storage Needs
Working Paper
false
ORIGINAL
ComputeStorageEstimate-WaterGC.pdf
ComputeStorageEstimate-WaterGC.pdf
application/pdf
84597
https://scholarworks.iu.edu/dspace/bitstream/2022/20803/1/ComputeStorageEstimate-WaterGC.pdf
4f2bbb931cd10cd7bfa109ddbd0255b3
MD5
1
LICENSE
license.txt
license.txt
text/plain
2042
https://scholarworks.iu.edu/dspace/bitstream/2022/20803/2/license.txt
ec02ee705cdd83d448aa3c67b4ef102c
MD5
2
TEXT
ComputeStorageEstimate-WaterGC.pdf.txt
ComputeStorageEstimate-WaterGC.pdf.txt
Extracted text
text/plain
7444
https://scholarworks.iu.edu/dspace/bitstream/2022/20803/3/ComputeStorageEstimate-WaterGC.pdf.txt
aa068b886dc0f7f16c8474d78afbf436
MD5
3
THUMBNAIL
ComputeStorageEstimate-WaterGC.pdf.jpg
ComputeStorageEstimate-WaterGC.pdf.jpg
IM Thumbnail
image/jpeg
4586
https://scholarworks.iu.edu/dspace/bitstream/2022/20803/4/ComputeStorageEstimate-WaterGC.pdf.jpg
67c02cc30e6ce0fc39abc9c629049cc3
MD5
4
2022/20803
oai:scholarworks.iu.edu:2022/20803
2021-10-18 06:02:17.296
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgYSBub24tZXhjbHVzaXZlLCBwZXJwZXR1YWwsIGlycmV2b2NhYmxlIHJpZ2h0IHRvIHJlcHJvZHVjZSwgdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgdGhlIGFic3RyYWN0KSB3b3JsZHdpZGUgaW4gcHJpbnQgYW5kIGVsZWN0cm9uaWMgZm9ybWF0IGFuZCBpbiBhbnkgbWVkaXVtLCBpbmNsdWRpbmcgYnV0IG5vdCBsaW1pdGVkIHRvIGF1ZGlvIG9yIHZpZGVvLgoKWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0LCBub3cga25vd24gb3IgbGF0ZXIgZGV2ZWxvcGVkLCBmb3IgcHJlc2VydmF0aW9uIG9yIGFjY2VzcywgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkuCgpZb3UgYWxzbyBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXkga2VlcCBtb3JlIHRoYW4gb25lIGNvcHkgb2YgdGhpcyBzdWJtaXNzaW9uIGZvciBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgY29weXJpZ2h0IG93bmVyIHRvIGdyYW50IEluZGlhbmEgVW5pdmVyc2l0eSB0aGUgcmlnaHRzIHJlcXVpcmVkIGJ5IHRoaXMgbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklmIHRoZSBzdWJtaXNzaW9uIGlzIGJhc2VkIHVwb24gd29yayB0aGF0IGhhcyBiZWVuIHNwb25zb3JlZCBvciBzdXBwb3J0ZWQgYnkgYW4gYWdlbmN5IG9yIG9yZ2FuaXphdGlvbiBvdGhlciB0aGFuIEluZGlhbmEgVW5pdmVyc2l0eSwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIGZ1bGZpbGxlZCBhbnkgcmlnaHQgb2YgcmV2aWV3IG9yIG90aGVyIG9ibGlnYXRpb25zIHJlcXVpcmVkIGJ5IHN1Y2ggY29udHJhY3Qgb3IgYWdyZWVtZW50LgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb25zLCBvdGhlciB0aGFuIGFzIGFsbG93ZWQgYnkgdGhpcyBsaWNlbnNlLCB0byB5b3VyIHN1Ym1pc3Npb24uIFdlIGFncmVlIHRvIG5vdCBtYWtlIGF2YWlsYWJsZSBhbnkgZmlsZXMgdGhhdCBhcmUgZW1iYXJnb2VkIHVudGlsIHRoZSBlbWJhcmdvIGhhcyBleHBpcmVkLgoKSWYgeW91IGFyZSBzdWJtaXR0aW5nIHRoaXMgaXRlbSBvbiBiZWhhbGYgb2YgdGhlIHJpZ2h0cyBob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcidzIHdyaXR0ZW4gcGVybWlzc2lvbiB0byBhY2NlcHQgdGhpcyBsaWNlbnNlIG9uIGhpcy9oZXIgYmVoYWxmLgo=
oai:scholarworks.iu.edu:2022/208092021-10-18T11:23:25Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Peng, Chen
Tom, Evans
Beth, Plale
2016-04-26T02:59:32Z
2016-04-26T02:59:32Z
http://hdl.handle.net/2022/20809
We conjecture that meaningful analysis of large-scale provenance can be preserved by analyzing provenance data in limited memory while the data is still in motion; that the provenance needs not be fully resident before analysis can occur. As a proof of concept, this paper defi nes a stream model for reasoning about provenance data in motion for Big Data provenance. We propose a novel streaming algorithm for the backward provenance query, and apply it to the live provenance captured from agent-based simulations. The performance test demonstrates high throughput, low latency and good scalability, in a distributed stream processing framework built on Apache Kafka and Spark Streaming.
Submitted by Peng Chen (chenpeng@indiana.edu) on 2016-04-26T02:23:32Z
No. of bitstreams: 1
streamProv.pdf: 920742 bytes, checksum: 2b0d1787defce578550516863c338fbf (MD5)
Approved for entry into archive by Beth Plale (plale@indiana.edu) on 2016-04-26T02:59:32Z (GMT) No. of bitstreams: 1
streamProv.pdf: 920742 bytes, checksum: 2b0d1787defce578550516863c338fbf (MD5)
Made available in DSpace on 2016-04-26T02:59:32Z (GMT). No. of bitstreams: 1
streamProv.pdf: 920742 bytes, checksum: 2b0d1787defce578550516863c338fbf (MD5)
the National Science Foundation under award number 1360463
en_US
live data provenance
stream processing
agent-based model
Analysis of Memory Constrained Live Provenance
Preprint
false
ORIGINAL
streamProv.pdf
streamProv.pdf
application/pdf
920742
https://scholarworks.iu.edu/dspace/bitstream/2022/20809/1/streamProv.pdf
2b0d1787defce578550516863c338fbf
MD5
1
LICENSE
license.txt
license.txt
text/plain
2042
https://scholarworks.iu.edu/dspace/bitstream/2022/20809/2/license.txt
ec02ee705cdd83d448aa3c67b4ef102c
MD5
2
TEXT
streamProv.pdf.txt
streamProv.pdf.txt
Extracted text
text/plain
30751
https://scholarworks.iu.edu/dspace/bitstream/2022/20809/3/streamProv.pdf.txt
17807b7a3cab0031a18e8c1caa381df9
MD5
3
THUMBNAIL
streamProv.pdf.jpg
streamProv.pdf.jpg
IM Thumbnail
image/jpeg
3655
https://scholarworks.iu.edu/dspace/bitstream/2022/20809/4/streamProv.pdf.jpg
c43f5ac6a2615f5ea666ff265625462c
MD5
4
2022/20809
oai:scholarworks.iu.edu:2022/20809
2021-10-18 07:23:25.586
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgYSBub24tZXhjbHVzaXZlLCBwZXJwZXR1YWwsIGlycmV2b2NhYmxlIHJpZ2h0IHRvIHJlcHJvZHVjZSwgdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgdGhlIGFic3RyYWN0KSB3b3JsZHdpZGUgaW4gcHJpbnQgYW5kIGVsZWN0cm9uaWMgZm9ybWF0IGFuZCBpbiBhbnkgbWVkaXVtLCBpbmNsdWRpbmcgYnV0IG5vdCBsaW1pdGVkIHRvIGF1ZGlvIG9yIHZpZGVvLgoKWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0LCBub3cga25vd24gb3IgbGF0ZXIgZGV2ZWxvcGVkLCBmb3IgcHJlc2VydmF0aW9uIG9yIGFjY2VzcywgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkuCgpZb3UgYWxzbyBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXkga2VlcCBtb3JlIHRoYW4gb25lIGNvcHkgb2YgdGhpcyBzdWJtaXNzaW9uIGZvciBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgY29weXJpZ2h0IG93bmVyIHRvIGdyYW50IEluZGlhbmEgVW5pdmVyc2l0eSB0aGUgcmlnaHRzIHJlcXVpcmVkIGJ5IHRoaXMgbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklmIHRoZSBzdWJtaXNzaW9uIGlzIGJhc2VkIHVwb24gd29yayB0aGF0IGhhcyBiZWVuIHNwb25zb3JlZCBvciBzdXBwb3J0ZWQgYnkgYW4gYWdlbmN5IG9yIG9yZ2FuaXphdGlvbiBvdGhlciB0aGFuIEluZGlhbmEgVW5pdmVyc2l0eSwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIGZ1bGZpbGxlZCBhbnkgcmlnaHQgb2YgcmV2aWV3IG9yIG90aGVyIG9ibGlnYXRpb25zIHJlcXVpcmVkIGJ5IHN1Y2ggY29udHJhY3Qgb3IgYWdyZWVtZW50LgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb25zLCBvdGhlciB0aGFuIGFzIGFsbG93ZWQgYnkgdGhpcyBsaWNlbnNlLCB0byB5b3VyIHN1Ym1pc3Npb24uIFdlIGFncmVlIHRvIG5vdCBtYWtlIGF2YWlsYWJsZSBhbnkgZmlsZXMgdGhhdCBhcmUgZW1iYXJnb2VkIHVudGlsIHRoZSBlbWJhcmdvIGhhcyBleHBpcmVkLgoKSWYgeW91IGFyZSBzdWJtaXR0aW5nIHRoaXMgaXRlbSBvbiBiZWhhbGYgb2YgdGhlIHJpZ2h0cyBob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcidzIHdyaXR0ZW4gcGVybWlzc2lvbiB0byBhY2NlcHQgdGhpcyBsaWNlbnNlIG9uIGhpcy9oZXIgYmVoYWxmLgo=
oai:scholarworks.iu.edu:2022/210242021-10-18T10:29:08Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Chen, Peng
Evans, Tom
Frisby, Michael
Izquierdo, Eduardo
Plale, Beth
2016-10-03T03:16:41Z
2016-10-03T03:16:41Z
2016
http://hdl.handle.net/2022/21024
An Agent Based Model (ABM) is a powerful tool for its ability to represent heterogeneous agents which through their interactions can reveal emergent phenomena. For this to occur though, the set of agents in an ABM has to accurately model a real world population to reflect its heterogeneity. But when studying human behavior in less well developed settings, the availability of the real population data can be limited, making it impossible to create agents directly from the real population. In this paper, we propose a hybrid method to deal with this data scarcity: we first use the available real population data as the baseline to preserve the true heterogeneity, and fill in the missing characteristics based on survey and remote sensing datasets; then for the remaining undetermined agent characteristics, we use the Microbial Genetic Algorithm to search for a set of values that can optimize the replicative validity of the model to match data observed from real world. We apply our method to the creation of a synthetic population of household agents for the simulation of agricultural decision making processes in rural Zambia. The result shows that the synthetic population created from the farmer register can correctly reflect the marginal distributions and the randomness of survey data; and can minimize the difference between the distribution of simulated yield and that of the observed yield in Post Harvest Survey (PHS).
Submitted by Peng Chen (chenpeng@indiana.edu) on 2016-10-02T23:47:39Z
No. of bitstreams: 1
simulation.pdf: 777511 bytes, checksum: 7c6d87bbc11dbd2b88b883e9b85cce2c (MD5)
Approved for entry into archive by Richard Higgins (rshiggin@indiana.edu) on 2016-10-03T03:16:41Z (GMT) No. of bitstreams: 1
simulation.pdf: 777511 bytes, checksum: 7c6d87bbc11dbd2b88b883e9b85cce2c (MD5)
Made available in DSpace on 2016-10-03T03:16:41Z (GMT). No. of bitstreams: 1
simulation.pdf: 777511 bytes, checksum: 7c6d87bbc11dbd2b88b883e9b85cce2c (MD5)
Previous issue date: 2016
The research is supported in part by the National Science Foundation under grants BCS1026776 and SES-1360463, and by the Pervasive Technology Institute at Indiana University.
en_US
A Hybrid Approach to Population Construction For Agricultural Agent-Based Simulation
Preprint
ORIGINAL
simulation.pdf
simulation.pdf
application/pdf
777511
https://scholarworks.iu.edu/dspace/bitstream/2022/21024/1/simulation.pdf
7c6d87bbc11dbd2b88b883e9b85cce2c
MD5
1
LICENSE
license.txt
license.txt
text/plain
2042
https://scholarworks.iu.edu/dspace/bitstream/2022/21024/2/license.txt
ec02ee705cdd83d448aa3c67b4ef102c
MD5
2
TEXT
simulation.pdf.txt
simulation.pdf.txt
Extracted text
text/plain
46148
https://scholarworks.iu.edu/dspace/bitstream/2022/21024/3/simulation.pdf.txt
5ea260d2b59cde561a6d1daf40091e69
MD5
3
THUMBNAIL
simulation.pdf.jpg
simulation.pdf.jpg
IM Thumbnail
image/jpeg
4539
https://scholarworks.iu.edu/dspace/bitstream/2022/21024/4/simulation.pdf.jpg
ab64d3ff407949f0b6faf7a652e3f730
MD5
4
2022/21024
oai:scholarworks.iu.edu:2022/21024
2021-10-18 06:29:08.146
IUScholarWorks
iusw@indiana.edu
Qnkgc2lnbmluZyBhbmQgc3VibWl0dGluZyB0aGlzIGxpY2Vuc2UsIHlvdSAodGhlIGNyZWF0b3Igb3IgY29weXJpZ2h0IG93bmVyKSBncmFudCB0byBJbmRpYW5hIFVuaXZlcnNpdHkgYSBub24tZXhjbHVzaXZlLCBwZXJwZXR1YWwsIGlycmV2b2NhYmxlIHJpZ2h0IHRvIHJlcHJvZHVjZSwgdHJhbnNsYXRlIChhcyBkZWZpbmVkIGJlbG93KSwgYW5kL29yIGRpc3RyaWJ1dGUgeW91ciBzdWJtaXNzaW9uIChpbmNsdWRpbmcgdGhlIGFic3RyYWN0KSB3b3JsZHdpZGUgaW4gcHJpbnQgYW5kIGVsZWN0cm9uaWMgZm9ybWF0IGFuZCBpbiBhbnkgbWVkaXVtLCBpbmNsdWRpbmcgYnV0IG5vdCBsaW1pdGVkIHRvIGF1ZGlvIG9yIHZpZGVvLgoKWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgdHJhbnNsYXRlIHRoZSBzdWJtaXNzaW9uIHRvIGFueSBtZWRpdW0gb3IgZm9ybWF0LCBub3cga25vd24gb3IgbGF0ZXIgZGV2ZWxvcGVkLCBmb3IgcHJlc2VydmF0aW9uIG9yIGFjY2VzcywgYW5kIHByb3ZpZGUgYmFzaWMgbWV0YWRhdGEgdGhhdCBkZXNjcmliZXMgdGhlIGNvbnRlbnRzIGZvciBkaXNjb3ZlcnkuCgpZb3UgYWxzbyBhZ3JlZSB0aGF0IEluZGlhbmEgVW5pdmVyc2l0eSBtYXkga2VlcCBtb3JlIHRoYW4gb25lIGNvcHkgb2YgdGhpcyBzdWJtaXNzaW9uIGZvciBzZWN1cml0eSwgYmFjay11cCBhbmQgcHJlc2VydmF0aW9uLgoKWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuCgpJZiB0aGUgc3VibWlzc2lvbiBjb250YWlucyBtYXRlcmlhbCBmb3Igd2hpY2ggeW91IGRvIG5vdCBob2xkIGNvcHlyaWdodCwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIG9idGFpbmVkIHRoZSB1bnJlc3RyaWN0ZWQgcGVybWlzc2lvbiBvZiB0aGUgY29weXJpZ2h0IG93bmVyIHRvIGdyYW50IEluZGlhbmEgVW5pdmVyc2l0eSB0aGUgcmlnaHRzIHJlcXVpcmVkIGJ5IHRoaXMgbGljZW5zZSwgYW5kIHRoYXQgc3VjaCB0aGlyZC1wYXJ0eSBvd25lZCBtYXRlcmlhbCBpcyBjbGVhcmx5IGlkZW50aWZpZWQgYW5kIGFja25vd2xlZGdlZCB3aXRoaW4gdGhlIHRleHQgb3IgY29udGVudCBvZiB0aGUgc3VibWlzc2lvbi4KCklmIHRoZSBzdWJtaXNzaW9uIGlzIGJhc2VkIHVwb24gd29yayB0aGF0IGhhcyBiZWVuIHNwb25zb3JlZCBvciBzdXBwb3J0ZWQgYnkgYW4gYWdlbmN5IG9yIG9yZ2FuaXphdGlvbiBvdGhlciB0aGFuIEluZGlhbmEgVW5pdmVyc2l0eSwgeW91IHJlcHJlc2VudCB0aGF0IHlvdSBoYXZlIGZ1bGZpbGxlZCBhbnkgcmlnaHQgb2YgcmV2aWV3IG9yIG90aGVyIG9ibGlnYXRpb25zIHJlcXVpcmVkIGJ5IHN1Y2ggY29udHJhY3Qgb3IgYWdyZWVtZW50LgoKSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb25zLCBvdGhlciB0aGFuIGFzIGFsbG93ZWQgYnkgdGhpcyBsaWNlbnNlLCB0byB5b3VyIHN1Ym1pc3Npb24uIFdlIGFncmVlIHRvIG5vdCBtYWtlIGF2YWlsYWJsZSBhbnkgZmlsZXMgdGhhdCBhcmUgZW1iYXJnb2VkIHVudGlsIHRoZSBlbWJhcmdvIGhhcyBleHBpcmVkLgoKSWYgeW91IGFyZSBzdWJtaXR0aW5nIHRoaXMgaXRlbSBvbiBiZWhhbGYgb2YgdGhlIHJpZ2h0cyBob2xkZXIsIHlvdSBtdXN0IGhhdmUgdGhlIHJpZ2h0cyBvd25lcidzIHdyaXR0ZW4gcGVybWlzc2lvbiB0byBhY2NlcHQgdGhpcyBsaWNlbnNlIG9uIGhpcy9oZXIgYmVoYWxmLgo=
oai:scholarworks.iu.edu:2022/223122018-08-02T07:19:15Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
Kouper, Inna
2018-08-01T19:41:29Z
2018-08-01T19:41:29Z
2017-07-23
http://hdl.handle.net/2022/22312
Hands on tutorial on using Azure VMs to give data science students hands-on experience. Students analyze PM 2.4 data in real time. Partnership of the Pacific Rim Applications and Grid Middleware Assembly (PRAGMA) with team in Taiwan deploying Airbox sensor network. Hands on tutorial presented at ESIP Summer 2017 meeting in Bloomington, IN
Submitted by Yu Luo (luoyu@indiana.edu) on 2018-07-31T18:14:47Z
No. of bitstreams: 3
1 SEADTrain ESIP Slide Overview - 20 min.pptx: 1395661 bytes, checksum: f4e5cc9f9246711162f0b84e70db6824 (MD5)
2 SEADTrain ESIP Ingest Demo - 20 min.pptx: 1406270 bytes, checksum: 18452d21c1c6ec5f531d1e23c57bc1ba (MD5)
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx: 1608671 bytes, checksum: d333221e0e3d50ec9184546b78401424 (MD5)
Approved for entry into archive by Richard Higgins (rshiggin@indiana.edu) on 2018-08-01T19:41:29Z (GMT) No. of bitstreams: 3
1 SEADTrain ESIP Slide Overview - 20 min.pptx: 1395661 bytes, checksum: f4e5cc9f9246711162f0b84e70db6824 (MD5)
2 SEADTrain ESIP Ingest Demo - 20 min.pptx: 1406270 bytes, checksum: 18452d21c1c6ec5f531d1e23c57bc1ba (MD5)
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx: 1608671 bytes, checksum: d333221e0e3d50ec9184546b78401424 (MD5)
Made available in DSpace on 2018-08-01T19:41:29Z (GMT). No. of bitstreams: 3
1 SEADTrain ESIP Slide Overview - 20 min.pptx: 1395661 bytes, checksum: f4e5cc9f9246711162f0b84e70db6824 (MD5)
2 SEADTrain ESIP Ingest Demo - 20 min.pptx: 1406270 bytes, checksum: 18452d21c1c6ec5f531d1e23c57bc1ba (MD5)
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx: 1608671 bytes, checksum: d333221e0e3d50ec9184546b78401424 (MD5)
Previous issue date: 2017-07-23
National Science Foundation award #1234983 and Microsoft
en
ESIP Summer 2017; persistent IDs; Azure data analysis; on-line learning
SEADTrain Data Analysis
Presentation
ORIGINAL
1 SEADTrain ESIP Slide Overview - 20 min.pptx
1 SEADTrain ESIP Slide Overview - 20 min.pptx
SEADTrain ESIP Slide Overview
application/vnd.openxmlformats-officedocument.presentationml.presentation
1395661
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/1/1%20SEADTrain%20ESIP%20Slide%20Overview%20-%2020%20min.pptx
f4e5cc9f9246711162f0b84e70db6824
MD5
1
2 SEADTrain ESIP Ingest Demo - 20 min.pptx
2 SEADTrain ESIP Ingest Demo - 20 min.pptx
SEADTrain ESIP Ingest Demo
application/vnd.openxmlformats-officedocument.presentationml.presentation
1406270
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/2/2%20SEADTrain%20ESIP%20Ingest%20Demo%20-%2020%20min.pptx
18452d21c1c6ec5f531d1e23c57bc1ba
MD5
2
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx
SEADTrain ESIP Help and Analysis
application/vnd.openxmlformats-officedocument.presentationml.presentation
1608671
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/3/3%20%26%204%20SEADTrain%20ESIP%20Help%20and%20Analysis%20-%20%2820%2b30%29%20min.pptx
d333221e0e3d50ec9184546b78401424
MD5
3
LICENSE
license.txt
license.txt
text/plain
2730
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/4/license.txt
8b5355396084efce0c43805344408e89
MD5
4
TEXT
1 SEADTrain ESIP Slide Overview - 20 min.pptx.txt
1 SEADTrain ESIP Slide Overview - 20 min.pptx.txt
Extracted text
text/plain
4359
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/5/1%20SEADTrain%20ESIP%20Slide%20Overview%20-%2020%20min.pptx.txt
6c5358efffb9a85b5ad8370181ad74bf
MD5
5
2 SEADTrain ESIP Ingest Demo - 20 min.pptx.txt
2 SEADTrain ESIP Ingest Demo - 20 min.pptx.txt
Extracted text
text/plain
4100
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/6/2%20SEADTrain%20ESIP%20Ingest%20Demo%20-%2020%20min.pptx.txt
b3369b94fe9a76521c7ff447b1402be1
MD5
6
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx.txt
3 & 4 SEADTrain ESIP Help and Analysis - (20+30) min.pptx.txt
Extracted text
text/plain
4601
https://scholarworks.iu.edu/dspace/bitstream/2022/22312/7/3%20%26%204%20SEADTrain%20ESIP%20Help%20and%20Analysis%20-%20%2820%2b30%29%20min.pptx.txt
10a85a831a4431fac51ae633599c4a2b
MD5
7
2022/22312
oai:scholarworks.iu.edu:2022/22312
2018-08-02 03:19:15.642
IUScholarWorks
iusw@indiana.edu
T3BlbiBBY2Nlc3MgUG9saWN5CgpJZiB5b3UgYXJlIHN1Ym1pdHRpbmcgYSBzY2hvbGFybHkgYXJ0aWNsZSBwdXJzdWFudCB0byB0aGUgSW5kaWFuYSBVbml2ZXJzaXR5IEJsb29taW5ndG9uIE9wZW4gQWNjZXNzIFBvbGljeSwgeW91ciBzdWJtaXNzaW9uIGlzIGdvdmVybmVkIGJ5IHRoZSB0ZXJtcyBvZiB0aGUgbm9uLWV4Y2x1c2l2ZSBsaWNlbnNlIGdyYW50ZWQgdW5kZXIgdGhlIE9wZW4gQWNjZXNzIFBvbGljeS4gSWYgeW91IG5lZWQgYXNzaXN0YW5jZSBkZXRlcm1pbmluZyB3aGV0aGVyIHlvdXIgYXJ0aWNsZSBpcyBzdWJqZWN0IHRvIHRoZSBPcGVuIEFjY2VzcyBQb2xpY3ksIHBsZWFzZSBjb250YWN0IHRoZSBTY2hvbGFybHkgQ29tbXVuaWNhdGlvbiBEZXBhcnRtZW50LiBUbyByZWFkIHRoZSBwb2xpY3ksIHZpc2l0OiBodHRwOi8vZ28uaXUuZWR1LzIxdHguIFRvIHJlYWQgdGhlIEZBUSBvbiB0aGUgcG9saWN5LCB2aXNpdDogaHR0cHM6Ly9vcGVuc2Nob2xhcnNoaXAuaW5kaWFuYS5lZHUvcG9saWN5LWZhcS4gCgpBbGwgb3RoZXIgc3VibWlzc2lvbnMgdG8gSVVTY2hvbGFyV29ya3MgYXJlIHN1YmplY3QgdG8gdGhlIHRlcm1zIG9mIHRoZSBub24tZXhjbHVzaXZlIGxpY2Vuc2UgYmVsb3c6CgpJVVNjaG9sYXJXb3JrcyBSZXBvc2l0b3J5IExpY2Vuc2UgCgpCeSBzdWJtaXR0aW5nIHlvdXIgd29yayB0byB0aGUgSVVTY2hvbGFyV29ya3MgcmVwb3NpdG9yeSwgeW91ICh0aGUgY3JlYXRvciBvciBjb3B5cmlnaHQgb3duZXIpIGdyYW50IHRvIEluZGlhbmEgVW5pdmVyc2l0eSBhIG5vbi1leGNsdXNpdmUsIHBlcnBldHVhbCwgaXJyZXZvY2FibGUgcmlnaHQgdG8gcmVwcm9kdWNlLCByZWZvcm1hdCAoYXMgZGVmaW5lZCBiZWxvdyksIGFuZC9vciBkaXN0cmlidXRlIHlvdXIgc3VibWlzc2lvbiAoaW5jbHVkaW5nIHRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgaW5jbHVkaW5nIGJ1dCBub3QgbGltaXRlZCB0byBhdWRpbyBvciB2aWRlby4gWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgcmVmb3JtYXQgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQsIG5vdyBrbm93biBvciBsYXRlciBkZXZlbG9wZWQsIGZvciBwcmVzZXJ2YXRpb24gb3IgYWNjZXNzLCBhbmQgcHJvdmlkZSBiYXNpYyBtZXRhZGF0YSB0aGF0IGRlc2NyaWJlcyB0aGUgY29udGVudHMgZm9yIGRpc2NvdmVyeS4gWW91IGFsc28gYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5IGtlZXAgbW9yZSB0aGFuIG9uZSBjb3B5IG9mIHRoaXMgc3VibWlzc2lvbiBmb3Igc2VjdXJpdHksIGJhY2stdXAgYW5kIHByZXNlcnZhdGlvbi4gWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIElmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZSBjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgSW5kaWFuYSBVbml2ZXJzaXR5IHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdCBzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkIHdpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLiBJZiB0aGUgc3VibWlzc2lvbiBpcyBiYXNlZCB1cG9uIHdvcmsgdGhhdCBoYXMgYmVlbiBzcG9uc29yZWQgb3Igc3VwcG9ydGVkIGJ5IGFuIGFnZW5jeSBvciBvcmdhbml6YXRpb24gb3RoZXIgdGhhbiBJbmRpYW5hIFVuaXZlcnNpdHksIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBmdWxmaWxsZWQgYW55IHJpZ2h0IG9mIHJldmlldyBvciBvdGhlciBvYmxpZ2F0aW9ucyByZXF1aXJlZCBieSBzdWNoIGNvbnRyYWN0IG9yIGFncmVlbWVudC4gSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb25zLCBvdGhlciB0aGFuIGFzIGFsbG93ZWQgYnkgdGhpcyBsaWNlbnNlLCB0byB5b3VyIHN1Ym1pc3Npb24uIFdlIGFncmVlIHRvIG5vdCBtYWtlIGF2YWlsYWJsZSBhbnkgZmlsZXMgdGhhdCBhcmUgZW1iYXJnb2VkIHVudGlsIHRoZSBlbWJhcmdvIGhhcyBleHBpcmVkLiBJZiB5b3UgYXJlIHN1Ym1pdHRpbmcgdGhpcyBpdGVtIG9uIGJlaGFsZiBvZiB0aGUgcmlnaHRzIGhvbGRlciwgeW91IG11c3QgaGF2ZSB0aGUgcmlnaHRzIG93bmVyJ3Mgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAoKcmV2LiAxMS8zLzIwMTcK
oai:scholarworks.iu.edu:2022/223132021-10-18T14:43:56Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Luo, Yu
Ratharanjan, Kunalan
Zhou, Quan
Plale, Beth
2018-08-01T19:41:57Z
2018-08-01T19:41:57Z
2018-05-09
http://hdl.handle.net/2022/22313
A poster for presenting the investigations on PRAGMA Airbox Sensor Data and PRAGMA Rice Genomics project. In poster sections, we demonstrate the PID assignment strategy for data for streaming data under Airbox use case, and viability of provenance as part of the PID KI record under Rice Genomics use case.
Submitted by Yu Luo (luoyu@indiana.edu) on 2018-07-31T18:35:44Z
No. of bitstreams: 1
Pragma 34 Poster (D2I center).pdf: 1066161 bytes, checksum: b440daf7026fa358b22154207fcedcb0 (MD5)
Approved for entry into archive by Richard Higgins (rshiggin@indiana.edu) on 2018-08-01T19:41:57Z (GMT) No. of bitstreams: 1
Pragma 34 Poster (D2I center).pdf: 1066161 bytes, checksum: b440daf7026fa358b22154207fcedcb0 (MD5)
Made available in DSpace on 2018-08-01T19:41:57Z (GMT). No. of bitstreams: 1
Pragma 34 Poster (D2I center).pdf: 1066161 bytes, checksum: b440daf7026fa358b22154207fcedcb0 (MD5)
Previous issue date: 2018-05-09
Funded in part by the National Science Foundation under grants #1550126, #1234983, #1659310 and by a grant from Research Data Alliance/US.
en
PRAGMA 34; persistent IDs; Airbox Sensor data; International Rice Research Institute
Persistent IDs: Application to Workflow and Sensor Applications
Presentation
ORIGINAL
Pragma 34 Poster (D2I center).pdf
Pragma 34 Poster (D2I center).pdf
Poster for PRAGMA 34
application/pdf
1066161
https://scholarworks.iu.edu/dspace/bitstream/2022/22313/1/Pragma%2034%20Poster%20%28D2I%20center%29.pdf
b440daf7026fa358b22154207fcedcb0
MD5
1
LICENSE
license.txt
license.txt
text/plain
2730
https://scholarworks.iu.edu/dspace/bitstream/2022/22313/2/license.txt
8b5355396084efce0c43805344408e89
MD5
2
TEXT
Pragma 34 Poster (D2I center).pdf.txt
Pragma 34 Poster (D2I center).pdf.txt
Extracted text
text/plain
4619
https://scholarworks.iu.edu/dspace/bitstream/2022/22313/3/Pragma%2034%20Poster%20%28D2I%20center%29.pdf.txt
64f9d07b916b44ffc03a1201e6cd4818
MD5
3
THUMBNAIL
Pragma 34 Poster (D2I center).pdf.jpg
Pragma 34 Poster (D2I center).pdf.jpg
IM Thumbnail
image/jpeg
5850
https://scholarworks.iu.edu/dspace/bitstream/2022/22313/4/Pragma%2034%20Poster%20%28D2I%20center%29.pdf.jpg
4d3dee511fe0704eba5eff39532f7e6a
MD5
4
2022/22313
oai:scholarworks.iu.edu:2022/22313
2021-10-18 10:43:56.163
IUScholarWorks
iusw@indiana.edu
T3BlbiBBY2Nlc3MgUG9saWN5CgpJZiB5b3UgYXJlIHN1Ym1pdHRpbmcgYSBzY2hvbGFybHkgYXJ0aWNsZSBwdXJzdWFudCB0byB0aGUgSW5kaWFuYSBVbml2ZXJzaXR5IEJsb29taW5ndG9uIE9wZW4gQWNjZXNzIFBvbGljeSwgeW91ciBzdWJtaXNzaW9uIGlzIGdvdmVybmVkIGJ5IHRoZSB0ZXJtcyBvZiB0aGUgbm9uLWV4Y2x1c2l2ZSBsaWNlbnNlIGdyYW50ZWQgdW5kZXIgdGhlIE9wZW4gQWNjZXNzIFBvbGljeS4gSWYgeW91IG5lZWQgYXNzaXN0YW5jZSBkZXRlcm1pbmluZyB3aGV0aGVyIHlvdXIgYXJ0aWNsZSBpcyBzdWJqZWN0IHRvIHRoZSBPcGVuIEFjY2VzcyBQb2xpY3ksIHBsZWFzZSBjb250YWN0IHRoZSBTY2hvbGFybHkgQ29tbXVuaWNhdGlvbiBEZXBhcnRtZW50LiBUbyByZWFkIHRoZSBwb2xpY3ksIHZpc2l0OiBodHRwOi8vZ28uaXUuZWR1LzIxdHguIFRvIHJlYWQgdGhlIEZBUSBvbiB0aGUgcG9saWN5LCB2aXNpdDogaHR0cHM6Ly9vcGVuc2Nob2xhcnNoaXAuaW5kaWFuYS5lZHUvcG9saWN5LWZhcS4gCgpBbGwgb3RoZXIgc3VibWlzc2lvbnMgdG8gSVVTY2hvbGFyV29ya3MgYXJlIHN1YmplY3QgdG8gdGhlIHRlcm1zIG9mIHRoZSBub24tZXhjbHVzaXZlIGxpY2Vuc2UgYmVsb3c6CgpJVVNjaG9sYXJXb3JrcyBSZXBvc2l0b3J5IExpY2Vuc2UgCgpCeSBzdWJtaXR0aW5nIHlvdXIgd29yayB0byB0aGUgSVVTY2hvbGFyV29ya3MgcmVwb3NpdG9yeSwgeW91ICh0aGUgY3JlYXRvciBvciBjb3B5cmlnaHQgb3duZXIpIGdyYW50IHRvIEluZGlhbmEgVW5pdmVyc2l0eSBhIG5vbi1leGNsdXNpdmUsIHBlcnBldHVhbCwgaXJyZXZvY2FibGUgcmlnaHQgdG8gcmVwcm9kdWNlLCByZWZvcm1hdCAoYXMgZGVmaW5lZCBiZWxvdyksIGFuZC9vciBkaXN0cmlidXRlIHlvdXIgc3VibWlzc2lvbiAoaW5jbHVkaW5nIHRoZSBhYnN0cmFjdCkgd29ybGR3aWRlIGluIHByaW50IGFuZCBlbGVjdHJvbmljIGZvcm1hdCBhbmQgaW4gYW55IG1lZGl1bSwgaW5jbHVkaW5nIGJ1dCBub3QgbGltaXRlZCB0byBhdWRpbyBvciB2aWRlby4gWW91IGFncmVlIHRoYXQgSW5kaWFuYSBVbml2ZXJzaXR5IG1heSwgd2l0aG91dCBjaGFuZ2luZyB0aGUgY29udGVudCwgcmVmb3JtYXQgdGhlIHN1Ym1pc3Npb24gdG8gYW55IG1lZGl1bSBvciBmb3JtYXQsIG5vdyBrbm93biBvciBsYXRlciBkZXZlbG9wZWQsIGZvciBwcmVzZXJ2YXRpb24gb3IgYWNjZXNzLCBhbmQgcHJvdmlkZSBiYXNpYyBtZXRhZGF0YSB0aGF0IGRlc2NyaWJlcyB0aGUgY29udGVudHMgZm9yIGRpc2NvdmVyeS4gWW91IGFsc28gYWdyZWUgdGhhdCBJbmRpYW5hIFVuaXZlcnNpdHkgbWF5IGtlZXAgbW9yZSB0aGFuIG9uZSBjb3B5IG9mIHRoaXMgc3VibWlzc2lvbiBmb3Igc2VjdXJpdHksIGJhY2stdXAgYW5kIHByZXNlcnZhdGlvbi4gWW91IHJlcHJlc2VudCB0aGF0IHRoZSBzdWJtaXNzaW9uIGlzIHlvdXIgb3JpZ2luYWwgd29yaywgYW5kIHRoYXQgeW91IGhhdmUgdGhlIHJpZ2h0IHRvIGdyYW50IHRoZSByaWdodHMgY29udGFpbmVkIGluIHRoaXMgbGljZW5zZS4gWW91IGFsc28gcmVwcmVzZW50IHRoYXQgeW91ciBzdWJtaXNzaW9uIGRvZXMgbm90LCB0byB0aGUgYmVzdCBvZiB5b3VyIGtub3dsZWRnZSwgaW5mcmluZ2UgdXBvbiBhbnlvbmUncyBjb3B5cmlnaHQuIElmIHRoZSBzdWJtaXNzaW9uIGNvbnRhaW5zIG1hdGVyaWFsIGZvciB3aGljaCB5b3UgZG8gbm90IGhvbGQgY29weXJpZ2h0LCB5b3UgcmVwcmVzZW50IHRoYXQgeW91IGhhdmUgb2J0YWluZWQgdGhlIHVucmVzdHJpY3RlZCBwZXJtaXNzaW9uIG9mIHRoZSBjb3B5cmlnaHQgb3duZXIgdG8gZ3JhbnQgSW5kaWFuYSBVbml2ZXJzaXR5IHRoZSByaWdodHMgcmVxdWlyZWQgYnkgdGhpcyBsaWNlbnNlLCBhbmQgdGhhdCBzdWNoIHRoaXJkLXBhcnR5IG93bmVkIG1hdGVyaWFsIGlzIGNsZWFybHkgaWRlbnRpZmllZCBhbmQgYWNrbm93bGVkZ2VkIHdpdGhpbiB0aGUgdGV4dCBvciBjb250ZW50IG9mIHRoZSBzdWJtaXNzaW9uLiBJZiB0aGUgc3VibWlzc2lvbiBpcyBiYXNlZCB1cG9uIHdvcmsgdGhhdCBoYXMgYmVlbiBzcG9uc29yZWQgb3Igc3VwcG9ydGVkIGJ5IGFuIGFnZW5jeSBvciBvcmdhbml6YXRpb24gb3RoZXIgdGhhbiBJbmRpYW5hIFVuaXZlcnNpdHksIHlvdSByZXByZXNlbnQgdGhhdCB5b3UgaGF2ZSBmdWxmaWxsZWQgYW55IHJpZ2h0IG9mIHJldmlldyBvciBvdGhlciBvYmxpZ2F0aW9ucyByZXF1aXJlZCBieSBzdWNoIGNvbnRyYWN0IG9yIGFncmVlbWVudC4gSW5kaWFuYSBVbml2ZXJzaXR5IHdpbGwgY2xlYXJseSBpZGVudGlmeSB5b3VyIG5hbWUgYXMgdGhlIGNyZWF0b3IgYW5kL29yIGNvcHlyaWdodCBvd25lciBvZiB0aGUgc3VibWlzc2lvbiwgYW5kIHdpbGwgbm90IG1ha2UgYW55IGFsdGVyYXRpb25zLCBvdGhlciB0aGFuIGFzIGFsbG93ZWQgYnkgdGhpcyBsaWNlbnNlLCB0byB5b3VyIHN1Ym1pc3Npb24uIFdlIGFncmVlIHRvIG5vdCBtYWtlIGF2YWlsYWJsZSBhbnkgZmlsZXMgdGhhdCBhcmUgZW1iYXJnb2VkIHVudGlsIHRoZSBlbWJhcmdvIGhhcyBleHBpcmVkLiBJZiB5b3UgYXJlIHN1Ym1pdHRpbmcgdGhpcyBpdGVtIG9uIGJlaGFsZiBvZiB0aGUgcmlnaHRzIGhvbGRlciwgeW91IG11c3QgaGF2ZSB0aGUgcmlnaHRzIG93bmVyJ3Mgd3JpdHRlbiBwZXJtaXNzaW9uIHRvIGFjY2VwdCB0aGlzIGxpY2Vuc2Ugb24gaGlzL2hlciBiZWhhbGYuIAoKcmV2LiAxMS8zLzIwMTcK
oai:scholarworks.iu.edu:2022/271202022-02-05T08:01:27Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
2013-2014
2022-02-04T20:07:37Z
2022-02-04T20:07:37Z
https://hdl.handle.net/2022/27120
In the early life of the international Research Data Alliance (RDA), mid-2014, a consortium of volunteers, initial consensus products that promote data sharing are beginning to emerge. The RDA community is grappling with adoption, specifically what is RDA’s role in advancing the adoption of the products emerging from its working groups?
This whitepaper posits that RDA has an active role to play in promoting the adoption of its products (“RDA Recommendations”). This role includes reaching potential adopters in the early stages of the technology adoption process. This whitepaper provides a contextual framework for adoption, products, and adopters. It then examines current RDA activities (circa 2014) and highlights potential gaps.
Submitted by Beth Plale (plale@indiana.edu) on 2022-02-03T15:21:34Z
No. of bitstreams: 1
RDA Adoption Plale v2 2014.pdf: 329046 bytes, checksum: 57500f2476964cf4038afe7a29530589 (MD5)
Approved for entry into archive by Department IUScholarWorks (iusw@indiana.edu) on 2022-02-04T20:07:37Z (GMT) No. of bitstreams: 1
RDA Adoption Plale v2 2014.pdf: 329046 bytes, checksum: 57500f2476964cf4038afe7a29530589 (MD5)
Made available in DSpace on 2022-02-04T20:07:37Z (GMT). No. of bitstreams: 1
RDA Adoption Plale v2 2014.pdf: 329046 bytes, checksum: 57500f2476964cf4038afe7a29530589 (MD5)
National Science Foundation award 1349002
Whitepaper is observational with recommendations
en
CC0 1.0 Universal - Public Domain
https://creativecommons.org/publicdomain/zero/1.0/
Research Data Alliance, adoption curve, data sharing, open science
A Role for the Research Data Alliance (RDA) in Adoption of RDA Products: a Whitepaper
Working Paper
ORIGINAL
RDA Adoption Plale v2 2014.pdf
RDA Adoption Plale v2 2014.pdf
main article
application/pdf
329046
https://scholarworks.iu.edu/dspace/bitstream/2022/27120/1/RDA%20Adoption%20Plale%20v2%202014.pdf
57500f2476964cf4038afe7a29530589
MD5
1
LICENSE
license.txt
license.txt
text/plain
34
https://scholarworks.iu.edu/dspace/bitstream/2022/27120/2/license.txt
327ffa635a664c7e1b0634c333adc7e8
MD5
2
TEXT
RDA Adoption Plale v2 2014.pdf.txt
RDA Adoption Plale v2 2014.pdf.txt
Extracted text
text/plain
10039
https://scholarworks.iu.edu/dspace/bitstream/2022/27120/3/RDA%20Adoption%20Plale%20v2%202014.pdf.txt
135b1e28e8feb736380b0976954dde9a
MD5
3
THUMBNAIL
RDA Adoption Plale v2 2014.pdf.jpg
RDA Adoption Plale v2 2014.pdf.jpg
IM Thumbnail
image/jpeg
4749
https://scholarworks.iu.edu/dspace/bitstream/2022/27120/4/RDA%20Adoption%20Plale%20v2%202014.pdf.jpg
46c4b917fc26afc9e842294a8413e461
MD5
4
2022/27120
oai:scholarworks.iu.edu:2022/27120
2022-02-05 03:01:27.055
IUScholarWorks
iusw@indiana.edu
SVVTY2hvbGFyV29ya3MgUmVwb3NpdG9yeSBMaWNlbnNlCg==
oai:scholarworks.iu.edu:2022/278372022-07-14T14:12:00Zcom_2022_357com_2022_356com_2022_19673col_2022_12986
Plale, Beth
2022-07-14T12:03:06Z
2022-07-14T12:03:06Z
https://hdl.handle.net/2022/27837
For a network of FAIR digital objects (a “data space”) to be fully realized at a global scale, its architecture must possess low barriers to entry to newcomer data providers. Barriers to entry is a measure of the up-front resource demands (costs) required to enter into a line of business or participate in a multi-organizational endeavor. The biodiversity community’s notion of Extended Specimen is a good match as a FAIR Digital Objects (FDO) data space. Extended Specimen is the interconnecting of physical specimen with all manner of derived and/or related data reflecting new sources of data and information related to collected specimens. We look at two possible manifestations of FAIR digital object data space for the global biodiversity community: the DiSSCo project in Europe and an early evaluation being undertaken in the US.
Application of the lense of barriers to entry in this context strongly suggests that the FAIR Digital Object data space adopt a policy of flexibility with respect to the requirements it imposes for newcomers.
Submitted by Beth Plale (plale@indiana.edu) on 2022-07-13T22:03:39Z
No. of bitstreams: 1
22 FDO Global network of FAIR data Plale v4.pdf: 180757 bytes, checksum: 9ddb5153ae6a526d05ea37b4c44ae1ed (MD5)
Approved for entry into archive by Beth Plale (plale@indiana.edu) on 2022-07-14T12:03:06Z (GMT) No. of bitstreams: 1
22 FDO Global network of FAIR data Plale v4.pdf: 180757 bytes, checksum: 9ddb5153ae6a526d05ea37b4c44ae1ed (MD5)
Made available in DSpace on 2022-07-14T12:03:06Z (GMT). No. of bitstreams: 1
22 FDO Global network of FAIR data Plale v4.pdf: 180757 bytes, checksum: 9ddb5153ae6a526d05ea37b4c44ae1ed (MD5)
en
CC BY-NC 4.0
https://creativecommons.org/licenses/by-nc/4.0/
open science
research data infrastructure
FAIR data
barriers to entry
Achieving low barriers to entry in the FAIR Digital Objects (FDO) data space: a Use Case in Biodiversity Extended Specimen Networks
Working Paper
ORIGINAL
22 FDO Global network of FAIR data Plale v4.pdf
22 FDO Global network of FAIR data Plale v4.pdf
application/pdf
180757
https://scholarworks.iu.edu/dspace/bitstream/2022/27837/1/22%20FDO%20Global%20network%20of%20FAIR%20data%20Plale%20v4.pdf
9ddb5153ae6a526d05ea37b4c44ae1ed
MD5
1
LICENSE
license.txt
license.txt
text/plain
34
https://scholarworks.iu.edu/dspace/bitstream/2022/27837/2/license.txt
327ffa635a664c7e1b0634c333adc7e8
MD5
2
TEXT
22 FDO Global network of FAIR data Plale v4.pdf.txt
22 FDO Global network of FAIR data Plale v4.pdf.txt
Extracted text
text/plain
12580
https://scholarworks.iu.edu/dspace/bitstream/2022/27837/3/22%20FDO%20Global%20network%20of%20FAIR%20data%20Plale%20v4.pdf.txt
da849ba17019602d5c63f1b778b8eee9
MD5
3
THUMBNAIL
22 FDO Global network of FAIR data Plale v4.pdf.jpg
22 FDO Global network of FAIR data Plale v4.pdf.jpg
IM Thumbnail
image/jpeg
4768
https://scholarworks.iu.edu/dspace/bitstream/2022/27837/4/22%20FDO%20Global%20network%20of%20FAIR%20data%20Plale%20v4.pdf.jpg
77bdd9fccf8aa0acc011f2e199a6ac04
MD5
4
2022/27837
oai:scholarworks.iu.edu:2022/27837
2022-07-14 10:12:00.47
IUScholarWorks
iusw@indiana.edu
SVVTY2hvbGFyV29ya3MgUmVwb3NpdG9yeSBMaWNlbnNlCg==