Indiana University

 

Home

Research
Journals
Data
Storage & Preservation Services
How to Prepare Your Data
Data Content Guidelines
Policies
Workshops and Consultations

Contact Us

About IUScholarWorks

Scholarly Communications

data
Archive and increase availability of your research data

Data Content Guidelines

IUScholarWorks General Data Content Guidelines

Note: These guidelines form an initial framework that is intended to aid the IUScholarWorks team in judging whether data may be accepted for deposit into the IUScholarWorks Data Service. We envision that this rubric will eventually be much broader in scope in the years to come, with a more robust definition of what defines "data".

  • Data accepted for deposit is generally defined as the facts and statistics produced for reference or analysis in the course of scientific research undertaken by IU faculty or researchers. (E.g. "numeric data sets": research results, surveys, instrument outputs, databases, and measurements.) Licensed data (such as that originating from the ICPSR, Bureau of Labor Statistics, etc.) is not accepted.
  • The data must be produced, submitted, or sponsored by IU faculty or researchers. Student-created data is accepted with faculty sponsorship of research.
  • The data should be complete and ready for distribution.
    • Classified or confidential data requiring formal, contractual, or legal restrictions to access, such as HIPAA-designated Protected Health Information, will not be included in IUScholarWorks, but may be archived on the Scholarly Data Archive (SDA).
  • We accept all file formats, although depending on the format of the file there may be limitations on the IU Libraries' ability to preserve it and for the ability of others to use it. We encourage standards and formats that are open (e.g., .txt), commonly used (e.g., .xls), or community-recognized (e.g., .mat) be used for all files deposited.
  • The dataset may contain multiple files. Individual data files (including zipped/tarred files) uploaded directly to IUScholarWorks or the SDA may not exceed 150MB or 50TB, respectively.

IUScholarWorks Repository Data Content Guidelines

  • Must meet the General guidelines, above.
  • The author/owner must be willing and able to grant IU the right to preserve and distribute the data (IUSW Deposit License), in accordance with user-defined embargo restrictions.
  • Each dataset submitted to IUScholarWorks repository should contain a 'readme' file listing:
    • The contents of the submitted dataset (i.e., file names, formats and sizes of each file).
    • A list of software used to produce, render and compress the data (where applicable).
    • Where in the research process the data falls (e.g. raw/unanalyzed data, processed/analyzed data, rendered/visualized data).
  • Submitters are responsible for adequate metadata (descriptive fields) of the contributed dataset for the purposes of discovery and other informational purposes.
  • It is recommended that datasets be described sufficiently, using the IUScholarWorks platform, so that other knowledgeable researchers can find, understand and use the data.

IUScholarWorks SDA Data Content Guidelines*

  • Must meet the General guidelines, above.
  • Each dataset submitted to IUScholarWorks repository should contain a 'readme' file listing:
    • The contents of the submitted dataset (i.e., file names, formats and sizes of each file).
    • A list of software used to produce, render and compress the data (where applicable).
    • Where in the research process the data falls (e.g. raw/unanalyzed data, processed/analyzed data, rendered/visualized data).
  • The filename should be human-readable and adequately descriptive of the content.

* This set of guidelines governs only "big data" submitted to IUScholarWorks, not to data stored in researcher-owned SDA accounts.