HomePublish with usData Availability Guide

Data Availability Guide

Data Availability Guide

Download as PDF

1. What should be shared

  • Data: raw data or processed datasets used for analysis (as permitted by ethics and law).

  • Code: scripts for cleaning, analysis, and figure generation (e.g., Python/R/Stata).

  • Materials: survey instruments, experimental protocols, interview guides, codebooks, and variable definitions.

  • Documentation: README describing how to reproduce results, and a data dictionary/codebook where relevant.

3. When data cannot be shared openly

Sometimes data cannot be made public due to privacy, legal, or contractual restrictions. In these cases, authors should:

  • Explain the restriction clearly (e.g., personal data, confidentiality agreements).

  • Share what is possible: anonymized data, synthetic data, code, and documentation.

  • Provide a controlled access pathway if feasible (data use agreement, secure access, or on-request access).

4. Data Availability Statement (required)

Include a Data Availability Statement in every manuscript. Choose the option below that best applies and adapt it.

5.1. Option A — Open data

Example statement:

“The data and code supporting the findings of this study are available in Zenodo at https://doi.org/10.5281/zenodo.1234567.”

5.2. Option B — Open code, restricted data

Example statement:

“Analysis code and documentation are available at https://doi.org/10.5281/zenodo.1234567. The underlying dataset contains sensitive personal information and cannot be shared publicly. Access may be granted to qualified researchers subject to ethics approval and a data use agreement; requests should be sent to [email].”

5.3. Option C — Available on request

Example statement:

“Data are available from the corresponding author upon reasonable request.”

Note: Avoid this option if a public repository is feasible.

5.5. Option E — Not applicable

Example statement:

“No datasets were generated or analyzed during the current study.”

6. How to cite data (APA 7 examples)

Cite datasets in your reference list when they are used as research outputs or evidence.

7.1. Dataset with DOI

Format:

Author/Organization. (Year). Title of dataset (Version) [Data set]. Repository. https://doi.org/xxxxx

Example:

Institute for Social Research. (2024). Survey dataset on labor market outcomes (Version 2.1) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.1234567

7.2. Code repository (software) citation

Example:

Doe, J. (2025). Replication code for: Market efficiency under structural breaks [Computer software]. GitHub. https://github.com/example/repo