Karen Cranston, Hilmar Lapp
sharing, publishing, archiving
For the remaining slides we are going to assume that we are at the point of publication.
Piwowar & Vision (2013) “Data reuse and the open data citation advantage.” PeerJ, e175
Figure 1: Citation density for papers with and without publicly available microarray data, by year of study publication.
Wicherts et al (2011) “Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results.” PLoS ONE 6(11): e26828
Figure 1. Distribution of reporting errors per paper for papers from which data were shared and from which no data were shared.
Catalog the artifacts you produced this morning.
You will likely have different artifacts:
READMEthat describes the data or software package
CONTRIBUTINGfile for new issues and pull requests
Documenting your research:
Put a license or waiver on it
Copyright applies to creative works
Typically not copyrightable:
Depends on jurisdiction and case:
Morin, Andrew, Jennifer Urban, and Piotr Sliz. 2012. “A Quick Guide to Software Licensing for the Scientist-Programmer.” PLoS Computational Biology 8 (7): e1002598.
The Open Definition sets out principles that define “openness” in relation to data and content. It makes precise the meaning of “open” in the terms open data, open content, and open source:
“Open means anyone can freely access, use, modify, and share for any purpose (subject, at most, to requirements that preserve provenance and openness).”
or more succinctly:
“Open data and content can be freely used, modified, and shared by anyone for any purpose”
CC0 enables scientists, educators, artists and other creators and owners of copyright- or database-protected content to waive those interests in their works and thereby place them as completely as possible in the public domain, so that others may freely build upon, enhance and reuse the works for any purposes without restriction under copyright or database law.
Dryad’s use of CC0 to make the terms of reuse explicit has some important advantages:
- Universality: CC0 is a single mechanism that is both global and universal, covering all data and all countries. It is also widely recognized.
- Simplicity: There is no need for humans to make, or respond to, individual data requests, and no need for click-through agreements. This allows more scientists to spend their time doing science.
From the Panton Principles:
[…] in the scholarly research community the act of citation is a commonly held community norm when reusing another community member’s work.
Community norms can be a much more effective way of encouraging positive behaviour, such as citation, than applying licenses. A well functioning community supports its members in their application of norms, whereas licences can only be enforced through court action and thus invite people to ignore them when they are confident that this is unlikely.
Let scientists do science without having to talk to lawyers.