Open Access
OUT OF CITE, OUT OF MIND: THE CURRENT STATE OF PRACTICE, POLICY, AND TECHNOLOGY FOR THE CITATION OF DATA CODATA-ICSTI Task Group on Data Citation Standards and Practices
TLDR
The CODATA-ICSTI Task Group as mentioned in this paper examines a number of key issues related to data identification, attribution, citation, and linking, as well as other functions such as attribution of credit and establishing provenance.Abstract:
PREFACE The growth in the capacity of the research community to collect and distribute data presents huge opportunities. It is already transforming old methods of scientific research and permitting the creation of new ones. However, the exploitation of these opportunities depends upon more than computing power, storage, and network connectivity. Among the promises of our growing universe of online digital data are the ability to integrate data into new forms of scholarly publishing to allow peer-examination and review of conclusions or analysis of experimental and observational data and the ability for subsequent researchers to make new analyses of the same data, including their combination with other data sets and uses that may have been unanticipated by the original producer or collector. The use of published digital data, like the use of digitally published literature, depends upon the ability to identify, authenticate, locate, access, and interpret them. Data citations provide necessary support for these functions, as well as other functions such as attribution of credit and establishment of provenance. References to data, however, present challenges not encountered in references to literature. For example, how can one specify a particular subset of data in the absence of familiar conventions such as page numbers or chapters? The traditions and good practices for maintaining the scholarly record by proper references to a work are well established and understood in regard to journal articles and other literature, but attributing credit by bibliographic references to data are not yet so broadly implemented. Recognizing the needs for better data referencing and citation practices and investing effort to address those needs has come at different rates in different fields and disciplines. As competing conventions and practices emerge in separate communities, inconsistencies and incompatibilities can interfere with promoting the sharing and use of research data. In order to reconcile this problem, sharing experiences across communities may be necessary, or at least helpful, to achieving the full potential of published data. Practical and consistent data citation standards and practices are thus important for providing the incentives, recognition, and rewards that foster scientific progress. New requirements from funding agencies to develop data management plans emphasize the need to develop standards and data citation practices. Together with representatives from several other organizations, the CODATA-ICSTI Task Group examines a number of key issues related to data identification, attribution, citation, and linking. Additionally, the Task Group helps coordinate international activities in this area and …read more
Citations
More filters
Mapping the backbone of science.
TL;DR: In this article, the authors presented a new map representing the structure of all of science, based on journal articles, including both the natural and social sciences, which provides a bird's eye view of today's scientific landscape.
Posted Content
Scientific Data Management in the Coming Decade
Jim Gray,David T. Liu,Maria Nieto-Santisteban,Alexander S. Szalay,David J. DeWitt,Gerd Heber +5 more
TL;DR: Analyzing this data to find the subtle effects missed by previous studies requires algorithms that can simultaneously deal with huge datasets and that can find very subtle effects --- finding both needles in the haystack and finding very small haystacks that were undetected in previous measurements.
Journal ArticleDOI
Toward the Geoscience Paper of the Future: Best practices for documenting and sharing research from data to software to provenance
Yolanda Gil,Cédric H. David,Ibrahim Demir,Bakinam T. Essawy,Robinson W. Fulweiler,Jonathan L. Goodall,Leif Karlstrom,Huikyo Lee,Heath J. Mills,Ji-Hyun Oh,Ji-Hyun Oh,Suzanne A. Pierce,Allen Pope,Allen Pope,Mimi W. Tzeng,Sandra R. Villamizar,Xuan Yu +16 more
TL;DR: The Geoscience Paper of the Future (GPF) as discussed by the authors is an approach to fully document, share, and cite all their research products including data, software, and computational provenance.
Journal ArticleDOI
Software in the scientific literature: Problems with seeing, finding, and using software mentioned in the biology literature
James Howison,Julia Bullard +1 more
TL;DR: A coding scheme is developed to identify software “mentions” and classify them according to their characteristics and ability to realize the functions of citations, providing recommendations to improve the practice of software citation.
References
More filters
Journal ArticleDOI
The file drawer problem and tolerance for null results
TL;DR: Quantitative procedures for computing the tolerance for filed and future null results are reported and illustrated, and the implications are discussed.
Why Most Published Research Findings Are False
TL;DR: In this paper, the authors discuss the implications of these problems for the conduct and interpretation of research and suggest that claimed research findings may often be simply accurate measures of the prevailing bias.
Journal ArticleDOI
Why Most Published Research Findings Are False
TL;DR: In this paper, the authors discuss the implications of these problems for the conduct and interpretation of research and conclude that the probability that a research claim is true may depend on study power and bias, the number of other studies on the same question, and the ratio of true to no relationships among the relationships probed in each scientifi c fi eld.
Journal ArticleDOI
Computational Social Science
David Lazer,Alex Pentland,Lada A. Adamic,Sinan Aral,Sinan Aral,Albert-László Barabási,Devon Brewer,Nicholas A. Christakis,Noshir Contractor,James H. Fowler,Myron P. Gutmann,Tony Jebara,Gary King,Michael W. Macy,Deb Roy,Marshall Van Alstyne,Marshall Van Alstyne +16 more
TL;DR: In this article, a field is emerging that leverages the capacity to collect and analyze data at a scale that may reveal patterns of individual and group behaviors at a large scale, such as behavior patterns.
Book
The Fourth Paradigm: Data-Intensive Scientific Discovery
TL;DR: This presentation will set out the eScience agenda by explaining the current scientific data deluge and the case for a “Fourth Paradigm” for scientific exploration.
Related Papers (5)
Achieving human and machine accessibility of cited data in scholarly publications
Joan Starr,Eleni Castro,Mercè Crosas,Michel Dumontier,Robert R. Downs,Ruth Duerr,Laurel L Haak,Melissa A. Haendel,Ivan Herman,Simon Hodson,Joe Hourclé,John Ernest Kratz,Jennifer Lin,Lars Holm Nielsen,Amy Nurnberger,Stefan Proell,Andreas Rauber,Simone Sacchi,Arthur P. Smith,Mike Taylor,Timothy Clark +20 more