The NIH Open Citation Collection: A public access, broad coverage resource

by B. Ian Hutchins, Kirk L. Baker, Matthew T. Davis, Mario A. Diwersy, Ehsanul Haque, Robert M. Harriman, Travis A. Hoppe, Stephen A. Leicht, Payam Meyer, George M. Santangelo

Citation data have remained hidden behind proprietary, restrictive licensing agreements, which raises barriers to entry for analysts wishing to use the data, increases the expense of performing large-scale analyses, and reduces the robustness and reproducibility of the conclusions. For the past several years, the National Institutes of Health (NIH) Office of Portfolio Analysis (OPA) has been aggregating and enhancing citation data that can be shared publicly. Here, we describe the NIH Open Citation Collection (NIH-OCC), a public access database for biomedical research that is made freely available to the community. This dataset, which has been carefully generated from unrestricted data sources such as MedLine, PubMed Central (PMC), and CrossRef, now underlies the citation statistics delivered in the NIH iCite analytic platform. We have also included data from a machine learning pipeline that identifies, extracts, resolves, and disambiguates references from full-text articles available on the internet. Open citation links are available to the public in a major update of iCite (

Source link

Related posts

Ipr1 Regulation by Cyclic GMP-AMP Synthase/Interferon Regulatory Factor 3 and Modulation of Irgm1 Expression via p53 [Research Article]


Transverse anisotropy in the deformation of the muscle during dynamic contractions [RESEARCH ARTICLE]


PROTEIN PHOSHATASE 2A B{alpha} and {beta} Maintain Centromeric Sister Chromatid Cohesion during Meiosis in Arabidopsis


This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Accept Read More

Privacy & Cookies Policy