Dryad Grows into a General Repository
We are excited to see Dryad’s role in the preservation of data expand into new areas and fields in 2016. Researchers submitted more data involving human subjects and data from social media. In addition, a quick look at our most popular data shows that two of the top five downloaded packages were from the fields of cardiology and science journalism. While Dryad’s origins are in the life sciences, it is increasingly being used as a general repository for data from a myriad of fields.
Let’s take a look at the numbers for 2016:
Increase in Number of Data Packages and Data Files
Our curators were busy! The total number of published data packages (sets of data files associated with a publication) at the end of the year was a whopping 15,325. Our curators meticulously archived 4,307 packages, a 10% increase from 2015. The size of data packages also continued to grow – from an average of 481MB to an average of 573MB, an increase of about 20%.
At the end of 2016, we were closing in on 50,000 archived data files; by January of this year, we passed that mark.
In a future blog, we’ll talk about the integration of new journals into the Dryad submission process, new members, and new partnerships. For now, we’ll just note that there was a 22% increase in the number of journals that have data in Dryad linking back to the article.
We’ve seen a significant uptick in human subjects data and social media data this year, which has prompted us to develop an FAQ on cleaning and de-identification of human subjects data for public access. As the idea of what data should be preserved continues to broaden, submissions of these kinds of data will only increase. We’ll keep you updated about this trend in future blogs.
Let’s take a look at the most popular data published in 2016, in terms of downloads. Among the top 5 downloads includes data on plant genetics, the early history of ray-finned fishes, and, not surprisingly in this age, the effects of climate change on boreal forests.
Also of interest are data from an article in Science evaluating how people make use of Sci-Hub, an open source scholarly library. Our guest blog on these data by science journalist John Bohannon generated a lot of interest this year and was one of our most popular blog posts ever.
Another significant development in 2016 came from the medical sciences. A comparison of coronary diagnostic techniques marked Dryad’s first submission from one of the top five cardiology journals, JACC: Cardiovascular Interventions.
The fact that 2 of the 5 top downloads come from fields outside of life sciences clearly indicates that data in Dryad now cover a broad range of fields.
Top 5 Downloads of Data Archived in 2016
|Article||Dryad DOI||Number of Downloads|
|Wagner MR et al. (2016) Host genotype and age shape the leaf and root microbiomes of a wild perennial plant. Nature Communications 7: 12151.||http://doi.org/10.5061/dryad.g60r3||3123|
|Bohannon J et al. (2016) Who’s downloading pirated papers? Everyone. Science 352(6285): 508-512.||http://doi.org/10.5061/dryad.q447c||2969|
|D’Orangeville L et al. (2016) Northeastern North America as a potential refugium for boreal forests in a warming climate. Science 352(6292): 1452-1455.||http://doi.org/10.5061/dryad.785cv||741|
|Johnson NP et al. (2016) Continuum of vasodilator stress from rest to contrast medium to adenosine hyperemia for fractional flow reserve assessment. JACC. Cardiovascular Interventions 9(8): 757-767.||http://doi.org/10.5061/dryad.f76nv||453|
|Lu J et al. (2016) The oldest actinopterygian highlights the cryptic early history of the hyperdiverse ray-finned fishes. Current Biology 26(12): 1602–1608.||http://doi.org/10.5061/dryad.t6j72||423|
Overall, we’ve had a great year and are delighted to be seeing a broader range of data from an increasing number of journals and fields. Thanks to our Board of Directors, members, and of course our staff for providing their support to make 2016 a notable year for Dryad!