Skip to Main Content

WiLDSI Data Portal

This portal was developed as part of the WiLDSI project with input from the DSI Scientific Network to enable the exploration and quantification of use & provision of nucleotide sequence data (NSD) / Digital Sequence Information (DSI) in the scientific literature. The underlying data set is the result of a ETL pipeline that extracts and links sequence records from the European Nucleotide Archive to citations in open-access publications aggregated in Europe PubMed Central. The dataset is updated regularly using automatic methods.

A weekly database dump is available for download:

This web application enables the discovery of data appropriate for bio-geographical studies, the exploration of collaborative networks, and the profiling of the flow of access and benefit relating to sequence data, for example:

A data note and a research article published in tandem at GigaScience provide more detailed information on our methods for extracting and linking nucleotide sequence data with associated publications, as well as interpretation and potential implications of these results:

A persistent copy of the dataset version used in these papers is published under the DOI: 10.5447/ipk/2021/8.

What is new?

  • Interactive filtering: static charts of preset filter conditions were removed and replaced by generic ones that support individual visual analysis of DSI usage patterns. Supported filter critera are:
    • Literature type: primary vs. secondary literature
    • Time window: DSI submission date and paper publication date
    • Taxonomic class: ENA taxonomic division
    • Annex 1: list of crops covered under the multilateral system of access and benefit sharing
    • ¿ author role: position in the author list - may indicate the role as first and senior author
  • Weekly data update from INSDC and Europ Pubmed Central
  • Extended colaboration groups: further groups of contries added to fin-grain map economical, regional and megabiodiversity groups
  • Data drill down for each chart: a raw data table is availble below each chart
  • Additional raw data tables: all data tables used for cahrt computation are available as interactive reports
  • Tutorial video: to help the on-boarding of UI concepts and features