GAPeDNA is an interactive web application for exploring taxonomic coverage gaps in eDNA metabarcoding reference databases. For a given primer pair and taxonomic group, it maps the proportion of species represented in the reference database across spatial regions worldwide.
🌐 Live app: shiny.cefe.cnrs.fr/GAPeDNA
If the link is unavailable, the server may be under maintenance (typically resolved within a few days). Please open an issue if the outage persists.
Reference sequences can now be recovered even in the absence of the primer binding sites by in silico PCR. GAPeDNA accounts for up to 3 mismatches per primer.
The database is updated periodically. Current version: MIDORI2 GenBank 269 (2025-12-09).
- Interactive choropleth maps — visualise per-region primer coverage as a percentage of species with a reference sequence
- Species table — click any polygon to display the full species list for that region, including IUCN conservation status and sequencing status for the selected marker
- Sequence extraction — upload a downloaded species table to retrieve the corresponding reference sequences and per-primer mismatch counts
- Download — export species tables and sequence data as CSV
| Group | Spatial resolutions | Checklist source |
|---|---|---|
| Marine fish | Ecoregions, Provinces, World | Albouy et al. 2019 |
| Freshwater fish | Drainage basins, World | Tedesco et al. 2017 |
| Elasmobranchs (sharks & rays) | Ecoregions, Provinces, Basins, World | IUCN Red List spatial data |
- Choose a taxon — marine fish, freshwater fish, or elasmobranchs
- Choose a spatial resolution — available options update automatically based on the taxon
- Choose a mitochondrial marker position — 12S, 16S, COI, CytB, or 18S
- Choose a primer pair — the interactive map renders immediately
Hover over any polygon to display the percentage of species sequenced. Click a polygon to load the full species list in the table below, with IUCN status and sequencing status per species. The table supports filtering and sorting. Use the Download button to export it as CSV.
Upload a CSV file exported from the World maps pane. The app will join each species to its reference sequence for the corresponding marker and display per-primer mismatch counts. The full table (including complete sequences) can be downloaded.
You need R (≥ 4.1) with the following packages installed:
shiny, leaflet, sf, dplyr, DT, htmltools, shinythemes, shinycssloaders
Option 1 — directly from GitHub:
library(shiny)
runGitHub("GAPeDNA", "virginiemarques")Option 2 — clone and run:
git clone https://github.com/virginiemarques/GAPeDNA
cd GAPeDNAlibrary(shiny)
runApp()The
data/data_for_GAPeDNA.Rdatafile (~100 MB) is not included in the repository. It is generated by the companion pipeline inGenerate_data_GAPeDNAor available on request.
Contributions to expand GAPeDNA's taxonomic scope are welcome. To propose a new group, please open an issue labelled enhancement and provide:
- One or more primer pairs targeting the group
- A global, spatialised species checklist at an appropriate resolution
If you use GAPeDNA in your work, please cite:
Marques V. et al. (2021). GAPeDNA: Assessing and mapping global species gaps in genetic databases for eDNA metabarcoding. Diversity and Distributions, 27, 1142–1155. https://doi.org/10.1111/ddi.13142
- Development & maintenance: Virginie Marques
- Illustration: P. Lopez (UMR MARBEC)
- Server deployment: A. Granier (CEFE)
- In silico PCR: CRABS
MIT © Virginie Marques — see LICENSE for details.

