AURORA (compounds in Nordic plants) is a Streamlit app, that integrates several plants databases like:
- COCONUT (Collection of Open Natural Products database),
- Laji.fi (Finnish Biodiversity Information Facility), and
- GBIF (Global Biodiversity Information Facility).
# create & activate a virtual environment (recommended)
python -m venv .venv
source .venv/bin/activate # on Windows use: .venv\Scripts\activate
# install dependencies
pip install -r requirements.txt
# run the app
streamlit run app.pyYou can also run the app inside Docker:
docker build -t aurora-app .
docker run -it --rm -p 8501:8501 aurora-app- COCONUT (Collection of Open Natural Products database) - CC0 license
- Laji.fi (Finnish Biodiversity Information Facility) - CC-BY license
- GBIF (Global Biodiversity Information Facility) - CC0/CC-BY/CC-BY-NC licenses (depending on the dataset)
All rights and data terms respected according to source guidelines.
- Add ETL scripts for automated retrieval and cleaning of GBIF, Laji.fi, and COCONUT datasets
- Provide a small reproducible test dataset for CI and demo purposes
- Expand CI workflow (unit tests, schema validation with Pandera)
- Optimize data loading and performance for larger datasets
- Add more interactive visualizations in the Streamlit app
Processed data files are expected under data/. Some data
sources (Laji.fi, COCONUT) may require manual export. ETL scripts will
be added in a future update.

