aqua90m

Remote interaction with freshwater and related data, mainly via OGC processes deployed on pygeoapi instances.

The module aqua90m contains various functions that ...

How to test the modules in python:

# activate virtual env:

# go to directory
cd .path/to/aqua90m/
# go up one directory, to be able to import the module "aqua90m":
cd ..

# run tests for one specific module:
python aqua90m/geofresh/basic_queries.py

The module pygeoapi_processes contains the processes that can be deployed on pygeoapi.

How to test the processes using requests.post():

# log in to server via ssh
ssh ...

# activate virtual env:

# go to directory
cd .../pygeoapi/pygeoapi/process/aqua90m/pygeoapi_processes/geofresh

# run all tests:
for f in ./*.py; do python "$f"; done

List of processes

Not up to date!

local subcatchment

get_snapped_points
get_snapped_points_plus
get_local_subcids
get_local_subcids_plural
get_local_streamsegments
get_local_streamsegments_subcatchments

upstream

get_upstream_subcids
get_upstream_streamsegments
get_upstream_bbox
get_upstream_subcatchments
get_upstream_dissolved
get_upstream_dissolved_aip (special version for usage by the AIP search interface, kept constant)

downstream

get_shortest_path_two_points
get_shortest_path_to_outlet

data access

extract_point_stats
subset_by_polygon (currently fails on aqua due to dependency issues with gdal)
subset_by_bbox (currently fails on aqua due to dependency issues with gdal)

utils

get_ddas_galaxy_link_textfile

Pygeoapi deployment

To deploy these processes, you first need a running pygeoapi instance (please see https://pygeoapi.io/ and follow their recommendations and best practices, e.g. about web servers, reverse-proxy usage and TLS certificates / HTTPS). We run pygeoapi via starlette behing an nginx webserver, via HTTPS.

To deploy the processes listed above on this pygeoapi instance, please modify the following existing files:

File pygeoapi/pygeoapi/plugin.py:

        # local subcatchment
        'SnappedPointsGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_snapped_points.SnappedPointsGetter',
        'SnappedPointsGetterPlus': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_snapped_points_plus.SnappedPointsGetterPlus',
        'LocalSubcidGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_local_subcids.LocalSubcidGetter',
        'LocalSubcidPluralGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_local_subcids_plural.LocalSubcidPluralGetter',
        'LocalStreamSegmentsGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_local_streamsegments.LocalStreamSegmentsGetter',
        'LocalStreamSegmentSubcatchmentGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_local_streamsegments_subcatchments.LocalStreamSegmentSubcatchmentGetter',
        # upstream
        'UpstreamSubcidGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_upstream_subcids.UpstreamSubcidGetter',
        'UpstreamStreamSegmentsGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_upstream_streamsegments.UpstreamStreamSegmentsGetter',
        'UpstreamBboxGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_upstream_bbox.UpstreamBboxGetter',
        'UpstreamSubcatchmentGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_upstream_subcatchments.UpstreamSubcatchmentGetter',
        'UpstreamDissolvedGetterCont': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_upstream_dissolved.UpstreamDissolvedGetter',
        'UpstreamDissolvedGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_upstream_dissolved_aip.UpstreamDissolvedGetter',
        # downstream
        'ShortestPathTwoPointsGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_shortest_path_two_points.ShortestPathTwoPointsGetter',
        'ShortestPathToOutletGetter': 'pygeoapi.process.aqua90m.pygeoapi_processes.rivernetwork.get_shortest_path_to_outlet.ShortestPathToOutletGetter',
        # data access
        'ExtractPointStatsProcessor': 'pygeoapi.process.aqua90m.pygeoapi_processes.data_access.extract_point_stats.ExtractPointStatsProcessor',
        #'SubsetterBbox': 'pygeoapi.process.aqua90m.pygeoapi_processes.data_access.subset_by_bbox.SubsetterBbox',
        #'SubsetterPolygon': 'pygeoapi.process.aqua90m.pygeoapi_processes.data_access.subset_by_polygon.SubsetterPolygon',
        # utils
        'HelferleinProcessor': 'pygeoapi.process.aqua90m.pygeoapi_processes.data_access.get_ddas_galaxy_link_textfile.HelferleinProcessor',

File pygeoapi/pygeoapi-config.yml:

   # local subcatchment

    get-snapped-points:
        type: process
        processor:
            name: SnappedPointsGetter

    get-snapped-point-plus:
        type: process
        processor:
            name: SnappedPointsGetterPlus

    get-local-subcids:
        type: process
        processor:
            name: LocalSubcidGetter

    get-local-subcids-plural:
        type: process
        processor:
            name: LocalSubcidPluralGetter

    get-local-streamsegments:
        type: process
        processor:
            name: LocalStreamSegmentsGetter

    get-local-streamsegments-subcatchments:
        type: process
        processor:
            name: LocalStreamSegmentSubcatchmentGetter



    # upstream

    get-upstream-subcids:
        type: process
        processor:
            name: UpstreamSubcidGetter

    get-upstream-streamsegments:
        type: process
        processor:
            name: UpstreamStreamSegmentsGetter

    get-upstream-bbox:
        type: process
        processor:
            name: UpstreamBboxGetter

    get-upstream-subcatchments:
        type: process
        processor:
            name: UpstreamSubcatchmentGetter

    get-upstream-dissolved:
        type: process
        processor:
            name: UpstreamDissolvedGetter

    get-upstream-dissolved-cont:
        type: process
        processor:
            name: UpstreamDissolvedGetterCont



    # downstream

    get-shortest-path-two-points:
        type: process
        processor:
            name: ShortestPathTwoPointsGetter

    get-shortest-path-to-outlet:
        type: process
        processor:
            name: ShortestPathToOutletGetter



    # data access

    extract-point-stats:
        type: process
        processor:
            name: ExtractPointStatsProcessor

    #FAILS (GDAL) get-subset-by-bbox:
        #type: process
        #processor:
              #name: SubsetterBbox

    # FAILS (GDAL) get-subset-by-polygon:
        #type: process
        #processor:
              #name: SubsetterPolygon



    # utils
    get-ddas-galaxy-link-textfile:
        type: process
        processor:
            name: HelferleinProcessor

Dependencies, to be added to dependencies.txt

TODO
# Also, how to install them?

In addition, you need to add a JSON config file for process-relevant config, located at any readable place, e.g. pygeoapi/config.json. The contents vary by processes (see process-specific details), but some entries are shared by all or many processes, such as:

download_dir: The path to a directory from where users are able to download files, e.g. the static directory of the webserver that is running pygeoapi. In our case, it is /var/www/nginx/download/. It needs to be writeable by the Linux user who runs pygeoapi (in our case, pyguser), and readable by the Linux user who runs the webserver (in our case, www-data).
download_url: The URL under which the contents of the above download_dir can be accessed by outside users. Here, the process results will be made available to the end users. (You can decide to make the directory password-protected by the web server if the results should not be public).

For many of the processes, you also need credentials to access to IGB's GeoFRESH database, which is restricted. For more information, please check https://geofresh.org/ or contact IGB Berlin.

For some other processes, R and the R package hydrographr need to be installed and runnable by the Linux user running pygeoapi.

Process-specific details

extract-point-stats (old)

For this process, R and the R package hydrographr are needed.

For this process, the config file needs to contain these items:

rasterlayer_lookup_table: Mapping between variable names and a local path or remote URL where the corresponding raster layer can be found, as GeoTIFF or VRT or any layer that gdallocationinfo can work with.
hydrographr_bash_files: Path where the executable bash files of the hydrographr R package can be found.
download_dir
download_url

Example:

    "rasterlayer_lookup_table": {
        "basin": "https://2007367-nextcloud.a3s.fi/igb/vrt/basin.vrt",
        "sti": "/opt/aquainfra_inputs/Hydrography90m/sti_h18v02.tif"
    },
    "hydrographr_bash_files": "/opt/pyg_upstream_dev/pygeoapi/pygeoapi/process/hydrographr/inst/sh",
    "download_dir": "/var/www/nginx/download/",
    "download_url": "https://aqua.igb-berlin.de/download/",

Implementation details/questions

Easy next steps (2026-01-09)

TODO Add main tests to all processes
TODO Go over all process descriptions
TODO Put singular and plural into one process (already done: get_shortest_distance_between_points.py)

Less easy:

TODO: get_shortest_distance_between_points.py: Have to accept CSV inputs!
Missing processes:
- get_shortest_distance_to_outlet.py
- get_basin_subc_ids.py
get_local_ids should return GeoJSON!

Design Questions (2026-01-09)

What to do if points are in different basins? Currently, we throw errors (except for outlet-routing)
How should output look, when there is one result per subc, or per site? Should we return one result per subcatchment? Or per site_id?
Processes that can return list of ids, or geometries: Separate processes, or joint?
- Separate: get_upstream_subcids, get_upstream_streamsegments, get_upstream_subcatchments
- Joint: get_path_between_points (distinguish by “result_format” or “add_geometries” or “add_distances”, something like that): get_subcids_between_points, get_linestrings_between_points, get_distances_between_points (this is already separate)
- Joint: get_path_to_outlet: get_subcids_to_outlet, get_linestrings_to_outlet, get_distance_to_outlet (this is already separate)
- Joint: get_outlets_for_polygon: get_outlet_subcids, get_outlet_points

routing module (2026-01-09)

Contains these functions that are called by processes:

get_dijkstra_ids_one_to_one()

Input: two individual points (represented by subc_ids)
Can be used for routing between two points
Can be used for routing between from a point to an outlet

Called by singular processes:

get_shortest_path_between_points.py
get_shortest_path_to_outlet.py

get_dijkstra_ids_to_outlet_plural()

Input: Lots of points! As GeoJSON or as dataframe
They don't need to be inside on region or basin. The function splits the points into batches, per basin. As there is one outlet per basin, it makes sense to run one one_to_many SQL query per basin.

Called by plural process:

get_shortest_path_to_outlet_plural.py

get_dijkstra_ids_many_to_many()

Input: Lots of points! As GeoJSON or as dataframe
The points have to be inside one region and basin. We cannot perform routing across basins, and already in one basin, the output is a complex matrix. Now if we accepted various basins, we'd have to return the results separately per basin anyway, so the user can as well make several queries.

Called by plural process:

get_shortest_path_between_points_plural.py

Name		Name	Last commit message	Last commit date
Latest commit History 578 Commits
geofresh		geofresh
mapclient		mapclient
pygeoapi_processes		pygeoapi_processes
test_input_data		test_input_data
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aqua90m

List of processes

Pygeoapi deployment

Process-specific details

extract-point-stats (old)

Implementation details/questions

Easy next steps (2026-01-09)

Design Questions (2026-01-09)

routing module (2026-01-09)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

aqua90m

List of processes

Pygeoapi deployment

Process-specific details

extract-point-stats (old)

Implementation details/questions

Easy next steps (2026-01-09)

Design Questions (2026-01-09)

routing module (2026-01-09)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages