Parallel I/O output module enhancements #655

SeanBryan51 · 2025-11-18T06:18:51Z

📚 Documentation preview 📚: https://cable--655.org.readthedocs.build/en/655/

SeanBryan51 · 2025-11-18T23:16:08Z

Hi @Whyborn, I've opened this PR to highlight the output module specific diffs (includes the aggregators and the list data structure for output variables). The implementation still contains a bunch of TODO's, but it would be awesome if I could get your feedback at this stage.

I've been testing it only in serial (testing in parallel will only be possible once the met data can be read in via parallel I/O) and currently gives the same answers as before for Qh.

However, I did have to change the NetCDF format in the old output module from classic to NetCDF4/HDF5 to easily compare outputs between new and old. This was a hack as the new NetCDF API has the NetCDF4 format hardcoded whenever creating or opening new files. I'll make that configurable so that both classic and NetCDF4 is supported.

Whyborn · 2025-11-19T00:29:06Z

First pass comments:

I think the output should be dropped from cable_output_aggregator_t- the aggregator works exactly perfectly fine outside the output module, and can be used any time you need temporal aggregation for output or science.
I don't think the output variable should handle the aggregation frequency. I think the control should be with the relevant science (I guess particularly the relevant science output module) module as to when an aggregator should be accumulated.
What's the benefit to having the active argument in the cable_output_add_variable call, over an if statement?
I think the general approach should be to have the aggregators defined first, and then pass as arguments to the cable_output_add_variable call. Something like:

class(aggregator_t), dimension(:), allocatable :: Qh_aggregators
...
Qh_aggregators = create_aggregators(canopy%fh, ["mean", "max"])
call cable_output_add_variable(
  ...
  aggregator=Qh_aggregators
  ...
)

This way it makes the aggregators available for use where necessary (e.g. the Tmx) variable. Although I'm not sure if you could make the elements of the returned Qh_aggregators targets?

I don't think the shapes should be pre-defined. I think they should be built by the caller by passing dimensions, something like:

cable_output_shape_land_patch = create_decomposition(["land", "patch"])
cable_output_shape_land_patch_soil = create_decomposition(["land", "patch", "soil"])

where the dimensions are defined earlier in initialisation with something like:

call define_cable_dimension("soil", 6)
call define_cable_dimension("rad", 2)

I think this makes the design more easily extensible? This makes it trivial for someone developing new code which may have new dimensions to output their own stuff.

In aggregators.f90, what's the point of the aggregator_store_t class? This seems like it could be

type aggregator_store_t
  class(aggregator_t), allocatable :: aggregators(:)
  integer :: num_aggregators
end type aggregator_store_t

where num_aggregators functions in the same way the current num_aggregators functions.

I do think we need a time module, but we may be able to import that from elsewhere for a full featured timing module.

SeanBryan51 · 2025-11-19T03:25:51Z

Thanks @Whyborn for your thoughts!

I think the output should be dropped from cable_output_aggregator_t- the aggregator works exactly perfectly fine outside the output module, and can be used any time you need temporal aggregation for output or science.

I don't think the output variable should handle the aggregation frequency. I think the control should be with the relevant science (I guess particularly the relevant science output module) module as to when an aggregator should be accumulated.

I agree that it would be nice if time related control were handled at a higher level (for example with the relevant science output module as you say). However, it wasn't obvious to me how this approach could allow for driving the intermediate aggregators required for Tmx and Tmn, where the intermediate aggregator accumulates at all frequencies and aggregates at daily frequencies, and the resulting aggregator accumulates at daily frequencies and aggregates at monthly frequencies. The cable_output_aggregator_t, along with the handling of the aggregation and accumulation frequencies at the aggregator level, was introduced so that aggregators can be driven independently at any accumulation or aggregation frequency which seemed easier for supporting the intermediate aggregator case. There might be a better solution here though, so I'm definitely more than happy to discuss further.

What's the benefit to having the active argument in the cable_output_add_variable call, over an if statement?

For two reasons:

I think it's nicer for enabling variables depending on some sort of boolean logic. For example for groups of variables, we could just .or. the variable specific logical with the group logical rather than updating the output inclusion type instance as is done here.
The advantage of using active over nesting the call within if (active) is that this allows us to build a list of all possible supported outputs (both active and inactive) which could used to generate documentation on the supported outputs. This was inspired from the CTSM/CLM folks.

I think the general approach should be to have the aggregators defined first, and then pass as arguments to the cable_output_add_variable call. Something like:
class(aggregator_t), dimension(:), allocatable :: Qh_aggregators
...
Qh_aggregators = create_aggregators(canopy%fh, ["mean", "max"])
call cable_output_add_variable(
  ...
  aggregator=Qh_aggregators
  ...
)
This way it makes the aggregators available for use where necessary (e.g. the Tmx) variable. Although I'm not sure if you could make the elements of the returned Qh_aggregators targets?

Oh yep I remember you mentioning this in our discussions. I think this is something we could definitely add in the future - I was hesitant to introduce this now as I want to avoid diverging in functionality from the current output module which assumes a single aggregation method per variable.

As for a list of objects being targets, I got around that problem by working with arrays of type aggregator_handle_t which contain a reference to an aggregator rather than the actual aggregator instance.

I don't think the shapes should be pre-defined. I think they should be built by the caller by passing dimensions, something like:
cable_output_shape_land_patch = create_decomposition(["land", "patch"])
cable_output_shape_land_patch_soil = create_decomposition(["land", "patch", "soil"])
where the dimensions are defined earlier in initialisation with something like:
call define_cable_dimension("soil", 6)
call define_cable_dimension("rad", 2)
I think this makes the design more easily extensible? This makes it trivial for someone developing new code which may have new dimensions to output their own stuff.

I like this suggestion, I agree it's definitely not that obvious what one needs to do to create a new CABLE_OUTPUT_SHAPE_TYPE_*. I think rather than determining the shape of the decomposition object from the defined dimensions, I would just initialise a new decomposition via the io_decomp_* procedures, or use the io_decomp instance. Removing the SHAPE_TYPE_* variables means I will need to rethink about how each variable will infer the temporary buffer used for grid cell averaging. I'll reflect on that a bit.

In aggregators.f90, what's the point of the aggregator_store_t class? This seems like it could be
type aggregator_store_t
  class(aggregator_t), allocatable :: aggregators(:)
  integer :: num_aggregators
end type aggregator_store_t
where num_aggregators functions in the same way the current num_aggregators functions.

aggregator_store_t was introduced because allocatable arrays of a polymorphic type are problematic (i.e. class(aggregator_t), allocatable :: aggregators(:)), as the compiler doesn't know in general what the type (and hence size) of each element will be on allocation. In practice, you need a container derived type which itself is not polymorphic, like aggregator_store_t, as the list element. It's similar to how you would declare an array of pointers in fortran. Interestingly the IBM compiler seems to actually allow for allocatable arrays of a polymorphic type: https://www.ibm.com/docs/en/xl-fortran-aix/16.1.0?topic=concepts-array-constructors, but its not standard as far as I know.

I do think we need a time module, but we may be able to import that from elsewhere for a full featured timing module.

I agree, a timing module for wider use in the code would be great. The time_step_matches function was my hacky attempt to clean up the timing logic code in write_output, definitely something we could introduce properly (independent of these changes).

Whyborn · 2025-11-19T06:02:45Z

However, it wasn't obvious to me how this approach could allow for driving the intermediate aggregators required for Tmx and Tmn, where the intermediate aggregator accumulates at all frequencies and aggregates at daily frequencies, and the resulting aggregator accumulates at daily frequencies and aggregates at monthly frequencies. The cable_output_aggregator_t, along with the handling of the aggregation and accumulation frequencies at the aggregator level, was introduced so that aggregators can be driven independently at any accumulation or aggregation frequency which seemed easier for supporting the intermediate aggregator case.

Does this allow aggregators to be driven independently? It seems to me like specifically doesn't allow this, because every aggregator which accumulates e.g. on_timestep, daily etc would be accumulated at the same time. This would mess with things that happen at the end of the day, using some aggregation of on_timestep quantities. I think the Tmn and Tmx variables are examples- I'm fairly sure these feed into CASA science. That would mean, to get the correct daily minima and maxima, the order of operations would have to look like:

do_biogeophysics -> do_timestep_accumulation -> do_CASA -> do_daily_accumulation

There has to be a way to trigger accumulation at specific times, for instances like this. This is why I think the aggregators have to be available to work with as standalone objects.

I think it's nicer for enabling variables depending on some sort of boolean logic. For example for groups of variables, we could just .or. the variable specific logical with the group logical rather than updating the output inclusion type instance as is done here.
The advantage of using active over nesting the call within if (active) is that this allows us to build a list of all possible supported outputs (both active and inactive) which could used to generate documentation on the supported outputs. This was inspired from the CTSM/CLM folks.

Yea I can see that, it might be more easily readable to have every call contain a

output%variable .or output%variable_group .or. output%variable_module

instead of a construction like

if (output%variable_module) then
  if (output%variable_group) then
    if (output%variable) then
      ...
    end 
  end
end

Although I don't see why the latter couldn't also be used to create the same documentation in the same way.

I like this suggestion, I agree it's definitely not that obvious what one needs to do to create a new CABLE_OUTPUT_SHAPE_TYPE_. I think rather than determining the shape of the decomposition object from the defined dimensions, I would just initialise a new decomposition via the io_decomp_* procedures, or use the io_decomp instance. Removing the SHAPE_TYPE_ variables means I will need to rethink about how each variable will infer the temporary buffer used for grid cell averaging. I'll reflect on that a bit.

Yea that's what I meant, whether the io_decomp is created via dimensions names or just numbers, which could be accessible with something like dim_length(dim_name), is much of the same to me. I do think it's important that the various dimensions can be accessed in this way, since it's something done already throughout the code via use statements, and this would help eliminate that method of variable sharing.

aggregator_store_t was introduced because allocatable arrays of a polymorphic type are problematic (i.e. class(aggregator_t), allocatable :: aggregators(:)), as the compiler doesn't know in general what the type (and hence size) of each element will be on allocation. In practice, you need a container derived type which itself is not polymorphic, like aggregator_store_t, as the list element. It's similar to how you would declare an array of pointers in fortran. Interestingly the IBM compiler seems to actually allow for allocatable arrays of a polymorphic type: https://www.ibm.com/docs/en/xl-fortran-aix/16.1.0?topic=concepts-array-constructors, but its not standard as far as I know.

Are you sure? I thought arrays of polymorphic classes were part of the standard, I used them in some of my aggregator testing. You just need select type blocks to access any of the child components.

I agree, a timing module for wider use in the code would be great. The time_step_matches function was my hacky attempt to clean up the timing logic code in write_output, definitely something we could introduce properly (independent of these changes).

There's the datetime-fortran which we already have a spack package for? I haven't actually looked at it's features yet though.

…rid cell averaging buffers

… of main loop

This change allows the `cable_abort_module` to be used in modules where `cable_def_types_mod` or `cable_IO_vars_module` are a dependee of that module as the removal of `range_abort` avoids introducing cyclic module dependencies. The impact of this change is minimal as `range_abort` is only called from the `cable_abort_module` in the code base.

…regator rather than abstract aggregator type This is done so that new_aggregator can instantiate non-polymorphic aggregator instances as well as polymorphic aggregator instances.

SeanBryan51 · 2025-11-25T02:03:32Z

Hi @Whyborn, I've made the updates we discussed last week. I think it's looking much better now with your suggestions on how aggregators are organised in the output module, thanks as always for the comments! Please let me know if there is anything else that catches your eye on the design.

I'm going to try add support for specifying non-time varying data in the output and restart files. For this I'm thinking we could introduce a cable_output_add_parameter subroutine (similar to cable_output_add_variable) for specifying the non-time varying parameters in the output, and a similar cable_output_add_restart_variable for the restart variables.

Whyborn · 2025-11-25T06:14:49Z

Just a couple of comments:

The cable_output_add_variable routine doesn't have an aggregation_frequency yet- I think we need that and a minimum frequency
I think it would be good to decouple the output and the grid cell averaging (and reductions in general). A bit like the aggregators- it's a functionality useful for the output, but should be considered a distinct module that the output module uses.
Along that line, I think it would be nice to define reducers/mapper by name. So you could define the variable like:

if (output%patch)
  decomp = decomp_land_real32
  reducer = "patch"
else
  decomp = decomp_land_patch_real32
  reducer = "none"
end if

call cable_output_add_variable(
  name="Qh",
  ...
  decomp=decomp,
  reducer=reducer
)

And then in the write_variable you could have:

subroutine write_variable(output_variable, output_file, time_index)
...
  if (output_variable%reducer == "patch") then
    call output_file%write_darray(
        var_name=output_variable%name,
        values=compute_patch_average(aggregator%storage)
        decomp=output_variable%decomp,
        frame=time_index
  else if (output_variable%reducer == "none") then
    call output_file%write_darray(
        var_name=output_variable%name,
        values=aggregator%storage
        decomp=output_variable%decomp,
        frame=time_index
  end if

Then you wouldn't have to have all the buffers. The compute_patch_average would be a function interface, with different rank/type module procedures.

Can you remind me why the all the decompositions for different element types are required to have distinct variables? Aren't the decomps basically just indexing arrays?

SeanBryan51 · 2025-11-26T00:19:54Z

The cable_output_add_variable routine doesn't have an aggregation_frequency yet- I think we need that and a minimum frequency

I'm definitely happy to add the frequency limit to cable_output_add_variable as that information is no doubt specific to each variable. I removed the aggregation_frequency argument because the aggregation frequency is tied to the output frequency which is set at the "profile" level.

I think it would be good to decouple the output and the grid cell averaging (and reductions in general). A bit like the aggregators- it's a functionality useful for the output, but should be considered a distinct module that the output module uses.

That sounds good to me, I will rename the module containing the grid cell averaging stuff to be more general (and maybe put this under utils/)?

Can you remind me why the all the decompositions for different element types are required to have distinct variables? Aren't the decomps basically just indexing arrays?

The distinction of types for decompositions is required by PIO via the basepiotype argument of pio_initdecomp.

Along that line, I think it would be nice to define reducers/mapper by name.

I'm happy to introduce a string argument for reducer instead of the grid_cell_average logical.

Then you wouldn't have to have all the buffers. The compute_patch_average would be a function interface, with different rank/type module procedures.

I did consider using a function interface instead of a subroutine interface, however I opted for the subroutine approach with the temporary buffer as this avoids introducing a potentially large allocation when computing the grid cell average. I want to limit the number of unnecessary allocations and copy operations in the write procedures where possible. It might be possible to return a preallocated array from a function. I can look into this a bit more.

Thank you again for the feedback on this!

src/util/aggregator_types.F90

ccarouge · 2025-12-01T01:46:16Z

src/offline/cable_output_prototype_v2.F90

+      end do
+    end if
+
+    if (time_step_matches(dels, time_index, output%averaging, leaps, start_year)) then


Note, we need to change "output%averaging" to "output%aggregating" since it's not always an average on that time window.

Totally agree, I probably won't change the name as part of the parallel I/O changes as its likely a breaking change to the CABLE namelist.

Do we have an issue for documenting namelist improvements? I know Jhan proposed some improvements to the organisation of namelists in #588. And there is also:

CABLE/src/offline/cable_driver_common.F90

Lines 166 to 179 in 8e20bac

! TODO(Sean): we should not be setting namelist parameters in the following if

! block - all options are all configurable via the namelist file and is

! unclear that these options are being overwritten. A better approach would be

! to error for bad combinations of namelist parameters.

IF (icycle > CASAONLY_ICYCLE_MIN) THEN

icycle = icycle - CASAONLY_ICYCLE_MIN

CASAONLY = .TRUE.

CABLE_USER%CASA_DUMP_READ = .TRUE.

CABLE_USER%CASA_DUMP_WRITE = .FALSE.

ELSE IF (icycle == 0) THEN

CABLE_USER%CASA_DUMP_READ = .FALSE.

spincasa = .FALSE.

CABLE_USER%CALL_POP = .FALSE.

END IF

And

CABLE/src/offline/cable_driver_common.F90

Lines 181 to 183 in 8e20bac

! TODO(Sean): overwriting l_casacnp defeats the purpose of it being a namelist

! option - we should either deprecate the l_casacnp option or not overwrite

! its value.

Note, we need to change "output%averaging" to "output%aggregating" since it's not always an average on that time window.

Just thought of this now, a way to make this a bit nicer is to have an aggregation_frequency component in the cable_output_profile_t type which is assigned to output%averaging on initialisation. I think this makes sense as the sampling frequency is intended to be set at the profile level.

ccarouge · 2025-12-01T01:49:43Z

src/offline/cable_serial.F90

+          ! TODO(Sean): this is a hack for determining if the current time step
+          ! is the last of the month. Better way to do this?
+          IF(ktau == 1) THEN
+            !MC - use met%year(1) instead of CABLE_USER%YearStart for non-GSWP forcing and leap years
+            IF ( TRIM(cable_user%MetType) .EQ. '' ) THEN
+                start_year = met%year(1)
+            ELSE
+                start_year = CABLE_USER%YearStart
+            ENDIF
+          END IF
+


If we read a met file during the initialisation (outside the time loop), we could then initialise "CABLE_USER%YearStart = met%year(1)" for the MetType = ''?
It would remove the need for this if condition and it makes sure that cable_user%YearStart is always defined and always has the same meaning.

Not sure that's what you mean in your TODO. It seems to apply more to the code in cable_timing_utils.F90 than here.

Yep I definitely want to get back to this, I'll test out the solution you propose. I mean to try out the site case (which I believe corresponds to MetType == ' ') to confirm I haven't broken things here for this case.

Not sure that's what you mean in your TODO. It seems to apply more to the code in cable_timing_utils.F90 than here.

You've interpreted that correctly, it was a reminder to myself to look into an alternative algorithm for monthly timings that doesn't require the start year of the simulation.

As an alternate, we could consider an additional "ktau"-like variable that is reset every year. (maybe there is one in the code, I don't know). The time loop tells us when we change year (to check if this is true with the site simulations).

What's the reason not to have cable_user%yearstart be something that is unconditionally required in the namelist, and have that be the final truth about the start point of the simulation? Having that able to be overwritten is bound to lead to confusion in the future.

As an alternate, we could consider an additional "ktau"-like variable that is reset every year. (maybe there is one in the code, I don't know). The time loop tells us when we change year (to check if this is true with the site simulations).

I've seen this pattern used in other models (see #656). I agree that tracking information like this throughout the run makes sense

What's the reason not to have cable_user%yearstart be something that is unconditionally required in the namelist, and have that be the final truth about the start point of the simulation? Having that able to be overwritten is bound to lead to confusion in the future.

That's a great point, it might make sense to instead error if met%year(1) does not match cable_user%yearstart, and rely on cable_user%yearstart for timing logic.

I don't really understand why met%year(1) is required for this case, I'll see what happens when we rely on cable_user%yearstart alone

Isn't ktau already that? The timestep within the current year, with ktau_tot being the total timestep in the simulation?

I believe ktau can extend beyond an year for site configurations (dependent on kend). For the MetType = 'gswp3' case that I'm testing at the moment, ktau is reset every year. This is probably a symptom of the many met forcing formats / configurations which are supported by the driver.

It looks like ktau loops with do ktau = kstart, kend here and kend is set to be the number of steps in a year in almost every instance, except for gswp which determines it based off the length of the time dimension in the met file (this case block)

src/offline/cable_output_definitions.F90

…_daily aggregators to canopy_type

…ily aggregators

Specific changes include: 1. Remove `output_aggregator_t` and instead assign each aggregator handle to either the `accumulations_time_step`, `accumulations_daily` list based on the accumulation frequency. 2. When updating outputs at each time step, accumulate aggregators according to the assigned accumulation frequency. If writing output variables, then perform the normalisation and resetting of each aggregator before and after the write operation. Intermediate aggregators (such as tscrn_max_daily) are assumed to be driven correctly outside the output module, i.e. they are treated the same as other working variables. 3. Add range check functionality

…al for land dimension

…l profile

…ariables

SeanBryan51 · 2025-12-02T01:12:01Z

Thanks for the comments @ccarouge, I'll add in those suggestions!

ccarouge · 2025-12-02T01:15:57Z

src/offline/cable_output_prototype_v2.F90

+      end do
+    end if
+
+    if (time_step_matches(dels, time_index, output%averaging, leaps, start_year)) then


I would split this subroutine in two (or more) to split out where we accumulate the aggregators and where we write out the data. I think we should have an update and a write procedure. I've seen you discussed this with Lachlan as well. I understand the idea of replicating the current behaviour. I'm just worried this is going to push us towards a design we would have to rethink to split to get the accumulation step handled by the science modules.

Lachlan and I sort of came to the agreement that any aggregators used as part of non-output related computations would be driven outside the output module (similar to the canopy%tscrn_max_daily aggregator), and that aggregators for writing output would be driven by the output module. Is there a specific use case you have in mind where it makes sense to have the accumulation step for the aggregators for writing output handled by the science modules?

It doesn't have to go to one of the science modules. I think it still makes sense to accumulate and write in different routines.

ccarouge · 2025-12-02T01:18:17Z

src/offline/cable_output_prototype_v2.F90

+
+  end subroutine write_variable
+
+  subroutine write_variable_grid_cell_average(output_variable, output_file, time_index, patch, landpt)


If I follow correctly, the reason the grid averaging is done within the write procedure is to save on memory allocation. But then, I don't understand why we need to save the averaging within output_variable%, why not use local variables to store the data for the write?

Personally I would like to move this functionality out of the output module, and have a reducer defined in the output variable initialisation, that would recognise various reductions i.e. grid cell average, dominant patch. But I think this is quite difficult to do with the tools Fortran offers

@Whyborn , my first comment was to say I'd prefer the grid averaging to be done separately from the write. But then, I thought Sean must have had a reason to do it this way. I thought hard (and not that long) and thought the memory thing could be it.

I don't understand why we need to save the averaging within output_variable%, why not use local variables to store the data for the write?

Just to confirm, are you asking whether the time aggregated quantities should be the grid cell averaged values rather than the values per patch?

@ccarouge I think it's actually quite difficult to decouple them completely, because the grid cell averages don't have working variables to be associated with.

…omputing mean aggregations

…bles in the same list

Restore pack functionality, move if (active) block from cable_output_add_variable to cable_output_commit.

…utput_commit

SeanBryan51 added 3 commits November 18, 2025 17:18

src/offline/cable_output.F90: enable NetCDF4 format in output

2e70c2c

Add modified version of Lachlan's aggregator implementation

82ba3fc

Add parallel I/O output module implementation

8abda2c

SeanBryan51 force-pushed the parallelio-playground-output-module-enhancements branch from 4b2f171 to 8abda2c Compare November 18, 2025 22:39

SeanBryan51 added 3 commits November 20, 2025 11:11

src/offline/cable_output_prototype_v2.F90: fix incorrect shapes for g…

682a50e

…rid cell averaging buffers

Remove CABLE_OUTPUT_SHAPE_TYPE_* constants

90ffdfa

src/offline/cable_serial.F90: move output module finalisation outside…

78c8d3c

… of main loop

SeanBryan51 mentioned this pull request Nov 21, 2025

Managing time in CABLE #656

Open

SeanBryan51 added 2 commits November 25, 2025 11:35

src/util/aggregator.F90: Return concrete aggregator type from new_agg…

1e98b3f

…regator rather than abstract aggregator type This is done so that new_aggregator can instantiate non-polymorphic aggregator instances as well as polymorphic aggregator instances.

ccarouge reviewed Nov 30, 2025

View reviewed changes

src/util/aggregator_types.F90 Outdated Show resolved Hide resolved

ccarouge reviewed Nov 30, 2025

View reviewed changes

src/util/aggregator_types.F90 Outdated Show resolved Hide resolved

ccarouge reviewed Dec 1, 2025

View reviewed changes

src/util/aggregator_types.F90 Outdated Show resolved Hide resolved

ccarouge reviewed Dec 1, 2025

View reviewed changes

src/offline/cable_output_definitions.F90 Show resolved Hide resolved

SeanBryan51 added 7 commits December 2, 2025 11:09

src/offline/cable_define_types.F90: add tscrn_max_daily and tscrn_min…

8ba77b7

…_daily aggregators to canopy_type

src/offline/cable_serial.F90: update tscrn_max_daily and tscrn_min_da…

d9fcfb0

…ily aggregators

src/offline/cable_output_prototype_v2.F90: mland should be mland_glob…

a2fbb16

…al for land dimension

Rename cable_output_utils_mod to cable_grid_reductions_mod

339a76a

Check if sampling and accumulation frequency is valid

5fb101d

Replace grid_cell_averaging logical with reduction_method string

1ca7b3e

SeanBryan51 added 7 commits December 2, 2025 11:23

Remove cable_output_add_aggregator subroutine

b7ada3b

Only add active output variables to list of output variables in globa…

93bce35

…l profile

src/util/aggregator_types.F90: sample source_data for point_accumulate

666a93b

src/offline/cable_output_prototype_v2.F90: make time_index optional

365a754

Add writing of parameters

9e1a41a

src/offline/cable_io_decomp.F90: fix dim specification for 2D patch v…

3ca423d

…ariables

src/offline/cable_output_definitions.F90: Add albsoil test case

d37196a

ccarouge reviewed Dec 2, 2025

View reviewed changes

src/util/aggregator_types.F90: Add incremental averaging method for c…

fb4febe

…omputing mean aggregations

SeanBryan51 force-pushed the parallelio-playground-output-module-enhancements branch from 03eaa9b to fb4febe Compare December 2, 2025 03:32

SeanBryan51 added 12 commits December 2, 2025 14:48

Rename storage to aggregated_data

676e718

Remove normalise procedure

5708824

src/offline/cable_output_prototype_v2.F90: store parameters and varia…

6b923d7

…bles in the same list

Reorganise cable_output_add_variable and cable_output_commit

ba4669f

Restore pack functionality, move if (active) block from cable_output_add_variable to cable_output_commit.

src/util/netcdf/cable_netcdf.F90: add redef

636acbb

src/util/netcdf/cable_netcdf.F90: make dim_names optional

dde58d9

src/util/netcdf/cable_netcdf.F90: Add iotype and mode arguments

8947cd6

src/offline/cable_output_prototype_v2.F90: specify iotype argument

b15c20c

src/offline/cable_output_prototype_v2.F90: end define mode in cable_o…

6a92ce9

…utput_commit

src/offline/cable_io_decomp.F90: remove restart specific decompositions

864f14a

Add cable_restart_mod and cable_restart_write_mod

5c36fd1

src/offline/cable_serial.F90: write restarts via new restart modules

9263f1e

SeanBryan51 force-pushed the parallelio-playground-output-module-enhancements branch from 4724830 to 9263f1e Compare December 10, 2025 00:55

SeanBryan51 mentioned this pull request Dec 10, 2025

Rename namelist option output%averaging #661

Open

	! TODO(Sean): we should not be setting namelist parameters in the following if
	! block - all options are all configurable via the namelist file and is
	! unclear that these options are being overwritten. A better approach would be
	! to error for bad combinations of namelist parameters.
	IF (icycle > CASAONLY_ICYCLE_MIN) THEN
	icycle = icycle - CASAONLY_ICYCLE_MIN
	CASAONLY = .TRUE.
	CABLE_USER%CASA_DUMP_READ = .TRUE.
	CABLE_USER%CASA_DUMP_WRITE = .FALSE.
	ELSE IF (icycle == 0) THEN
	CABLE_USER%CASA_DUMP_READ = .FALSE.
	spincasa = .FALSE.
	CABLE_USER%CALL_POP = .FALSE.
	END IF

	! TODO(Sean): overwriting l_casacnp defeats the purpose of it being a namelist
	! option - we should either deprecate the l_casacnp option or not overwrite
	! its value.


		end subroutine write_variable

		subroutine write_variable_grid_cell_average(output_variable, output_file, time_index, patch, landpt)

Parallel I/O output module enhancements #655

Are you sure you want to change the base?

Parallel I/O output module enhancements #655

Uh oh!

Conversation

SeanBryan51 commented Nov 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SeanBryan51 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Whyborn commented Nov 19, 2025

Uh oh!

SeanBryan51 commented Nov 19, 2025

Uh oh!

Whyborn commented Nov 19, 2025

Uh oh!

SeanBryan51 commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Whyborn commented Nov 25, 2025

Uh oh!

SeanBryan51 commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SeanBryan51 Dec 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ccarouge Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SeanBryan51 commented Dec 2, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

SeanBryan51 commented Nov 18, 2025 •

edited by github-actions bot

Loading

SeanBryan51 commented Nov 18, 2025 •

edited

Loading

SeanBryan51 commented Nov 25, 2025 •

edited

Loading

SeanBryan51 commented Nov 26, 2025 •

edited

Loading

SeanBryan51 Dec 3, 2025 •

edited

Loading

ccarouge Dec 1, 2025 •

edited

Loading