Rlssm class make model dist by cpaniaguam · Pull Request #915 · lnccbrown/HSSM

cpaniaguam · 2026-03-02T18:40:53Z

This pull request introduces reinforcement learning sequential sampling model (RLSSM) support to the HSSM package. It adds a new RLSSM class, supporting configuration, likelihood construction, and data validation for RL+SSM models, and refines the configuration workflow to require a fully annotated log-likelihood function. The changes also improve pre-commit configuration and update the package's public API.

Major features and changes:

1. RLSSM Model Integration

Added a new RLSSM class in src/hssm/rl/rlssm.py to support models that combine reinforcement learning processes with sequential sampling models. This class builds a differentiable pytensor Op from an annotated JAX log-likelihood function and enforces strict data requirements for balanced panels.
Introduced a utility function validate_balanced_panel in src/hssm/rl/utils.py to ensure input data forms a balanced panel, which is required for RLSSM models.

2. Configuration Enhancements

Extended RLSSMConfig in src/hssm/config.py to require an ssm_logp_func (an annotated JAX SSM log-likelihood function), replacing the previous loglik/loglik_kind workflow. Added runtime validation to ensure this function is callable and properly annotated. [1] [2] [3]
Updated from_rlssm_dict to accept a config dictionary and extract ssm_logp_func and model_name directly from it, simplifying model instantiation.

3. Public API and Package Structure

Registered RLSSM and RLSSMConfig in the package's public API via src/hssm/__init__.py and created a new src/hssm/rl/__init__.py for RL-related exports. [1] [2] [3]

4. Developer Experience

Updated .pre-commit-config.yaml to exclude the tests/ directory from ruff and mypy checks, streamlining development workflows.

…r RL parameters

…del structure

Copilot

Pull request overview

Adds first-class RL + SSM (RLSSM) support to HSSM by introducing a new RLSSM model that builds a differentiable PyTensor Op from an annotated JAX SSM log-likelihood and plugs it into the existing distribution-building pipeline.

Changes:

Introduces RLSSM model class plus RL utility validate_balanced_panel.
Extends configuration via RLSSMConfig.ssm_logp_func and exposes RLSSM in the public API.
Adds test coverage for RLSSM initialization/model build and updates RLSSMConfig validation tests.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`src/hssm/rl/rlssm.py`	New RLSSM model implementation integrating RL likelihood `Op` into HSSMBase.
`src/hssm/rl/utils.py`	Adds balanced-panel validation helper for RLSSM datasets.
`src/hssm/rl/__init__.py`	RL subpackage exports for RLSSM and utilities.
`src/hssm/config.py`	Adds `ssm_logp_func` to `RLSSMConfig` and validates presence.
`src/hssm/__init__.py`	Exposes `RLSSM` / `RLSSMConfig` at top-level.
`tests/test_rlssm.py`	New end-to-end-ish RLSSM tests (init, model build, balanced panel, smoke sampling).
`tests/test_rlssm_config.py`	Updates RLSSMConfig tests to include the new required field.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-02T19:01:39Z

+            f"same number of trials. Observed trial counts: {dict(counts)}"
+        )
+
+    return int(len(counts)), int(counts.iloc[0])


validate_balanced_panel only checks equal trial counts, but the RL likelihood builder reshapes the row order into (n_participants, n_trials, ...) (see make_rl_logp_func), which assumes each participant’s trials are in one contiguous block (and usually in-trial order). With interleaved participants, the panel can be “balanced” yet produce a silently incorrect likelihood. Consider validating contiguity (each participant appears in exactly one run of length n_trials) and/or sorting by participant_col (+ an optional trial_col if present) before returning (n_participants, n_trials).

Suggested change

return int(len(counts)), int(counts.iloc[0])

# Ensure that each participant's trials form a single contiguous block

# of rows of length n_trials. This is required because downstream code

# reshapes the data into (n_participants, n_trials, ...) based on row

# order, assuming no interleaving across participants.

n_trials = int(counts.iloc[0])

# Identify contiguous "blocks" of identical participant IDs.

blocks = data[participant_col].ne(data[participant_col].shift()).cumsum()

block_counts = data.groupby([participant_col, blocks]).size()

# Each participant must appear in exactly one block, and that block

# must have length n_trials.

blocks_per_participant = block_counts.groupby(level=0).size()

invalid_multi_blocks = blocks_per_participant[blocks_per_participant != 1]

invalid_block_sizes = block_counts[block_counts != n_trials]

if not invalid_multi_blocks.empty or not invalid_block_sizes.empty:

raise ValueError(

"Data must be ordered so that each participant's trials appear in "

"a single contiguous block of rows of length n_trials. "

"Participants with non-contiguous or incorrectly sized blocks "

f"were found. Consider sorting your data by '{participant_col}' "

"and, if available, by a trial index column before building the "

"RL likelihood."

)

return int(len(counts)), n_trials

@cpaniaguam this one may have merit actually. Can we unpack this on tues (Apr. 21st)?

Copilot · 2026-03-02T19:01:40Z

+            "Please provide the correct participant column name via "
+            "`participant_col`."
+        )
+


groupby(participant_col) drops NaN participant IDs by default, which can make n_participants/n_trials incorrect without an explicit error. Consider adding a check like data[participant_col].isna().any() and raising a clear ValueError if participant IDs are missing.

Suggested change

# Ensure there are no missing participant IDs, since groupby will drop NaNs

# silently, which would make n_participants / n_trials incorrect.

if data[participant_col].isna().any():

raise ValueError(

f"Column '{participant_col}' contains missing values. "

"Please fill or remove rows with missing participant IDs before "

"calling validate_balanced_panel."

)

Are we checking for NaN participant id's in the data validator?
If yes this is fine here no?

… function

…ble and has required attributes

… callable and properly annotated

Copilot

Pull request overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-02T19:40:46Z

+            )
+
+        # Rearrange data so missing rows come first (no-op when missing_data=False).
+        self.data = _rearrange_data(self.data)
+


_rearrange_data(self.data) changes row order, but the RL logp Op reshapes trials purely by row order into (n_participants, n_trials, ...). If any rows are moved (e.g., when missing_data=True and rt == -999), this will break per-participant trial sequences and invalidate the RL learning dynamics. Since missing-data networks are not supported for RLSSM, consider raising an explicit error when missing_data/deadline handling is requested (or implement a participant-wise rearrangement that preserves within-subject order).

Copilot · 2026-03-02T19:40:47Z

+    counts = data.groupby(participant_col).size()
+    if counts.nunique() != 1:
+        raise ValueError(
+            "Data must form balanced panels: all participants must have the "
+            f"same number of trials. Observed trial counts: {dict(counts)}"
+        )
+
+    return int(len(counts)), int(counts.iloc[0])


validate_balanced_panel() only checks equal trial counts via groupby().size(), but it does not validate that rows are ordered/grouped by participant. The RL likelihood builder (make_rl_logp_func) reshapes arrays with .reshape(n_participants, n_trials, -1) based purely on row order, so interleaved participant rows will silently mix subjects/trials and produce an incorrect likelihood. Consider either (a) enforcing contiguous blocks per participant (and optionally stable-sorting by participant_col + a trial index column if available) or (b) returning a sorted copy of the data and using that downstream.

…preserve trial sequence integrity

… RLSSM initialization

Copilot

Pull request overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-02T20:22:54Z

+        # All RLSSM parameters are treated as trialwise: the Op expects arrays of
+        # length n_total_trials for every parameter, and make_distribution.logp
+        # broadcasts scalar / (1,)-shaped tensors up to (n_obs,) accordingly.
+        params_is_trialwise = [
+            True for param_name in self.params if param_name != "p_outlier"
+        ]
+
+        extra_fields_data = (
+            None
+            if not self.extra_fields
+            else [deepcopy(self.data[field].values) for field in self.extra_fields]
+        )
+
+        assert self.list_params is not None, "list_params should be set"
+        # self.loglik was set to the pytensor Op built in __init__; cast to
+        # narrow the inherited union type so make_distribution's type-checker
+        # accepts it without a runtime penalty.
+        loglik_op = cast("Callable[..., Any] | Op", self.loglik)
+        return make_distribution(
+            rv=self.model_name,
+            loglik=loglik_op,
+            list_params=self.list_params,
+            bounds=self.bounds,
+            lapse=self.lapse,
+            extra_fields=extra_fields_data,
+            params_is_trialwise=params_is_trialwise,
+        )


params_is_trialwise is derived from self.params (excluding p_outlier), but it is passed alongside list_params=self.list_params. If self.list_params includes p_outlier (common in HSSMBase), this makes params_is_trialwise shorter and potentially misaligned with list_params, which can cause incorrect broadcasting or length-check failures in make_distribution. Build params_is_trialwise from self.list_params in the same order, marking p_outlier as non-trialwise.

…ation

Copilot

Pull request overview

Copilot reviewed 8 out of 8 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…ary assertion for list_params

… for independent copies

…ror for unsupported usage

…consistency

…asses-basemodelconfig-only-dict-supported' into inject-RLSSMConfig-directly-into-HSSMBase

…ect position

AlexanderFengler

small comments.

AlexanderFengler · 2026-03-26T22:55:09Z

+        data: pd.DataFrame,
+        rlssm_config: RLSSMConfig,
+        participant_col: str = "participant_id",
+        include: list[dict[str, Any] | Any] | None = None,


What would you call it instead here?
We would want to make that change globally not just for this class I guess.

Either way, would do that as a separate PR.

AlexanderFengler · 2026-03-27T00:22:43Z

+            )
+        if deadline is not False:
+            raise ValueError(
+                "RLSSM does not support `deadline` handling. "


@krishnbera do we actually have a solution for this?

again "[...] yet."

AlexanderFengler · 2026-03-27T00:25:47Z

        """
        # Start with defaults
-        config = cls.config_class.from_defaults(model, loglik_kind)
+        # get_config_class is provided by Config/RLSSMConfig mixin through MRO


why does RLSSMConfig show up here in this file?

All this will be cleaned up after #936 and #931 get merged into their respective base branches.

…ed data constants

…sistency

…LSSMConfig and related tests

…conditional checks

…emoving conditional checks" This reverts commit 7cf8bca.

…dation

…to-HSSMBase Inject rlssm config directly into hssm base

…y-injection-into-model-classes-basemodelconfig-only-dict-supported Handle configs via dependency injection

review-notebook-app · 2026-03-31T15:52:06Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

…oglik_kind key in RLSSMConfig; update model instantiation parameter name

AlexanderFengler

few substantive comments, but this should be very very close.
Thanks @cpaniaguam ( @krishnbera for visibility )

AlexanderFengler · 2026-04-21T01:12:59Z

+for models that couple a reinforcement learning (RL) learning process with a
+sequential sampling decision model (SSM).
+
+The key difference from :class:`HSSM` is the likelihood:


I just want to flag that we should bring this logic up tomorrow (Apr 21st) in the team meeting once more explicitly.

For now I want to make sure this works and we can add models, but reflecting on it a bit, I wonder if we leave some room for harmonization on the table.

AlexanderFengler · 2026-04-21T01:14:06Z

+        # Raise early so the user gets a clear message before model construction.
+        if missing_data is not False:
+            raise ValueError(
+                "RLSSM does not support `missing_data` handling. "


"[...] yet".

AlexanderFengler · 2026-04-21T01:14:28Z

+            )
+        if deadline is not False:
+            raise ValueError(
+                "RLSSM does not support `deadline` handling. "


again "[...] yet."

AlexanderFengler · 2026-04-21T01:21:18Z

+        # Build the differentiable pytensor Op from the annotated SSM function.
+        # This Op supersedes the loglik/loglik_kind workflow: it is passed as
+        # `loglik` to HSSMBase so Config.validate() is satisfied, and
+        # _make_model_distribution() uses it directly without any further wrapping.
+        #
+        # Pass copies of list_params / extra_fields so the closure inside
+        # make_rl_logp_func captures its own isolated list objects.  HSSMBase will
+        # later append "p_outlier" to self.list_params (which is the SAME list
+        # object as `list_params` above), and that mutation must NOT be visible to
+        # the Op's _validate_args_length check at sampling time.
+        loglik_op = make_rl_logp_op(
+            ssm_logp_func=rlssm_config.ssm_logp_func,
+            n_participants=n_participants,
+            n_trials=n_trials,
+            data_cols=list(data_cols),
+            list_params=list(list_params),
+            extra_fields=list(extra_fields),
+        )


@cpaniaguam does this comment have merit?

AlexanderFengler · 2026-04-21T01:22:33Z

+            params_is_trialwise=params_is_trialwise,
+        )
+
+    def _get_prefix(self, name_str: str) -> str:


this utility might be more generically useful than placing it in this .py file?

AlexanderFengler · 2026-04-21T01:23:58Z

+            "Please provide the correct participant column name via "
+            "`participant_col`."
+        )
+


Are we checking for NaN participant id's in the data validator?
If yes this is fine here no?

AlexanderFengler · 2026-04-21T01:24:46Z

+            f"same number of trials. Observed trial counts: {dict(counts)}"
+        )
+
+    return int(len(counts)), int(counts.iloc[0])


@cpaniaguam this one may have merit actually. Can we unpack this on tues (Apr. 21st)?

AlexanderFengler · 2026-04-21T01:33:51Z

    hooks:
      - id: ruff
        args: [--fix, --exit-non-zero-on-fix]
+        exclude: ^tests/


why actually exclude tests from ruff?

AlexanderFengler · 2026-04-21T01:34:13Z

    hooks:
      - id: mypy
        args: [--no-strict-optional, --ignore-missing-imports]
+        exclude: ^tests/


same here. just asking, is that actually a pattern people follow? Hadn't seen that before.

AlexanderFengler · 2026-04-21T01:35:36Z

correct to take this one mostly as copy/paste from our original HSSM class?

cpaniaguam added 4 commits March 2, 2026 12:33

Add ssm_logp_func to RLSSMConfig and update validation tests

20ddc4c

Add RLSSM model and utilities for reinforcement learning integration

d97dcee

Refactor RLSSM parameter handling and add custom prefix resolution fo…

a6a0238

…r RL parameters

Add tests for RLSSM class covering initialization, validation, and mo…

d880977

…del structure

cpaniaguam changed the base branch from main to cp-main-sb March 2, 2026 18:41

Refactor loglik handling in RLSSM to improve type safety with casting

bef8d6c

cpaniaguam requested a review from Copilot March 2, 2026 18:54

Copilot started reviewing on behalf of cpaniaguam March 2, 2026 18:55 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

cpaniaguam added 5 commits March 2, 2026 14:29

Add NaN value check for participant column in validate_balanced_panel…

3981ef6

… function

Add validation for ssm_logp_func in RLSSMConfig to ensure it is calla…

d84a800

…ble and has required attributes

Add exclude rules for ruff and mypy hooks to skip tests directory

15ad6e2

Add validation tests for ssm_logp_func in RLSSMConfig to ensure it is…

262ec07

… callable and properly annotated

Add tests for NaN participant_id and unannotated ssm_logp_func in RLSSM

381275a

cpaniaguam requested a review from Copilot March 2, 2026 19:33

Copilot started reviewing on behalf of cpaniaguam March 2, 2026 19:33 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

cpaniaguam added 2 commits March 2, 2026 15:13

Reject missing data and deadline handling in RLSSM initialization to …

0e9ba42

…preserve trial sequence integrity

Add tests to validate error handling for missing data and deadline in…

4f28c68

… RLSSM initialization

cpaniaguam requested a review from Copilot March 2, 2026 20:17

Copilot AI reviewed Mar 2, 2026

View reviewed changes

Copilot started reviewing on behalf of cpaniaguam March 2, 2026 20:23 View session

cpaniaguam added 3 commits March 2, 2026 15:25

Refactor path handling for loading RLDM fixture dataset in tests

5e9f566

Add fixture to set floatX to float32 for module tests

67ac2ce

Ensure params_is_trialwise aligns with list_params in RLSSM initializ…

e1c05df

…ation

cpaniaguam requested a review from Copilot March 2, 2026 21:05

Copilot started reviewing on behalf of cpaniaguam March 2, 2026 21:10 View session

Copilot AI reviewed Mar 2, 2026

View reviewed changes

Comment thread src/hssm/rl/rlssm.py Outdated

Comment thread src/hssm/rl/rlssm.py Outdated

cpaniaguam added 2 commits March 2, 2026 16:18

Clarify comments on default_priors in ModelConfig and remove unnecess…

564232b

…ary assertion for list_params

Update RLSSM to use to_numpy(copy=True) for extra_fields and add test…

bafc037

… for independent copies

cpaniaguam added 13 commits March 18, 2026 14:51

Add test to ensure RLSSMConfig.from_defaults raises NotImplementedError

27d505e

Clarify RLSSMConfig.from_defaults behavior and raise NotImplementedEr…

ce8e187

…ror for unsupported usage

Inject JAX backend into RLSSMConfig during initialization

7c7fd32

Refactor RLSSM class to use model_config instead of rlssm_config for …

e604406

…consistency

Merge branch '930-pass-configs-via-dependency-injection-into-model-cl…

582a6fe

…asses-basemodelconfig-only-dict-supported' into inject-RLSSMConfig-directly-into-HSSMBase

Fix merge conflicts with base branch

a3898d7

Remove commented out lines

4d99410

Remove RLSSMConfig import from __init__.py

f04f47e

Reorganize import statements by moving RLSSMConfig import to the corr…

11115af

…ect position

Move RLSSMConfig import to the correct module in test files

6a9384f

Update docstring in __init__.py and exports

0285f04

Remove RLSSMConfig class and its associated methods from config.py

5807a71

Move RLSSMConfig class hssm.rl module

4bf67ea

AlexanderFengler reviewed Mar 27, 2026

View reviewed changes

cpaniaguam added 11 commits March 27, 2026 10:54

Refactor config.py to remove RLSSM-specific defaults and unify observ…

5d74bfe

…ed data constants

Fix formatting of error messages in TestRLSSMConfigValidation for con…

ef000cd

…sistency

Enhance validation in RLSSMConfig for ssm_logp_func attributes

91b1098

Add validation test for non-callable values in ssm_logp_func.computed

c3a4f52

Rename 'learning_process_loglik_kind' to 'learning_process_kind' in R…

692dc5d

…LSSMConfig and related tests

Simplify response and list_params assignment in HSSMBase by removing …

7cf8bca

…conditional checks

Revert "Simplify response and list_params assignment in HSSMBase by r…

c46f923

…emoving conditional checks" This reverts commit 7cf8bca.

Refactor RLSSMConfig to dynamically retrieve required fields for vali…

fac838e

…dation

Update RLSSMConfig to handle field exceptions in from_rlssm_dict method

3084df4

Merge pull request #936 from lnccbrown/inject-RLSSMConfig-directly-in…

babee92

…to-HSSMBase Inject rlssm config directly into hssm base

Merge pull request #931 from lnccbrown/930-pass-configs-via-dependenc…

b0e2179

…y-injection-into-model-classes-basemodelconfig-only-dict-supported Handle configs via dependency injection

cpaniaguam marked this pull request as ready for review March 31, 2026 15:56

cpaniaguam requested a review from AlexanderFengler March 31, 2026 15:57

Fix import path for RLSSM and RLSSMConfig; correct learning_process_l…

1233cd7

…oglik_kind key in RLSSMConfig; update model instantiation parameter name

AlexanderFengler reviewed Apr 21, 2026

View reviewed changes

-    return int(len(counts)), int(counts.iloc[0])
+    # Ensure that each participant's trials form a single contiguous block
+    # of rows of length n_trials. This is required because downstream code
+    # reshapes the data into (n_participants, n_trials, ...) based on row
+    # order, assuming no interleaving across participants.
+    n_trials = int(counts.iloc[0])
+    # Identify contiguous "blocks" of identical participant IDs.
+    blocks = data[participant_col].ne(data[participant_col].shift()).cumsum()
+    block_counts = data.groupby([participant_col, blocks]).size()
+    # Each participant must appear in exactly one block, and that block
+    # must have length n_trials.
+    blocks_per_participant = block_counts.groupby(level=0).size()
+    invalid_multi_blocks = blocks_per_participant[blocks_per_participant != 1]
+    invalid_block_sizes = block_counts[block_counts != n_trials]
+    if not invalid_multi_blocks.empty or not invalid_block_sizes.empty:
+        raise ValueError(
+            "Data must be ordered so that each participant's trials appear in "
+            "a single contiguous block of rows of length n_trials. "
+            "Participants with non-contiguous or incorrectly sized blocks "
+            f"were found. Consider sorting your data by '{participant_col}' "
+            "and, if available, by a trial index column before building the "
+            "RL likelihood."
+        )
+    return int(len(counts)), n_trials

+    # Ensure there are no missing participant IDs, since groupby will drop NaNs
+    # silently, which would make n_participants / n_trials incorrect.
+    if data[participant_col].isna().any():
+        raise ValueError(
+            f"Column '{participant_col}' contains missing values. "
+            "Please fill or remove rows with missing participant IDs before "
+            "calling validate_balanced_panel."
+        )

Conversation

cpaniaguam commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

AlexanderFengler left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

review-notebook-app bot commented Mar 31, 2026

Uh oh!

AlexanderFengler left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cpaniaguam commented Mar 2, 2026 •

edited

Loading