Chronos2 forecasting model#112

Open

topepo wants to merge 20 commits into

mainfrom

Member

topepo commented May 20, 2026

This is a pure R torch version of Chronos version 2. It is a pre-trained model; weights are downloaded and cached on first use.

A GPU is not required, but extremely helpful at prediction time. Unlike most other models, this implementation appears to achieve good speedups on Apple GPUs (aka MPS devices).

topepo added 16 commits

May 8, 2026 18:54


          initial files

041e5b9


          steps to miove to a more traditional brulee interface

e94f3eb


          better download for model weights

28f5355


          output as tibble

4ce2b37


          use quantile prediction class in output

202495a


          add note

81877e2


          oy with the dashes!

7a59a1d


          some api changes


          improve testing and docs

b0ca5ae


          ignore extra columns

fec27bc


          fix error in print method

aa325b6


          update GHA

0e7489b


          small doc changes

d38fe54


          more tests

6e16ca4


          Merge branch 'main' into chronos

5e9ca7c


          update tests and docs

258f57a

github-actions Bot reviewed

View reviewed changes

github-actions Bot left a comment

Remaining comments which cannot be posted as a review comment to avoid GitHub Rate Limit

air

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 788 to 794 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 829 to 835 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 863 to 869 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 898 to 904 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 931 to 937 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Line 951 in 258f57a

    
           ctx_list <- list(torch::torch_tensor(rnorm(16), dtype = torch::torch_float32()))

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 965 to 971 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 1000 to 1006 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 1022 to 1023 in 258f57a

    
           tiny_model, tiny_config, torch::torch_device("cpu"), 
        
           inputs, prediction_length = 4L, num_output_patches = 1L

[air] _{reported by reviewdog 🐶}

brulee/tests/testthat/test-chronos2-predict.R

Lines 1035 to 1041 in 258f57a

    
           d_model = 32L, d_ff = 64L, d_kv = 16L, num_heads = 2L, 
        
           num_layers = 1L, dropout_rate = 0.0, layer_norm_epsilon = 1e-6, 
        
           rope_theta = 10000, vocab_size = 2L, pad_token_id = 0L, 
        
           reg_token_id = 1L, context_length = 64L, input_patch_size = 4L, 
        
           input_patch_stride = 4L, output_patch_size = 4L, 
        
           max_output_patches = 4L, quantiles = c(0.1, 0.5, 0.9), 
        
           use_arcsinh = FALSE, use_reg_token = TRUE,

topepo commented

View reviewed changes

R/chronos2-misc.R

+              # Pinned default revision for `amazon/chronos-2`. Bump this deliberately
+              # when we're ready to ship a new set of weights -- never let users silently
+              # track a moving HuggingFace branch.
+              chronos2_default_revision <- function() {

Member Author

topepo May 20, 2026

Maybe we change this along the way as more versions are populated. For now, it's keyed to this particular version.

topepo commented

View reviewed changes

R/chronos2-misc.R

+                  cli::cli_progress_step("Downloading {.url {url}}")
+                  err <- tryCatch(
+                    {
+                      curl::curl_download(url, dest, mode = "wb", quiet = TRUE)

Member Author

topepo May 20, 2026

R's internal file.download() had a lot of issues with downloading this file, so we went with curl.

topepo commented

View reviewed changes

R/chronos2-misc.R

+              chronos2_download <- function(
+                model_id = "amazon/chronos-2",
+                revision = chronos2_default_revision(),
+                cache_dir = file.path(Sys.getenv("HOME"), ".cache", "chronos-r")

Member Author

topepo May 20, 2026

This seems like a good place to put the weights. I'm not aware of there is a canonical location for cached objects related to R.

topepo added 3 commits

May 19, 2026 22:34

air

4db018e


          more air


          enable install of brulee before trying to download the weights

00acf03

topepo marked this pull request as ready for review

May 20, 2026 12:47


          protect against empty data frames

d020141

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet