cipi >> {blockstrap}; exp-plan >> {adsasi}; geo-mod >> bin workflow; reg-ord >> resource

ercbk · ercbk · commit e6e850be468e · 2026-02-02T11:02:08.000-05:00
diff --git a/qmd/confidence-and-prediction-intervals.qmd b/qmd/confidence-and-prediction-intervals.qmd
@@ -205,6 +205,8 @@
         -   The BCa approach can be unsatisfactory for relatively small sample sizes
     -   Packages
         -   [{]{style="color: #990000"}[bayesboot](https://github.com/rasmusab/bayesboot){style="color: #990000"}[}]{style="color: #990000"} - Implements Rubin's (1981) Bayesian bootstrap
+        -   [{]{style="color: #990000"}[blockstrap](https://numbats.github.io/blockstrap/){style="color: #990000"}[}]{style="color: #990000"} - Sample complete groups (“blocks”) from a grouped data frame. (just does sampling)
+            -   Implements a simple block bootstrap style sampler: instead of sampling individual rows, you sample entire groups preserving the intra-group structure.
         -   [{]{style="color: #990000"}[boot.pval](https://cran.r-project.org/web/packages/boot.pval/){style="color: #990000"}[}]{style="color: #990000"} - Computation of bootstrap p-values through inversion of confidence intervals, including convenience functions for regression models.
             -   Linear models fitted using `lm`,
             -   Generalized linear models fitted using `glm` or `glm.nb`,
diff --git a/qmd/experiments-planning.qmd b/qmd/experiments-planning.qmd
@@ -114,6 +114,9 @@
 
 -   Packages
 
+    -   [{]{style="color: #990000"}[adsasi](https://cran.r-project.org/web/packages/adsasi/index.html){style="color: #990000"}[}]{style="color: #990000"} - Adaptive Sample Size Simulator
+        -   The user writes a function that takes as argument a sample size and returns a boolean (for whether or not the trial is a success). The 'adsasi' functions will then use it to find the correct sample size empirically.
+        -   The unavoidable mis-specification is obviated by trying sample size values close to the right value, the latter being understood as the value that gives the probability of success the user wants (usually 80 or 90% in biostatistics, corresponding to 20 or 10% type II error).
     -   [{]{style="color: #990000"}[BayesPower](https://cran.r-project.org/web/packages/BayesPower/index.html){style="color: #990000"}[}]{style="color: #990000"} - Sample Size and Power Calculation for Bayesian Testing with Bayes Factor
     -   [{]{style="color: #990000"}[gsMAMS](https://cran.r-project.org/web/packages/gsMAMS/index.html){style="color: #990000"}[}]{style="color: #990000"} ([Vignette](https://joss.theoj.org/papers/10.21105/joss.06322)): an R package for Designing Multi-Arm Multi-Stage Clinical Trials
         -   For designing group sequential multi-arm multi-stage (MAMS) trials with continuous, ordinal, and survival outcomes, which is computationally very efficient even for a number of stages greater than 3.
@@ -150,7 +153,7 @@
 -   [Sample Size Justification](https://online.ucpress.edu/collabra/article/8/1/33267/120491/Sample-Size-Justification) (See article for more details on each type)
 
     |  |  |
-    |:-----------------------------------|:-----------------------------------|
+    |:---|:---|
     | Type of justification  | When is this justification applicable?  |
     | Measure entire population  | A researcher can specify the entire population, it is finite, and it is possible to measure (almost) every entity in the population.  |
     | Resource constraints  | Limited resources are the primary reason for the choice of the sample size a researcher can collect.  |
@@ -162,7 +165,7 @@
 -   Considerations when deciding on an effect size (See [Sample Size Justification](https://online.ucpress.edu/collabra/article/8/1/33267/120491/Sample-Size-Justification) \>\> What is Your Inferential Goal? for more details)
 
     |  |  |
-    |:-----------------------------------|:-----------------------------------|
+    |:---|:---|
     | Type of evaluation  | Which question should a researcher ask?  |
     | Smallest effect size of interest  | What is the smallest effect size that is considered theoretically or practically interesting?  |
     | The minimal statistically detectable effect  | Given the test and sample size, what is the critical effect size that can be statistically significant?  |
diff --git a/qmd/geospatial-modeling.qmd b/qmd/geospatial-modeling.qmd
@@ -351,7 +351,7 @@
 -   [{gstat}]{style="color: #990000"} Kriging Functions
 
     | Function | Description |
-    |------------------------------------|------------------------------------|
+    |----|----|
     | `krige` | Simple, Ordinary or Universal, global or local, Point or Block Kriging, or simulation |
     | `krige.cv` | kriging cross validation, n-fold or leave-one-out |
     | `krigeSTTg` | Trans-Gaussian spatio-temporal kriging |
@@ -679,11 +679,19 @@
             -   At least 100 location pairs in the first bin
             -   There should 6–8 bins before the sill (to fit the variogram model)
         -   Try Sensitivity Analysis for low location counts (e.g. 53 locations)
-            1.  Start with $\Delta \approx 40$ which is [width = 40]{.arg-text} (first bin \~60 pairs; noisy but usable)
-            2.  Fit only *simple* variogram models (Exp, spherical)
-            3.  Downweight or ignore the first bin if needed
-            4.  Check sensitivity of the *range estimate* to:
-                -   $\Delta = 30, 20, 15, \text{and maybe}\;50$
+        -   Workflow
+            1.  Choose $N_{\text{min}}$ (See RSE table) (e.g. 75 effective pairs)
+            2.  Compute $\Delta_{\text{min}}$\
+                $$
+                \Delta_{\text{min}} = \sqrt{\frac{2N_{\text{min}}H^2}{n(n-1)}}
+                $$
+            3.  Assess sensitivity of range estimate for $\Delta  \gt \Delta_{\text{min}}$
+                1.  If $\Delta_{\text{min}} \approx 40$ , then start with $\Delta = 40$ which is [width = 40]{.arg-text} (first bin \~60 pairs; noisy but usable)
+                2.  Fit only *simple* variogram models (Exp, spherical)
+                3.  Downweight or ignore the first bin if needed
+                4.  Check sensitivity of the *range estimate* to:
+                    -   $\Delta = 30, 20, 15, \text{and maybe}\;50$
+            4.  If results are unstable $\rightarrow$ data limitation, not a tuning failure
 
 #### Types {#sec-geo-gmod-interp-krig-types .unnumbered}
 
diff --git a/qmd/llms-mcp.qmd b/qmd/llms-mcp.qmd
@@ -36,6 +36,11 @@
     -   [R Econometrics MCP Server](https://github.com/gojiplus/rmcp/)
     -   [Posit Skills](https://github.com/posit-dev/skills) for Claude Code (Claude Skills)
         -   Package development, Testing, Shiny, and Quarto brand_yml, Crafting Release Posts
+    -   [Claude Code R Skills: A curated collection of Claude Code configurations for modern R use](https://github.com/ab604/claude-code-r-skills)
+        -   Modular Skills (tidyverse, rlang, performance, OOP, testing)
+        -   Enforcement Rules (security, testing, git workflow)
+        -   Workflow Commands (planning, code review, TDD)
+        -   Context Management Hooks
 
 -   Using MCP servers rather than the cloud service CLI tools (e.g. BigQuery CLI) provides better security control over what LLM products (e.g. Claude Code) can access, especially for handling sensitive data that requires logging or has potential privacy concerns.
 
diff --git a/qmd/regression-ordinal.qmd b/qmd/regression-ordinal.qmd
@@ -80,6 +80,7 @@
         -   Uses [{mgcv}]{style="color: #990000"} for modeling and [{sure}]{style="color: #990000"} diagnostics
 -   Resources
     -   [Chapter 23](https://bookdown.org/content/3686/ordinal-predicted-variable.html) (Kurz's brms, tidyverse version), Doing Bayesian Analysis by Kruschke
+    -   [Ordinal regression models made easy: A tutorial on parameter interpretation, data simulation and power analysis](https://onlinelibrary.wiley.com/doi/full/10.1002/ijop.13243) (Also in R \>\> Documents \>\> Regression \>\> ordinal)
 -   Power and Sample Size Calculations for a Proportional Odds Model ([Harrell](https://www.fharrell.com/post/pop/))
 -   Paired data: Use robust cluster sandwich covariance adjustment to allow ordinal regression to work on paired data. ([Harrell](https://twitter.com/f2harrell/status/1690837549340598272?s=20))
 -   Ordered probit regression: This is very, very similar to running an ordered logistic regression. The main difference is in the interpretation of the coefficients.
@@ -2133,7 +2134,7 @@
 
 -   As with a binary outcome, the logit and probit analysis will nearly always lead to the same conclusions
 
--   The coefficients of represent the change in the z-score (standard normal quantile) for being at or below a certain category for a one-unit change in the predictor.
+-   The coefficients of represent the change in the z-score (standard normal quantile) for being at or below a certain category of the response for a one-unit change in the predictor.
 
     -   [Example]{.ribbon-highlight}:\
         ![](_resources/Regression,_Ordinal.resources/po-probit-coef-interp-1.webp){.lightbox width="532"}