rmk118
diff --git a/‎CONDUCT.md‎
Lines changed: 13 additions & 0 deletions b/‎CONDUCT.md‎
Lines changed: 13 additions & 0 deletions
diff --git a/‎CONTRIBUTING.md‎
Lines changed: 41 additions & 0 deletions b/‎CONTRIBUTING.md‎
Lines changed: 41 additions & 0 deletions
diff --git a/‎LICENSE‎
Lines changed: 0 additions & 2 deletions b/‎LICENSE‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎R/density_int.R‎
Lines changed: 197 additions & 0 deletions b/‎R/density_int.R‎
Lines changed: 197 additions & 0 deletions
diff --git a/‎R/infl_pt.R‎
Lines changed: 7 additions & 7 deletions b/‎R/infl_pt.R‎
Lines changed: 7 additions & 7 deletions
diff --git a/‎R/piecewise.R‎
Lines changed: 2 additions & 2 deletions b/‎R/piecewise.R‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.Rmd‎
Lines changed: 16 additions & 2 deletions b/‎README.Rmd‎
Lines changed: 16 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 17 additions & 2 deletions b/‎README.md‎
Lines changed: 17 additions & 2 deletions
@@ -0,0 +1,13 @@
+# Contributor Code of Conduct
+
+As contributors and maintainers of this project, we pledge to respect all people who contribute through reporting issues, posting feature requests, updating documentation, submitting pull requests or patches, and other activities.
+
+We are committed to making participation in this project a harassment-free experience for everyone, regardless of regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socioeconomic status, nationality, personal appearance, race, caste, color, religion, or sexual identity and orientation.
+
+Examples of unacceptable behavior by participants include the use of sexual language or imagery, derogatory comments or personal attacks, trolling, public or private harassment, insults, or other unprofessional conduct.
+
+Project maintainers have the right and responsibility to remove, edit, or reject comments, commits, code, wiki edits, issues, and other contributions that are not aligned to this Code of Conduct. Project maintainers who do not follow the Code of Conduct may be removed from the project team.
+
+Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by opening an issue or contacting one or more of the project maintainers.
+
+This Code of Conduct is adapted from the [Contributor Covenant](https://www.contributor-covenant.org/), version 2.1, available at <https://www.contributor-covenant.org/version/2/1/code_of_conduct.html>
@@ -0,0 +1,41 @@
+# Contributing
+
+We love pull requests from everyone. By participating in this project, you
+agree to abide by our [code of conduct](CONDUCT.md).
+
+## Getting Started
+
+* Make sure you have a [GitHub account](https://github.com/signup/free). If you are not familiar with git and GitHub, take a look at <http://happygitwithr.com/> to get started.
+* [Submit a post for your issue](https://github.com/rmk118/morphmat/issues), assuming one does not already exist.
+  * Clearly describe your issue, including steps to reproduce when it is a bug, or some justification for a proposed improvement.
+* [Fork](https://github.com/rmk118/morphmat/fork) the repository on GitHub or in your shell terminal to make a copy of the repository on your account.
+    
+## Making changes
+
+* Before you make a Pull Request, make sure you have discussed your proposed change in an issue post and that the team support your proposed change.
+* We recommend that you create a Git branch for each pull request (PR).
+* Edit the files, save often, and make commits of logical units, where each commit indicates one concept
+* Follow the Tidyverse [style guide](https://style.tidyverse.org/).
+* Make sure you write [good commit messages](http://tbaggery.com/2008/04/19/a-note-about-git-commit-messages.html).
+* We use testthat. Contributions with test cases included are easier to accept.
+* For user-facing changes, add a bullet to the top of NEWS.md below the current development version header describing the changes made followed by your GitHub username, and links to relevant issue(s)/PR(s).
+* Run _all_ the tests using `devtools::check()` to assure nothing else was accidentally broken.
+* If you need help or unsure about anything, post an update to [your issue](https://github.com/rmk118/morphmat/issues/).
+
+## Submitting your changes
+
+Push to your fork and [submit a pull request](https://github.com/rmk118/morphmat/compare/).
+
+At this point you're waiting on us. We like to at least comment on pull requests within a few days (and, typically, one business day). We may suggest some changes or improvements or alternatives.
+
+Some things you can do that will increase the chance that your pull request is accepted:
+
+* Engage in discussion on [your issue](https://github.com/rmk118/morphmat/issues/).
+* Be familiar with the background literature cited in the [README](README.Rmd)
+* Write tests that pass `devtools::check()`.
+* Follow the [Tidyverse style guide](https://style.tidyverse.org/).
+
+## Acknowledgements
+
+This contributing guide was modified from the CONTRIBUTING.md file from the rrtools package (https://github.com/benmarwick/rrtools/blob/master/CONTRIBUTING.md), an excellent collection of tools for writing reproducible reports and articles in R.
+
@@ -0,0 +1,197 @@
+#' Distribution interval derived (DID) cutline method
+#'
+#' @description Implementation of the method described by Richar & Foy (2022) (DOI:
+#'   10.1139/facets-2021-0061).
+#'
+#' @param dat data frame or matrix containing the data
+#' @param xvar Name of column (or integer or double vector) containing
+#'   measurements for the x-axis variable (e.g., carapace width).
+#' @param yvar Name of column (or integer or double vector) containing
+#'   measurements for the y-axis variable (e.g., claw height).
+#' @param log Boolean; should both variables be log-transformed before
+#'   performing the regression? Defaults to FALSE.
+#' @param upper Integer or double; the upper bound for possible SM50 values.
+#'   Must be on the same scale as the data. Defaults to the 80th percentile of
+#'   the x-variable.
+#' @param lower Integer or double; the lower bound for possible SM50 values.
+#'   Must be on the same scale of the data. Defaults to the 20th percentile of
+#'   the x-variable.
+#' @param int_num Integer; how many intervals between the lower and upper bound
+#'   should be used? Defaults to 25. With fewer intervals, each interval will
+#'   contain more points, increasing the accuracy of the estimated density
+#'   minimum for a given interval. However, the linear regression of the minima
+#'   distributions (the divisions between immature and mature individuals within
+#'   an interval) against the midpoints of those intervals may be more reliable
+#'   with more intervals.
+#' @param plot Boolean; should a plot of the data with the calculated minima and
+#'   discriminating line be displayed?
+#' @param adjust the bandwidth used for the kernel density estimate is actually
+#'   adjust*bw. This makes it easy to specify values like ‘half the default’
+#'   bandwidth.
+#'
+#' @returns Something
+#'
+#' @examples
+#' set.seed(12)
+#' fc <- fake_crustaceans(n = 1000, L50 = 100, allo_params = c(1, 0.2, 1.1, 0.2))
+#' density_int(dat = fc, xvar = "x", yvar = "y", upper = 120)
+#' density_int(dat = fc, xvar = "x", yvar = "y", upper = log(120), log = TRUE)
+density_int <- density_int <- function(dat,
+                                       xvar,
+                                       yvar,
+                                       lower = NULL,
+                                       upper = NULL,
+                                       int_num = 25,
+                                       log = FALSE,
+                                       plot = FALSE,
+                                       adjust = 1) {
+  if (isTRUE(log)) {
+    dat$xvar <- log(dat[[xvar]])
+    dat$yvar <- log(dat[[yvar]])
+  }
+  else {
+    dat$xvar <- dat[[xvar]]
+    dat$yvar <- dat[[yvar]]
+  }
+
+  if (is.null(lower)) {
+    lower <- stats::quantile(dat$xvar, 0.2, names = FALSE)
+  }
+
+  if (is.null(upper)) {
+    upper <- stats::quantile(dat$xvar, 0.8, names = FALSE)
+  }
+
+  int_width <- (upper - lower) / int_num
+
+  i <- 1
+  int_bottom <- lower
+  df_ints <- data.frame(
+    int_bottom = rep(NA, int_num),
+    int_top = rep(NA, int_num),
+    min = rep(NA, int_num),
+    n_obs = rep(NA, int_num)
+  )
+
+  ##### BEGIN LOOP
+  while (i < int_num + 1) {
+    int_top_temp <- int_bottom + int_width
+    temp_df <- dat %>% filter(xvar >= int_bottom, xvar <= int_top_temp)
+    n_obs <- nrow(temp_df)
+
+    if (n_obs < 5) {
+      abort(
+        paste(
+          "Each interval must contain at least 5 data points. The interval from",
+          round(int_bottom, 3),
+          "to",
+          round(int_top_temp, 3),
+          "only contains",
+          n_obs,
+          "points.",
+          sep = " "
+        )
+      )
+    }
+
+    df_ints$int_bottom[i] <- int_bottom
+    df_ints$int_top[i] <- int_top_temp
+    df_ints$n_obs[i] <- n_obs
+
+    # compute a kernel density estimate
+    density_test <- stats::density(temp_df$yvar, adjust = adjust)
+
+    # convert into a data frame
+    density_test <- data.frame(x = density_test$x, density = density_test$y)
+
+    span <- 5
+
+    # find the local maxima - should be two modes
+    density_test$is_max <- splus2R::peaks(x = density_test$density,
+                                          span = span,
+                                          strict = TRUE)
+    modes <- density_test %>%
+      dplyr::filter(.data$is_max == TRUE) %>%
+      dplyr::pull(x)
+
+
+    while(length(modes) > 2) {
+      span <- span + 2
+
+      density_test$is_max <- splus2R::peaks(x = density_test$density,
+                                            span = span,
+                                            strict = TRUE)
+      modes <- density_test %>%
+        dplyr::filter(.data$is_max == TRUE) %>%
+        dplyr::pull(x)
+
+    }
+
+    if(length(modes) < 2) {
+      #   int_num <- int_num - 1
+      #   int_width <- (upper - lower) / int_num
+      #   i <- 1
+      #   int_bottom <- lower
+      #   df_ints <- data.frame(
+      #     int_bottom = rep(NA, int_num),
+      #     int_top = rep(NA, int_num),
+      #     min = rep(NA, int_num),
+      #     n_obs = rep(NA, int_num)
+      #   )
+      abort(
+        paste(
+          "Each interval should contain two peaks in the density of points along the y-axis. The interval from",
+          round(int_bottom, 3),
+          "to",
+          round(int_top_temp, 3),
+          "only contains",
+          length(modes),
+          "peaks. Try decreasing the int_num argument or changing the adjust argument, which is a multiplier for the smoothing bandwidth.",
+          sep = " "
+        )
+      )
+    }
+    # else {
+    between_modes <- density_test %>% filter(x > modes[1], x < modes[2])
+
+    interval_min <- between_modes[which.min(between_modes$density), "x"]
+
+    df_ints$min[i] <- interval_min
+
+    int_bottom <- int_top_temp
+    i <- i + 1
+    # }
+  } # end loop
+
+  df_ints$midpt <- (df_ints$int_bottom + df_ints$int_top) / 2
+
+  # optionally visualize the data with the discriminant line
+  if (plot == TRUE) {
+    if (log == TRUE) {
+      xlab <- paste0("ln(", xvar, ")")
+      ylab <- paste0("ln(", yvar, ")")
+    }
+    else {
+      xlab <- xvar
+      ylab <- yvar
+    }
+
+    lm_density <- stats::lm(min ~ midpt, data = df_ints)
+    pred_line <- data.frame(x = dat$xvar,
+                            y = stats::predict(lm_density, data.frame(midpt = dat$xvar)))
+    print(
+      ggplot2::ggplot() +
+        ggplot2::geom_point(data = dat, aes(x = xvar, y = yvar)) +
+        ggplot2::geom_point(
+          data = na.omit(df_ints),
+          aes(x = midpt, y = min),
+          color = "red"
+        ) +
+        ggplot2::geom_line(data = pred_line, aes(x, y)) +
+        ggplot2::labs(x = xlab, y = ylab) +
+        ggplot2::theme_light()
+    )
+  }
+
+  return(df_ints)
+}
@@ -14,10 +14,10 @@
 #' y-var/x-var ratio for all points, then finding the local minimum separating
 #' the two peaks representing the maturity clusters.
 #'
-#' @param dat optional data frame or matrix containing the data
-#' @param x Name of column (or integer or double vector) containing measurements
+#' @param dat data frame or matrix containing the data
+#' @param xvar Name of column (or integer or double vector) containing measurements
 #'   for the x-axis variable (e.g., carapace width).
-#' @param y Name of column (or integer or double vector) containing measurements
+#' @param yvar Name of column (or integer or double vector) containing measurements
 #'   for the y-axis variable (e.g., claw height).
 #' @param log Boolean; should both variables be log-transformed before performing the
 #'   regression? Defaults to FALSE.
@@ -42,13 +42,13 @@
 #' infl_pt(fc, "x", "y", plot = TRUE)
 #' infl_pt(fc, "x", "y", log = TRUE, plot = TRUE)
 #'
-infl_pt <- function(dat, x, y, log = FALSE, plot = FALSE) {
+infl_pt <- function(dat, xvar, yvar, log = FALSE, plot = FALSE) {
   # find the ratio between the two morphometric variables
   if (isTRUE(log)) {
-    ratio <- log(dat[[y]])/log(dat[[x]])
+    ratio <- log(dat[[yvar]])/log(dat[[xvar]])
   }
   else {
-    ratio <- dat[[y]]/dat[[x]]
+    ratio <- dat[[yvar]]/dat[[xvar]]
   }
 
 
@@ -60,7 +60,7 @@ infl_pt <- function(dat, x, y, log = FALSE, plot = FALSE) {
 
   # find the local minimum between the two peaks
   density_test$is_min <- splus2R::peaks(
-    x = -density_test$y, span = 3, strict = FALSE)
+    x = -density_test$y, span = 3, strict = TRUE)
 
   min <- density_test %>%
     dplyr::filter(.data$is_min == TRUE) %>%
 
@@ -19,10 +19,10 @@
 #' @param log Boolean; should both variables be log-transformed before performing the
 #'   regression? Defaults to FALSE.
 #' @param upper Integer or double; the upper bound for possible SM50 values.
-#'   Must be on the same trans of the data. Defaults to the 80th percentile of
+#'   Must be on the same scale as the data. Defaults to the 80th percentile of
 #'   the x-variable.
 #' @param lower Integer or double; the lower bound for possible SM50 values.
-#'   Must be on the same trans of the data. Defaults to the 20th percentile of
+#'   Must be on the same scale of the data. Defaults to the 20th percentile of
 #'   the x-variable.
 #'
 #' @returns An estimate of SM50 from the specified method(s).
 
@@ -24,6 +24,8 @@ knitr::opts_chunk$set(
 
 <!-- badges: end -->
 
+## Description
+
 A compilation of methods used to estimate size at (sexual) maturity based on morphometric data, most commonly applied to crabs, lobsters, and other crustaceans. Approaches include modeling approaches based on piecewise/segmented linear regression as well as numerous clustering-based methods.
 
 `morphmat` is intended to help fisheries scientists and managers to implement the multitude of computational methods that have been developed for estimating crustacean size at maturity. This package is being developed as part of my graduate work at the University of Maine. **This is still a work in progress and should not yet be used for research purposes.**
@@ -57,8 +59,9 @@ The following scripts do not use morphometric data and require individuals to al
 You can install the development version of morphmat from [GitHub](https://github.com/) with:
 
 ``` r
-# install.packages("devtools")
-devtools::install_github("rmk118/morphmat")
+remotes::install_github("rmk118/morphmat")
+# or
+pak::pak("rmk118/morphmat")
 ```
 
 ## Articles/vignettes
@@ -110,4 +113,15 @@ mod <- glm(data = out_df, pred_mat_num ~ x, family = binomial(link = "logit"))
 unname(-coef(mod)[1] / coef(mod)[2])
 ```
 
+
+## Contributing
+
+If you would like to contribute to this project, please start by reading the [Guide to Contributing](CONTRIBUTING.md). Please note that this project is released with a [Contributor Code of Conduct](CONDUCT.md). By participating in this project you agree to abide by its terms.
+
+## How to Cite
+
+> Krasnow, R. (2025). morphmat:
+> An R package to estimate crustacean size at maturity using morphometric data.
+> <https://ruby.science/morphmat>.
+
 ## References
@@ -13,6 +13,8 @@ public.](https://www.repostatus.org/badges/latest/wip.svg)](https://www.repostat
 
 <!-- badges: end -->
 
+## Description
+
 A compilation of methods used to estimate size at (sexual) maturity
 based on morphometric data, most commonly applied to crabs, lobsters,
 and other crustaceans. Approaches include modeling approaches based on
@@ -74,8 +76,9 @@ You can install the development version of morphmat from
 [GitHub](https://github.com/) with:
 
 ``` r
-# install.packages("devtools")
-devtools::install_github("rmk118/morphmat")
+remotes::install_github("rmk118/morphmat")
+# or
+pak::pak("rmk118/morphmat")
 ```
 
 ## Articles/vignettes
@@ -140,6 +143,18 @@ unname(-coef(mod)[1] / coef(mod)[2])
 #> [1] 77.70282
 ```
 
+## Contributing
+
+If you would like to contribute to this project, please start by reading
+the [Guide to Contributing](CONTRIBUTING.md). Please note that this
+project is released with a [Contributor Code of Conduct](CONDUCT.md). By
+participating in this project you agree to abide by its terms.
+
+## How to Cite
+
+> Krasnow, R. (2025). morphmat: An R package to estimate crustacean size
+> at maturity using morphometric data. <https://ruby.science/morphmat>.
+
 ## References
 
 <div id="refs" class="references csl-bib-body hanging-indent"