Bayesian bootstrap is not more precice after accounting for oversampling

Hey Matteo -

Thank you for your blog post on the Bayesian boostrap! I've found it quite helpful in adapting to my own problems and gaining a better understanding of the differences between bayesian and classic bootstrap.

I was trying to replicate your analysis by rewriting some of the code, and I noticed that in the two-level sampling part of your blog, you oversample from the dataframe 10x (cell 19). This is the reason you get a more precise / narrow posterior distribution, not just the use of the bayesian boostrap. You can check this yourself by oversampling in your classic bootstrap procedure, which results in this:

<img width="661" alt="image" src="https://user-images.githubusercontent.com/5107405/200388347-3882eeab-75f7-4c06-9b10-4d081ab66836.png">

Within the wider context of the blog post, I think you do need to oversample to account for the rare events cases you describe later in the blog post. If you don't oversample, you're going to have instances of sampling where you won't get the rare event. You could try this for yourself with a regression that's unable to additionally take weights and would require the two-level sampling procedure. This would also result in instances where you might not be able to fit the model (since it is actually resampling) or end up with parameter estimates at extreme values.



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bayesian bootstrap is not more precice after accounting for oversampling #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Bayesian bootstrap is not more precice after accounting for oversampling #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions