Resampling

What Resampling is

Resampling is a statistical method used to estimate the properties of an entire population by analyzing a random sample. It involves repeatedly drawing samples from the population and calculating the desired statistic on each sample. The results of the calculations are then combined to estimate the properties of the entire population.

Steps for Resampling:

  1. Collect a sample of data from the population.

  2. Calculate the statistic of interest on the sample.

  3. Repeat steps 1 and 2 a number of times.

  4. Record the results of each statistic calculation.

  5. Calculate the mean (or other summary statistic) of the results.

  6. Compare the mean to the statistic of interest for the population.

  7. Interpret the results.

Examples

  1. Resampling is commonly used to estimate population parameters from a sample. For example, to estimate the population mean from a given sample, one can use resampling techniques such as bootstrapping or jackknifing.

  2. Resampling is also used to compare two populations to determine if their means are significantly different. For example, a researcher could use resampling to compare the average heights of male and female students in a school to determine if there is a statistically significant difference between the two groups.

  3. Resampling is used to assess the accuracy of a model. For example, a researcher could use resampling to measure how well a machine learning model predicts a binary outcome (e.g. whether a person will default on a loan or not).

Related Topics