Statistics practice problems with answers
Ten exam-style statistics problems with complete worked solutions, covering descriptive statistics, the normal distribution, confidence intervals, hypothesis testing, the binomial distribution, and regression interpretation. Attempt each problem fully before opening the solution — the retrieval attempt is what makes practice testing the most effective study technique in the literature. Free to use, no signup.
Problem 1
For the sample , calculate the sample mean and the sample standard deviation.
Show worked solution
Mean: . Squared deviations: . Sample variance divides by : , so .
Problem 2
Exam scores are normally distributed with mean 70 and standard deviation 8. What proportion of students score above 86?
Show worked solution
Standardize: . From the standard normal table, . About 2.3 percent of students score above 86.
Problem 3
A random sample of observations has mean and sample standard deviation . Construct a 95 percent confidence interval for the population mean.
Show worked solution
The standard error is . With the normal approximation is reasonable, so the interval is , giving . Interpretation: the procedure that generated this interval captures the true mean in 95 percent of repeated samples — not "there is a 95 percent probability the mean is in this interval."
Problem 4
Test against at the 5 percent significance level, given , , .
Show worked solution
Test statistic: . With , the two-sided critical value at the 5 percent level is approximately . Since , we fail to reject — narrowly. Note what this is not: it is not evidence that ; the data are simply not quite strong enough to rule it out at this significance level.
Problem 5
A study reports a p-value of 0.03. Which statement is correct? (a) There is a 3 percent probability the null hypothesis is true. (b) If the null hypothesis were true, data this extreme or more would occur in about 3 percent of samples. (c) The effect has a 97 percent probability of being real.
Show worked solution
Statement (b). The p-value is a conditional probability computed assuming is true: it measures how surprising the observed data would be under the null. It says nothing direct about the probability that the null is true or that the effect is real — those statements require a prior and a Bayesian framework. Confusing (a) for (b) is among the most common errors in applied statistics.
Problem 6
A factory tests whether a batch of components meets specification. Define Type I and Type II errors in this context, and state which error the significance level controls.
Show worked solution
Type I error: rejecting a batch that actually meets specification (a false alarm). Type II error: accepting a batch that is actually defective (a miss). The significance level is the probability of a Type I error the test tolerates; the probability of a Type II error is , and power . Lowering without changing anything else raises — the two errors trade off.
Problem 7
A fair production process produces defective items with probability . In a random sample of 10 items, what is the probability of exactly 2 defects?
Show worked solution
Binomial with , : . There is roughly a 30 percent chance of exactly two defective items.
Problem 8
A regression of exam score () on hours studied () gives . Interpret the slope coefficient, and predict the score for a student who studies 10 hours.
Show worked solution
The slope means each additional hour of study is associated with a 0.8-point higher predicted exam score, on average. Prediction at : . Two cautions: the relationship is an association, not proven causation, and predictions far outside the observed range of (extrapolation) are unreliable.
Problem 9
The correlation between monthly ice cream sales and monthly drowning deaths is . Does ice cream cause drowning? Explain the statistical issue.
Show worked solution
No — this is the classic confounding example. A third variable, summer weather, raises both swimming (hence drownings) and ice cream sales. Correlation measures linear association, not causation; a strong is fully consistent with no causal link between the correlated variables. Establishing causality requires experimental design or causal-inference methods that control for confounders.
Problem 10
Explain why the standard error of the sample mean is , and compute how the standard error changes when the sample size increases from 25 to 100.
Show worked solution
Averaging reduces noise: independent sampling errors partially cancel, and the variance of the mean is , so its standard deviation is . Increasing from 25 to 100 multiplies by 2, halving the standard error. The square root is the practical headache: each halving of the standard error requires quadrupling the sample size.
Keep practicing
Want problems based on your actual course?
These problems are general. Upload your own lecture notes and ExamTeX generates a full practice exam — with an answer key — matched to what your professor actually taught.
Generate a full practice exam from your own notes →