July/August 2011 Statistical IntervalsPart 1: The Confidence Interval
Statistical intervals play an important role in statistical inference. But practitioners often confuse the difference among confidence, prediction and tolerance type intervals. This article series will clarify these differences and indicate when each type of interval should be used.Q: What is a confidence interval? A: The most common type of statistical interval used in practice is the confidence interval (CI).1 CIs apply to unknown parameters of either a probability distribution or some other statistical model, such as the parameters in a regression model. Common examples of parameters to which this might be applied include a mean, a standard deviation, a percentile or a proportion. To construct a two-sided CI for an unknown parameter, we start with some observed data representing the model we are using and proceed to calculate the confidence limits a and b (with a < b) such that it may be claimed that the parameter lies between a and b with the stated confidence. We say that the interval would capture or cover the parameter of interest with the stated confidence. The term confidence refers to the long run proportion of the time (probability) that the interval would capture the parameter of interest. For example, if the unknown parameter is the mean, μ, of a normal distribution, the two-sided CI would take the form a ≤ μ ≤ b. For one-sided intervals, this would take the form μ ≤ b or μ ≥ a. It is common practice to call the confidence value a confidence coefficient and denote it as 100(1 - α)%, where 0 ≤ α ≤ 1. In other words, there is a probability of 1 - α of selecting a sample for which the CI will contain the true value of μ. Confidence coefficients of 90 percent, 95 percent and 99 percent are commonly used. The manner in which the CI end points are calculated is context dependent. If the parameter of interest is a mean and the standard deviation is known, then the CI calculation is from Equation 1, where the standard normal distribution is used to produce the desired confidence level. The quantity zα/2 is the upper 100(α/2) percentage point from the standard normal distribution, such that Example 1
This CI indicates that the true value for the mean impact energy for A238 steel at 60oC lies between 63.84J and 65.08J with 95 percent confidence. In practice, any CI either contains the true value or it does not. The correct interpretation of the CI is to say that if many random samples were taken, and a 95 percent CI were computed from each sample, then 95 percent of these intervals would contain the true value of the mean impact energy, μ. In other words, 5 percent of these CIs would fail to contain μ. Since σ is typically unknown, we must estimate it with the sample standard deviation s and use Equation 2, where the student’s t distribution is used to produce the desired confidence level.
The quantity tα/2 is the upper 100(α/2) percentage point from student’s t distribution with n - 1 degrees of freedom such that The interval construction in Equation 2 is said to capture or contain the unknown mean, μ, with the stated confidence 100(1 - α)%. For sample mean and sample standard deviation s, we can write a probability statement for Equation 2, assuming that the sample comes from a normal distribution or that the sample size is at least 30. This statement takes the form of Equation 3.
Equation 3 is a true probability statement because Example 2 Table 1 — Alloy Specimen Load at Failure, in Megapascals The sample statistics are:
Based on the central limit theorem, we can also state a more general form for a confidence interval for any parameter of interest, say θ. Let
Suppose that we wish to construct a confidence interval for a population proportion, p. When we observe x observations in the sample of size n that belong to a class of interest,
Example 3
To summarize, a confidence interval for an unknown parameter of a statistical model is a random interval that probably covers or contains the true value of the parameter for the sample we are using. The containment probability, called confidence, does not apply to the particular interval we are calculating. Confidence applies to the run proportion of such intervals that would be expected to cover the parameter if we were to use many samples of the same size under the same conditions. Reference Stephen N. Luko, Hamilton Sundstrand, Windsor Locks, Conn., is the immediate past chairman of Committee E11 on Quality and Statistics, and a fellow of ASTM International. Dean V. Neubauer, Corning Inc., Corning, N.Y., is an ASTM fellow; he serves as vice chairman of Committee E11 on Quality and Statistics, chairman of Subcommittee E11.30 on Statistical Quality Control, chairman of E11.90.03 on Publications and coordinator of the DataPoints column. In the next article in this series, we will discuss prediction intervals for a future observation. | |||||||