Table of Contents
A Bland-Altman plot is a fundamental statistical tool specifically designed to visualize and quantify the agreement between two distinct quantitative measurement methods or instruments. Developed by statisticians Martin Bland and Douglas Altman, this plotting technique has become the standard method for method comparison across numerous scientific disciplines.
The core purpose of employing this plot is to rigorously assess the interchangeability of a new, potentially cheaper or faster measurement technique when compared against an established reference standard, often referred to as the “gold standard.” This assessment is indispensable in critical fields such as clinical research, where patient safety depends on accurate readings, as well as in engineering and laboratory sciences.

The structure of the Bland-Altman plot provides both a visual and quantitative evaluation of agreement. The plot maps specific metrics derived from paired observations: the x-axis consistently represents the average measurement recorded by the two instruments, while the y-axis displays the actual difference between those same measurements (Method 1 minus Method 2).
Interpreting the agreement hinges upon three crucial horizontal lines superimposed onto the scatter plot. These lines define the central tendency and the expected range of discrepancy:
- The central line, which represents the mean difference in measurements between the two instruments. This value is commonly referred to as the systematic bias.
- The upper limit of the 95% confidence interval for the difference, defining the upper bound of the Limits of Agreement (LoA).
- The lower limit of the 95% confidence interval for the difference, defining the lower bound of the Limits of Agreement (LoA).
Utilizing these visual markers, the Bland-Altman analysis allows researchers to determine two primary statistical characteristics that define the relationship between the two measurement methods: systematic error (bias) and random error (variability).
Understanding Systematic Error (Bias)
The first key outcome determined by the plot is the average difference in measurements between the two instruments, which quantifies the systematic error. The horizontal line drawn in the center of the chart represents this mean difference. This specific value is formally known as the bias or systematic error.
If this central bias line is positioned far from zero, it suggests a significant systematic difference in measurements, meaning one method consistently yields results higher or lower than the other across the entire measurement range. Researchers must evaluate whether this degree of systematic error is clinically or practically acceptable for the intended application of the new instrument.
Quantifying Random Error (Limits of Agreement)
The second essential outcome is the determination of the typical range of agreement between the two instruments, which captures the random error. The upper and lower confidence interval lines define the crucial Limits of Agreement (LoA).
By statistical convention, 95% of the differences between the paired measurements are expected to fall within these confidence limits. The distance between the upper and lower LoA lines directly reflects the magnitude of the random differences. A wider distance between these limits indicates greater variability and suggests poorer precision or consistency between the methods, implying that the two instruments may not be reliably interchangeable.
Bland-Altman plots are particularly valuable because they can also reveal proportional bias—a pattern where the difference between methods changes systematically as the magnitude of the measurement increases. Such a pattern would manifest as the data points forming a funnel or wedge shape, requiring more complex statistical treatment than simple bias correction.
Note: A Bland-Altman plot is sometimes referred to by its earlier derivation, the Tukey mean-difference plot. These names describe the same powerful visualization technique used interchangeably across various statistical domains.
Step 1: Collect Paired Measurement Data
To illustrate the methodology, consider a scenario where a researcher seeks to assess the agreement between two different instruments, Instrument A and Instrument B, both designed to measure the weight of biological specimens, specifically frogs, in grams. To conduct this robust method comparison, the researcher must utilize both Instrument A and Instrument B to weigh the identical set of subjects (in this case, 20 frogs).
This initial phase of paired data collection forms the essential foundation of the Bland-Altman analysis. By ensuring that the same subject is measured by both instruments, any subsequent observed difference can be confidently attributed to method-related discrepancy rather than inherent subject variability, which significantly enhances the reliability of the comparison.

Step 2: Calculate the Average Measurement and the Difference
Before plotting can commence, the raw data collected in Step 1 must be transformed into the required X and Y coordinates. This involves two parallel calculations for every single paired observation:
We calculate the average measurement (the X-coordinate, represented mathematically as (A + B) / 2 ) and the difference in measurements (the Y-coordinate, represented mathematically as A – B) for every single observation.
The resulting average value column (X-coordinate) accounts for the overall magnitude of the measurement being taken, ensuring that potential proportional bias is captured. Meanwhile, the difference column (Y-coordinate) precisely captures the disparity, or lack of agreement, between the two instruments for that specific subject.

Step 3: Determine the Mean Difference and Limits of Agreement
The next crucial step involves establishing the exact placement of the three horizontal lines that will guide the interpretation of the plot. These key statistical metrics are calculated solely from the values contained within the Difference column.
First, calculating the average of the values in the Difference column yields the mean difference, also known as the systematic bias. In this illustrative example, the calculated mean difference is exactly 0.5 grams.
Second, the variability of the differences is quantified by calculating the standard deviation (s) of the values in the Difference column. This measure of spread is determined to be 1.235.
Finally, the Limits of Agreement (LoA)—the upper and lower bounds of the 95% confidence interval—are calculated using the mean difference ($bar{x}$) plus or minus 1.96 times the standard deviation (s). This formula defines the range in which 95% of future measurement differences are expected to fall:
Upper Limit of Agreement: $bar{x}$ + 1.96*s = 0.5 + 1.96 * 1.235 = 2.92 grams.
Lower Limit of Agreement: $bar{x}$ – 1.96*s = 0.5 – 1.96 * 1.235 = -1.92 grams.
These derived statistical values allow for a clear, quantitative interpretation regarding the comparison between Instrument A and Instrument B:
- On average, Instrument A consistently exhibits a systematic bias, weighing the specimens 0.5 grams heavier than Instrument B.
- The random variability indicates that 95% of the differences in weight measurements between the two instruments are expected to fall within the range of -1.92 grams and 2.92 grams.
With these statistical benchmarks established, the data is now fully prepared for visual representation in the Bland-Altman plot.
Step 4: Create and Analyze the Plot
The final step synthesizes the data transformation and statistical calculations into the definitive visual tool. We create a scatter plot where the calculated average measurement (X-axis) is plotted against the calculated difference in measurements (Y-axis). The resulting scatter of points immediately reveals the distribution of differences across the entire range of measured values.
We then superimpose the three critical horizontal lines—the mean difference (0.5), the upper Limit of Agreement (2.92), and the lower Limit of Agreement (-1.92). This visualization allows the analyst to quickly identify whether the variability is consistent (homoscedasticity) or whether the random error increases with measurement magnitude (proportional bias). It also helps in identifying potential outliers that fall outside the 95% LoA boundaries, allowing for a rapid and comprehensive assessment of method agreement.

Additional Resources for Statistical Analysis
For those interested in delving deeper into method agreement statistics, further reading on concepts like Cohen’s Kappa for categorical data, or advanced regression techniques for complex comparisons, is highly recommended. Understanding the nuances of systematic error and variability is essential for rigorous scientific practice, and the application of the Bland-Altman method is foundational for validating new diagnostic tools and ensuring measurement stability over time.
Cite this article
Mohammed looti (2025). Understanding Bland-Altman Plots: A Guide to Comparing Measurement Methods. PSYCHOLOGICAL STATISTICS. Retrieved from https://statistics.arabpsychology.com/what-is-a-bland-altman-plot-definition-example/
Mohammed looti. "Understanding Bland-Altman Plots: A Guide to Comparing Measurement Methods." PSYCHOLOGICAL STATISTICS, 5 Nov. 2025, https://statistics.arabpsychology.com/what-is-a-bland-altman-plot-definition-example/.
Mohammed looti. "Understanding Bland-Altman Plots: A Guide to Comparing Measurement Methods." PSYCHOLOGICAL STATISTICS, 2025. https://statistics.arabpsychology.com/what-is-a-bland-altman-plot-definition-example/.
Mohammed looti (2025) 'Understanding Bland-Altman Plots: A Guide to Comparing Measurement Methods', PSYCHOLOGICAL STATISTICS. Available at: https://statistics.arabpsychology.com/what-is-a-bland-altman-plot-definition-example/.
[1] Mohammed looti, "Understanding Bland-Altman Plots: A Guide to Comparing Measurement Methods," PSYCHOLOGICAL STATISTICS, vol. X, no. Y, ص Z-Z, November, 2025.
Mohammed looti. Understanding Bland-Altman Plots: A Guide to Comparing Measurement Methods. PSYCHOLOGICAL STATISTICS. 2025;vol(issue):pages.