![]() Biology, images, analysis, design... |
|
"It has long been an axiom of mine that the little things are infinitely the most important" |
|
Measures of validity for measurement variables: Use and misuse(Bland Altman plot, calibration, regression, correlation)Statistics courses, especially for biologists, assume formulae = understanding and teach how to do Use and Misuse
Data validation aims to assess measurement bias and ensure that a variable measures what it is supposed to measure. We deal elsewhere For measurement variables, data validation consists of comparing two measurements each on the same subject. The first measurement is of the variable you are able to measure in practice - sometimes called the practical variable. The second measurement is of the true value of the variable, or at least as close as you can get to the true value. This variable is sometimes called the criterion variable, or gold standard. With questionnaire data, the response of the interviewee would be the practical variable - for example if you are asking a farmer to tell you the number of cattle he or she owns. The researcher may carry out a ground-truthing operation and count the number of cattle himself - that would be the criterion variable. Most commonly each variable is measured in the same units, so we are looking to see whether values of the two variables are in agreement. An exception to this is if the practical variable is a proxy or surrogate variable, with quite different units. Here we are interested in whether there is a close and consistent relationship between the proxy and true values, so that we can predict one from the other. Although validation studies of nominal variables frequently appear in the literature, validation studies of measurement variables are far less common. The most widely used analytical techniques for validating measurement variables are correlation and regression. Unfortunately the correlation coefficient We give Use of the Bland-Altman plot seems not to have penetrated the ecological and wildlife literature where it is hard to find any examples of data validation! All too often it is simply assumed that the (proxy variables) trap catches or volunteer counts are meaningful measures of what they are supposed to be measuring - and this assumption is unchecked. Where validation studies are done, linear regression is by far the most commonly used method of analysis. This is perfectly acceptable if the two variables are measured in different units (for example trap catches versus population size), and the plot is being done to enable prediction of the criterion variable from a proxy variable. But where the two variables are measured in the same units there is clearly scope for more extensive use of the Bland-Altman method. Validation studies suffer from some general problems not related to the specific statistical technique used. For example validation studies using questionnaires often only validate some minor point (such as for example herd size) which can be measured easily, and then assume that this 'validity' is somehow proven for all other aspects (such as disease diagnosis). Excessively small sample size is another common problem prevalent in wildlife studies relating different methods of estimating population size. What the statisticians sayBland (2000)![]() ![]() ![]() ![]() ![]() ![]() Bland & Altman (2007) Bland & Altman (2002) Wikipedia provides sections on calibration,
|