InfluentialPoints.com
Biology, images, analysis, design...
 Use/Abuse Principles How To Related
"It has long been an axiom of mine that the little things are infinitely the most important" (Sherlock Holmes)

# Non-parametric correlation and regression

#### Worked example 1

Our worked example uses data from Luiselli (2006) who tested hypotheses on the ecological patterns of rarity using snake communities worldwide. In we looked at whether the percentage of rare species was significantly different among continents. Here we assess whether there is a relationship between the percentage of rare species in an area and species richness. Data are presented below:

 Snake data No. sp. Rank % rare Rank 8241718161215135312202866346176874332359461417181751052717 11.029.022.0 25.5 19.0 14.5 18.0 16.0 4.5 1.014.527.031.0 8.0 8.0 33.0 8.0 22.0 35.0 10.0 2.0 32.0 28.0 4.5 12.034.0 17.0 22.0 25.522.04.5 13.0 4.530.0 22.0 12.520.811.816.718.78.332023.100010250014.7023.514.714.20317.40023.921.45.938.911.8010033.329.4 19.027.017.5 23.025.014.0 26.029.0 6.06.0 6.0 15.5 32.0 6.0 6.0 21.5 6.0 30.021.5 20.0 6.0 12.0 24.0 6.0 6.031.028.013.0 35.017.5 6.015.5 6.034.0 33.0

{Fig. 1}

It is not immediately clear from this figure whether the relationship can be considered monotonic. Hence we do a plot of rank (Y) against rank (X) to try to clarify the situation:

{Fig. 2}

The plot of rank (Y) against rank (X) does indeed appear to be linear (albeit with a lot of scatter), so we can assume we are dealing with a monotonic relationship. Whether the bivariate observations are independent is less clear, as it is possible that the areas of some of the studies overlapped. With that word of caution, we proceed with estimation of the Spearman correlation coefficient.

Using

Since there are ties, we will use the full form of the equation.

 rs = 13832−(35 x 182) √[14892 − (35 x 182)] √[14798.5− (35 x 182)] = 0.7110
where
• rs is the Spearman correlation coefficient,
• R(X) and R(Y) are the ranks of the individual observations of the two variables,
• n is the number of bivariate observations

The sample size is moderate (n = 35) so we can studentize the statistic by dividing by its standard error:

 t = 0.7110 √ = 5.808 35 − 2 1 − 0.71102

The P-value for this value can be obtained from the t distribution (df=2). R gives this as 0.000009 so we accept there is a highly significant positive association between the percentage of rare species in an area and species richness.