Cronbach's alpha
Cronbach's alpha (Cronbach's <math>\alpha</math>) or coefficient alpha (coefficient <math>\alpha</math>), is a reliability coefficient and a measure of the internal consistency of tests and measures.<ref name=c1951>Template:Cite journal</ref><ref name=c1978>Template:Cite journal</ref> It was devised by the American psychometrician Lee Cronbach. Today it enjoys such wide-spread usage that numerous studies warn against using Cronbach's alpha uncritically.<ref name="Sijtsma">Template:Cite journal</ref><ref name="GY">Template:Cite journal</ref><ref name="RZ">Template:Cite journal</ref><ref name="RM">Template:Cite journal</ref><ref name="c2004">Template:Cite journal</ref>
History
In his initial 1951 publication, Lee Cronbach described the coefficient as Coefficient alpha<ref name=c1951/> and included an additional derivation.<ref name="Cronbach">Template:Cite journal</ref> Coefficient alpha had been used implicitly in previous studies,<ref name="Hoyt">Template:Cite journal</ref><ref name="Guttman">Template:Cite journal</ref><ref name="JF">Template:Cite journal</ref><ref name="Gulliksen">Template:Cite book</ref> but his interpretation was thought to be more intuitively attractive relative to previous studies and it became quite popular.<ref>Template:Cite journal</ref>
- In 1967, Melvin Novick and Charles Lewis proved that it was equal to reliability if the true scoresTemplate:Efn of the compared tests or measures vary by a constant, which is independent of the people measured. In this case, the tests or measurements were said to be "essentially tau-equivalent."<ref name="NL">Template:Cite journal</ref>
- In 1978, Cronbach asserted that the reason the initial 1951 publication was widely cited was "mostly because [he] put a brand name on a common-place coefficient."<ref name="c1978" />Template:Rp He explained that he had originally planned to name other types of reliability coefficients, such as those used in inter-rater reliability and test-retest reliability, after consecutive Greek letters (i.e., <math>\beta</math>, <math>\gamma</math>, etc.), but later changed his mind.
- Later, in 2004, Cronbach and Richard Shavelson encouraged readers to use generalizability theory rather than <math>\alpha</math>. Cronbach opposed the use of the name "Cronbach's alpha" and explicitly denied the existence of studies that had published the general formula of KR-20 before Cronbach's 1951 publication of the same name.<ref name="c2004" />
Prerequisites for using Cronbach's alpha
To use Cronbach's alpha as an accurate estimate of reliability, the following conditions must be met:<ref>Template:Cite journal</ref><ref name="Cortina1993">Template:Cite journal</ref>
- The "parts" (i.e. items, test parts, etc.) must be essentially tau-equivalent;
- Errors in the measurements are independent.
However, under the definition of CTT, the errors are defined to be independent.
This is often a source of confusion for users who might consider some aspect of the testing process to be an "error" (rater biases, examinee collusion, self-report faking). Anything that increases the covariance among the parts will contribute to greater true score variance. Under such circumstances, alpha is likely to over-estimate the reliability intended by the user.<ref name="Cortina1993"/><ref name="Sijtsma"/>
Formula and calculation
Reliability can be defined as one minus the error score variance divided by the observed score variance:
<math display="block">\rho_{XX'} = \left(1 - {\sigma^2_E \over \sigma^2_X} \right)</math>
Cronbach's alpha is best understood as a direct estimate of this definitional formula with error score variance estimated as the sum of the variances of each "part" (e.g., items or testlets):<ref name=RM/>
<math display="block">\alpha = {k \over k-1 } \left(1 - {\sum_{i=1}^k \sigma^2_{y_i} \over \sigma_X^2} \right)</math>
where:
- <math>k</math> represents the number of "parts" (items, test parts, etc.) in the measure;
- the <math>k/(k-1)</math> term causes alpha to be an unbiased estimate of reliability when the parts are parallel or essentially tau equivalent;
- <math>\sigma_{y_i}^2</math> the variance associated with each part i; and
- <math>\sigma_X^2</math> the observed score variance (the variance associated with the total test scores).
The reason that the sum of the individual part variances estimates the error score variance is because
<math display="inline">\sigma_X^2 = \sigma_T^2 + \sigma_E^2</math>
and the variance of a composite is equal to twice the sum of all covariances of the parts plus the sum of the variances of the parts: <math>\sigma^2_{X} = \sum \sum \sigma_{y_i,y_j} + \sum \sigma^2_{y_i}</math>. Therefore <math>\sum \sum \sigma_{y_i,y_j}</math> estimates <math display="inline">\sigma_T^2</math> and <math>\sum \sigma^2_{y_i}</math> estimates <math display="inline">\sigma_E^2</math>. It is much easier to compute alpha by summing the part variances (to estimate error score variance) than adding up all the unique part covariances (to estimate true score variance) .
Alternatively, alpha can be calculated through the following formula:<ref>Template:Cite AV media</ref>
- <math> \alpha = {k \bar c \over \bar v + (k - 1) \bar c} </math>
where:
- <math>\bar v</math> represents the average variance
- <math>\bar c</math> represents the average inter-item covariance (or the average covariance of "parts").
Common misconceptions
Application of Cronbach's alpha is not always straightforward and can give rise to common misconceptions.
A high value of Cronbach's alpha indicates homogeneity between the items
Many textbooks refer to <math>\alpha</math> as an indicator of homogeneity<ref>Template:Cite web</ref> between items. This misconception stems from the inaccurate explanation of Cronbach (1951)<ref name = Cronbach/> that high <math>\alpha</math> values show homogeneity between the items. Homogeneity is a term that is rarely used in modern literature, and related studies interpret the term as referring to uni-dimensionality. Several studies have provided proofs or counterexamples that high <math>\alpha</math> values do not indicate uni-dimensionality.<ref name=Cortina>Template:Cite journal</ref><ref name=GLM>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref>Template:Cite journal</ref><ref name=TBC>Template:Cite journal</ref> See counterexamples below.
| <math>X_1</math> | <math>X_2</math> | <math>X_3</math> | <math>X_4</math> | <math>X_5</math> | <math>X_6</math> | |
|---|---|---|---|---|---|---|
| <math>X_1</math> | <math>10</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>3</math> |
| <math>X_2</math> | <math>3</math> | <math>10</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>3</math> |
| <math>X_3</math> | <math>3</math> | <math>3</math> | <math>10</math> | <math>3</math> | <math>3</math> | <math>3</math> |
| <math>X_4</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>10</math> | <math>3</math> | <math>3</math> |
| <math>X_5</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>10</math> | <math>3</math> |
| <math>X_6</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>3</math> | <math>10</math> |
<math>\alpha=0.72</math> in the uni-dimensional data above.
| <math>X_1</math> | <math>X_2</math> | <math>X_3</math> | <math>X_4</math> | <math>X_5</math> | <math>X_6</math> | |
|---|---|---|---|---|---|---|
| <math>X_1</math> | <math>10</math> | <math>6</math> | <math>6</math> | <math>1</math> | <math>1</math> | <math>1</math> |
| <math>X_2</math> | <math>6</math> | <math>10</math> | <math>6</math> | <math>1</math> | <math>1</math> | <math>1</math> |
| <math>X_3</math> | <math>6</math> | <math>6</math> | <math>10</math> | <math>1</math> | <math>1</math> | <math>1</math> |
| <math>X_4</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>10</math> | <math>6</math> | <math>6</math> |
| <math>X_5</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>6</math> | <math>10</math> | <math>6</math> |
| <math>X_6</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>6</math> | <math>6</math> | <math>10</math> |
<math>\alpha=0.72</math> in the multidimensional data above.
| <math>X_1</math> | <math>X_2</math> | <math>X_3</math> | <math>X_4</math> | <math>X_5</math> | <math>X_6</math> | |
|---|---|---|---|---|---|---|
| <math>X_1</math> | <math>10</math> | <math>9</math> | <math>9</math> | <math>8</math> | <math>8</math> | <math>8</math> |
| <math>X_2</math> | <math>9</math> | <math>10</math> | <math>9</math> | <math>8</math> | <math>8</math> | <math>8</math> |
| <math>X_3</math> | <math>9</math> | <math>9</math> | <math>10</math> | <math>8</math> | <math>8</math> | <math>8</math> |
| <math>X_4</math> | <math>8</math> | <math>8</math> | <math>8</math> | <math>10</math> | <math>9</math> | <math>9</math> |
| <math>X_5</math> | <math>8</math> | <math>8</math> | <math>8</math> | <math>9</math> | <math>10</math> | <math>9</math> |
| <math>X_6</math> | <math>8</math> | <math>8</math> | <math>8</math> | <math>9</math> | <math>9</math> | <math>10</math> |
The above data have <math>\alpha=0.9692</math>, but are multidimensional.
| <math>X_1</math> | <math>X_2</math> | <math>X_3</math> | <math>X_4</math> | <math>X_5</math> | <math>X_6</math> | |
|---|---|---|---|---|---|---|
| <math>X_1</math> | <math>10</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>1</math> |
| <math>X_2</math> | <math>1</math> | <math>10</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>1</math> |
| <math>X_3</math> | <math>1</math> | <math>1</math> | <math>10</math> | <math>1</math> | <math>1</math> | <math>1</math> |
| <math>X_4</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>10</math> | <math>1</math> | <math>1</math> |
| <math>X_5</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>10</math> | <math>1</math> |
| <math>X_6</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>1</math> | <math>10</math> |
The above data have <math>\alpha=0.4</math>, but are uni-dimensional.
Uni-dimensionality is a prerequisite for <math>\alpha</math>. One should check uni-dimensionality before calculating <math>\alpha</math> rather than calculating <math>\alpha</math> to check uni-dimensionality.
A high value of Cronbach's alpha indicates internal consistency
The term "internal consistency" is commonly used in the reliability literature, but its meaning is not clearly defined. The term is sometimes used to refer to a certain kind of reliability (e.g., internal consistency reliability), but it is unclear exactly which reliability coefficients are included here, in addition to <math>\alpha</math>. Cronbach (1951)<ref name = Cronbach/> used the term in several senses without an explicit definition. Cortina (1993)<ref name="Cortina"/> showed that <math>\alpha</math> is not an indicator of any of these.
Removing items using "alpha if item deleted" always increases reliability
Most psychometric software will produce a column labeled "alpha if item deleted" which is the coefficient alpha that would be obtained if an item were to be dropped. For good items, this value is lower than the current coefficient alpha for the whole scale. But for some weak or bad items, the "alpha if item deleted" value shows an increase over the current current coefficient alpha for the whole scale.
Removing an item using "alpha if item deleted" may result in 'alpha inflation,' where sample-level reliability is reported to be higher than population-level reliability.<ref name=KL>Template:Cite journal</ref> It may also reduce population-level reliability.<ref name=r2007>Template:Cite journal</ref> The elimination of less-reliable items should be based not only on a statistical basis but also on a theoretical and logical basis. It is also recommended that the whole sample be divided into two and cross-validated.<ref name=KL/>
Notes
References
External links
- Cronbach's alpha SPSS tutorial
- The free web interface and R package cocoon allow us to statistically compare two or more dependent or independent Cronbach alpha coefficients.