This paper discusses the issues involved in calculating indices of composite reliability for ‘modular’ or ‘unitised’ assessments of the kind used in GCSEs, AS- and A-level examinations in England. The increasingly widespread use of on-screen marking has meant that the item-level data required for calculating indices of reliability is now routinely available for most (but not all) units of unitised assessments.
Whilst it is relatively straightforward to obtain indices of reliability at unit level, it is far more complex to obtain indices at overall assessment level because of problems created by:
- the number of different possible ‘routes’ to the final assessment;
- the different knowledge, skills and understanding assessed in different units;
- the wide variety in item type and size within and across units;
- the fact that the item-level data required for calculating reliability indices is not available (or does not exist) for certain units; and
- the different intended weighting of different units in the composite total and the possible distortion of these weights by use of the Uniform Mark Scale.