Text Size: Default Text Size Text Size Medium Text Size Large
Block M. Skip to Main Content
University of Minnesota

What's Inside

The two most common types of grading systems used at the University of Minnesota are norm-referenced and criterion-referenced. Many professors combine elements of each of these systems for determining student grades by using a system of anchoring or by presetting grading criteria which are later adjusted based on actual student performance.

Norm-Referenced Systems

Definition

In norm-referenced systems students are evaluated in relationship to one another (e.g., the top 10% of students receive an A, the next 30% a B, etc.). This grading system rests on the assumption that the level of student performance will not vary much from class to class. In this system the instructor usually determines the percentage of students assigned each grade, although it may be determined (or at least influenced) by departmental policy.

Advantages

Norm-referenced systems are very easy for instructors to use. They work well in situations requiring rigid differentiation among students, where, for example, due to program size restrictions, only a certain percentage of the students can advance to higher level courses. They are generally appropriate in large courses which do not encourage cooperation among students.

Disadvantages

One objection to norm-referenced systems is that an individual's grade is determined not only by his/her achievements, but also by the achievements of others. In a large non selective lecture class, you can be fairly confident that the class is representative of the student population, but in small classes (under 40) the group may not be a representative sample. One student may get an A in a low-achieving section while a fellow student with the same score in a higher-achieving section gets a B.

A second objection to norm-referenced grading is that it promotes competition rather than cooperation. When students are pitted against each other for the few As to be given out, they're less likely to be helpful to each other.

Possible Modification

When using a norm-referenced system in a small class, you need to modify the allocation of grades based on the caliber of students in the class. One method of modifying a norm-referenced system is anchoring. Jacobs and Chase in Developing and Using Tests Effectively: A Guide for Faculty, 1992, describe the following ways to use an anchor:

"If instructors have taught a class several times and have used the same or an equivalent exam, then the distribution of test scores accumulated over many classes can serve as the anchor. The present class is compared with this cumulative distribution to judge the ability level of the group and the appropriate allocation of grades. Anchoring also works well in multi-section courses where the same text, same syllabus, and same examinations are used. The common examination can be used to reveal whether and how the class groups differ in achievement, and the grade in the individual sections can be adjusted accordingly.... If an instructor is teaching a class for the first time and has no other scores for comparison, a relevant and well-constructed teacher-made pretest may be used as an anchor."

Modifying the norm-referenced system by anchoring also helps mitigate feelings of competition among students since they may feel they are not directly in competition with each other.

Before deciding on a norm-referenced system, consider:

  • What is your expected class size? If your class size is smaller than 40, do not use a norm-referenced system unless you use anchoring to modify it.
  • Is it important for students to work cooperatively in this class (e.g., do you ask them to form study groups or work on projects as a group)? If the answer is yes, a norm-referenced system is not appropriate for your class.

Criterion-Referenced Systems

Definition

Norm-referenced tests measure students relative to each other. Criterion-referenced tests measure how well individual students do relative to pre-determined performance levels. Teachers use criterion-referenced tests when they want to determine how well each student has learned specific knowledge or skills.

In criterion-referenced systems students are evaluated against an absolute scale, normally a set number of points or a percentage of the total (e.g., 95-100 = A, 88-94 = B, etc.). Since the standard in this grading system is absolute, it is possible that all students could get As or all students could get Ds.

Advantages

Students are not competing with each other and are thus more likely to actively help each other learn. A student's grade is not influenced by the caliber of the class.

Disadvantages

It is difficult to set a reasonable standard for students without a fair amount of teaching experience. Most experienced faculty set criteria based on their knowledge of how students usually perform; thus, criterion-referenced systems often become fairly similar to norm-referenced systems.

Possible modifications

Instructors sometimes choose to maintain some flexibility in their grading system by telling the class in advance that the threshold for grades may be lowered if it seems appropriate. Thus, if a first exam was more difficult for students than the instructor imagined, s/he can lower the grading criteria rather than trying to compensate for the difficulty of the first exam with an easier second exam. Raising the criteria because too many students achieved As, however, is never advisable.

Another way of doing criterion-referenced grading is by listing class objectives and assigning grades based on the extent to which the student achieved them. (For example, A = Student has achieved all major and minor objectives of the course; B = Student has achieved all major objectives and several minor objectives; etc.).

Before deciding on a criterion-referenced system, consider:

  • How will you determine reasonable criteria for students? If you are teaching the class for the first time, it is advisable to maintain some flexibility.

Other Systems

Some alternate systems of grading include contract grading, peer grading, and self-evaluation by students.

In contract grading instructors list activities students can participate in or objectives they can achieve, usually attaching a specified number of points for each activity (e.g., book report = 30 points, term paper = 60 points). Students select the activities and/or objectives which will give them the grade they want and a contract is signed. It is advisable to have qualitative criteria stated in the contract in addition to listing the activities.

In some classes, a portion of a student's grade is determined by peers' evaluation of his/her performance. If students are told what to look for and how to grade, they generally can do a good job. The agreement between peer and instructor rating is about 80%. Peer grading is often used in composition classes and speech classes. If used, it should always be done anonymously.

Students can also be asked to assess their own work in the class and their assessment can be a portion of the final grade. This method has educational value since learning to assess one's own progress contributes to the university's goal of preparing students to be life-long learners.

A research analysis found that the percentages of self-assessors whose grades agree with those of faculty graders vary from 33% to 99%. Experienced students tend to rate themselves similarly to faculty while less experienced students generally give themselves higher grades than a faculty grader. Students in science classes also produced self-assessments which closely matched faculty assessment. If self-assessment is used, the instructor and student should meet to discuss the student's achievement before the self-evaluation is made.

Center for Teaching and Learning