| |
|
| |
|
|
| |
|
| |
|
| |
B. Developing Multiple-Choice Questions
|
| |
Multiple-choice questions
are selection-type items. Students are given three or more possible answers
and are asked to choose the correct answer or the "best" answer. The item
begins with an item stem, followed by alternatives (the correct answer
and distractors). Multiple-choice questions can be used to measure knowledge
recall as well as higher order thinking. They are appropriate for use
with objectives that call for the students to do such tasks as recognize,
distinguish between, select, estimate, infer, predict, relate, categorize.
Some of the most thorough
work regarding multiple-choice questions has been done by Thomas Haladyna
(1999). He presents four types of content (facts, concepts, principles,
and procedures) and five types of cognitive behaviors (recalling, understanding,
predicting, evaluating, and problem solving). While multiple-choice questions
can be used to measure all of these, he suggests the context-dependent
item set (a set of multiple-choice questions based on new content that
is presented as part of the assessment, such as a paragraph, graphic display,
etc.) or constructed response (short answer, completion, or essay questions)
as more appropriate for critical thinking and problem solving than independent
multiple-choice questions. 
Multiple-choice items
can be scored easily and quickly. They can be scored by machine and are
frequently used for standardized tests. It is possible to sample a lot
of content with multiple-choice items. Although multiple-choice tests
are sometimes called "multiple guess" tests, there is less chance of guessing
the correct answer than with true-false questions. Higher-order thinking
can be assessed with multiple-choice items. The student can be asked to
apply a rule or principle (example 1 below), to show understanding of
cause and effect (example 2), or to identify the reasoning behind a particular
choice of action (example 3).
|
| |
|
| |
| |
1. |
|
Statistical tests CANNOT show which one
of the following? |
| |
|
|
A |
Whether two sets of scores are significantly different |
| |
|
|
B |
Why two sets of scores are different |
| |
|
|
C |
How closely related are two sets of scores |
| |
|
|
|
|
| |
2. |
|
What is likely to happen to mortgage interest
rates when interest rates on savings go up? |
| |
|
|
A |
Increase |
| |
|
|
B |
Decrease |
| |
|
|
C |
No change |
| |
|
|
D |
Unpredictable |
| |
|
|
|
|
| |
3. |
|
Why did the chicken cross the road? |
| |
|
|
A |
The chicken liked walking on pavement. |
| |
|
|
B |
Corn had been spilled on the other side. |
| |
|
|
C |
There was a hunting dog chasing the chicken. |
| |
|
|
D |
The rest of the flock had already crossed. |
|
| |
|
| |
Although a multiple-choice
test may be referred to as an "objective test," no test is truly objective.
The instructor or someone else subjectively determines what content is
included
in the test, the amount of emphasis placed on various topics, and the
type(s) of questions used. Tests containing selection type questions (true-false,
multiple-choice, matching, etc.), however, can be scored objectively
because the scorer is not called upon to use his or her judgment when
scoring the questions. Also, the questions can be scored fairly quickly
using an answer key or machine scoring if it is available.
While students may be
able to guess or use the process of elimination, questions of this type
require the students to make more complex distinctions than if each item
were written as a true-false statement. Instead of choosing between two
options in example 4a, there are more options and less chance of guessing
correctly in example 4b.
|
| |
|
| |
| |
4a. |
|
True |
False |
The capital of Kentucky is Louisville. |
|
| |
|
|
|
|
|
|
| |
4b. |
|
Which city is the capital of Kentucky? |
|
| |
|
|
A |
Frankfort |
|
|
| |
|
|
B |
Lexington |
|
|
| |
|
|
C |
Louisville |
|
|
| |
|
|
D |
Paducah |
|
|
|
| |
|
| |
There
are disadvantages for both the teacher and the student in using multiple-choice
questions. It is more time-consuming for the teacher to construct good multiple-choice
items than true false or completion items. One reason for this is the difficulty
of finding suitable distractors, which are plausible (particularly at lower
grades). To be plausible, the distractor must have the potential for being
selected as the correct answer. Two distractors are as effective as three
if one of the three is not plausible (highly unlikely to be selected), as
is the case with B in example 5. |
| |
|
| |
| |
5. |
|
Which is the most important reason for
using true-false questions? |
| |
|
|
A |
The questions can be written quickly. |
| |
|
|
B |
Students prefer them. |
| |
|
|
C |
A lot of content can be covered. |
| |
|
|
D |
Students can respond to more questions in a given amount
of time. |
|
| |
|
| |
Another consideration
is trying to insure that one question or its distractors do not provide
clues to the answer of another question.
Multiple-choice questions
may not be as good for problem-solving (math) as completion/short answer
items, but if students must take standardized tests in multiple-choice
format, the students need some experience with this format.
Reading level and reading
speed of the students must be considered when constructing
the items. If the language is too difficult or if there are slow readers
among the students, some may earn lower scores because they do not finish,
rather than because they do not know the material. Each multiple-choice
question typically involves more reading than a true-false, short answer
or completion question.
"Best" answer items (measuring
understanding, application, and interpretation) are usually more difficult
for the students than "correct" answer items. Example 6 (below) asks the
student to determine the "MOST IMPORTANT" consideration. This presupposes
that your instruction has included activities in which students are called
upon to develop skills of comparing and evaluating information. However,
if the students have been instructed that a particular purpose is the
most important one, the question measures only knowledge or recall of
information.
|
| |
|
| |
| |
6. |
|
Which one of the following was the most
important consideration in locating cities during frontier times in
America? |
| |
|
|
A |
good farmland |
|
| |
|
|
B |
access to waterways |
|
| |
|
|
C |
easy to defend against attack by Indians |
|
| |
|
|
D |
moderate temperatures. |
|
|
| |
|
| |
Younger
children are usually better able to deal with direct questions in the item
stem (example 7a below) than incomplete statements (example 7b). They are
also generally easier to create. |
| |
|
| |
| |
7a. |
|
Which would be a synonym for courtesy? |
| |
|
|
A |
bow (bend forward from the waist) |
| |
|
|
B |
honesty |
| |
|
|
C |
politeness |
| |
|
|
D |
tribute |
| |
|
|
|
|
| |
7b. |
|
__________ is a synonym for courtesy. |
| |
|
|
A |
bow (bend forward from the waist) |
| |
|
|
B |
honesty |
| |
|
|
C |
politeness |
| |
|
|
D |
tribute |
|
| |
|
| |
If
an incomplete sentence is used (as in example 7b), it is preferable to put
the blank at the end of the sentence if at all possible so that the student
doesn't have to reread the stem to determine the basis on which to select
an answer. While this may not be a problem in a short stem as in example
7b, the stem in example 6 would be more problematic. |
| |
|
| |
| |
6. |
|
__________ was the most important consideration
in locating cities during frontier times in America? |
|
| |
|
| |
Haladyna
(1999) discusses two methods for using item shells when creating multiple-choice
questions. An item shell provides a basis for the stem (the question or
problem part of the item) and may be either generic or taken from questions
that have proved to be successful in the past. Some examples of generic
item shells (with the shells underlined) are presented below. Each question
would be followed by the appropriate alternatives. The shell could easily
be adapted to current content by replacing the word or phrase that is not
underlined.
(NOTE: Below are underlines and not links.) |
| |
| |
Which would be another name for
a scavenger? |
| |
|
| |
Which is an example of osmosis?
|
| |
|
| |
What is the cause (purpose, reason for)
of the appearance of the aurora borealis on some nights and not
on others? |
| |
|
| |
What would happen if there were
no sunlight for 30 days? |
| |
|
| |
What is the best (worst, most important,
least important) method of studying for essay tests? |
| |
|
| |
Which of the following would have had
the greatest influence on the thinking of average citizens in
colonial times? |
| |
|
| |
In what way are typing and
using a computer similar? |
| |
|
| |
Which question best illustrates
the application of higher level thinking? |
| |
|
| |
Which of the following best describes
the way students learn? |
|
| |
|
| |
In
Haladyna's second method, utilizing items you have had success with in the
past, the first step is to identify the key words or phrases representing
the content of the item. In example 8a, the key words or phrases are considered
to be "unit" test, "30" (minutes), and "higher levels of thinking." |
| |
|
| |
| |
8a. |
|
You are constructing a unit test
for your students. You have decided they will have 30 minutes
to complete the test. Your objectives are primarily written at higher
levels of thinking. What is the best type of test to use? |
| |
|
|
A |
True-False |
|
| |
|
|
B |
Multiple-choice |
|
| |
|
|
C |
Essay |
|
| |
|
|
D |
A combination of A and B |
|
| |
|
|
|
|
|
| |
The item shell in example 8a is as follows: |
| |
|
|
|
|
|
| |
|
|
You are constructing a __________ test
for your students. You have decided they will have ________ minutes
to complete the test. Your objectives are primarily written at ________________.
What is the best type of test to use? |
|
| |
|
| |
The
next step would be to look at ways to vary the underlined words. The unit
test could become a weekly quiz, a daily quiz, a final exam, etc. Instead
of 30 minutes we could 45 minutes, 15 minutes, or any other reasonable length
of time; and the type of objectives could be described as at the recall
level, understanding, etc. From this point, it is not difficult to construct
a new stem by substituting for one or more key words or phrases. |
| |
|
| |
| |
8b. |
|
You are constructing a final exam
for your students. You have decided they will have 60 minutes
to complete the test. Your objectives are primarily written at
the application and evaluation levels. What is the best type of
test to use? |
| |
|
|
A |
True-False |
|
| |
|
|
B |
Multiple-choice |
|
| |
|
|
C |
Essay |
|
| |
|
|
D |
A combination of A and B |
|
|
| |
|
| |
| Ebel and Frisbie (1986)
provided some suggestions for formulating distractors. First, identify
the correct answer ("deer" in example 9a) and note the class of things
to which the correct answer belongs. |
|
 |
| |
| |
9a. |
|
The grazing of which wild animal poses
problems for farmers in the Southeast? |
| |
|
|
A |
deer |
|
|
| |
|
| |
The
correct answer is an animal, and that would represent the class of
things from which we would select our distractors. We also need to relate
our distractors to other key terms or descriptors in the stem: wild
animal, one that grazes on crops, and one prevalent in the Southeast.
While we know our correct answer fits all of these descriptors, we must
insure that none of our distractors do so since we are seeking a "correct"
answer. We could select a wild animal that is carnivorous, an animal that
grazes in the Southeast but is not wild, or perhaps a wild animal in the
Southeast but one that does not pose problems for farmers, as shown below.
|
| |
|
| |
| |
9b. |
|
The grazing of which wild animal poses
problems for farmers in the Southeast? |
| |
|
|
A |
deer |
|
| |
|
|
B |
bear |
|
| |
|
|
C |
cow |
|
| |
|
|
D |
turkey |
|
|
| |
|
| |
If we were seeking the
"best" answer, one or more of our distractors could fit all the categories
in the stem but to a lesser degree than the deer. This also increases
the difficulty of the question because we have made the alternatives more
homogeneous.
|
| |
| Another strategy in developing
distractors is to use the mistakes commonly made by the students as
distractors. |
|
 |
| |
Variation One
variation of the multiple-choice question is one in which a question is
posed in which the answer is either yes or no, but a reason is attached
to the yes or no response. The student must not only know whether the response
is yes or no but also the reason behind it. |
| |
|
| |
| |
10. |
|
Do students who study Spanish generally
do better in Language Arts (English) than students who study Latin?
|
| |
|
|
A |
Yes, because they are more likely to see and hear Spanish
words. |
| |
|
|
B |
Yes, because the Spanish language is so similar to
ours. |
| |
|
|
C |
No, because many of our words are derived from Latin
words. |
| |
|
|
D |
No, because the study of Latin requires more discipline.
|
|
| |
|
| |
Another variation that links multiple-choice
and true-false questions occurs when the student has to indicate whether
each of the alternatives is true or false, or to mark those that
are true (after evaluating each alternative). This is sometimes referred
to as multiple true false. The item is structured like a multiple-choice
item with a stem and multiple responses. In actuality, the student treats
each item (A through E) as a forced choice question. |
| |
|
| |
| |
11. |
|
Which of the following are good assessment
practices? Mark "G" beside each good practice, "P" beside each
poor one. |
| |
|
_____ |
A |
assessments are matched to objectives |
| |
|
_____ |
B |
a variety of assessments are used |
| |
|
_____ |
C |
test items in the teacher's manual are ignored |
| |
|
_____ |
D |
assessments are administered frequently |
| |
|
_____ |
E |
grades are determined primarily on written tests |
|
| |
|
| |
|
| |
|
| |
| The stem should: |
| |
|
| * |
be meaningful and provide a definite problem |
| |
|
| * |
include as much of the item as possible. If, after
writing the item, you notice that each of your alternatives begins
with the same word or phrase, incorporate the word or phrase into
the stem, deleting it from each of the alternatives. |
| |
|
| * |
be stated positively if at all possible; if negatives
are included they should be highlighted in some way (in capital letters
or bold type) |
| |
|
| * |
be stated simply and not contain irrelevant information |
| |
|
| * |
NOT include the phrase "what do you think .
. ." because any answer will have to be considered correct |
| |
|
| * |
avoid negatives and other terms that might be confusing
(NOT, NEVER, EXCEPT, ONLY). If they are necessary, call attention
to them by putting them in capital letters and/or bold type. |
| |
|
| |
|
| Alternatives should: |
| |
|
| * |
fit the stem grammatically (match use of the letter
a or the word an in the stem) |
| |
|
| * |
fit the stem in number (singular/plural - do/does,
single/multiple - cause/causes) |
| |
|
| * |
all be approximately the same length |
| |
|
| * |
be brief |
| |
|
| * |
be presented in logical order (alphabetical, chronological,
magnitude, etc.) unless it is impossible to avoid a response pattern
as the questions are originally developed or through renumbering and
rearranging them |
| |
|
| * |
include only one correct or clearly best answer |
| |
|
| * |
provide three to five options, including the correct
answer |
| |
|
| * |
NOT include a key word (or words) that appear
in the only in the correct/best answer |
| |
|
| * |
use "none of the above" and "all of the above" infrequently.
"None of the above"is preferable to "all of the above." If a student
knows two of the answers are correct, s/he knows all of the above
is the answer. For "none of the above," each of the other responses
must be weighed. |
| |
|
| * |
should all be plausible |
| |
|
| * |
avoid using "Both B and D are correct" ONLY when this
is the correct answer. |
| |
|
| |
|
| Format: |
| |
|
| * |
List alternatives vertically beneath the stem, beginning
a new line with each response. |
| |
|
| * |
Have students circle the letter beside their response
to avoid scoring problems reading student handwriting. |
| |
|
| * |
Number the questions (stems), and use capital or lowercase
letters for the responses. If there are students with learning disabilities
who have difficulty distinguishing between lowercase "b" and "d" it
is preferable to use capital letters. |
| |
|
| * |
The position (A, B, C, D) of the correct answer should
vary from question to question so there is no pattern that could lead
to predicting the correct answer. |
| |
|
| * |
The stem and its alternatives (responses) should appear
on the same page. |
| |
|
| * |
When referring to a map, table, or figure, label it
for easy reference. The reference materials (maps, tables, figures,
etc.,) should precede the related items and be on the same page. |
| |
|
| * |
One question should not give a clue to the answer of
another question. |
| |
|
| * |
Directions should specify if there is only one correct/best
answer to each item or if there may be more. If some questions have
only one answer and some have more than one, the stem should specify
(which one of the following; which of the following
with directions to mark all) |
|
| |
|
| |
|
| |
THIS CONCLUDES THE INFORMATION ON MULTIPLE-CHOICE
QUESTIONS. GO TO THE NEXT SECTION TO CHECK YOUR KNOWLEDGE. |
| |
|
| |
|
|
| |
|
Copyright
© 2001-2002 Alabama Department of Education
All Rights Reserved.
|
|