Link to TOC: Current module is Multiple-Choice Questions
     
 
   
 

B. Developing Multiple-Choice Questions

 

        Multiple-choice questions are selection-type items. Students are given three or more possible answers and are asked to choose the correct answer or the "best" answer. The item begins with an item stem, followed by alternatives (the correct answer and distractors). Multiple-choice questions can be used to measure knowledge recall as well as higher order thinking. They are appropriate for use with objectives that call for the students to do such tasks as recognize, distinguish between, select, estimate, infer, predict, relate, categorize.

        Some of the most thorough work regarding multiple-choice questions has been done by Thomas Haladyna (1999). He presents four types of content (facts, concepts, principles, and procedures) and five types of cognitive behaviors (recalling, understanding, predicting, evaluating, and problem solving). While multiple-choice questions can be used to measure all of these, he suggests the context-dependent item set (a set of multiple-choice questions based on new content that is presented as part of the assessment, such as a paragraph, graphic display, etc.) or constructed response (short answer, completion, or essay questions) as more appropriate for critical thinking and problem solving than independent multiple-choice questions.  Not a link: Current module is Multiple-Choice Questions

        Multiple-choice items can be scored easily and quickly. They can be scored by machine and are frequently used for standardized tests. It is possible to sample a lot of content with multiple-choice items. Although multiple-choice tests are sometimes called "multiple guess" tests, there is less chance of guessing the correct answer than with true-false questions. Higher-order thinking can be assessed with multiple-choice items. The student can be asked to apply a rule or principle (example 1 below), to show understanding of cause and effect (example 2), or to identify the reasoning behind a particular choice of action (example 3).

   
 
  1.   Statistical tests CANNOT show which one of the following?
      A Whether two sets of scores are significantly different
      B Why two sets of scores are different
      C How closely related are two sets of scores
         
  2.   What is likely to happen to mortgage interest rates when interest rates on savings go up?
      A Increase
      B Decrease
      C No change
      D Unpredictable
         
  3.   Why did the chicken cross the road?
      A The chicken liked walking on pavement.
      B Corn had been spilled on the other side.
      C There was a hunting dog chasing the chicken.
      D The rest of the flock had already crossed.
   
 

        Although a multiple-choice test may be referred to as an "objective test," no test is truly objective. The instructor or someone else subjectively determines what content is  Not a link: Current module is Multiple-Choice Questions included in the test, the amount of emphasis placed on various topics, and the type(s) of questions used. Tests containing selection type questions (true-false, multiple-choice, matching, etc.), however, can be scored objectively because the scorer is not called upon to use his or her judgment when scoring the questions. Also, the questions can be scored fairly quickly using an answer key or machine scoring if it is available.

        While students may be able to guess or use the process of elimination, questions of this type require the students to make more complex distinctions than if each item were written as a true-false statement. Instead of choosing between two options in example 4a, there are more options and less chance of guessing correctly in example 4b.

   
 
  4a.   True False The capital of Kentucky is Louisville.  
             
  4b.   Which city is the capital of Kentucky?  
      A Frankfort    
      B Lexington    
      C Louisville    
      D Paducah    
   
          There are disadvantages for both the teacher and the student in using multiple-choice questions. It is more time-consuming for the teacher to construct good multiple-choice items than true false or completion items. One reason for this is the difficulty of finding suitable distractors, which are plausible (particularly at lower grades). To be plausible, the distractor must have the potential for being selected as the correct answer. Two distractors are as effective as three if one of the three is not plausible (highly unlikely to be selected), as is the case with B in example 5.
   
 
  5.   Which is the most important reason for using true-false questions?
      A The questions can be written quickly.
      B Students prefer them.
      C A lot of content can be covered.
      D Students can respond to more questions in a given amount of time.
   
 

        Another consideration is trying to insure that one question or its distractors do not provide clues to the answer of another question.

        Multiple-choice questions may not be as good for problem-solving (math) as completion/short answer items, but if students must take standardized tests in multiple-choice format, the students need some experience with this format.

        Reading level and reading speed of the students must be considered when  Not a link: Current module is Multiple-Choice Questions constructing the items. If the language is too difficult or if there are slow readers among the students, some may earn lower scores because they do not finish, rather than because they do not know the material. Each multiple-choice question typically involves more reading than a true-false, short answer or completion question.

        "Best" answer items (measuring understanding, application, and interpretation) are usually more difficult for the students than "correct" answer items. Example 6 (below) asks the student to determine the "MOST IMPORTANT" consideration. This presupposes that your instruction has included activities in which students are called upon to develop skills of comparing and evaluating information. However, if the students have been instructed that a particular purpose is the most important one, the question measures only knowledge or recall of information.

   
 
  6.   Which one of the following was the most important consideration in locating cities during frontier times in America?
      A good farmland  
      B access to waterways  
      C easy to defend against attack by Indians  
      D moderate temperatures.  
   
          Younger children are usually better able to deal with direct questions in the item stem (example 7a below) than incomplete statements (example 7b). They are also generally easier to create.
   
 
  7a.   Which would be a synonym for courtesy?
      A bow (bend forward from the waist)
      B honesty
      C politeness
      D tribute
         
  7b.   __________ is a synonym for courtesy.
      A bow (bend forward from the waist)
      B honesty
      C politeness
      D tribute
   
          If an incomplete sentence is used (as in example 7b), it is preferable to put the blank at the end of the sentence if at all possible so that the student doesn't have to reread the stem to determine the basis on which to select an answer. While this may not be a problem in a short stem as in example 7b, the stem in example 6 would be more problematic.
   
 
  6.   __________ was the most important consideration in locating cities during frontier times in America?
   
          Haladyna (1999) discusses two methods for using item shells when creating  Not a link: Current module is Multiple-Choice Questions multiple-choice questions. An item shell provides a basis for the stem (the question or problem part of the item) and may be either generic or taken from questions that have proved to be successful in the past. Some examples of generic item shells (with the shells underlined) are presented below. Each question would be followed by the appropriate alternatives. The shell could easily be adapted to current content by replacing the word or phrase that is not underlined.
(NOTE: Below are underlines and not links.)
 
  Which would be another name for a scavenger?
   
  Which is an example of osmosis?
   
  What is the cause (purpose, reason for) of the appearance of the aurora borealis on some nights and not on others?
   
  What would happen if there were no sunlight for 30 days?
   
  What is the best (worst, most important, least important) method of studying for essay tests?
   
  Which of the following would have had the greatest influence on the thinking of average citizens in colonial times?
   
  In what way are typing and using a computer similar?
   
  Which question best illustrates the application of higher level thinking?
   
  Which of the following best describes the way students learn?
   
          In Haladyna's second method, utilizing items you have had success with in the past, the first step is to identify the key words or phrases representing the content of the item. In example 8a, the key words or phrases are considered to be "unit" test, "30" (minutes), and "higher levels of thinking."
   
 
  8a.   You are constructing a unit test for your students. You have decided they will have 30 minutes to complete the test. Your objectives are primarily written at higher levels of thinking. What is the best type of test to use?
      A True-False  
      B Multiple-choice  
      C Essay  
      D A combination of A and B  
           
  The item shell in example 8a is as follows:
           
      You are constructing a __________ test for your students. You have decided they will have ________ minutes to complete the test. Your objectives are primarily written at ________________. What is the best type of test to use?
   
          The next step would be to look at ways to vary the underlined words. The unit test could become a weekly quiz, a daily quiz, a final exam, etc. Instead of 30 minutes we could 45 minutes, 15 minutes, or any other reasonable length of time; and the type of objectives could be described as at the recall level, understanding, etc. From this point, it is not difficult to construct a new stem by substituting for one or more key words or phrases.
   
 
  8b.   You are constructing a final exam for your students. You have decided they will have 60 minutes to complete the test. Your objectives are primarily written at the application and evaluation levels. What is the best type of test to use?
      A True-False  
      B Multiple-choice  
      C Essay  
      D A combination of A and B  
   
          
       Ebel and Frisbie (1986) provided some suggestions for formulating distractors. First, identify the correct answer ("deer" in example 9a) and note the class of things to which the correct answer belongs.
 Not a link: Current module is Multiple-Choice Questions
 
  9a.   The grazing of which wild animal poses problems for farmers in the Southeast?
      A deer  
   
          The correct answer is an animal, and that would represent the class of things from which we would select our distractors. We also need to relate our distractors to other key terms or descriptors in the stem: wild animal, one that grazes on crops, and one prevalent in the Southeast. While we know our correct answer fits all of these descriptors, we must insure that none of our distractors do so since we are seeking a "correct" answer. We could select a wild animal that is carnivorous, an animal that grazes in the Southeast but is not wild, or perhaps a wild animal in the Southeast but one that does not pose problems for farmers, as shown below.
   
 
  9b.   The grazing of which wild animal poses problems for farmers in the Southeast?
      A deer  
      B bear  
      C cow  
      D turkey  
   
 

        If we were seeking the "best" answer, one or more of our distractors could fit all the categories in the stem but to a lesser degree than the deer. This also increases the difficulty of the question because we have made the alternatives more homogeneous.

          
       Another strategy in developing distractors is to use the mistakes commonly made by the students as distractors.
 Not a link: Current module is Multiple-Choice Questions
  Variation      One variation of the multiple-choice question is one in which a question is posed in which the answer is either yes or no, but a reason is attached to the yes or no response. The student must not only know whether the response is yes or no but also the reason behind it.
   
 
  10.   Do students who study Spanish generally do better in Language Arts (English) than students who study Latin?
      A Yes, because they are more likely to see and hear Spanish words.
      B Yes, because the Spanish language is so similar to ours.
      C No, because many of our words are derived from Latin words.
      D No, because the study of Latin requires more discipline.
   
  Another variation that links multiple-choice and true-false questions occurs when the student has to indicate whether each of the alternatives is true or false, or to mark those that are true (after evaluating each alternative). This is sometimes referred to as multiple true false. The item is structured like a multiple-choice item with a stem and multiple responses. In actuality, the student treats each item (A through E) as a forced choice question.
   
 
  11.   Which of the following are good assessment practices? Mark "G" beside each good practice, "P" beside each poor one.
    _____ A assessments are matched to objectives
    _____ B a variety of assessments are used
    _____ C test items in the teacher's manual are ignored
    _____ D assessments are administered frequently
    _____ E grades are determined primarily on written tests
   
   
 
 Not a link: Current module is Multiple-Choice Questions
 
The stem should:
   
* be meaningful and provide a definite problem
   
* include as much of the item as possible. If, after writing the item, you notice that each of your alternatives begins with the same word or phrase, incorporate the word or phrase into the stem, deleting it from each of the alternatives.
   
* be stated positively if at all possible; if negatives are included they should be highlighted in some way (in capital letters or bold type)
   
* be stated simply and not contain irrelevant information
   
* NOT include the phrase "what do you think . . ." because any answer will have to be considered correct
   
* avoid negatives and other terms that might be confusing (NOT, NEVER, EXCEPT, ONLY). If they are necessary, call attention to them by putting them in capital letters and/or bold type.
   
   
Alternatives should:
   
* fit the stem grammatically (match use of the letter a or the word an in the stem)
   
* fit the stem in number (singular/plural - do/does, single/multiple - cause/causes)
   
* all be approximately the same length
   
* be brief
   
* be presented in logical order (alphabetical, chronological, magnitude, etc.) unless it is impossible to avoid a response pattern as the questions are originally developed or through renumbering and rearranging them
   
* include only one correct or clearly best answer
   
* provide three to five options, including the correct answer
   
* NOT include a key word (or words) that appear in the only in the correct/best answer
   
* use "none of the above" and "all of the above" infrequently. "None of the above"is preferable to "all of the above." If a student knows two of the answers are correct, s/he knows all of the above is the answer. For "none of the above," each of the other responses must be weighed.
   
* should all be plausible
   
* avoid using "Both B and D are correct" ONLY when this is the correct answer.
   
   
Format:
   
* List alternatives vertically beneath the stem, beginning a new line with each response.
   
* Have students circle the letter beside their response to avoid scoring problems reading student handwriting.
   
* Number the questions (stems), and use capital or lowercase letters for the responses. If there are students with learning disabilities who have difficulty distinguishing between lowercase "b" and "d" it is preferable to use capital letters.
   
* The position (A, B, C, D) of the correct answer should vary from question to question so there is no pattern that could lead to predicting the correct answer.
   
* The stem and its alternatives (responses) should appear on the same page.
   
* When referring to a map, table, or figure, label it for easy reference. The reference materials (maps, tables, figures, etc.,) should precede the related items and be on the same page.
   
* One question should not give a clue to the answer of another question.
   
* Directions should specify if there is only one correct/best answer to each item or if there may be more. If some questions have only one answer and some have more than one, the stem should specify (which one of the following; which of the following with directions to mark all)
   
   
  THIS CONCLUDES THE INFORMATION ON MULTIPLE-CHOICE QUESTIONS. GO TO THE NEXT SECTION TO CHECK YOUR KNOWLEDGE.
 
Back to Top of Page
 
Previous Section: A
Next Section: C
 
Copyright © 2001-2002 Alabama Department of Education
All Rights Reserved.