Analyzing Consumer Preferences with Paired Comparison Data
Albert C. Bemmaor

ESSEC Cergy Pontoise - France
November 2000

 

This web page makes a computer program available, PREF, to permit a reader to replicate and to extend the analysis reported in the following article:

Bemmaor, Albert C. and Udo Wagner (2000), "A Multiple-Item Model of Paired Comparisons: Separating Chance From Latent Preference," Journal of Marketing Research, 37 (November), 514-24.

PREF works under Windows 95, Windows NT and Windows XP. You can download all the DATA sets which are mentioned in this study.

You can also download a computer program in executable form and two files, HELPPREF.HTM and REFERENCES.HTM. All the files are zipped. Here is a brief description of the current state of the program:

  1. Estimation and simulation of the binomial model;
  2. Estimation and simulation of the binomial/Dirichlet model;
  3. Estimation and simulation of the multinomial model;
  4. Estimation and simulation of the multinomial/Dirichlet model.
For each model, in addition to the parameter estimates, the program computes the following statistics that are reported in the article: mean, standard deviation, R² and the test statistics (G², chi-square). There is also a graph menu which permits to plot the distribution of preferences when the model is the binomial/Dirichlet model (in the case of two items only) or the multinomial/Dirichlet model. There is a simulation menu for each of the four models. The program has been thoroughly tested; however, the user of the program is solely responsible for the results obtained. The whole package is self-contained. We do not provide a support service. The computer program requires the availability of MFC42.dll . Although this dll is available with most personal computers, you can find two versions of the program. One version called PREF_WITHOUT does not includes the dll, and another version called PREF_ WITH includes the dll. Given the fairly large size of the dll (919 kb), you can check whether this dll is available on your computer prior to loading PREF. A user can also download GRAPH_BD1.GIF; this graph gives an indication on the computation times that correspond to a set of parameters k and I of the Binomial/Dirichlet. The number of points on a scale is equal to 2k+1 and the number of items is equal to I. For example, the graph shows that when the number of points on a scale is equal to 5 (k=2) and the number of items is equal to 30, the computation times are low; this corresponds to 435 paired comparisons per subject ! Computations times become all the more taxing as k increases. (We suggest to use k = 2 for all applications as the predictive accuracy) of stated preferences is likely to be bounded upward).

  

Examples of issues that can be addressed with the use of PREF

Here is a set of issues that can be dealt with:

Some hints on the means to address the issues stated above are as follows :

This analysis is quite simple to carry out. There has to be at least two brands in the first step. Otherwise, PREF can handle "many" brands. The question to be asked for PREF can be changed to the following one:

Between the two products A and B, which one would you buy?

-2 I would buy B certainly
-1 I would buy B probably
0 I do not know which one i would buy
1 I would buy A probably
2 I would buy A certainly

For frequently packaged goods, you can measure "draws" prior to trial (tasting or use) and after trial (tasting or use). You can include a price to each product tested. In this fashion, you can test alternative formulations of the new product by assessing their corresponding «draws» before making a new product decision. PREF permits to assess the "sources of volume" of line extensions prior to launch : «A line extension … involves a different flavor or ingredient variey, a different form or size, or a different application for the (parent) brand» (Keller 2003, p. 577).

An Example:

With two brands (Classic Coke and Pepsi), the polarization index phi is equal to 0.278. (The closer phi is to 1, the more heterogeneous the respondents are. the closer it is to 0, the more homogenous, the respondents are. phi is equal to 1/(a1 + a2 + a3 + 1) in the case of two brands). you can click on "graph" and obtain a representation of the distribution of the preferences across the respondents. You obtain an inverted-J shaped curve which shows the relative homogeneity of the respondents. When you include New Coke, the polarization index shifts to 0.113. (phi is equal to 1/(a1 + a2 + a3 + a4 + 1) in the case of three brands). Due to the increased similarity between products, the preferences become more homogenous across respondents and consequently choices become potentially more "random".

Note: In an electronic message dated 7/26/2004, Professor Agresti confirmed that the test between New Coke, Classic Coke and Pepsi shown in his 1992 article was a "blind" test. In a test where the brand names appear, would you expect the results to differ? How? Why? Run a similar test as that given in this example.

Using the data available in HELPPREF.HTM, follow the methodology shown above to compute the "draws" of New Coke. Consistent with Agresti's (1992, Table 3) paper, the brands are as follows: A: New Coke; B: Classic Coke; C: Pepsi. The comparisons are given in the following order: A-B, A-C and B-C. Looking at the Table in 3.1, we find that, for example, four respondents preferred Classic Coke to New Coke strongly, 13 preferred Classic Coke to New Coke slightly, nine did not have a preference, 19 preferred New Coke to Classic Coke sligthly and eight preferred New Coke to Classic Coke strongly. It is quite easy to understand the other data. Here is the correspondence between the Bemmaor/Wagner notations and Agresti's in A-B:

Bemmaor/Wagner
Agresti
Label
-2
5
I prefer Classic Coke to New Coke strongly

-1

4
I prefer Classic Coke to New Coke slightly
0
3
I have no preference
1
2
I prefer New Coke to Classic Coke slightly
2
1
I prefer New Coke to Classic Coke strongly

Are the results consistent with the expectations?

Solution given by PREF.

Running PREF with Classic Coke and Pepsi only (The number of products is equal to two and the parameter k is equal to 2), one obtains:

mu1 (Classic Coke) = 0.362
mu2 (Pepsi) = 0.503
mu3 = 0.135
Total = 1

Reallocating the third "brand" in proportion to the shares, one obtains:

mu1R (Classic Coke) = 0.419
mu2R (Pepsi) = 0.581

These results are consistent with previous results found in a blind test between Classic Coke and Pepsi. In a blind test, more people prefer Pepsi to Classic Coke. Including New Coke into the set of brands and running PREF again (the number of products is equal to three), one obtains:

mu1 (New Coke) = 0.407
mu2 (Classic Coke) = 0.274
mu3 (Pepsi) = 0.319
Total = 1

Under the proportional draw assumption, 41.9% of the share of New Coke originates from Classic Coke and 58.1% comes from Pepsi. PREF shows that New Coke "draws" (0.419-0.274)/0.407 = 35.6% of its share from Classic Coke instead of 41.9% and 64.4% of its share from Pepsi instead of 58.1%. Therefore, it "draws" 10.8% (=(64.4 - 58.1)/58.1) more than in proportion to share from Pepsi - as intended. New Coke was sweeter than Classic Coke, and, consequently, more similar to Pepsi. Note that New Coke was preferred to Pepsi and much preferred to Classic Coke in the "blind" test. This analysis does not take account of potential inertia due to past choice behavior. As shown by the experience, this inertia may be key to the failure of a new product.

Note: The qualitative results shown here, i.e., New Coke is more preferred to Classic Coke than it is to Pepsi and Pepsi is preferred to Classic Coke, are consistent with those reported in Agresti (1992, Table 4). As in Agresti, we assume that the repeated ratings are correlated across respondents. But also, we assume that they are independent of each other at the individual level.

One possible means to take inertia into consideration consists of (1) re-running the product tests with the brand names "in the open", (2) shifting the response categories from "I prefer Product 1 to Product 2 strongly" to "I would buy Classic Coke(rather than New Coke) certainly", and (3) running PREF as before. The "draws" and the level of the preference share of New Coke can be surprising when compared with the previous analysis. As we do not have the data available, the question remains open.

This comparison across tests with versus without the brand names can bring valuable insights into the potential interaction of "draws". Again, the main advantage of the version with the brand names is that it aims to capture inertia. Consequently, we can expect the performance of the new product in the test to be less encouraging, in particular in the case of a line extension.

The soft drink data are Professor Agresti's. They were published but there is an error in the reported data (see references.htm). Professor Agresti sent the author the corrected data by (regular) mail. The analysis reported in Agresti (1992) is based upon the correct data set.

Notes:

(i) Although the soft drink data reported in Agresti (1992, Table 3) are in error, the paper is based upon the correct data set. The user of PREF can find the typewriter ribbon data of Fleckenstein and colleagues (1958) in Table 1 of the Agresti paper and he/she can compare the results obtained with PREF to those reported in Table 2 of the paper (adjacent categories logit, cumulative probit and cumulative logit). the disadvantage of this data set as compared with the soft drink data is that intuition on the results is more limited. The Fleckenstein and colleagues data are available by downloading the data set on the website, among other data sets.

(ii) In the heading of Table 1 (Agresti 1991), there is an error: The title refers to typewriter ribbons whereas the Fleckenstein and colleagues' data deal with typewriter carbon papers. You can test for the similarity of the typewriter carbon papers by (1) running PREF on four carbon papers, (2) including a fifth one and (3) running PREF again.

PREF provides a flexible methodology for analyzing paired comparison data. These comparisons can be applied to (1) the comparison of product "profiles" in conjoint analysis, (2) "blind"product tests (between New Coke, Classic Coke and Pepsi as in Agresti's 1992 article in Applied Statistics, No. 2), and (3) survey data with identified brands and products .

Due to potential biases and errors (e.g., confusion, social desirability bias) , PREF does not resort to a respondent’s memory to capture new product substitution: Stimuli are described as accurately as possible, including prices whenever appropriate, to activate all possible cues to a respondent.. A respondent may be asked to taste or to try the product (if available) before providing further comparative evaluations. This testing (tasting or use) can include the existing products. The better informed a respondent is, the more reliable the comparative evaluations are likely to be. Information provided to a respondent at the time of the study is a key ingredient to PREF methodology.

Teaching with PREF

Depending on the level of the course (doctoral seminar versus MBA course versus undergraduate course), the instructor will give the technical details of the model or he/she will not. He/she will explain the "spirit" of the model in all the cases. For your information, we tested PREF in the classroom and had students carry out projects with PREF. They did not ask a single question about the use of PREF.
Reading the JMR paper is not a requirement to understand the output of PREF. Some students can ignore the JMR paper while others can read it to understand where the output comes from. Therefore, self-selection will prevail.

Getting acquainted with PREF

It takes a maximum of five to ten minutes to get acquainted with PREF and to get it to run on a set of hypothetical data. The model which will be the most commonly used one is, by far, the Binomial/Dirichlet model. The other models are either a special case or applicable in special contexts only. The Binomal/Dirichlet is a model with scope: It can be used to address a whole series of issues, from the assessment of « draws » for a new high technology product prior to launch to the measurement of consumer preferences between two yogurts in a «blind» test. The range of applications of PREF is rather unique. (Of course, the user will change the question depending on the purpose of the analysis). It is a multi-purpose model. What makes it attractive also is the inputs, i.e , paired comparisons, which are very-easy-to-collect data. PREF is self-contained to the extent that the data can be collected and the analysis can be carried out in the classroom. Changes can easily be made in the analysis ex post with the collection of additional data and/or the analysis of different subsets of paired comparisons. The students have full control of the whole research process (the design and the analysis). .This website will show neither the names of the adopters nor the names of their universities nor any anonymous testimonial. Each individual will be influenced by his/her own judgement only. PREF relies on the JMR article only.

Keller, Kevin Lane (2003), Strategic Brand Management : Building, Measuring, and Managing Brand Equity, Second Edition, Upper Saddle River, New Jersey : Prentice Hall.

Gibson, Larry (2003), "Letter to the Editor: Why the New Coke Failed," Marketing Research, 15 (2), 52. Available in full text in EBSCO data basis (Business Source Premier).

for a definition of line extensions with examples, see:

Cegarra, Jean-Jack and Dwight Merunka (1993), "Les Extensions de Marque: Concepts et Modèles," Recherche et Applications en Marketing, 8 (1), 53-76.

for an empirical study on the determinants of the potential success of line extensions, see:

Reddy, Srinivas K., Susan L. Holak, and Subodh Bhat (1994), "To Extend or Not to Extend: Success Determinants of Line Extensions," Journal of Marketing Research, 31 (May), 243-262.

for an alternative methodology which is designed to capture the "sources of volume" of a new product, see:

Bockenholt, Ulf and William R. Dillon (1997), Some New Methods for an Old Problem: Modeling Preference Changes and Competitive Market Structure in Pretest Market Data, " Journal of Marketing Research, 34 (February) 130-142.

Fader, Peter S. and Bruce G. Hardie (1996), "Modeling Consumer Choice Among SKU's", Journal of Marketing Research,33 (November), 442-52.

for an example of ex-post measurement of cannibalization, see:

Lomax, Wendy, Kathy Hammond, Robert East, and Maria Clemente (1997), The Measurement of Cannibalization, Journal of Product and Brand Management, 6 (1), 27-39.

The use of the program requires to obtain a license number. For further information, you can contact the Center for Research in Management and Economics, at the following address:

research.center@essec.fr

The fax number is 33 1 34 43 32 11. You can look up this web page regularly as we plan to update it as feedback comes in. We would be pleased to receive feedback on (1) the course title, (2) the level of the course (undergraduate, graduate, doctoral), and (3) the topics the students dealt with (one line each), at the following address:

pref@essec.fr

We do not expect to receive any comment on the technical performance of PREF: PREF works. It was thoroughly tested in the late 90's. The chances of technical failures are 0%.

 

PS. As an extension, the binomial and binomial/Dirichlet model can be applied to absolute preference scores when there are two products in a test. In this case, letting x1 be the absolute preference score of item 1 and x2 be the absolute preference score of item 2, the user can compute a relative preference score d12 = x1x2 for each individual respondent. However, the model makes the restrictive assumption that the individual scores x1 and x2 are not observed. Only the difference d12 is observed. Further extensions of the model have not yet been implemented.

Acknowledgement: Albert Bemmaor expresses his gratitude to Professor Alan Agresti of the University of Florida in Gainesville for his constant support during the development of this website. The usual disclaimer applies.

Last modified 07/19/04