Profiting from the Decoy Effect: A Case Study of an Online Diamond Retailer
Profiting from the Decoy Effect:
A Case Study of an Online Diamond Retailer
Abstract
The decoy effect (DE), first introduced by Huber et al. (1982), has been robustly documented across dozens of product categories and choice settings using lab experiments. However, it has never been verified in a real product market in the literature. In this paper, we empirically test and quantify the DE in the sales operations of a leading online diamond retailer. We develop a diamond-level proportional hazard framework by jointly modeling market-level decoy–dominant detection probabilities and the boost in sales upon detection of dominants. Results suggest that decoy–dominant detection probabilities are low (11%–25%) in the diamond market; however, upon detection, the DE increases dominant diamonds’ sale hazards significantly (1.8–3.2 times). In terms of the managerial significance, we find that the DE substantially increases the diamond retailer’s gross profit by 14.3%. We further conduct simulation studies to understand the DE’s profit impact under various dominance scenarios.
Keywords: Decoy Effect, Attraction Effect, Asymmetric Dominance Effect, Context-Dependent Choice, Proportional Hazard Model, Diamond Pricing
1
1 Introduction
The decoy effect (Huber et al., 1982), also called attraction or asymmetric dominance effect, refers to the phenomenon of consumers having different preferences for existing choice alternatives with and without dominant (i.e., decoy) options in their choice sets. By design, these decoys are inferior to some, but not to all, existing choice options. When such decoys exist, all else equal, dominant options’ choice likelihoods get larger compared to cases when the decoys are not present. Since its introduction, the decoy effect (DE henceforth) has become one of the most popular and frequently cited context effects in the consumer-behavior literature, and it has been thoroughly examined across dozens of product categories and choice domains using lab experiments (see, for example, Huber et al., 1982; Huber and Puto, 1983; Wedell, 1991; Lehmann and Pan, 1994; Royle et al., 1999).
Despite its popularity, the DE’s practical validity has been severely challenged recently by a series of unsuccessful replication attempts that shed light on the limits and boundaries of the effect. Frederick et al. (2014) showed that the DE can be observed only in very stylized settings, such as the presentation of two products with two numerically depicted attributes. Yang and Lynn (2014) provided additional support to these findings and questioned whether the DE has any practical significance, or if it is just an experimental artifact. The lack of documentation on the practice of the DE in product markets was noted by Huber et al. (2014); and this has further put the practical validity and significance of the DE into question. In this paper, in response to these recent studies, we provide strong empirical evidence that not only validates the DE in a real product market but also illustrates its managerial significance through quantifying the substantial profit impact.
Even though it has been almost four decades since the DE was introduced, to the best of our knowledge, there has been no empirical study that tests and quantifies the DE with field data. To achieve this, one must consider and resolve a few key challenges. First, a researcher needs to calibrate decoy–dominant relationships among product alternatives. Since products typically have horizontal attributes—such as brand, taste, size, and packaging—and consumers have heterogeneous preferences for them, decoys to some consumers may not be decoys to others. Therefore, in most product markets, strict decoy–dominant relationships may not exist, let alone permit calibration. Second, consumers should be able to detect the decoy–dominant relationships. Unlike in lab experiments where alternatives with only two or three attributes are presented, choice scenarios in real life are far more complex: in a typical consumer product category, alternatives have a much larger number of attributes, e.g., brand, size, design, color, weight, packaging, content, taste, and price, to name a few. Thus, it is much harder for consumers to fully evaluate the trade-offs and detect existing decoy–dominant relationships, so that, as noted in Huber et al. (1982), “the effect may be lessened,” and the lack of detection becomes one important mitigating factor of the DE (Huber et al., 2014). For consumers to detect decoy–dominant relationships, the choice decision must be salient and require enough cognitive processing that consumers’ preferences can be constructed rather than already revealed (Huber et al., 2014). For example, for trivial decisions, consumers may just make their choices without paying much attention to the alternatives; consequently, they may not be able to detect existing decoys/dominants. Similarly, for repeat-purchase products, added decoys may not impact the choices of consumers who have already developed clear preferences for existing alternatives. From a technical perspective, Simonson (2014) further called for a systematic study separating decoy–dominant detection from the DE (i.e., sales boost in dominants upon detection). Lastly, it is quite possible that decoy pricing strategies, i.e., introducing decoy–dominant relationships by charging higher prices for the same or inferior quality products, may not generate positive profit impacts for firms, which limits the existence of the decoy pricing practice in the real world.
Due to the above-mentioned challenges, to empirically test and quantify the DE, we need data from a product category 1) with a reasonably small number of vertical product attributes; 2) that is important to consumers but not repeatedly purchased; and 3) that has the decoy pricing practice. The online diamond market is a highly appropriate case for this purpose: diamonds are commodity-type products with quality clearly defined on a few vertical attributes such as carat, color, cut, and clarity (4Cs); diamond purchases are important but not repeated lifetime decisions; and, finally, we frequently observe decoy pricing patterns in the online diamond market.
We use diamond pricing and sales data from a major U.S. online jewelry retailer to empirically test the DE’s existence, quantify its magnitude, and show its significance for firm profitability. We validate that a diamond’s value is predominantly determined by its most important vertical attributes, i.e., the 4Cs. Yet we also observe significant price variations in the market for diamonds with the same characteristics, and we construct dyadic decoy–dominant relationships among diamond pairs based on their 4Cs and prices for our analysis. Data patterns show that dominants have significantly larger sale probabilities than diamonds that are neither decoys nor dominants, while the opposite is true for decoys. The effect for dominants, interestingly, doubles that for decoys (29% vs. 14%). Sales share regressions reveal that increasing the proportion of dominants in the market would extract a dis-proportionally larger sales share from other diamonds. These data patterns are consistent with the predictions of the DE but could also be explained qualitatively by alternative mechanisms such as the reference price effect and consumer search. We develop formal statistical tests to show that, quantitatively, there are strong statistical supports favoring the DE over the reference price explanation, and that the observed price variations are consistent with the DE coexisting with consumer search but cannot be explained solely by consumer search.
Given the data providing strong evidence of the DE, we formally develop a proportional hazard framework in our empirical analysis. Modeling the impact of decoy diamonds on sales of their dominants in the diamond market requires us to separate the market-level decoy–dominant detection from the sales boost once diamonds are detected as dominants. As discussed earlier, this is especially important in a real market context because, with numerous alternatives available, consumers may search and construct consideration sets with a limited number of alternatives that have no or few decoy–dominant relationships even though such relationships exist within a broader set of options in the diamond market. To achieve that, we incorporate two critical components into our proposed hazard framework: market-level decoy–dominant detection probability and dominant boost hazard upon dominant detection. Thus, in our setting, upon the dominant detection, the dominant boost hazard component is used to test the existence and measure the magnitude of the DE.
In the estimation, we use a diamond’s characteristics, daily demand fluctuations, competition from other similar diamonds, and the observed decoy–dominant structure to control for the differences in the sale hazards of diamonds. To capture potential consumer heterogeneity in response to decoy pricing, we divide the diamonds into three price segments (low: $2K–$5K; medium: $5K–$10K; and high: $10K–$20K) and estimate segment-specific detection probabilities and sales boosts upon dominant detection. Further, to capture the unobserved heterogeneity and correlations in diamond sales, we use a diamond-level random effect specification. Results suggest that the market-level detection probability for a decoy (or dominant) diamond is quite low, ranging from 11% in the high-price segment to 25% in the low-price segment. The low decoy–dominant detection probabilities confirm that modeling the decoy–dominant detection probabilities explicitly in the diamond market is required to quantify the DE accurately. As opposed to low detection probabilities, upon detection, we find that a dominant diamond’s sale hazard gets 2.7, 1.8, and 3.2 times larger in the low-, medium-, and high-price segments, respectively. This finding validates the DE in the field in response to recent studies questioning this aspect, including Frederick et al. (2014) and Yang and Lynn (2014). Next, we quantify the profit impact of the DE and find that the DE improves the retailer’s gross profit by 14.3%. This finding shows that the DE is not only real but also highly substantive managerially. Finally, through additional simulation studies, we show that the profit impact of the DE gets larger as the number of decoys increases; as the price variations increase; and as the market-level detection probabilities remain similar, decrease, and increase in the low-, medium-, and high-price segments, respectively.
To reiterate our contribution, we advance the literature on the DE by empirically separating the market-level decoy–dominant detection from the DE boost of decoys on dominants. More importantly, for the first time in the literature, we 1) validate the existence of the DE in a real market; 2) quantify its magnitude across different segments; and 3) show its substantive profit impact. This paper thus attenuates the recent concerns (Frederick et al., 2014; Yang and Lynn, 2014) about the practical validity of this classical context effect beyond traditional lab settings.
2 Literature Review
This paper contributes to two streams of literature: the general consumer-behavior literature on context-dependent choices (in particular, the DE) and the empirical consumer choice-modeling literature in marketing and economics.
Standard rational choice models in economics and marketing are built upon the revealed preference assumption, which implicitly assumes two principles: the principle of regularity (Luce, 1977) and the principle of independence of irrelevant alternatives (IIA) (Luce, 1959). In contrast, consumer-behavior researchers adopted the notion of constructed preference (Bettman et al., 1998), and they extensively documented context effects in consumer choices (Tversky, 1972; Simonson, 1989). The DE
(Huber et al., 1982), which is a classic example of such context effects, violates both regularity and
IIA principles. The DE has been examined across dozens of product categories and choice domains
(see, for example, Huber et al., 1982; Huber and Puto, 1983; Wedell, 1991; Lehmann and Pan, 1994; Royle et al., 1999). Further, the literature features investigations of cognitive processes and mechanisms moderating the DE and related context effects (see, for example, Ratneshwar et al., 1987; Heath et al., 1995; Khan et al., 2011; Müller et al., 2014; Guo and Wang, 2016; Morewedge et al., 2018). In this domain, Khan et al. (2011) studied the influence of choice construal on context effects and found that high construal as opposed to low increases the size of the DE. Morewedge et al. (2018) demonstrated that when comparisons of alternatives for choice makers require social comparisons, the context effects get stronger. Guo and Wang (2016) studied underlying causes of context effects and found that the response time can mediate the compromise effect, but the context information cannot. Despite an ample amount of research devoted to the DE, empirical testing and quantification of the effect in a real product market has not been achieved yet. Our paper fills this important gap by validating the DE in the field.
Despite its wide acceptance, the limits and boundaries of the DE have been debated by multiple studies recently. Frederick et al. (2014) stated that the DE could only be observed in very stylized lab settings with 2×2 numerical depictions of the products (two products with two attributes, with a decoy to one product added to the choice set later). Through 38 replication attempts, their study showed that when the product attributes are depicted with perceptual representations and verbal descriptions (rather than numerical), the DE weakens, dies, or gives way to the repulsion effect. Through 91 replication attempts, Yang and Lynn (2014) also showed that replicating the DE is very difficult with verbal and pictorial depictions of product attributes. They went further and questioned the usefulness of marketing academics by accusing the studies of placing insufficient emphasis on the practical significance of the examined behavioral concepts. With the current research, we respond to concerns of Frederick et al. (2014) and Yang and Lynn (2014) by providing strong empirical evidence of the existence of the DE in a real product market.
In response to Frederick et al. (2014) and Yang and Lynn (2014), Simonson (2014) underlined the importance of recognizing the set formation, i.e., subjects being aware of decoy–dominant relationships, in being able to replicate the DE. He argued that consumers’ choices require them to make multiple trade-off contrasts simultaneously. As a result, they may not be able to make their decisions based on existing decoy–dominant configurations, especially if such configurations are difficult to detect. He thus called for a systematic study on the drivers of decoy–dominant detection. In addition, Huber et al. (2014) recognized the lack of practice of the DE in today’s product markets, noting that it is difficult to observe the DE in a real product market since the detection of decoys is typically very hard for consumers due to numerous alternatives with many attributes. With this research, we respond to Simonson’s (2014) call by explicitly modeling decoy–dominant detection in the studied online diamond market to empirically quantify the DE.
Our study is closely related to consumer choice models in economics and marketing literature. Classic multinomial logit and probit models are built upon the revealed preference assumption, thus they cannot directly account for context effects. Given the extensive documentation on the prevalence of context effects in the behavioral literature, a few empirical and analytical methods have been developed to incorporate these context effects into the choice models. Tversky (1972) formulated his well-recognized Elimination-By-Aspects (EBA) model to account for the similarity effect. Kamakura and Srivastava (1984) modified the standard multinomial probit model to account for the similarity effect by modifying the error structure through incorporating similarity-based error correlations. Kivetz et al. (2004) proposed a choice model that can account for the compromise effect. Orhun (2009) developed an analytical choice model to study the decoy and compromise effects under the loss-aversion assumption. Rooderkerk et al. (2011) proposed an empirical choice model that can incorporate decoy, compromise, and similarity effects all together. They used choice-based conjoint data to estimate their proposed model and showed that ignoring context effects significantly biases the choice model’s predictions. Our paper adopts a different approach by developing a proportional hazard model that explicitly accounts for the DE using pricing and sales data.
Since the studied online diamond retailer offers a large number of diamonds daily, as discussed earlier we separate the detection of decoys/dominants and the boost in sales upon dominant detection. Accordingly, our decoy/dominant detection component serves the role of a market-level consideration model. Due to that, our research is also related to the empirical literature that separately considers consumers’ consideration sets and choices. Existing studies in that domain use consumer-level data to model consumers’ consideration and choice decisions together. Some studies use purchase data only (see, for example, Siddarth et al., 1995; Chiang et al., 1998; Van Nierop et al., 2010), while others use purchase along with search-related data (see, for example, De los Santos et al., 2012; Honka, 2014). Unlike those studies, we don’t observe either purchase or search behaviors at the individual level. Hence, we derive a diamond-level hazard model from individual primitives, including consideration sets and choice. Consequently, our hazard specification provides a framework to separately estimate the market-level decoy/dominant detection from the DE by solely utilizing the aggregate product pricing and sales data.
In the following sections, we first describe the online diamond market, our dataset, and how we calibrate the decoy–dominant relationships, then we provide data evidence on the existence of the DE in this market. We develop our model framework in Section 5, and we present the estimation results and consider the DE’s managerial implications in the subsequent two sections. Finally, we conclude with a discussion of the current study’s limitations and offer directions for future research.
3 Data
3.1 Online Diamond Market
Several U.S. retailers emerged in the online market for diamonds and jewelry products in the past two decades. We use a panel data set from a major retailer in this market. In fiscal year 2015, the retailer reported net sales of $480 million. According to industry reports, it has approximately 50% market share of the U.S. online diamond market, with sales approximately three times greater than its closest competitor. These figures clearly indicate that the retailer is the leading player in the market.
The retailer sells a variety of jewelry products to end consumers, such as unbranded loose diamonds, gemstones, wedding rings,[1] bracelets, necklaces, and earrings. Loose diamonds account for the core part of the retailer’s business in terms of revenue contribution. According to its annual report, the retailer works with dozens of diamond suppliers worldwide under an “exclusivity” agreement, which requires suppliers to sell their diamonds only through the retailer’s online channel, and not through their own or other competing online and offline channels. For listed diamonds, the identities of the suppliers are not revealed on the retailer’s website so that consumers cannot differentiate diamonds based on the suppliers. Instead, consumers recognize only the retailer name as the diamond brand. To operate in a cost-efficient manner, the retailer uses a drop-shipping business model, i.e., the retailer, in most cases, does not physically carry inventories of loose diamonds listed on its website. Instead, it purchases diamonds from corresponding suppliers when consumers place their orders with the retailer. Unlike traditional brick-and-mortar stores, where only a limited number of diamonds are available, this drop-shipping model allows the retailer to list more than a hundred thousand diamonds every day.
In this setting, suppliers list their diamonds on the retailer’s website and establish wholesale prices. The retailer then adds a fixed percentage markup to the wholesale prices. Per the retailer’s annual report, the markup is fixed at around 18–20% for all diamonds. Because the retailer chooses a fixed margin over wholesale prices, the decoy pricing structure ultimately comes from the suppliers.2 Nevertheless, consumers are expected to respond to the decoy–dominant structure regardless of whether the retailer or the suppliers create it. That being the case, as we demonstrate later, the dominance structure still affects the retailer’s sales and ultimately its profitability to a large extent.
Since more than a hundred thousand diamonds are listed each day, the retailer provides filtering and sorting tools that help consumers to search for diamonds. On the website, a consumer can filter diamonds based on a desired range of diamond characteristics (such as price, carat, clarity, etc.). The website then returns a list of all the diamonds that fall into the filtering criteria on a single page in the default ascending price order. This one-page structure requires the consumer to scroll down to check all diamonds filtered. The web page displays each diamond in a row with its carat,
loose diamond accounts for more than 90% of the total price paid by consumers.
2We note that the supplier price variation may exist due to multiple reasons. First, due to consumer search costs, the observed prices can be the outcome of a mixed-strategy price equilibrium on the supplier side. Second, suppliers might have different costs, resulting in different pricing functions. Third, suppliers may change prices at different times. Understanding the source of the price variation is beyond the scope of the current study. Instead, we focus on quantifying the DE given the observed price variations in our data.
cut, color, clarity (4Cs henceforth), and price information. The consumer needs to further click into a diamond’s details page to check the other less-significant characteristics such as symmetry and polish. The website also allows the consumer to sort the diamonds based on any one of the 4Cs or price. Given abundant diamonds from the retailer, the consumer would still face a long list of diamonds (hundreds to thousands) even after a few rounds of filtering and sorting. This list typically includes many decoys and dominants that are not easy to detect without laborious investigation. In addition, because consumers are presented with a list of diamonds one at a time according to their set criteria, this search process resembles the non-sequential search that is consistent with our derivation of the diamond-level hazard model from individual primitives (see Web Appendix B for the details).
3.2 Data Description
We construct a panel data set of diamond prices and sales from this online retailer. We collect our daily data from the retailer’s website through a web crawler for the period from February 2011 to September 2011. For each diamond listed during our sample period, we observe the diamond’s inherent physical characteristics (its 4Cs and some other attributes such as symmetry and polish) and daily prices until the diamond is sold. In the data, diamond prices typically change over time: on average, each diamond’s price changes once every 21 days, conditional upon it being unsold. Figure 1 provides an example of price dynamics among three 1.0-carat diamonds from the day of introduction in the market till the end of the observation period.

Figure 1: Diamond Price Patterns Over Time
As seen in Figure 1, the diamond prices can go up and down, and each diamond may have a unique price pattern over time. We infer that a diamond is sold through the retailer’s website on the last day it is listed as available, based on its unique SKU number. We believe this is a reasonable approach because, as discussed earlier, the suppliers are under an exclusive channel agreement with the retailer so that the diamond sale would not have happened through other channels. On average, it takes about 50 days to sell a diamond. In our analysis, we focus specifically on round-shaped diamonds with prices ranging between $2K and $20K. Round-shaped diamonds are the most popular ones among those listed (74%) and sold (78%).[2] Diamonds in different price ranges might be more attractive to different segments of potential buyers with various budgetary constraints. To account for the potential heterogeneity in the DE across consumer segments, we further divide the diamonds into low- ($2K–$5K), medium- ($5K–$10K), and high-price ($10K–$20K) segments based on their first-day market prices.
Before calibrating the decoy–dominant structure, we first examine what determines diamond prices. We run a linear regression with (log- of) daily diamond prices as our dependent variable and the diamonds’ physical characteristics as independent variables to uncover the secret diamond-pricing formula. To control for potential demand variations across periods, we also add day fixed effects to the regression. We report the regression results in Table 1. The adjusted R-squared for the model with 4Cs, along with day fixed effects, is as high as 96.67%. Individual regressions for each day result in adjusted R-squared from 94.92% to 96.57%. The results provide strong evidence that the 4Cs are the predominant attributes in determining diamond prices.
To further check the robustness, we ran several regressions by incorporating other diamond attributes, such as symmetry and polish, into our main regression model. Overall, the R-squared measure does not improve. Moreover, the effects of these additional variables on prices are mostly insignificant, and their estimates are notably smaller in magnitude compared to the estimates of the 4Cs. For example, the implied price difference contributed by symmetry and polish turns out to be less than 0.5%. Thus, we have strong supporting figures to conclude that the quality of a diamond can be measured very precisely by simply looking at its 4Cs. Accordingly, in our analysis, we label each unique 4C combination as a grade. Characterizing a diamond as a combination of its 4Cs and price is, indeed, quite consistent with industry reports on diamond valuations and with articles educating consumers on purchasing diamonds. Even though the variation in diamonds’ physical attributes explains a large portion of the price variation, we still observe significant within-grade and within-day price variation. Specifically, in the data, the average of the ratios of price standard deviation to the mean price at each day-grade combination is 0.1, indicating a sufficiently large within-grade and within-day price variation. This variation is essential to characterize the dyadic decoy–dominant relationships among every diamond-pair, as discussed next.
Table 1: Diamond Price Regression Model: ln(price) on 4Cs and day fixed effects
Variable
Estimate
S.E.
Carat
1.768**
0.0004
Poor
0.000
Good
0.056**
0.0009
Cut
Very Good
0.114**
0.0009
Ideal
0.180**
0.0009
Signature Ideal
0.231**
0.0013
J
0.000
I
0.141**
0.0004
H
0.268**
0.0004
Color (Low to High)
G
0.361**
0.0003
F
0.455**
0.0003
E
0.503**
0.0004
D
0.583**
0.0004
SI2
0.000
SI1
0.124**
0.0003
VS2
0.274**
0.0003
VS1
0.371**
0.0003
Clarity (Low to High)
VVS2
0.455**
0.0003
VVS1
0.544**
0.0003
IF
0.619**
0.0004
FL
0.762**
0.0004
Daily Dummies
included
Adj. R-squared
96.67%
Adj. R-squared w/o daily dummies
95.27%
Adj. R-squared w/ daily separate regressions
94.92%–96.57%
Note: Estimates with ∗ and ∗∗ are significant at the 0.10 and 0.05 levels, respectively.
3.3 Dominance Construction
By definition, a diamond B is a decoy to another diamond A when B is inferior to A in at least one attribute, but has no superior attribute. In our specific setting, we define a diamond as a decoy under two conditions: 1) In terms of 4Cs, B is inferior in at least one attribute to A and has no attribute superior to A but has the same or a higher price than A; and 2) B has the same 4Cs as A but is priced at least 5% higher.[3] Under the two decoy definitions, for any two diamonds on a particular day, we define the relationship between them as follows: A dominates![]()
dominates
), and no dominance (A ∼ B).
In our data sample, every pairwise decoy–dominant relationship between all listed diamonds is constructed for each day.[4] For a particular diamond j, for each day t, we create two measures: number of diamonds that are decoys (NjtDecoy), and dominants (NjtDominant) to that diamond. The median number of decoys and dominants that a diamond has is 7 in the data sample, while the distribution is right-skewed. Since the number of diamonds varies across different grades significantly, there exist large variations in the number of decoys/dominants across grades. We normalize the two measures by dividing them by the number of diamonds in diamond j’s grade on day t, and label these grade-level percentage measures of decoys and dominants as
and RjtDominant,
respectively.
4 Data Evidence of the DE
In this section, we discuss some data patterns and reduced-form analyses that are suggestive of the DE. We show that important data patterns cannot be solely explained by alternative mechanisms such as the reference price effect or consumer search without considering the DE.
4.1 Dominance Types and Diamond Sales
Based on the dominance construction, on day t, diamond j can belong to one of the four mutually exclusive groups: 1) neither decoy nor dominant: RjtDecoy = 0 and RjtDominant = 0; 2) decoy only: RjtDecoy = 0 and RjtDominant > 0; 3) dominant only: RjtDecoy > 0 and RjtDominant = 0; and 4) both decoy and dominant: RjtDecoy > 0 and RjtDominant > 0. The middle column of Table 2 shows the count of diamond-day observations for each diamond type. Due to the significant within-grade price variations, we observe that a majority of the diamonds fall into the both decoy and dominant type. However, even in the smallest group, i.e., neither decoy nor dominant, we have a sufficient number of observations (27,077) to allow the identification of our model, as we discuss later.
We summarize the percentage of diamonds sold in the total diamond-day observations across different diamond types in the last column of Table 2. Each cell is calculated by dividing the number of diamonds sold in each diamond type by that type’s total diamond-day observations. For example, there are 404,283 decoy only diamond-day observations, out of which 6,792 were sold, yielding the average sale probability of 1.68%. Table 2 shows that decoy only diamonds have the lowest average sale probability (1.68%), while the opposite is true for dominant only diamonds (2.53%). The both type diamonds have a slightly higher average sale probability than the neither type.
Table 2: Summary Statistics Across Diamond Types
Diamond Type
Diamond-Day Observations
Daily Percentage Sales
Neither decoy nor dominant
27,077
1.96%
Decoy only
404,283
1.68%
Dominant only
340,456
2.53%
Both decoy and dominant
1,945,009
2.08%
Total
2,716,825
2.07%
In addition to the diamond-level sale probability calculations, we explore how the overall dominance structure impacts the sales shares of different diamond types using two linear regressions. We use the daily sales share of decoys (in percentage) and the share of dominants as the dependent variables and include segment dummies and percentage of decoy only and dominant only diamonds in each segment as independent variables. Results reported in Table 3 show that the percentage of decoy only diamonds significantly increases the decoys’ sales share, while the percentage of dominant only diamonds significantly increases the sales share of dominants but reduces that of decoys.
Table 3: Diamond Sales Share Regression
Decoy Sales Share Dominant Sales Share
Variable
Estimate
S.E.
Estimate
S.E.
Intercept
0.023
0.026
−0.048
0.040
Medium Segment(5K−10K)
0.006
0.016
−0.082∗∗
0.025
High Segment(10K−20K)
−0.008
0.021
−0.086∗∗
0.032
% Decoys
0.899∗∗
0.198
0.161
0.301
% Dominants
−0.312∗∗
0.155
1.972∗∗
0.236
Adj. R-squared
0.118
0.117
Note: Estimates with ∗ and ∗∗ are significant at the 0.10 and 0.05 levels, respectively.
Qualitatively, the patterns presented in Tables (2) and (3) could be explained by reference price effect or consumer search. Dominants are in general more preferred by consumers because their prices are relatively lower than comparable diamonds, whereas the opposite is the case for decoys (reference price effect). Beyond the relative price disadvantage, a decoy only diamond may also be detected as a decoy by a segment of consumers who search extensively, and it would never be purchased upon detection, leading to a further reduction in the sale probability (consumer search). Notably, the average sale probability of a decoy is still much larger than zero; thus, it is critical to control for decoy detection in our analysis. The sales share regression results in Table (3) are also directionally consistent with consumer search because having relatively more decoy only diamonds in the market would reduce the chances of consumers discovering these decoys along with their dominants. In other words, as there is a larger share of decoy only diamonds, the size of the market segment detecting them as decoys is expected to decrease, leading the sales share of decoys to increase. Similarly, as the percentage of dominant only diamonds increases, consumers are more likely to include these diamonds in their search process, leading to an increase in the sales share of dominants.
Quantitatively, however, the magnitude of differences in the average sale probabilities across the diamond types and the magnitude of the regression estimates could be suggestive of additional effects beyond reference price and consumer search. The average sale probability difference between a dominant only and a neither type diamond (0.57%) is twice the difference between a neither type and a decoy only diamond (0.28%). Since buying decoys and dominants can be seen (by consumers) as monetary losses and gains, respectively, based on the prospect theory (Tversky, 1972), reference price effect would predict the exact opposite pattern, i.e., the former probability difference is expected to be smaller than the latter. Consumer search, on the other hand, could explain the significant reduction in the average sale probability for decoys (due to the segment detecting them never purchasing them), but it could hardly explain the significant boost in the sales of dominants, especially when the effect is twice that of decoys. In addition, we would expect the sales share of dominants to increase proportionally with the increase in the percentage of dominants based on random consumer search, yet the estimated elasticity is as high as 1.97, significantly larger than 1.0, suggesting that including additional dominant only diamonds would extract a dis-proportionally larger sales share from other diamonds. All this evidence suggests that there is a significant boost effect in the sale probabilities for dominants, which is not completely explained by reference price effect or consumer search, yet is very consistent with the predictions from the DE.
We further develop formal statistical tests to show evidence consistent with the DE instead of reference price effect and beyond consumer search explanations in the following subsections.
4.2 Decoy Effect vs. Reference Price Effect
The marketing literature has extensively examined the importance of relative price comparisons in consumers’ purchase decisions (Monroe, 1973). The reference prices could be formed based on contextual factors such as the distribution of market prices (Biswas and Blair, 1991; Rajendran and Tellis, 1994), or temporal anchoring stimuli such as a product’s past prices (Lynch Jr et al., 1991; Rajendran and Tellis, 1994). In our particular context, given that diamonds are not repeat-purchase products and there are thousands of diamonds available each day, it is more likely that consumers form reference prices based on contextual factors rather than temporal anchors (Kalyanaram and Winer, 1995). When consumers use average grade-level price as the reference, the prospect theory (Kahneman, 1979) predicts that 1) consumers respond positively to dominants (in the monetary gain domain) and negatively to decoys (in the loss domain); and 2) the response to decoys would be stronger than to dominants (of the same size) due to loss aversion.
Even though the reference price effect (RPE henceforth) and decoy effect may look similar at first glance, we would like to emphasize a few notable differences that we illustrate in Figure 2. First, the RPE gradually vanishes as the monetary gains or losses approach zero. In other words, the RPE predicts the sale probability function to be continuous as the price of a diamond moves from the gain to the loss domain (i.e., around the reference price point). To the contrary, under the decoy effect, we expect a discrete jump in the sale probability function around the reference

Figure 2: Sale Probability Function According to Different Theories
point—because as a non-decoy diamond turns into a decoy one, a market segment emerges with zero probability of purchasing this decoy upon detection, resulting in a downward jump in the sale probability function around the decoy threshold price. The opposite becomes the case as a non-dominant diamond turns into a dominant one: a market segment with significantly large purchase probability emerges, and this segment induces the sale probability to jump upward around the dominant threshold price. Thus, the sale probability function is expected to be discontinuous around the reference price point with the decoy effect. Further, the magnitude of the discrete jump (around the reference price) depends on the market-level detection probabilities for decoys; and on both the market-level detection probabilities and the strength of the DE (i.e., sales boost) for dominants.
Second, the predicted relative effects across the domains (loss to gain/decoy to dominant) are different under the RPE and the DE predictions. The RPE predicts the slope of the sale probability function to be steeper for decoys than for dominants. We would argue the opposite is more likely to be the case under the decoy effect explanation, because further reducing the prices of dominants would not only increase the size of the consumer segment who detects them as dominants but also elevate the attractiveness of these dominants (i.e., the sales boost level) upon detection. However, as prices of decoys increase, it gets easier for consumers to detect them, i.e., the size of the market segment that fails to detect them decreases. Note that only this shrinking segment would respond to the increasing prices (or monetary losses) of decoys since the other segment would eliminate them upon detection irrespective of the size of the loss amount. Therefore, with the DE, the sale probability function is expected to have a flatter slope in the loss domain.
We estimate three diamond-level logistic sales regressions to test whether our data support the DE theory rather than the RPE. The results of these three specifications are reported in Table 4. In the regressions, we control for the following variables: a diamond’s 4Cs, daily demand effects such as weekday and holiday dummies, the Google search indexes for diamond-related keywords, and the number of diamonds in the same grade. We define the reference price as the average price of diamonds belonging to the same grade. Central to our test, we include separate intercepts and separate price slopes (PGain and PLoss as percentage differences relative to the reference price) for diamonds in the Gain and Loss domains in the first regression. We replace the definition of Gain and Loss with Dominant and Decoy, and the relative prices will be interacting with the Dominant and Decoy dummies in the second and third regressions. The difference between the last two specifications is that the definition of Dominant and Decoy is based on whether a diamond has any decoys or dominants in specification (I), while in specification (II) we use a stricter definition of whether a diamond is a decoy only or dominant only. The regression specifications follow the tradition in empirical tests of the RPE (Hardie et al., 1993; Bell and Lattin, 2000).
Table 4: Diamond Sales Response Function with Logistic Regressions
Variable
Gain/Loss
Decoy/Dominant(I)
Decoy/Dominant(II)
Estimate
S.E.
Estimate
S.E.
Estimate
S.E.
Controls
included
included
included
Intercept—Gain
−0.028
0.019
Intercept—Loss
−0.006
0.019
Intercept—Dominant
0.152∗∗
0.014
0.140∗∗
0.018
Intercept—Decoy
−0.186∗∗
0.014
−0.205∗∗
0.019
Slope—PGain
−2.444∗∗
0.145
Slope—PLoss
−1.733∗∗
0.156
Slope—PGain × Dominant
−1.491∗∗
0.131
−1.831∗∗
0.192
Slope—PLoss × Decoy
−1.059∗∗
0.137
−0.567∗∗
0.211
− 0.686∗∗ 0.215 0.432∗∗ 0.218 1.264∗∗ 0.292
Note: Estimates with ∗ and ∗∗ are significant at the 0.10 and 0.05 levels, respectively.
The results show a very consistent pattern with the price coefficient more negative in the gain/dominant domain than in the loss/decoy domain across the three regressions. The differences between the slopes (loss minus gain) are all significantly positive at the 0.05 level, as shown in the table. In addition, intercepts for dominants are significantly positive while those for decoys are significantly negative in the last two regressions, indicating the existence of a discrete jump around the reference price point. This is consistent with our earlier discussion about the role of the DE. Lastly, the models with the DE outperform the RPE-only model based on the AICs. These results are all robust to alternative model specifications, such as separating the effects across the three price segments and including quadratic terms for relative prices. Overall, our test results show strong statistical support favoring the DE over the RPE, as illustrated in Figure 2.
4.3 Decoy Effect vs. Consumer Search
As Simonson (2014) emphasized, consumers’ detection of decoy–dominant relationships is a precondition for the DE. Given the extremely large number of alternatives in the online diamond market, consumers rely on search to finalize their consideration sets. Thus, consumer search is an inherent component in our test for the DE (see our Model section). In this subsection, we develop a formal statistical test to check whether the data patterns can be solely explained by consumer search, or if there is evidence of the DE in addition to consumer search. Our test relies on the observed market price dispersion and sales information. We provide the details of this test in Web Appendix A. The intuition behind the test is that under pure consumer search without the DE, a supplier sets prices to maximize the expected profit of each individual diamond, and thus identical diamonds with different prices are expected to generate the same level of profit for the supplier in equilibrium (Burdett and Judd, 1983). However, if there is DE along with consumer search, a supplier needs to consider the price optimization beyond each individual diamond because decoys would serve as “loss-leaders” helping the supplier get higher expected profits from the dominants. Consequently, one would expect the profit contribution of dominants to be higher than that of decoys.
Under the hypothesis of no DE (i.e., there is only search effect), we can utilize the observed prices and sales information to recover the cost of each diamond j on day t (cjt) from the corresponding supply-side pricing optimality conditions. Recovered costs should be approximately the same for diamonds with identical 4Cs, and exactly the same for the same diamond over time. However, if there exists DE along with consumer search, because of the positive profit externality from decoys to dominants, the recovered costs from the optimality conditions will be higher for decoys and lower for dominants, as compared to their true costs. This essentially leads to a positive correlation between the diamonds’ recovered costs and their relative price levels. The test of the pure search effect versus the search effect along with the DE thus becomes the same as testing whether recovered costs are increasing with relative prices. Accordingly, we conduct two statistical tests: 1) using cross-diamond price variations (labeled as Test I); and 2) using within-diamond price variations over time (labeled as Test II). Results are provided in Table A1 in Web Appendix A. Results consistently reject the null hypothesis of a pure consumer search explanation across all three diamond price segments. All the estimated coefficients of relative price levels (rpjt) are positive and significant, and thus are directionally consistent with the hypothesis supporting the existence of the DE together with consumer search in our data.
To sum up, our data analyses yield the following results: 1) A diamond’s sale probability largely depends on whether it is a decoy and/or a dominant; 2) It is critical to account for decoy–dominant detection in validating the DE; and 3) There is suggestive evidence of the DE beyond alternative mechanisms of the RPE and consumer search. Hence, empirical investigation of the decoy phenomenon requires a deliberately developed model, which we introduce in the next section.
5 Model
We develop a diamond-level proportional hazard model to study the daily sale likelihoods of diamonds. We model a diamond’s sale hazard on a given day with two components: daily diamond sale hazard and dominance hazard. Separating a diamond’s daily sale likelihood and the DE is a challenging task since consumer-level search and purchase behaviors are unobserved. To achieve the objective, we derive our diamond-level proportional hazard from the underlying consumer primitives including consumer arrival process, search, consideration set formation, and conditional choice probabilities with the embedded DE. Please see Web Appendix B for the details of our derivation including how the DE is embedded into consumers’ conditional choice probabilities, and how the proposed hazard specification can be used to quantify the DE at the market level. Following this derivation, we use the following to denote the proportional hazard that diamond j will be sold on day t:
hj(t) = ψj(Xjt;β) · φj(Djt;γ), (1)
where ψj(·) is the daily diamond sale hazard and φj(·) is the dominance hazard. We next discuss each component with the choice and rationale of the corresponding variables. We present an overview of the variables in Table 5.
Table 5: List of Variables Used in Model Estimation
Variable Name Description
Daily Diamond Sale Hazard (Xjt)
Kj Diamond segment Segment dummies, low (2K–5K), medium (5K–10K), high (10K–20K)
Zt
Google search indexes
Daily Google search trends index of diamond-related keywords
Weekday dummies
Dummy variables of weekdays
Holiday dummies
Dummy variables for Valentine’s Day and federal holidays
Hj
Diamond characteristics
Dummy coded cut, color, and clarity of diamond j, log- of carat of diamond j, indicator of carat 1.0 for diamond j
pjt
Diamond price
Daily price of diamond j (in 1,000)
Wjt
Daily competitiveness
Log of number of diamonds of the same grade
Dominance Hazard (Djt) rpjt Relative price index
% price difference between diamond j and its grade-level average
Rjt Percentage decoys
Number of diamond j’s decoys / number of diamonds in j’s grade
Percentage dominants
Number of diamond j’s dominants / number of diamonds in j’s grade
spjt Price standard deviation
Standard deviation of prices in diamond j’s grade on day t
5.1 Daily Diamond Sale Hazard
The daily diamond sale hazard component (ψj(·)) captures the daily baseline sale likelihoods of diamonds without the DE consideration. In general, a diamond’s daily sale likelihood depends on a few essential factors. First, since diamond buyers with various budgetary constraints have different preferences for the 4Cs, diamonds at different price levels and grades are likely to have different sale likelihoods. Second, the sale likelihood of a diamond is expected to decrease as the number of similar diamonds listed increases. Third, because consumers may have different purchase intentions on different days, the sale likelihoods of diamonds may also change over time. For example, consumers may have different purchase likelihoods on different days of a week, or special occasions and holidays. Accordingly, we model the daily diamond sale hazard ψj(·) as an exponential function of 1) diamond j’s price segment (dummy coded low-, medium-, or high-price, labeled as Kj); 2) dummy-coded cut, color, and clarity, log- of carat[5] (labeled as Hj); 3) its price (labeled as pjt); 4) log- of the number of diamonds with the same 4Cs (labeled as Wjt); 5) Google search trends to capture consumer interests, weekday and holiday dummies (labeled as Zt).
With Xjt = {Kj,Hj,pjt,Wjt,Zt}, and β = {βK,βH,βp,βW ,βZ}, we define the daily diamond sale hazard as,
ψj(Xjt;β) = exp(bj + KjβK + HjβH + pjtβp + WjtβW + ZtβZ). (2)
We use a random effect specification, i.e., exp(bj) ∼ Γ(1,σ2), to allow the intercept term to be diamond-specific.[6] It helps capture unobserved correlations in sale likelihoods among diamonds. This is important in our setting, because if there exist demand shocks that affect two diamonds at the same time, without this specification the effect of one diamond on the other one’s demand would likely be inferred with biases. We adopt the Gamma distribution assumption following Lancaster (1979), in which the random effect can be analytically integrated out.
5.2 Dominance Hazard
We capture the DE by our dominance hazard component φj(·). Per our derivation in Web Appendix B, a diamond’s sale likelihood further depends on 1) the probability/size of the market segment that detects a diamond to be a decoy/dominant dominant; and 2) the sale boost for a dominant upon the detection of it having decoys. We separate these two parts as the market-level decoy–dominant detection and the dominant boost hazard.
5.2.1 Market-Level Decoy–Dominant Detection
The likelihood of detecting decoys and dominants in the diamond market may depend on various context-related factors. First of all, it should naturally depend on the decoy–dominant structure in the market. As consumers typically sample diamonds based on their desired grades using the filtering and sorting tools, a diamond with a larger percentage of decoys/dominants in its grade would have a higher chance of being included together with its decoys/dominants in the consideration sets. Second, the relative price of a diamond in its grade matters because a diamond is more likely to stand out either on the top or the bottom of the returned lists under the default price sorting design when its absolute relative price differences get larger. Third, diamond purchases are unique, first-time experiences for most consumers who have limited knowledge regarding diamond pricing. Thus, learning about the prices becomes an inherent part of their decision process. A greater variation in the prices of comparable diamonds indicates more opportunities to save, which will likely motivate consumers to spend more time on the retailer’s site searching and comparing alternatives. Consequently, they would be more likely to detect existing decoy–dominant relations. As such, we expect the grade-level price dispersion to be another moderator of dominance detection. Based on the above rationales, we model the market-level decoy–dominant detection part of our dominance hazard as a function of the following variables: 1) percentages of decoys and dominants that diamond j has (RjtDecoy and RjtDominant), 2) the relative price measurement (rpjt), and 3) the price standard deviation in diamond j’s grade (spjt).
We denote the market-level decoy and dominant detection probabilities as
and
PrjtDominant(·), respectively. Given Djt = {Kj,RjtDecoy,RjtDominant,rpjt,spjt}, we model these two terms as the following:
, (3)
where I(·) is the indicator function and VjtDominant and VjtDecoy are specified as:
VjtDominant = Kjγ0Dominant + γ1Dominant ln(RjtDecoy) + γ2DominantI(rpjt < 0)(−rpjt) + γ3Dominantspjt
.
VjtDecoy = Kjγ0Decoy + γ1Decoy ln(RjtDominant) + γ2DecoyI(rpjt > 0)(rpjt) + γ3Decoyspjt
(4)
The intercept terms (γ0Dominant and
) are modeled at each of the diamond price segments since decoy–dominant detection probabilities defined in Equation (3) might differ across various market segments with different consumer budgetary levels. The other γ s capture how the percentage measures of decoys/dominants, the relative price, and standard deviation of the grade-level prices would impact the likelihood of decoy–dominant detection probabilities.
5.2.2 Dominant Boost Hazard
Upon a diamond being detected as a dominant, the size of the boost in its sale likelihood, i.e., the DE, may also depend on a few important context-related factors. First, all else equal, dominants with more decoys are likely to become more attractive (DE is also called attraction effect) compared to diamonds with fewer decoys, especially when consumers engage in multiple comparisons. Second, based on the prospect theory (Kahneman, 1979), a consumer’s purchase decision depends on whether the price paid is perceived as fair with respect to her reference price point. Since diamonds are not repeat-purchase products, a consumer is expected to form her reference price based on the prices of available diamonds rather than the past price histories (Mazumdar et al., 2005). Thus, upon detection of a dominant, we expect its sale likelihood to increase as the size of the price gain relative to comparable diamonds increases. Lastly, following our discussion on consumer learning in dominance detection, we expect consumers to spend less time as the variation of the grade-level prices decreases. Accordingly, we expect that with a smaller price dispersion, consumers become more likely to settle down with their detected dominants rather than continuing the search for other diamonds. As such, we expect the sale boost of a detected dominant to be more substantial as the within-grade price variation decreases. Given these rationales, we include the same set of variables as in the dominance detection component. We label Djt = {Kj,RjtDecoy,RjtDominant,rpjt,spjt}, and define the dominant boost hazard Qjt as follows:

Similar to the intercept term of the market-level decoy–dominant detection probabilities, we model the intercept term
at each of the diamond price segments since the dominant boost sizes might differ across market segments. The other γs capture how the percentage measures of decoys/dominants, the relative price, and standard deviation of the grade-level prices would impact the sales boost upon dominant detection.
Given market-level detection probabilities and dominant boost hazard specifications, we operationalize the dominance hazard as follows:
1(1 − PrjtDecoy)
φj(t|·) =
(1 − PrjtDominant) + PrjtDominantQjt
if j is Neither
if j is Decoy Only
.
if j is Dominant Only
(6)
if j is Both
We would like to note that we made an implicit assumption in the derivation of the dominance hazard in Equation (6). We assume that once a consumer detects a specific diamond to be a decoy, she would never purchase it, as she can always choose the dominant one. This assumption is consistent with the existing literature. For example, Huber et al. (1982) verified that fully informed subjects would seldom make “mistakes” of choosing decoys in lab experiments.
Equation (6) shows how the dominance hazard depends on a diamond’s type: if it is neither decoy nor dominant, the DE has no impact on the sale hazard of the diamond, i.e., the dominance hazard is normalized to one. If the diamond is decoy only type, it is considered only by the consumer segment that fails to detect it as a decoy under our assumption. Further, the DE does not play a role in the sale hazard of the diamond given no detection, resulting in the overall dominance hazard being the size of this segment, i.e., φj(·) = 1−PrjtDecoy(·). For a dominant only type diamond, there exist two market segments: the segment that fails to detect the diamond as a dominant, and the segment that is able to detect it. The DE does not have any impact on the former segment (i.e., Qjt(·) = 1), while we expect a boost in sale hazard (i.e., Qjt(·) > 1) for the latter. The overall sale hazard thus becomes the expression in the third line of Equation (6). Finally, if the diamond is a both decoy and dominant type, it is considered for purchase only by the market segment that fails to detect it as a decoy, with the size of the segment being 1 − PrjtDecoy(·). Similar to the decoy only case, the remaining consumer segment never purchases it, i.e., Qjt(·) = 0. The segment that fails to detect the diamond as a decoy can be divided into two sub-segments: the sub-segment that fails to detect the diamond as a dominant and the one that is able to detect it. Similar to the dominant only case, the former sub-segment with size 1 − PrjtDominant(·) will not be impacted by DE, i.e., Qjt(·) = 1, while the sale hazard from the other sub-segment will be boosted by Qjt(·) > 1.
Combining all the scenarios, for the dominance hazard, we have the expression in the last line of
Equation (6).
5.3 Model Estimation
Based on the model components outlined above, we can further arrange the terms and derive the following log-hazard representation (see Web Appendix C for details):
lnhj(t) =
−I(Decoy)ln(1 + eDjtDecoyγDecoy)
| {z }
daily diamond sale hazard dominance hazard of a decoy
(7)
h DjtDominant(γDominant+γBoost)) − ln(1 + eDjtDominantγDominant)i,
+ I(Dominant) ln(1 + e
| {z } dominance hazard of a dominant
where DjtDecoy = {Kj,lnRjtDominant,I(rpjt > 0)rpjt,spjt} and DjtDominant = {Kj,lnRjtDecoys,I(rpjt < 0)(−rpjt),spjt}. The effects of being a decoy and being a dominant on the sale hazards are clearly outlined in the last two terms of this representation.
Denote the total number of days since diamond j (in total J diamonds) is on market to the end of our observation period as Tj, and the day diamond j is sold since its introduction as Tjs. Following the derivation in Lancaster (1979) for the random effect hazard model, the total likelihood we use for estimation becomes the following:
, (8)
where
is the survivor function and h¯j(τ) is the mean value of the hazard hj(τ) where bj = 0. Notice that
, i.e., it reduces to the case without random effects. We estimate the model by using the maximum likelihood approach.
We now discuss a few properties of our model. First, the unit of our analysis is each diamond, which is different from classic choice models. Second, in terms of how to model the DE conditional on dominant detection, we choose to use a scalar function Qjt(·). When Qjt(·) > 1, our specification becomes consistent with the DE theory, i.e., upon detection, there is a boost in sales for dominant diamonds. In other words, under our framework, testing the existence of the DE becomes the same as testing whether Qjt(·) > 1. Details of this test are provided in Web Appendix B. Third, it is quite possible that consumers are heterogeneous, so we allow our daily diamond sale hazard ψj(·) and the dominance hazard φj(·) to differ across different diamond price segments.
5.4 Model Identification
Our identification strategy relies on the fact that it takes different numbers of days to sell different types of diamonds (neither decoy nor dominant, decoy only, dominant only, or both decoy and dominant); or equivalently, there are variations in the daily sale probabilities across different types of diamonds. First, based on the normalization in the first line of Equation (6), the sale hazard for the neither type equals the baseline daily diamond sale hazard. Thus, we use the variation of the time it takes to sell neither type diamonds with different Kj, Hj, Wjt, and Zt in Xjt to identify the parameters β in the daily diamond sale hazard component. Second, conditional on the identification of β, we identify the parameters related to the detection probabilities and dominant boost hazard. Since decoy diamonds could only be purchased by the market segment that fails to detect them as decoys (see the second line of Equation (6)), we use the variation of time it takes to sell decoy only diamonds with different RjtDominant, rpjt, and spjt in Djt to identify the parameters of the market-level decoy detection probabilities, i.e., γDecoy (also see the second component of Equation (7)). Third, as seen in the third and fourth lines of Equation (6), it is not possible to separately identify the parameters of the market-level dominant detection probabilities (γDominant) and of the dominant boost hazard (γBoost) since PrjtDominant and Qjt are always bundled together in the form of (1 − PrjtDominant) + PrjtDominantQjt. To separately identify γDominant from γBoost, we make the following assumption: γDecoy = γDominant, i.e., all else equal, the market-level probability of detecting a diamond with n decoys as a dominant is identical to the probability of detecting a diamond in the same grade with n dominants as a decoy. Since the decoy–dominant relationships are calibrated at the diamond-pair level, the probability of discovering one diamond dominating another is the flip side of discovering that one is dominated by the other. Thus, it is reasonable to assume parameters quantifying the decoy and dominant detection probabilities in the diamond market to be the same.[7] Based on this symmetric market-level detection assumption, and conditional on the dominant detection parameters (γDominant) being identified, we use the variation of the time to sell dominant only and both decoy and dominant type diamonds with different Djt to identify the parameters of the dominant boost hazard, γBoost. In our empirical setting, the diamond prices change over days and subsequently the decoy–dominant structure also changes daily. This data variation empowers the identification of our model parameters (β, γDecoy, and γBoost). Finally, variations in the time it takes to sell diamonds with similar Xjt and Djt enables identification of the unobserved heterogeneity σ2.
In addition, the identification of the parameters relies on our exclusion restrictions, i.e., Xjt and Djt enter into separate model components. The variables used in Xjt (e.g., the diamond’s characteristics, price, daily demand shifters, etc.) are either directly related to diamond j itself or the overall demand in the market. Accordingly, we input Xjt into the daily diamond sale hazard component. In contrast, the variables in Djt (percentage of decoys/dominants, relative price, within-grade price variation) are context-related since they require diamond j to be considered along with other comparable diamonds. Accordingly, we include them in the dominance hazard component to quantify the studied context effect, i.e., the DE.
6 Results
6.1 Main Estimation Results
We report our estimation results in Table 6. Results suggest that the daily diamond sale hazard increases with the diamond’s carat size. In addition, 1.0-carat diamonds are significantly easier to sell. As expected, as a diamond’s price increases, its sale hazard decreases. Regarding the cut, color, and clarity attributes, we observe an inverse U-shape relationship, i.e., the daily diamond sale hazard is the largest for diamonds with moderate cut, color, and clarity levels.[8] Estimates of daily demand proxies suggest that Google search indexes for diamond-related keywords are significant proxies for the sales. The daily diamond sale hazard increases significantly when the search indexes on the keywords of “engagement ring” and the studied retailer’s name are high, whereas it decreases when the search intensity is high on the keywords of “diamond,” “diamond ring,” and the competitor’s name. The daily diamond sale hazard also differs significantly across weekdays, with Thursday being the best day for diamond sales, while Saturday and Sunday are the worst days. One explanation might be that people use weekends to do research on diamonds, possibly go to brick-and-mortar stores and educate themselves about how to pick up the right diamond, and then place their orders online during weekdays. The results also suggest that diamonds are easier to sell during holidays. Regarding the competition from other diamonds, we find that, intuitively, as the number of same-grade diamonds increases, the daily sale hazard decreases. Lastly, the estimated variance of the random effect is reasonably large, i.e., the standard deviation is around 27% of the intercepts, highlighting the importance of controlling for unobserved consumer heterogeneity in diamond sales.
Table 6: Model Estimates
Variable
Estimate
S.E.
Daily Diamond Sale Hazard
Low-Price Segment (2K−5K)
−3.349**
0.114
Medium-Price Segment (5K−10K)
−3.169**
0.121
High-Price Segment (10K−20K)
−3.312**
0.131
ln(Carat)
0.462**
0.079
Is 1.0 Carat
0.542**
0.018
Price (in 1000)
−0.057**
0.006
Cut: Poor
0.000
Cut: Good
0.325**
0.096
Cut: Very Good
0.835**
0.095
Cut: Ideal
1.227**
0.094
Cut: Signature Ideal
0.431**
0.127
Color: J
0.000
Color: I
0.075**
0.029
Color: H
0.228**
0.030
Color: G
0.248**
0.031
Color: F
0.299**
0.032
Color: E
0.065*
0.035
Color: D
0.046
0.039
Clarity: SI2
0.000
Clarity: SI1
0.166**
0.023
Clarity: VS2
0.310**
0.025
Clarity: VS1
0.248**
0.027
Clarity: VVS2
0.078**
0.031
Clarity: VVS1
−0.112**
0.036
Clarity: IF
−0.564**
0.044
Clarity: FL
−0.300
0.415
Google Search: “diamond”
−0.205*
0.110
Google Search: “diamond ring”
−0.527**
0.065
Google Search: “wedding ring”
−0.003
0.057
Google Search: “engagement ring”
0.079**
0.031
Google Search: retailer’s name
0.183**
0.020
Google Search: competitor’s name
−0.446**
0.064
Weekday: Monday
0.000
Weekday: Tuesday
−0.026*
0.014
Continued on next page
Table 6: Model Estimates
Variable
Estimate
S.E.
Weekday: Wednesday
−0.053**
0.014
Weekday: Thursday
0.052**
0.014
Weekday: Friday
−0.186**
0.015
Weekday: Saturday
−1.737**
0.026
Weekday: Sunday
−1.158**
0.021
Is Holiday
0.033**
0.013
ln(# Diamonds of the Same Grade)
−0.026**
0.011
σ2 (Variance of the Random Effect)
0.787**
0.012
Dominance Hazard—Market-Level Detection Probability

Dominance Hazard—Dominant Boost Hazard

Note: Estimates with ∗∗ and ∗ are significant at the 0.05 and 0.10 levels, respectively.
We now discuss the estimation results regarding the market-level decoy–dominant detection probabilities and the dominant boost hazard, which are the most critical components of our model for addressing the paper’s central research questions. First, our results suggest that the base market-level detection probability of a decoy (or dominant) diamond is the highest for diamonds in the low-price segment and the lowest in the high-price segment. One potential explanation is that consumers of the low-price ($2K–$5K) segment are usually on tight budgets and are more motivated to spend extra time searching for better prices, leading to larger consideration sets. As a result, they are more likely to detect existing decoy–dominant relationships. The positive significant estimate of (log- of) grade-level percentage of decoys/dominants (0.241) shows that when a larger percentage of diamonds are decoys in a diamond’s grade, it gets relatively easier for the market to detect a diamond as a dominant. The positive significant estimate of the relative price index (1.816) shows that the further a decoy/dominant diamond is priced from the average grade-level price, the higher is the probability of decoy/dominant detection in the diamond market. In addition, results suggest that as the price variation increases in a grade, the market-level detection probabilities of decoys/dominants marginally increase in that grade. This finding suggests that as the within-grade price variation gets larger, consumers search more to identify decoys (and dominants). By using our model estimates, we next calculate the market-level decoy–dominant detection probabilities. Interestingly, we find that for decoy and dominant diamonds, the market-level detection probabilities are quite low: 0.25 for the low-, 0.17 for the medium-, and 0.11 for the high-price segments. These findings show that our real-life scenario with a large number of diamonds defined on 4Cs and price with many decoys and dominants greatly contrasts with usual lab settings, in which participants are almost always aware of the decoy–dominant relationships by the construction of the experiments. Along this line, Simonson (2014) stated that consumers’ detection of decoy–dominant relationships among the existing choice alternatives is a vital precondition for decoys to increase their dominants’ sale likelihoods and called for a systematic study that accounts for decoy–dominant detection. Responding to the call, our study contributes to the literature by developing a framework to guide marketing researchers about how to quantify decoy–dominant detection probabilities in real product markets with pricing and sales information.
Second, the intercept estimates of our dominant boost hazard component are all positive, confirming that, upon dominant detection, the sale hazard would be significantly boosted. This provides direct and conclusive evidence of the existence of the DE in a real product market. The demand boost effect is lower for the medium-price segment (0.552) compared to the low- (0.965) and high-price (1.072) segments. The parameter estimate for the (log- of) percentage of decoys is positive and significant (0.072), indicating that having more decoys would increase the dominant diamonds’ sale hazards. The relative price measure is also positive and significant (1.227), suggesting that as a dominant is discounted further away from the grade-level average price, its sale likelihood is boosted more. The estimate of the grade-level price dispersion turns out to be insignificant. We next calculate the average dominant boost effect in proportional terms using the estimates. We find that, conditional on a diamond being detected as a dominant, its sale hazard increases by 2.7 times for the low-, 1.8 times for the medium-, and 3.2 times for the high-price segments, suggesting that—due to the DE—there is a substantial boost in the sale hazards of dominant diamonds.
In summary, our estimation results show that in general, consumers have a low chance of detecting
the decoy–dominant relationships in the online diamond market, especially among diamonds in the high-price segment. Thus, it is critical to model the decoy–dominant detection process in real product markets in order to correctly quantify the sales impacts of decoys on dominants. On the other hand, even though the market-level decoy–dominant detection probabilities are low, once an alternative is detected as a dominant, its sale hazard increases quite significantly, especially in the low- and high-price segments. With this finding, we not only provide strong field evidence about the existence of the DE, we also respond to Frederick et al. (2014) and Yang and Lynn (2014), who questioned the practical validity and usefulness of the DE.
6.2 Model Comparison and Robustness
In addition to our main model estimation, we conduct comparisons with two simpler modeling approaches and check the robustness of the results with three alternative specifications. We show that one would obtain worse data fits and biased inferences of the DE under these alternative approaches. We also confirm that the estimates are very robust under alternative specifications with other ways of heterogeneity control and different specifications of the reference price.
One important modeling contribution of this paper is the separation of the DE from decoy– dominant detection. To test the importance of this separation, in our first model comparison, we estimate a benchmark proportional hazard model by including the same set of controls directly in the baseline hazard. Specifically, the log-hazard specification in Equation (7) becomes the following:
ln(hj(t)) =bj + Xjtβ + DjtDecoyγDecoy + DjtDominantγDominant. (9)
The second column of Table 7 (column “No Detection”) reports the estimates of the dominance variables for decoys (decoy shrinkage hazard) and dominants (dominant boost hazard). This simple modeling approach is outperformed by our proposed model based on the AIC, suggesting that explicitly separating the market–level decoy–dominant detection from the dominant boost hazard better explains data variations. More importantly, the alternative model’s estimates have a compound effect of detection and boost on diamonds’ sale hazards, causing inaccurate inferences in understanding the magnitude of the DE if one directly uses these estimates. Results suggest that dominant boost hazards from this benchmark model are much smaller (ranging from 1.1 to 1.4 times) compared to ones from the proposed model (1.8 to 3.2 times). Thus, not explicitly controlling for the market-level detection causes a significant under-estimation of the DE’s magnitude.
One potential explanation for the observed sales patterns is heterogeneous competitive effects. Specifically, if there exist two segments with different competition intensities, diamonds in the first are not affected by competition, whereas diamonds in the second are highly affected. Further, the likelihood of being in the second segment depends on a diamond’s relative price difference in the grade, so we may expect to see a shrinkage in the sales of decoys and a boost in that of dominants. To test this explanation, we use a simple 2-segment (competitive vs. noncompetitive) latent-class model with diamond price segment dummies and absolute relative price differences determining the competitive segment membership, and a competition hazard (conditional on being in the competitive segment) that is modeled by the (log- of) number of competing diamonds in the same grade. This specification yields a much higher AIC compared to our proposed specification (516,718 vs. 515,432), strongly supporting the DE story over the heterogeneous competitive effects explanation.
Table 7: Model Comparisons and Robustness Checks
Variable No Detection Homogeneous Reference Price
Fixed Effect
Diamond Random Effects included
included
included
Day Fixed Effects

Market-Level Detection Probability/Decoy Shrinkage Hazard

included
Dominant Boost Hazard
N. parameters 50

49 50
247
Log-likelihood −257,679
−262,156 −257,669
−254,071
AIC 515,458
524,410 515,438
508,636
Note: Numbers reported are mean estimates and standard errors. Estimates with ∗∗ and ∗ are significant at the 0.05 and 0.10 levels, respectively. Other controls (diamond 4Cs, competition, etc.) are included in the models but not reported here to save space.
We also conduct a series of robustness checks with alternative model specifications and report the results in Table 7. The first specification omits the random effect in the hazard specification (column “Homogeneous”), the second adopts a RPE specification with the relative price variables entering into the daily diamond sale hazard component instead of our dominance hazard specification (column “Reference Price”), and the last specification includes day fixed effects instead of Google search indexes and weekday dummies as the control for demand effects across days (column “Fixed Effect”). Across these three specifications, we find very consistent and robust results: most of the estimates have the same direction and very similar magnitude as in our main model. One notable difference is that the relative magnitude of the DE is reversed for the low- and high-price segments under the homogeneous model. We believe our random effect specification captures more unobserved heterogeneity across diamonds and thus should be more accurate, with the log-likelihood also much improved. The change in moving the relative price variables into the diamond daily hazard component seems to make little impact on the model estimates. In addition, the estimates again show that the sales function is more responsive in the gain domain than in the loss domain, consistent with the findings in our data pattern explorations. Finally, including day fixed effects could improve the model fit but it comes at the cost of almost 200 additional parameters. Importantly, the coefficient estimates for the dominance hazard component remain almost the same. Thus, we prefer our main model specification for being parsimonious.
7 Managerial Significance
We now explore the managerial implications of our study. We first quantify the DE’s overall profit impact to the retailer using model estimates. We also explore through simulation studies how sensitive the DE’s profit impact is with respect to changes in different aspects of dominance structure. These exercises shed new light on the DE’s practical significance, showing that it is not simply an experimental artifact.
To quantify the DE’s profit impact, we start with the retailer’s profit for a given diamond j at time t that can be calculated as:
πjt = Prj(t|·) × (pjt − wjt), (10)
where pjt is the price, and wjt is the wholesale price, which can be easily calculated by subtracting out the retailer’s mark-up of 18% (provided in retailer’s annual report) from the observed daily retail prices. Prj(t|·) = 1 − exp(−hj(t|·)) is the discrete-time hazard, or the probability that diamond j would be sold on day t, conditional on not being sold until that day. The DE’s impact on profit can be quantified by calculating the differences in the sale probability Prj(t|·) with and without setting the dominant boost hazard component Qjt(·) to 1.
We present the results in Table 8. Without the DE, on average, each diamond contributes $22.89 in gross profit each day; whereas with the DE, the contribution becomes $26.16. The DE increases the retailer’s overall gross profit by 14.3%, or equivalently, the DE contributes a 12.5% share of the retailer’s gross profit. The profit increase due to the DE is the largest in the low-price segment (25.4%), whereas the opposite is the case in the medium-price segment (8.6%). Based on the financial information of the retailer, this percentage increase would translate into an annual profit increase of $9 million dollars. This result shows that even though decoy–dominant detection probabilities are low in the online diamond market, the DE still has a very substantial profit impact due to the significant boost in sale likelihoods upon dominance detection. Indeed, this profit impact is what really matters the most from the substantive point of view. This result further mitigates the concerns of Frederick et al. (2014) and Yang and Lynn (2014) about the DE’s practical significance.
Table 8: The Impact of the DE on Retailer’s Gross Profit
Effect
2K–5K
5K–10K
10K–20K
Overall
Avg Daily Profit Per Diamond W/O the DE
8.71
30.17
43.67
22.89
Avg Daily Profit Per Diamond W/ the DE
10.92
32.76
49.72
26.16
% Profit Increase due to the DE
25.37%
8.58%
13.85%
14.29%
We next investigate the sensitivity of the DE’s profit impact to different dominance configurations in the diamond market. In the first simulation study, we check how the DE’s profit impact changes when each existing decoy/dominant diamond has one more decoy and/or dominant in the market. Results of this simulation study are presented in the top portion of Table 9. We find that, intuitively, having one more decoy would increase the profit (0.37%) given the effect of decoys in boosting the demand of dominants. The opposite is true on the profit of existing diamonds when we add one more dominant (-0.24%), because the decoy detection probability would increase, leading to a decrease of sales from those decoys. The retailer’s profit from existing diamonds slightly increases (0.14%) if each decoy/dominant diamond has one more decoy and one more dominant.
Table 9: Percentage Changes in the DE’s Profit Impact as the Dominance Structure Changes
2K–5K
5K–10K 10K–20K
Overall
Changing the Number of Decoys/Dominants
Each Diamond Has 1 More Decoy
0.43%
0.25%
0.39%
0.37%
Each Diamond Has 1 More Dominant
−0.26%
−0.25%
−0.17%
−0.24%
Each Diamond Has 1 More Decoy and Dominant
0.16%
0.01%
0.22%
0.14%
Changing the Price Variation
Multiplying the Dispersion by 0.5
−0.52%
−0.28%
−1.08%
−0.60%
Multiplying the Dispersion by 0.8
−0.22%
−0.10%
−0.47%
−0.25%
Multiplying the Dispersion by 1.2
0.22%
0.21%
0.49%
0.28%
Multiplying the Dispersion by 1.5
0.58%
0.48%
1.32%
0.74%
Changing the Market-Level Detection Probabilities
Market-level Detection Probability in the Data
0.25
0.17
0.11
0.19
Market-level Detection Probability of 0.05
−3.36%
1.49%
−2.51%
−1.41%
Market-level Detection Probability of 0.10
−1.71%
0.99%
−0.25%
0.01%
Market-level Detection Probability of 0.30
−0.45%
−4.03%
2.98%
−0.04%
Market-level Detection Probability of 0.40
−2.28%
−7.65%
1.66%
−2.40%
We examine the effect of price dispersion on the DE’s profit impact in the second simulation study. Specifically, we enlarge or shrink the relative prices of each diamond by a factor. For example, think about a diamond that is priced at $11,000, with a calculated mean grade price of $10,000. By preserving the mean price level, we change the price of this diamond to $10,500 (dispersion factor 0.5), $10,800 (dispersion factor 0.8), $11,200 (dispersion factor 1.2), and $11,500 (dispersion factor
1.5) in our simulation study. The middle portion of Table 9 reports the results of this simulation. Results suggest that the DE’s profit impact increases with the increase in price dispersion in the studied ranges. Based on our model estimates, an expanded price dispersion would increase both the detection probabilities of decoys/dominants and the effect of demand boost on dominants. The profit gains from dominants (due to higher detection and sales boost) outperform the losses from decoy sales (due to higher detection), leading to an increase in the retailer’s profit.
Given that dominance detection is a critical pre-condition for the DE, we evaluate the profit impact of the DE under different dominance detection levels in our last simulation study. By changing the intercepts in the detection equation, we set the market-level detection probabilities at 5%, 10%, 30%, and 40%. The detection probabilities are between 25% for the low-price segment and 11% for the high-price segment in the data. Results reported in the bottom portion of Table 9 suggest that the retailer could gain additional profits when the detection probability is lower in the medium-price segment and higher in the high-price segment, as compared to the current levels. On the other hand, the current detection level in the low-price segment seems to be quite optimal in terms of driving profits. These patterns are driven by the fact that dominance detection has opposite impacts on the profits from decoys versus dominants. Thus, the optimal dominance detection levels critically depend on the magnitude of the DE across the diamond price segments.
In summary, we show that the DE has a substantial impact on the studied retailer’s profitability. Further, we find that the retailer could potentially gain additional profits from the DE when 1) there are more decoys in the market, 2) the within-grade price variations increase, and 3) the dominance detection probabilities either increase or decrease across different price segments. Having a better understanding of the profit impact could potentially enable the retailer to find profitable avenues by effectively utilizing the DE in its marketing and operations activities. One interesting observation is that the retailer has recently included a new section of comparable alternatives on the details pages of a selected list of diamonds, in which their decoys/dominants are displayed for some of the diamonds. This practice could likely change the chances of consumers detecting decoys and dominants.
8 Conclusions
In this research, we empirically validate the DE by using unique panel data from a leading online jewelry retailer. We estimate a proportional hazard model with embedded market-level decoy–dominant detection probabilities and the sales boost upon dominant detection. Our proposed model is derived from individual consumer primitives. We model daily sale likelihoods of diamonds as a function of their 4Cs, daily demand fluctuations, competitive effects from other diamonds, and the observed decoy–dominant structure. We find that, in general, the market-level probability of detecting a diamond as a decoy or dominant is quite low, especially in the medium- and high-price diamond segments. However, once a diamond is detected as a dominant, its sale hazard increases quite substantially (1.8 to 3.2 times). Thus, we empirically validate the existence of the DE in a real product market and show that the effect is not merely an experimental artifact as argued in recent studies. Model comparisons reveal that not controlling for decoy–dominant detection yields biased results regarding the DE. The estimates are robust under various alternative specifications.
In addition, we contribute to the substantive issue of measuring the overall profit impact, i.e., the managerial significance, of the DE. We quantify the overall profit impact of the DE using model estimates and find that it improves the retailer’s gross profit by 14.3%. We also explore how sensitive this profit impact is with respect to changes in the dominance structure, and conclude that the profit impact could potentially increase with more decoys, a larger within-grade price variation, and changes in the market-level dominance detection probabilities.
Our study is the first empirical attempt to quantify the widely documented DE in the consumer behavior literature. It is exciting to apply the well-developed context-dependent choice theory to real-life data and empirically quantify the managerial implications. Accordingly, we would like to note that the studied online diamond market is a unique setting to test the DE since diamonds have vertical attributes, and decoys and dominants are widely observed. That being said, we believe it is possible to find other market settings that also allow the DE to be tested. For example, it is common for multiple sellers to carry different price stickers when selling identical products with the same shipping/return policies and warranties in e-commerce platforms such as Amazon. Thus, the more expensive alternatives serve the role of decoys. Similarly, in used-goods platforms such as eBay, it is often possible to observe products with worse conditions from low-rating sellers to be more expensive than some of the better ones from top sellers. To substantiate the DE, the retail platforms (such as Amazon and eBay) can make decoy–dominant detection easier (or more difficult) by either changing the order of seller listings or recommending the products of some sellers. That being said, such practices may have practical limitations for platform designers due to creating concerns about the equality of sellers and discouraging the participation of smaller sellers on the platforms. In addition to above-mentioned within-product price variations on retail platforms, some producers price their product bundles to be the same as their main product to make the bundle more attractive. For example, Dyson offers the V11 Torque Drive Cord-Free Vacuum with extra Extension Hose at the same price as the Vacuum itself. Last, as opposed to these strict decoy examples, retailers may position some products as near decoys. For example, the price difference between iPhone X 512GB and 256GB is the same as the difference between the 256GB and 64GB, making upgrading to 512GB more attractive to some consumers. In the future, quantifying the DE across various product categories can potentially help researchers to better understand the DE’s limits and boundaries.
We believe there are multiple dimensions that can be pursued to extend understanding of the DE in future research. One direction is to investigate how consumers learn and respond to decoy–dominant relations in their search process. We face significant modeling challenges in the current context given that we only observe pricing and sales information. Future research could potentially address such issues when consumer-level search data are available. Another direction is to jointly model demand for and supply of diamonds. We are less concerned with the diamond suppliers’ optimal pricing decisions in our application since our focus is the DE on the demand side. Modeling the suppliers’ pricing decisions under the DE might be an avenue for future study if such supplier-level information is observed. A third direction is to consider the DE across multiple competitors because consumers might search for products from different retailers. In our setting, since the retailer captures about 50% of the market share in the U.S., this issue was not a major concern. It might be worth pursuing in a different product market when data of multiple retail outlets are available. Finally, as Bell and Lattin (2000) studied the impact of price response heterogeneity in quantifying loss aversion, it is worthwhile to investigate the same factor for the DE. We partially achieve this by allowing the DE to differ across diamond price segments. When consumer-level panel data are available, the heterogeneity in both price responses and the DE could be better formulated in the future.
References
Bell, David R, James M Lattin. 2000. Looking for loss aversion in scanner panel data: The confounding effect of price response heterogeneity. Marketing Science 19(2) 185–200.
Bettman, James R, Mary Frances Luce, John W Payne. 1998. Constructive consumer choice processes. Journal of Consumer Research 25(3) 187–217.
Biswas, Abhijit, Edward A Blair. 1991. Contextual effects of reference prices in retail advertisements. Journal of Marketing 55(3) 1–12.
Burdett, Kenneth, Kenneth L Judd. 1983. Equilibrium price dispersion. Econometrica: Journal of the Econometric Society 955–969. [[NEED VOLUME NUMBER]]
Chiang, Jeongwen, Siddhartha Chib, Chakravarthi Narasimhan. 1998. Markov chain Monte Carlo and models of consideration set and parameter heterogeneity. Journal of Econometrics 89(1-2) 223–248.
De los Santos, Babur, Ali Hortaçsu, Matthijs R Wildenbeest. 2012. Testing models of consumer search using data on web browsing and purchasing behavior. American Economic Review 102(6) 2955–2980.
Frederick, Shane, Leonard Lee, Ernest Baskin. 2014. The limits of attraction. Journal of Marketing Research 51(4) 487–507.
Guo, Liang, Wenbo Wang. 2016. Compromise effects explained by contextual deliberation reveal the rationality in preference construction. Working paper, Chinese University of Hong Kong.
Hardie, Bruce GS, Eric J Johnson, Peter S Fader. 1993. Modeling loss aversion and reference dependence effects on brand choice. Marketing Science 12(4) 378–394.
Heath, T B, S Chatterjee, K R France. 1995. Mental accounting and changes in price: The frame dependence of reference dependence. Journal of Consumer Research 22 90–97.
Hong, Han, Matthew Shum. 2006. Using price distributions to estimate search costs. The RAND Journal of Economics 37(2) 257–275.
Honka, Elisabeth. 2014. Quantifying search and switching costs in the US auto insurance industry. The RAND Journal of Economics 45(4) 847–884.
Honka, Elisabeth, Pradeep Chintagunta. 2016. Simultaneous or sequential? Search strategies in the US auto insurance industry. Marketing Science 36(1) 21–42.
Huber, J, J W Payne, C Puto. 1982. Adding asymmetrically dominated alternatives: Violations of regularity and the similarity hypothesis. Journal of Consumer Research 9(1) 90–98.
Huber, J, C Puto. 1983. Market boundaries and product choice: Illustrating attraction and substitution effects. Journal of Consumer Research 10 31–44.
Huber, Joel, John W Payne, Christopher P Puto. 2014. Let’s be honest about the attraction effect. Journal of Marketing Research 51(4) 520–525.
Kahneman, Daniel. 1979. Prospect theory: An analysis of decisions under risk. Econometrica 47 278.
Kalyanaram, Gurumurthy, Russell S Winer. 1995. Empirical generalizations from reference price research. Marketing Science 14(3_supplement) G161–G169.
Kamakura, Wagner A, Rajendra K Srivastava. 1984. Predicting choice shares under conditions of brand interdependence. Journal of Marketing Research 420–434. [[NEED VOLUME NUMBER]]
Khan, Uzma, Meng Zhu, Ajay Kalra. 2011. When trade-offs matter: The effect of choice construal on context effects. Journal of Marketing Research 48(1) 62–71.
Kivetz, Ran, Oded Netzer, V Srinivasan. 2004. Alternative models for capturing the compromise effect. Journal of Marketing Research 41(3) 237–257.
Lancaster, Tony. 1979. Econometric methods for the duration of unemployment. Econometrica: Journal of the Econometric Society 939–956. [[NEED VOLUME NUMBER]]
Lehmann, Donald R, Yigang Pan. 1994. Context effects, new brand entry, and consideration sets. Journal of Marketing Research 31(3) 364.
Luce, R Duncan. 1959. Individual Choice Behavior. John Wiley & Sons, Inc, New York.
Luce, R Duncan. 1977. The choice axiom after twenty years. Journal of Mathematical Psychology 15 215–233.
Lynch Jr, John G, Dipankar Chakravarti, Anusree Mitra. 1991. Contrast effects in consumer judgments: Changes in mental representations or in the anchoring of rating scales? Journal of Consumer Research 18(3) 284–297.
Mazumdar, Tridib, Sevilimedu P Raj, Indrajit Sinha. 2005. Reference price research: Review and propositions. Journal of Marketing 69(4) 84–102.
Monroe, Kent B. 1973. Buyers’ subjective perceptions of price. Journal of Marketing Research 10(1) 70–80.
Morewedge, Carey K, Meng Zhu, Eva C Buechel, Andrea C Morales. 2018. Hedonic contrast effects are larger when comparisons are social. Journal of Consumer Research. [[NEED VOLUME AND PAGE NUMBERS]]
Müller, Holger, Victor Schliwa, Sebastian Lehmann. 2014. Prize decoys at work—New experimental evidence for asymmetric dominance effects in choices on prizes in competitions. International Journal of Research in Marketing 31(4) 457–460.
Orhun, A Yesim. 2009. Optimal product line design when consumers exhibit choice set-dependent preferences. Marketing Science 28(5) 868–886.
Rajendran, Kharan N, Gerard J Tellis. 1994. Contextual and temporal components of reference price. Journal of Marketing 58(1) 22–34.
Ratneshwar, S, A D Shocker, D W Stewart. 1987. Toward understanding the attraction effect: The implications of product stimulus meaningfulness and familiarity. Journal of Consumer Research 13 520–533.
Rooderkerk, Robert P, Harald J Van Heerde, Tammo HA Bijmolt. 2011. Incorporating context effects into a choice model. Journal of Marketing Research 48(4) 767–780.
Royle, Jone R, David J. O’Connor, Gareth M. Reynolds, Paul A. Bottomley. 1999. The robustness of the asymmetrically dominated effect: Buying frames, phantom alternative, and in-store purchases. Psychology and Marketing 16(3) 225–243.
Siddarth, S, Randolph E Bucklin, Donald G Morrison. 1995. Making the cut: Modeling and analyzing choice set restriction in scanner panel data. Journal of Marketing Research 32(3) 255–266.
Simonson, I. 1989. Choice based on reasons: The case of attraction and compromise effects. Journal of Consumer Research 16 158–174.
Simonson, Itamar. 2014. Vices and virtues of misguided replications: The case of asymmetric dominance. Journal of Marketing Research 51(4) 514–519.
Tversky, A. 1972. Elimination by aspects: A theory of choice. Psychological Review 79 281–299.
Van Nierop, Erjen, Bart Bronnenberg, Richard Paap, Michel Wedel, Philip Hans Franses. 2010. Retrieving unobserved consideration sets from household panel data. Journal of Marketing Research 47(1) 63–74.
Wedell, Douglas H. 1991. Distinguishing among models of contextually induced preference reversals. Journal of Experimental Psychology: Learning, Memory, and Cognition 17(4) 767–778.
Yang, Sybil, Michael Lynn. 2014. More evidence challenging the robustness and usefulness of the attraction effect. Journal of Marketing Research 51(4) 508–513.
Zhang, Xing, Tat Y Chan, Ying Xie. 2017. Price search and periodic price discounts. Management Science 64(2) 495–510.
Web Appendices
A A Statistical Test for the Source of the Observed Price Variation
In this web appendix, we develop a statistical test to understand whether the observed price variations in the data can be solely explained by consumer search, or if they are suggestive of both consumer search and the DE.
Burdett and Judd (1983) proved that when consumers search for price, even for homogeneous products, price variations can arise in equilibrium. The intuition is that because of consumer search cost, consumers may not discover all options; and as a result, high-priced options may be purchased by some consumers. Further, in the mixed-strategy pricing equilibrium, each observed price point generates the same expected profit. Hong and Shum (2006) used this idea of mixed-strategy pricing equilibrium to recover consumer search-cost distribution purely from observed price variations for textbooks—a typical homogeneous good. We borrow this idea and use it to conduct our statistical test. For diamonds with identical 4Cs (i.e., the same grade), denoted by the set J, we observe two types of price variations: 1) across diamonds in the set J on day t, and 2) within the same diamond
(identified by SKU) over time. We use pjt to denote the price of diamond j on day t. We use Phj(t|pjt) to denote the sale response function conditional on price. Because the retailer has a fixed margin (1−r = 18%) in our market setting, the wholesale price of suppliers becomes wjt = r ×pjt. We denote the marginal cost for suppliers as cjt.
If there is no DE in the sale response function Phj(t|pjt) (i.e., the observed price variation is driven solely by consumer search), for a supplier, setting prices either through maximizing the total expected profit from a set of diamonds or through maximizing the expected profit for each individual diamond (in the corresponding set) will yield the same set of prices and ultimately the same total profit. However, when the DE exists along with consumer search, this no longer holds since decoys boost their dominants’ demand, i.e., decoys bring positive externality to the profitability of their dominants. Therefore, we predict the expected profit from dominants to be higher than that of decoys. In other words, if the DE exists together with consumer search, each price point in the support of the observed price distribution would no longer yield the same expected profit, i.e., dominants will have higher expected profits compared to decoys. The supplier’s expected profit from diamond j on day t is:
πjt = (r × pjt − cjt)Phj(t|pjt). (A.1)
Under the assumption of no DE, assume that suppliers have priced optimally, in which case, the following first-order condition should hold:
. Note that this optimality condition holds for any price observed in the marketplace, because under mixed pricing equilibrium, each price point generates the same optimal profit. From this optimality condition, we can invert the following marginal cost:
, (A.2)
where
is the price elasticity at price pjt.
However, when there is DE, this relationship is no longer true, because decoys serve as “loss leaders” and generate less expected profits than their dominants. Consequently, for high-priced decoys, the recovered costs under the optimality condition without the DE will be higher than the true costs; while the opposite is true for low-priced dominants. Therefore, using Equation (A.2) leads us to a relationship where the calculated cost cjt increases with the observed price pjt for identical diamonds. We use this idea in our proposed statistical tests. In the first test (labeled as Test I), we assume that the suppliers’ marginal costs of diamonds with identical attributes are the same, i.e., cjt = c,∀j ∈ J,∀t. It might be a reasonable assumption in this particular industry, because diamonds are supplied globally by a few dominant manufacturers. Further, this assumption was also used by both Burdett and Judd (1983) and Hong and Shum (2006). To conduct our test, we proceed with the following steps:
Use a proportional hazard model to fit the sale response function Phj(t|pjt) with polynomials of pjt (we use linear, quadratic, and cubic forms), diamond characteristics, and daily demand controls to control for focal and over-time demand effects.
Invert the implied cost ccjt using Equation (A.2) for each observed price point under our null hypothesis that observed price variation is driven by consumer search only.
Regress ccjt over the relative price index rpjt = (pjt − pJt)/pJt (pJt is the average price of diamonds in J on day t) and other control variables such as diamond characteristics, daily demand effects, and grade average prices.
If the estimated coefficient for rpjt is insignificant, then the test favors the null hypothesis that price dispersion could be explained based on consumer search alone; if the coefficient is positive and significant, we would have the statistical support to reject the null hypothesis, and the results would be consistent with the price variations being driven by the DE along with consumer search. In our second test (labeled as Test II), we relax the cost assumption (cjt = c,∀j ∈ J,∀t) from Test I. Instead, we impose the following assumption: For the same diamond j, the cost for the supplier would be the same over time, i.e., cjt = c,∀t. In other words, diamonds in the same grade might have different costs, but this cost is time-invariant. We use the within-diamond over time price variation to test our hypothesis. The test follows the same steps as in Test I, except that in Step 3, we run the regressions using diamond-level (i.e., SKU-level) fixed effects as controls and test whether the coefficient for rpjt is significant. We repeat Test I and II for each of the three diamond price segments (low-, medium-, and high-price) and report the results in Table A1. Both Test I and II reject the null hypothesis (i.e., the price variation arises solely from consumer search) and support the DE coexisting with consumer search.
Table A1: Test of the Source of the Observed Price Variation
Variable 2K–5K 5K–10K 10K–20K
Test I
Controls
4Cs, daily demand effects, pJt
rpjt
2.172∗∗(0.001)
5.401∗∗(0.002)
12.721∗∗(0.003)
Adj. R-squared
Test II
0.987
0.995
0.996
Controls
diamond fixed effects
rpjt
1.178∗∗(0.028)
1.917∗∗(0.052)
1.143∗∗(0.167)
Adj. R-squared
0.943
0.960
0.945
Note: Estimates with ∗∗ and ∗ are significant at the 0.05 and 0.10 levels, respectively. Dependent variable—estimated cost—is in 1000 dollars. In Test II, we randomly sample 500 diamonds in each price segment.
B Derivation of Diamond-Level Sale Hazard from Consumer Primitives
In this web appendix, we derive our diamond-level proportional hazard model (Equation 1 in the manuscript) from consumer primitives including consumer arrival process, search, consideration set formation, and conditional choice probabilities. We further discuss 1) how we embed the DE in consumers’ conditional choice probabilities, and 2) how our specification serves as a test for the DE.
B.1 Individual Primitives and Continuous-Time Diamond Hazard
We assume that potential diamond consumers arrive at random times to the retailer’s website. In each specific time τ (can be a millisecond), we assume that at most one consumer would be making a diamond purchase decision. A representative consumer i, arriving at time τi, searches the retailer’s website to form her consideration set and then decides whether to purchase one of the diamonds from that set that maximizes her utility or not to purchase any diamonds. In terms of search, we assume independence across diamonds—that is, the probability that a particular diamond is included is independent of any other diamonds being included or not.[9] We denote the consumer i’s consideration set as Mi, and the super set containing all the possible consideration sets as M.
We define the conditional choice probability of consumer i choosing a particular diamond j from her consideration set, given j has not been sold before τi, as si(j|Mi) (to be discussed in the following subsection in detail). The expected sale probability of diamond j at time τi is thus the sum of the choice probabilities over all possible consideration sets:
ωj(τi) = X Pr(Mi) × si(j|Mi). (B.1)
Mi∈M
Note that this consumer and diamond specific choice probability is equal to the sale hazard of diamond j at time τi. This is the case because it is the conditional probability that diamond j will be sold at τi conditional on it not being sold until τi, and the consumer i is the only consumer deciding whether j would be purchased at this particular time.
B.2 Embedding the DE into the Continuous-Time Diamond Hazard
For a specific diamond j, we can classify diamonds into three mutually exclusive and collectively exhaustive types based on their relationships to j. The set of Cjo contains diamonds that are neither dominants nor decoys to j; the set CjDominant contains all diamonds that are dominating j; and finally, the set CjDecoy represents the collection of j’s decoys. Denote as Mijo the set of diamonds from Cjo that are in consumer i’s consideration set, and similarly MijDominant and MijDecoy the sets of diamonds (in i’s consideration set) that are from j’s dominants set (CjDominant) and decoys set (CjDecoy). Denote Moj, MDominantj , and MDecoyj as the super sets of Mijo, MijDominant, and MijDecoy, respectively. We now can express consumer i’s consideration set Mi as a combination of j, Mijo , MijDominant , and MijDecoy and define the choice probability si(j|Mi) accordingly:[10]
s0i,(j|j ∪ Mijo ),
si(j|Mi) =
0si,(j|j ∪ Mijo ∪ MijDecoy),
if j /∈ Mi if j ∈ Mi & MijDominant = ∅ & MijDecoy = ∅ if j ∈ Mi & MijDominant 6= ∅ if j ∈ Mi & MijDecoy 6= ∅ & MijDominant = ∅.
(B.2)
In the above equation, j has a choice probability of zero in the first case simply because it is not in the consideration set. The choice probability is also zero in the third case, because we assume that consumers are rational, and once a dominant is included in the choice set, the inferior decoy diamond j will never be purchased. Also, notice that in the last case, the consumer will only choose an option from the subset, not from MijDecoy, because diamonds in MijDecoy are inferior to option j and a rational consumer would not buy those decoys. In other words, the effective choice set becomes the same as the second case. If there is no DE, si(j|j ∪ Mijo ) = si(j|j ∪ Mijo ∪ MijDecoy); however, if there is DE, we would expect the presence of diamonds from MijDecoy to increase the attractiveness and thus the choice probability of diamond j. To capture that demand boost due to the DE, we denote the relationship between the two conditional choice probabilities as follows.
si(j|j ∪ Mijo ∪ MijDecoy) = si(j|j ∪ Mijo ) × qi(j,Mijo ,MijDecoy), (B.3)
where qi(j,Mijo ,MijDecoy) is a scalar that affects the choice probability of option j and is a function of dominance relationships in the choice set. Note that qi(j,Mijo ,MijDecoy) = 1 if there is no DE; qi(j,Mijo ,MijDecoy) > 1 if there is DE. Given Equations (B.2) and (B.3), we next aggregate the choice probability over the possible consideration sets to derive the continuous-time diamond hazard from Equation (B.1) as follows: ωj(τi) = X Pr(Mi) × si(j|Mi)
Mi∈M
= X Pr(Mi = j ∪ Mijo )si(j|j ∪ Mijo )+
Mijo ∈Moj
X X Pr(Mi = j o ∪ MijDecoy)si(j|j ∪ Mijo ∪ MijDecoy) ∪ Mij
Mijo ∈Moj MijDecoy∈MDecoyj
= X Pr(Mi = j ∪ Mijo )si(j|j ∪ Mijo )+
Mijo ∈Moj
X X Pr(M = j ∪ Mo ∪ MijDecoy)si(j|j ∪ Mijo )qi(j,Mijo ,MijDecoy) . (B.4)
i ij
Mijo ∈Moj MijDecoy∈MDecoyj
=Pr(j ∈ Mi)Pr(MijDecoy = ∅)Pr(MijDominant = ∅) X Pr(Mijo ∈ Mi)si(j|j ∪ Mijo )+
Mijo ∈Moj
Pr(j ∈ Mi)Pr(MijDominant = ∅) X Pr(Mijo ∈ Mi)si(j|j ∪ Mijo )×
Mijo ∈Moj
X Decoy ∈ Mi)qi(j,Mijo ,MijDecoy) Pr(Mij
MijDecoy∈MDecoyj ,MijDecoy6=∅
In the above derivation, we first use Equation (B.2) (i.e., two non-zero choice probabilities on the second and fourth lines) to obtain the second line of Equation (B.4). Next, we use Equation (B.3) to move from the second to the third line of Equation (B.4). We then use the independence assumption to move from the third to the fourth line of Equation (B.4). Next, we define the following:
si(j|j,Moj) = X Pr(Mijo ∈ Mi)si(j|j ∪ Mijo ). (B.5)
Mijo ∈Moj
Given si(j|j,Moj) from Equation (B.5), we then define the following: qi(j,Moj,MDecoyj ) = X Pr(Mijo ∈ Mi)si(j|j ∪ Mijo )qi(j,Mijo ,MjDecoy )/si(j|j,Moj), (B.6)
Mijo ∈Moj
where 
/Pr(MijDecoy 6= ∅). Plugging si(j|j,Moj) and
from Equation (B.5) and (B.6) into
(B.4), and rearranging the terms, we can simplify Equation (B.4) as follows:
ωj(τi) = X Pr(Mi) × si(j|Mi)
Mi∈M
=Pr(j ∈ Mi)Pr(MijDecoy = ∅)Pr(MijDominant = ∅)si(j|j,Moj)+
Pr(j ∈ Mi)Pr(MijDecoy 6= ∅)Pr(MijDominant = ∅)si(j|j,Moj)qi(j,Moj,MDecoyj ) (B.7)
=Pr(j ∈ Mi)si(j|j,Moj)×
Pr(MijDominant = ∅)hPr(MijDecoy = ∅) + Pr(MijDecoy 6= ∅)qi(j,Mjo,MjDecoy )i.
Note that
is the average of qi(j,Mjo,MjDecoy) weighted by the probability of having MijDecoy; and qi(j|Moij,MDecoyj ) is the average of
weighted by the probability of having Mijo . Thus, the final qi(j,Moij,MDecoyj ) is the average of all the qi(j|.) weighted by the probability of all the possible combinations of MijDecoy and Mijo .
Equation (B.7) tells us that the aggregate sale hazard in the continuous time can be decomposed to the sum of two components: 1) the probability that no decoy–dominant relationships are included in the consideration set times the aggregate choice probability over all no dominants–no decoys sets (i.e., the baseline choice probability); and 2) the probability that a diamond’s decoys but not dominants are included in the consideration set times the baseline choice probability multiplied by an additional aggregate term q that depends on the decoy–dominant structure. From the DE theory, we know that for each specific consideration set, qi(j,Mijo ,MjDecoy) ≥ 1 , i.e., it cannot be the case that adding dominated options to the choice set would reduce the choice share of a dominant. When this q function aggregates to the market level, it is a weighted average of all the consideration set-level qs. Therefore, the aggregate qi(j,Moij,MDecoyj ) ≥ 1. As qi(j,Mijo ,MjDecoy) ≥ 1 captures the potential DE at the individual-choice level, testing whether qi(j,Moij,MDecoyj ) ≥ 1 equals testing whether on average the DE exists in individual choices, i.e., whether Eqi(j,Mijo ,MjDecoy) ≥ 1.
B.3 Daily Diamond Hazard
We now derive the aggregate-level sale hazard for diamond j at a discrete time, i.e., day t. We assume nt potential consumers arrive randomly during day t, and each consumer can be represented by consumer i[11]. By definition in survival analysis, we know the survival function for diamond j at the end of day t, thus Sj(t) is defined as:
Sj(t) = e−Hj(t), (B.8)
where
is the cumulative hazard function.
We use Phj(t) to denote the hazard in the discrete time for diamond j on day t:
(B.9)
where hj(t) = ntωj(τi), and the corresponding survival function is Sj(t) = Πtk=1e−hj(k).
Functional Form Specification
Based on our derivation of ωj(τi) in Equation (B.7), we now can write hj(t) in Equation (B.9) as follows:
hj(t) =ntωj(τi)
=ntPr(j ∈ Mi)si(j|j,Moj)× (B.10)
Pr(MijDominant = ∅)hPr(MijDecoy = ∅) + Pr(MijDecoy 6= ∅)qi(j,Moj,MjDecoy )i.
In the above equation, the ntPr(j ∈ Mi)si(j|j,Moj) component represents the daily hazard for diamond j. The second component, Pr(MijDominant = ∅)[Pr(MijDecoy = ∅) + Pr(MijDecoy 6=
, represents the combined effect of the probability of diamond dominance de-
tection[12] and the boost in sales upon dominant detection. Therefore, we operationalize the hazard hj(t) at a discrete time in Equation (B.10) as follows:
hj(t) = ψj(·)φj(·). (B.11)
Equation (B.11) is the same as Equation (1) in our manuscript. This completes our derivation of the diamond-specific sale hazard function based on individual consumer primitives.
C. Log-Hazard Derivation
Based on our model specifications in Equations (2) and (6), we can derive the log-hazard specification
as:

=−I(Decoy)ln(1 + eDjtDecoyγDecoy)
| {z }
daily diamond sale hazard dominance hazard of a decoy
+ I(Dominant)hln(1 + eDjtDominant(γDominant+γBoost)) − ln(1 + eDjtDominantγDominant)i,
| {z } dominance hazard of a dominant
(C.1)
where DjtDecoy = {Kj,lnRjtDominant,I(rpjt > 0)rpjt,spjt}, DjtDominant = {Kj,lnRjtDecoys,I(rpjt < 0)(−rpjt),spjt} and I(Decoy) and I(Dominant) are indicators for a diamond with dominants and
decoys, respectively.
Last updated
Was this helpful?