Labelled vs Unlabelled design, constraints and interactions
Posted: Tue Jan 05, 2021 10:40 am
Dear Ngene team and all,
First, I seize the opportunity to wish you happy new year full of health and happiness.
I am beginner in Ngene and although I have been reading the software’s manual and following discussions in in the forum, I apologise in advance for any silly questions and mistakes.
I am working on the generation of an efficient design, where each choice task will have two alternatives plus an opt-out option. The attributes to be included in the CE are six, out of which one (price) is quantitative and the remaining 5 are qualitative. One attribute has 6 levels, three attributes have 3 levels and two attributes have 2 levels.
The problem I face is that the definition of an attribute and consequently its levels depend on another attribute and its levels (similar to transportation mode and travel time case, I believe). More precisely, say attribute M has two levels (m1, m2). Now, attribute S is defined slightly different when attribute M takes the value m1 or m2.
I am not sure whether the CE would be best to be labelled or unlabelled (in terms of Ngene design) and am working on generating both of them. I am aware of the pros and cons for each type of design.
In the unlabelled CE I treated S attribute as having 6 levels (3 levels when attribute M takes the value of m1 and 3 levels when attribute M takes the value of m2) and introduced constraints. In the labelled CE I treated S attribute as 2 separate attributes (WS and FS) specific for each alternative and so each of them has 3 levels and did not introduce constraints.
Is my reasoning correct or it is far away from Ngene logic? The two designs can be seen below.
Your help is invaluable.
Thank you in advance.
Michail
First, I seize the opportunity to wish you happy new year full of health and happiness.
I am beginner in Ngene and although I have been reading the software’s manual and following discussions in in the forum, I apologise in advance for any silly questions and mistakes.
I am working on the generation of an efficient design, where each choice task will have two alternatives plus an opt-out option. The attributes to be included in the CE are six, out of which one (price) is quantitative and the remaining 5 are qualitative. One attribute has 6 levels, three attributes have 3 levels and two attributes have 2 levels.
The problem I face is that the definition of an attribute and consequently its levels depend on another attribute and its levels (similar to transportation mode and travel time case, I believe). More precisely, say attribute M has two levels (m1, m2). Now, attribute S is defined slightly different when attribute M takes the value m1 or m2.
I am not sure whether the CE would be best to be labelled or unlabelled (in terms of Ngene design) and am working on generating both of them. I am aware of the pros and cons for each type of design.
In the unlabelled CE I treated S attribute as having 6 levels (3 levels when attribute M takes the value of m1 and 3 levels when attribute M takes the value of m2) and introduced constraints. In the labelled CE I treated S attribute as 2 separate attributes (WS and FS) specific for each alternative and so each of them has 3 levels and did not introduce constraints.
Is my reasoning correct or it is far away from Ngene logic? The two designs can be seen below.
- Code: Select all
? Labelled Experiment
Design
;alts = W, F, none
;rows = 36
;eff = (mnl, d)
;con:
;block = 6
;model:
U(W) = bw[0] +
b2[0] * C[1,2,3,4,5,6] +
b3.dummy[0] * N[1,2] +
b4.dummy[0|0] * R[1,2,3] +
b5.dummy[0|0] * T[1,2,3] +
b6.dummy[0|0] * WS[1,2,3] +
b8[0] * N.dummy[1]*T.dummy[1] +
b9[0] * N.dummy[1]*T.dummy[2] +
b10[0] * T.dummy[1]*WS.dummy[1] +
b11[0] * T.dummy[1]*WS.dummy[2] +
b12[0] * T.dummy[2]*WS.dummy[1] +
b13[0] * T.dummy[2]*WS.dummy[2] +
b14[0] * N.dummy[1]*WS.dummy[1] +
b15[0] * N.dummy[1]*WS.dummy[2] /
U(F) = bf[0] +
b2 * C[1,2,3,4,5,6] +
b3.dummy * N[1,2] +
b4.dummy * R[1,2,3] +
b5.dummy * T[1,2,3] +
b7.dummy[0|0] * FS[1,2,3] +
b8[0] * N.dummy[1]*T.dummy[1] +
b9[0] * N.dummy[1]*T.dummy[2] +
b16[0] * T.dummy[1]*FS.dummy[1] +
b17[0] * T.dummy[1]*FS.dummy[2] +
b18[0] * T.dummy[2]*FS.dummy[1] +
b19[0] * T.dummy[2]*FS.dummy[2] +
b20[0] * N.dummy[1]*FS.dummy[1] +
b21[0] * N.dummy[1]*FS.dummy[2]
$
- Code: Select all
? Unlabelled Experiment
Design
;alts = alt1*, alt2*, none
;rows = 48
;eff = (mnl, d)
;cond:
if(alt1.M=1, alt1.S=[1,2,3]),
if(alt1.M=2, alt1.S=[4,5,6]),
if(alt2.M=1, alt2.S=[1,2,3]),
if(alt2.M=2, alt2.S=[4,5,6]),
if(alt1.M=1 and alt2.M=1, alt1.S=[1,2,3] and alt2.S=[1,2,3]),
if(alt1.M=2 and alt2.M=2, alt1.S=[4,5,6] and alt2.S=[4,5,6])
;block = 6
;model:
U(alt1) = bA[0] +
b2[0] * C[1,2,3,4,5,6] +
b3.dummy[0] * N[1,2] +
b4.dummy[0|0] * R[1,2,3] +
b5.dummy[0] * M[1,2] +
b6.dummy[0|0] * T[1,2,3] +
b7.dummy[0|0|0|0|0] * S[1,2,3,4,5,6] +
b8[0] * T.dummy[1]*S.dummy[1] +
b9[0] * T.dummy[1]*S.dummy[2] +
b10[0] * T.dummy[1]*S.dummy[3] +
b11[0] * T.dummy[1]*S.dummy[4] +
b12[0] * T.dummy[1]*S.dummy[5] +
b13[0] * T.dummy[2]*S.dummy[1] +
b14[0] * T.dummy[2]*S.dummy[2] +
b15[0] * T.dummy[2]*S.dummy[3] +
b16[0] * T.dummy[2]*S.dummy[4] +
b17[0] * T.dummy[2]*S.dummy[5] +
b18[0] * N.dummy[1]*S.dummy[1] +
b19[0] * N.dummy[1]*S.dummy[2] +
b20[0] * N.dummy[1]*S.dummy[3] +
b21[0] * N.dummy[1]*S.dummy[4] +
b22[0] * N.dummy[1]*S.dummy[5] +
b23[0] * N.dummy[2]*S.dummy[1] +
b24[0] * N.dummy[2]*S.dummy[2] +
b25[0] * N.dummy[2]*S.dummy[3] +
b26[0] * N.dummy[2]*S.dummy[4] +
b27[0] * N.dummy[2]*S.dummy[5] +
b28[0] * N.dummy[1]*T.dummy[1] +
b29[0] * N.dummy[1]*T.dummy[2] /
U(alt2) = bB[0] +
b2 * C[1,2,3,4,5,6] +
b3.dummy * N[1,2] +
b4.dummy * R[1,2,3] +
b5.dummy * M[1,2] +
b6.dummy * T[1,2,3] +
b7.dummy * S[1,2,3,4,5,6]
$
- Would it be preferred to design a labelled CE, where respondents will have to make a choice always between the two different values of attribute M and where I could see how the variation in attributes alters the choice? Or an unlabelled CE, where I reckon people will be able to compare alternatives with the same value of attribute M, which does make sense in the context of the research? And what is the difference in Ngene coding?
- Is the number of rows I specified correct in both designs?
- Are the constraints set in the unlabelled CE too many (overlapping) or incorrect?
- When I hit “run”, the labelled design runs fine, but I am not sure if it has been specified correctly, because some indicators are not defined, while I am unclear how to interpret others (e.g. D-error=0.805, B estimate=100, S estimate=0, Sp estimates=undefined, Sp t-ratios:=0)
- When I hit run, the unlabelled CE runs. However, it gives me the following message and I press stop. What does it mean and what do I need to do?
Warning: Two alternatives were specified for alternative dominance checking, but do not have the same priors, and so will not be checked. 'alt1', 'alt2'
The conditional statement nesting cluster 1 contains 36 permissible combinations of attribute levels.
The nesting cluster contains the following if statements:
* if(alt1.m=1, alt1.s=[1,2,3])
* if(alt1.m=2, alt1.s=[4,5,6])
* if(alt1.m=1 and alt2.m=1, alt1.s=[1,2,3] and alt2.s=[1,2,3])
* if(alt2.m=1, alt2.s=[1,2,3])
* if(alt2.m=2, alt2.s=[4,5,6])
* if(alt1.m=2 and alt2.m=2, alt1.s=[4,5,6] and alt2.s=[4,5,6])
An attempt will be made to balance the frequency of each level in attributes affected by constraints, however complete balance might not be possible.
Note: Defaulting to assigning blocks with the 'minsum' method.
Warning: No valid design has been found after 1000 evaluations. There may be a problem with the specification of the design. A common problem is that the choice probabilities are too extreme (close to 1 and 0), perhaps because some or all of the prior values are too large. Also, it is generally a good idea to start with a simple design (MNL, non-Bayesian), then add complexity. If you press stop, a design will be reported, which may assist in diagnosing the problem. - Are the interaction effects specified in the two models correct? Should they be specified in all (except the opt-out) utility functions or only in one/some of them? If the latter, which one(s)? Does it make sense to include them all or I could include some of them?
- Does the mode of selection affect the Ngene design or it is something the researcher may decide to include or otherwise? For example, in the case of 2 alternatives plus the opt-out option, the mode of choice could be: (a) choose one only or (b) if opt-out has been chosen, but you had to make a choice between the two offerings what would you choose. Does (b) make sense in a CE where 2 alternatives plus the opt-out exist?
- After the design is generated, will Ngene generate the formatted scenarios that will include the opt-out or this has to be manually done?
- How can one decide (based on what) for the number of blocks?
- Is there any way to insert the qualitative attributes in the model, other than introducing dummies?
Your help is invaluable.
Thank you in advance.
Michail