Endpoint Unlabeled Efficient Bayesian Design
Posted: Mon Oct 02, 2017 10:51 am
Dear All,
I am constructing choice sets for an unlabeled experiment. I have 5 attributes each with two levels.
I conducted a pilot study using an optimal design. The pilot had two alternatives and a no-choice option. The total choice sets were 16 divided into 2 blocks. So, every participant evaluated 8 choice sets.
I want to use the priors to construct a Bayesian efficient design. However, I am having trouble with some issues:
1-When I use 3 alternatives and a no-choice, with ;row= 16 Ngene cannot find a design.
2-When I use 2 alternatives and a no-choice, with ;row=16 Ngene produces a design that looks great. But as illustrated by Sándor and Wedel (2002), having 3 or 4 alternatives per choice set is far more accurate in estimating panel-Mixed-logit.
3-The only time that Ngene is able to find a design with 3 alternatives and a no-choice option is when I reduce the total number of choice sets to ;row =8. I do not know why Ngene is limiting me to just 8 choice sets for the whole experiment (I tried 12 and 10 rows but did not work). I would like to increase the number of choice sets to at least 16. I am planning to use Latent Class analysis and panel-Mixed-logit and I would like to have more combinations to be evaluated. My problem is that my sampling frame limits the participants to a small sub-sample (with restrictive inclusion criteria), where it is hard to obtain more than 300 participants.
3-a. What is the best way to deal with this issue?
3.b. Do more choice sets add to the efficiency of the design and compensate for fewer participants?
3.c. Is it better to use 16 choice sets in 2 blocks, where each choice set has 2 alternatives and no-choice option, or to use 8 choice sets in total where each choice set has 3 alternatives and a no-choice option?
4-The syntax below is what worked with Ngene using 8 choices with 3 alternatives, and 2 choice sets with 2 alternatives. As advised elsewhere in the forum, one could do an MNL Bayesian efficient design and optimize it to a RP-Panel design. How do I compare the efficiency between the two? If:
First Option: 16 choice sets, 2 blocks of 8 choices, each choice set has 2 alternatives and a no-choice
(MNL: D-error=0.31, S-estimate=179) and (RP-Panel: D-error=0.89, S-estimate=390) and (D-optimality=93.7%) although the design looks good, while iterating, Ngene shows “ERROR: Aborting the run. After approximately 10 minutes, an initial random design was not found”
Second Option: 8 choice sets in total, each choice set has 3 alternatives and a no-choice option
(MNL: D-error=0.6, S-estimate=374) and (RP-Panel: D-error= 1.46, S-estimate=686) and (D-optimality=98.5%)
Can I conclude from this that I can proceed with the design and can estimate taste heterogeneity using Latent Class and RP-Panel-Mixed-logit? What option of the two should I proceed with?
?The model that did not work
Design
;alts (model1)= alt1*, alt2*, alt3*, alt4
;alts (model2)= alt1*, alt2*, alt3*, alt4
;rows=16
;block=2
;eff = model1(mnl,d,mean)
;rdraws= gauss(3)
;bdraws= gauss(3)
;rep= 1000
;model(model1):
U(alt1) = x1[(n,-0.10,0.04)]*A1[0,1] + x2[(n,0.04,0.04)]*A2[0,1] + x3[(n,0.23,0.04)]*A3[0,1] + x4[(n,0.12,0.04)]*A4[0,1] + x5[(n,0.16,0.04)]*A5[0,1]/
U(alt2) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt3) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt4) = None[-1.8]
;model(model2):
U(alt1) = x1[n,-0.17,0.45]*A1[0,1] + x2[n,0.08,0.61]*A2[0,1] + x3[n,0.46,0.6]*A3[0,1] + x4[n,0.23,0.6]*A4[0,1] + x5[n,0.33,0.88]*A5[0,1]/
U(alt2) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt3) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt4) = None[-1.2]
$
This syntax shows this message:
“A valid initial random design could not be generated after approximately 10 seconds. In this time, of the 224299 attempts made, there were 0 row repetitions, 25729 alternative repetitions, and 198570 cases of dominance. There are a number of possible causes for this, including the specification of too many constraints, not having enough attributes or attribute levels for the number of rows required, and the use of too many scenario attributes. A design may yet be found, and the search will continue for 10 minutes. Alternatively, you can stop the run and alter the syntax.”
Any feedback you provide is greatly appreciated.
Sincerely,
Hamad
I am constructing choice sets for an unlabeled experiment. I have 5 attributes each with two levels.
I conducted a pilot study using an optimal design. The pilot had two alternatives and a no-choice option. The total choice sets were 16 divided into 2 blocks. So, every participant evaluated 8 choice sets.
I want to use the priors to construct a Bayesian efficient design. However, I am having trouble with some issues:
1-When I use 3 alternatives and a no-choice, with ;row= 16 Ngene cannot find a design.
2-When I use 2 alternatives and a no-choice, with ;row=16 Ngene produces a design that looks great. But as illustrated by Sándor and Wedel (2002), having 3 or 4 alternatives per choice set is far more accurate in estimating panel-Mixed-logit.
3-The only time that Ngene is able to find a design with 3 alternatives and a no-choice option is when I reduce the total number of choice sets to ;row =8. I do not know why Ngene is limiting me to just 8 choice sets for the whole experiment (I tried 12 and 10 rows but did not work). I would like to increase the number of choice sets to at least 16. I am planning to use Latent Class analysis and panel-Mixed-logit and I would like to have more combinations to be evaluated. My problem is that my sampling frame limits the participants to a small sub-sample (with restrictive inclusion criteria), where it is hard to obtain more than 300 participants.
3-a. What is the best way to deal with this issue?
3.b. Do more choice sets add to the efficiency of the design and compensate for fewer participants?
3.c. Is it better to use 16 choice sets in 2 blocks, where each choice set has 2 alternatives and no-choice option, or to use 8 choice sets in total where each choice set has 3 alternatives and a no-choice option?
4-The syntax below is what worked with Ngene using 8 choices with 3 alternatives, and 2 choice sets with 2 alternatives. As advised elsewhere in the forum, one could do an MNL Bayesian efficient design and optimize it to a RP-Panel design. How do I compare the efficiency between the two? If:
First Option: 16 choice sets, 2 blocks of 8 choices, each choice set has 2 alternatives and a no-choice
(MNL: D-error=0.31, S-estimate=179) and (RP-Panel: D-error=0.89, S-estimate=390) and (D-optimality=93.7%) although the design looks good, while iterating, Ngene shows “ERROR: Aborting the run. After approximately 10 minutes, an initial random design was not found”
Second Option: 8 choice sets in total, each choice set has 3 alternatives and a no-choice option
(MNL: D-error=0.6, S-estimate=374) and (RP-Panel: D-error= 1.46, S-estimate=686) and (D-optimality=98.5%)
Can I conclude from this that I can proceed with the design and can estimate taste heterogeneity using Latent Class and RP-Panel-Mixed-logit? What option of the two should I proceed with?
?The model that did not work
Design
;alts (model1)= alt1*, alt2*, alt3*, alt4
;alts (model2)= alt1*, alt2*, alt3*, alt4
;rows=16
;block=2
;eff = model1(mnl,d,mean)
;rdraws= gauss(3)
;bdraws= gauss(3)
;rep= 1000
;model(model1):
U(alt1) = x1[(n,-0.10,0.04)]*A1[0,1] + x2[(n,0.04,0.04)]*A2[0,1] + x3[(n,0.23,0.04)]*A3[0,1] + x4[(n,0.12,0.04)]*A4[0,1] + x5[(n,0.16,0.04)]*A5[0,1]/
U(alt2) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt3) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt4) = None[-1.8]
;model(model2):
U(alt1) = x1[n,-0.17,0.45]*A1[0,1] + x2[n,0.08,0.61]*A2[0,1] + x3[n,0.46,0.6]*A3[0,1] + x4[n,0.23,0.6]*A4[0,1] + x5[n,0.33,0.88]*A5[0,1]/
U(alt2) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt3) = x1*A1 + x2*A2 + x3*A3 + x4*A4 + x5*A5/
U(alt4) = None[-1.2]
$
This syntax shows this message:
“A valid initial random design could not be generated after approximately 10 seconds. In this time, of the 224299 attempts made, there were 0 row repetitions, 25729 alternative repetitions, and 198570 cases of dominance. There are a number of possible causes for this, including the specification of too many constraints, not having enough attributes or attribute levels for the number of rows required, and the use of too many scenario attributes. A design may yet be found, and the search will continue for 10 minutes. Alternatively, you can stop the run and alter the syntax.”
Any feedback you provide is greatly appreciated.
Sincerely,
Hamad