Beginner: review a design and some questions
Posted: Mon Jul 03, 2023 7:54 am
Dear Moderators,
I am happy to join this forum of Ngene. I am trying to ensure I am going in the right direction. Let me describe:
Three alternatives
Alt1: product (unlabeled)
Alt2: product (unlabeled)
Alt3: neither (I would not choose option A and B)
Three attributes
Att1: a generic attribute measured by percentage, two levels, [20%, 30%]
Att2: a generic discrete attribute, five levels, [0, 1, 2, 3, 4]
Att3: price, three levels, [$20, $30, $40]
Research purposes and models
Att2 is about production practices and has five levels. Among the five levels, we have an interest in two levels especially. The two levels represent two different environmentally friendly practices (two eco-labels).
The main purpose is to estimate the marginal WTP for the two eco-labels. Specifically,
(1) Estimate the WTP for the two eco-labels, including means and more information about the WTP distribution
(2) Examine how WTP for the two eco-labels varies with some respondent characteristics (for example, income, gender, age)
(3) Predict potential market shares of eco-labeled products
For (1), I plan to use a random parameter logit (i.e., mixed logit) to examine the WTP distribution, considering heterogeneous preference.
For (2), I plan to consider interaction terms between eco-label dummies and some covariates.
Procedures
Step 1. For the pilot study,
Generate efficient design with zero priors.
Step 2: Run pilot, estimate for MNL.
Step 3. Actual survey
Use priors and SE's from the pilot to design a d-efficient design.
Use the same code as the pilot except for priors.
A request
Is the procedure reasonable in general? Do I miss something? Any feedback would be very appreciated. For information, I put some comments next to each code sentence.
Questions
Question 1. Above I use ;eff = (mnl,d) for both the pilot and the actual survey, although I plan to use a MMNL (mixed logit). My decision is based on the finding of Bliemer and Rose (2010): the efficient design for the MNL model performed relatively efficient for estimating panel mixed logit models, although some efficiency will be lost. Is it a good approach?
Question 2. I put the criterion for stopping evaluations because I found evaluations took very long in some cases. I thought 10,000 iterations would be sufficient. Is it a good approach?
Question 3. I plan to consider some interaction effects between eco-labels and some economic characteristics of respondents. Respondent characteristics of interest are continuous. But I do not consider the interactions in designing experiments. Is it a good approach?
Any feedback would be very appreciated. Thank you.
I am happy to join this forum of Ngene. I am trying to ensure I am going in the right direction. Let me describe:
Three alternatives
Alt1: product (unlabeled)
Alt2: product (unlabeled)
Alt3: neither (I would not choose option A and B)
Three attributes
Att1: a generic attribute measured by percentage, two levels, [20%, 30%]
Att2: a generic discrete attribute, five levels, [0, 1, 2, 3, 4]
Att3: price, three levels, [$20, $30, $40]
Research purposes and models
Att2 is about production practices and has five levels. Among the five levels, we have an interest in two levels especially. The two levels represent two different environmentally friendly practices (two eco-labels).
The main purpose is to estimate the marginal WTP for the two eco-labels. Specifically,
(1) Estimate the WTP for the two eco-labels, including means and more information about the WTP distribution
(2) Examine how WTP for the two eco-labels varies with some respondent characteristics (for example, income, gender, age)
(3) Predict potential market shares of eco-labeled products
For (1), I plan to use a random parameter logit (i.e., mixed logit) to examine the WTP distribution, considering heterogeneous preference.
For (2), I plan to consider interaction terms between eco-label dummies and some covariates.
Procedures
Step 1. For the pilot study,
Generate efficient design with zero priors.
- Code: Select all
Design
;alts = alt1*, alt2*, alt3 ? alt1 and alt2 are unlabeled products
;rows = 60 ? 60 is needed for balanced
;eff = (mnl,d) ? Use a D-efficiency design because it is most widely used
;alg = swap(stop=total(10000 iterations)) ? Criterion for stopping evaluations
;block = 4 ? Each respondent will answer 15 choice tasks (=60/4)
;model:
U(alt1) = b11 + b2 * A[0.20,0.30] + b3.dummy[0|0|0|0] * B[0,1,2,3,4] + b4 * C[20,30,40] /
U(alt2) = b12 + b2 * A + b3 * B + b4 * C
$
Step 2: Run pilot, estimate for MNL.
Step 3. Actual survey
Use priors and SE's from the pilot to design a d-efficient design.
Use the same code as the pilot except for priors.
A request
Is the procedure reasonable in general? Do I miss something? Any feedback would be very appreciated. For information, I put some comments next to each code sentence.
Questions
Question 1. Above I use ;eff = (mnl,d) for both the pilot and the actual survey, although I plan to use a MMNL (mixed logit). My decision is based on the finding of Bliemer and Rose (2010): the efficient design for the MNL model performed relatively efficient for estimating panel mixed logit models, although some efficiency will be lost. Is it a good approach?
Question 2. I put the criterion for stopping evaluations because I found evaluations took very long in some cases. I thought 10,000 iterations would be sufficient. Is it a good approach?
Question 3. I plan to consider some interaction effects between eco-labels and some economic characteristics of respondents. Respondent characteristics of interest are continuous. But I do not consider the interactions in designing experiments. Is it a good approach?
Any feedback would be very appreciated. Thank you.