by johnr » Fri May 16, 2014 11:45 am
Hi
As you state below, the issue is that you need to specify the model in preference space, not WTP, so you need to convert the WTP estimates to the constituent components, beta_k and beta_c. Given that you have no knowledge of the price parameter, beta_c, then you proposal sounds like the most sensible approach to the issue.
In reference to your query, the theory says that (asymptotically) the design should not bias the estimates - it impacts upon the standard errors. Whilst there are some that argue the design will influence peoples decisions, if that is the case, then I would suggest we all pack up and go home - it basically says that every published paper using SP is design specific and nothing can be generalised from any published result. Of course, it is possible that if you do certain things with the design, you might induce certain effects - if you use certain designs for example, they are known to induce lexicographic behaviour for example, however these are extreme (but unfortunately common) mistakes. The real issue is that if you get the priors wrong, you get the wrong standard errors. This is why we talk about theoretical minimum sample sizes – the calculation is based on the assumption you got the priors correct. So the worst case scenario would be according to the theory (and my personal experience for what it is worth) is that you will need a larger sample size than what you thought you might have needed.
With regards to your second question, it is the unfortunate matter of how long is a piece of string. Think of it this way, given that utility = beta *X, if you fix X (say for one design iteration), but have a wide spread of betas in your distribution you are drawing from, then U will vary greatly also. So if you have two utility functions, and the betas are jumping all over the place, then their utilities will also be all over the place. The problem is that the AVC matrix is a function of the probabilities, so it will fluctuate greatly over the draws of betas. The D-error, based on the AVC matrix, will also vary greatly, and will exhibit a number of possible outliers. The Bayesian D-error, being the average over D-errors based on the set of draws, will be pulled to the outlier values (as averages are). Now if you have a narrower set of possible betas, the utilities won’t vary as much, and following through, you are less likely to have problems with the optimisation. How large is too large a range? Difficult to say, but I would start by looking at your syntax and calculating the marginal contribution to utility for each of your attributes (i.e., beta*X) across the range of values assumed at the extreme levels. For example, consider your price attribute.
Beta_lower = -0.0006
Beta_upper = -0.011
X_lower = 20
X high = 100
So if the program were to take the lowest beta draw and apply it to two alternatives, one with the lowest price level, and one with the highest, then the marginal utility contributions would be -0.012 and -0.22. Now consider the program drew the highest beta value possible. The marginal contributions to utilities would then be -0.06 and -1.1. So in effect, you are allowing the price to contribute to overall utility somewhere between -0.012 and -1.1. To get why this might be a problem, do the same exercise for the other attributes. You will see that the other attributes are contributing nearly nothing to the utility which may be the case, but it means that in order to find these effects relative to the price effect, the design (any design) will struggle – and this will come through in terms of the programs ability to locate a good design.
Hopefully, you can see why I say it is a how long is a piece of string type problem. It will depend not just on the parameter and the X, but on the other parameters and Xs also. And every problem will have different parameters and different Xs, so there is no single answer to your question other than, it depends on the problem! But you can do the type of pre-design thinking I suggest above (and have suggested in other posts) to minimise these types of issues arising.
John