by Michiel Bliemer » Wed Mar 16, 2022 4:57 pm
If J is the number of alternativesm then you can include a constant only in J-1 alternatives (generic alternatives in an unlabelled experiment typically have no constants as you mention) and you can only include socio-demographic variables as a main effect in J-1 alternatives. The reason is that choice probabilities rely on the differences in utilities between the alternatives.
For example,
U(A) = 0
U(B) = 2
U(C) = 5
will yield the same choice probabilities as
U(A) = 1
U(B) = 3
U(C) = 6
since the utility differences are the same.
If you include J constants then you overspecify the model. You need to normalise one of the constants to 0,
Similarly, if you add AGE to all alternatives, then it becomes
U(A) = 0 + beta * AGE
U(B) = 2 + beta * AGE
U(C) = 5 + beta * AGE
Since beta * AGE is the same for all alternatives, it simply drops out when computing choice probabillities (because only utility differences matter), which means that beta cannot be estimated. To ensure that all parameters are identifiable, you will need to normalise one of of them to zero, for example
U(A) = 0 + beta * AGE
U(B) = 2 + beta * AGE
U(C) = 5
However, these utility functions DO NOT MAKE SENSE in an unlabelled experiment with generic alternatives. In an unlabelled experiment, all utility fuctions need to be the same as otherwise the model cannot be interpreted. For this reason, socio-demographics should NOT be added as main effects in utility functions of generic alternatives. The only way to meaningfully include socio-demographics in an unlabelled experiment is by creating interactions, for example:
U(A) = 0 + b1 * cost + b2 * cost * AGE
U(B) = 2 + b1 * cost + b2 * cost * AGE
U(C) = 5 + b1 * cost + b2 * cost * AGE
In this case, parameter b2 describes how age influences cost sensitivity.
Note that if you have an opt-out alternative, then you can include socio-demographics as a main effect, for example:
U(A) = b1 * cost + b2 * AGE
U(B) = b1 * cost + b2 * AGE
U(C) = b1 * cost + b2 * AGE
(optout) = constant
To conclude, you need to think carefully about what utility functions are meaningful and make sure that all parameters are identifiable.
Michiel