1 Traffic Modelling, Department of Transport, Technical University of Denmark2 Department of Transport, Technical University of Denmark3 Department of Management Engineering, Technical University of Denmark
This thesis gives an overview of what has been done in the research area of passenger transport modelling, with a focus on the model type in the core of a model complex. After a formulation of the choice problem (choice probability, the set alternatives), a method for estimation and requirements for data, a literature review follows. Models applied for estimation of discrete choice models are described by properties and limitations, and relations between these are established. Model types are grouped into three classes, Hybrid choice models, Tree models and Latent class models. Relations between model, data and estimation are described, with a focus of possibilities/limitations of different techniques. Two special issues of modelling are addressed in further detail, namely data segmentation and estimation of Mixed Logit models. Both issues are concerned with whether individuals can be assumed ’homogeneous’, that is, can be described by the same model (fixed coefficients). First, a new method for data segmentation is proposed, which segments data by individual preferences. Segmentation by individual preferences will diminish the severeness of the assumed homogeneity of individuals (assumed for estimation of choice models). For application of the method an algorithm is provided with a case. Also for the second issue, estimation of Mixed Logit models, a method was proposed. The most commonly used approach to estimate Mixed Logit models, is to employ the Maximum Simulated Likelihood estimation (MSL), which simultaneously finds optimal coefficients values (utility elements) and parameter values (distributed terms) in the utility function. The shape of the distributed terms is specified prior to the estimation; hence, the validity is not tested during the estimation. The proposed method, assesses the shape of the distribution from data, by means of repetitive model estimation. In particular, one model was estimated for each sub-sample of data. The shape of distributions is assessed from between model comparisons. This is not to be regarded as an alternative to MSL estimation, rather as a complimentary test to assess the shape of distribution prior to a MSL estimation. The method is tested on synthetic data where different shapes of distribution are assumed for distributed terms. Differences in choice of alternative are asserted for different distributions and MNL estimation is seen to have some difficulty in explaining the choice of alternative. For the MSL estimation, some problems with the traditional determination of which shape of distribution to employ, are demonstrated. Further, all tested distributed terms (different shape of distribution) were significant; neither of these distributions were superior to the others. The proposed method for determination of shape of distribution, was able to recover the ’correct’ shape of distributions and to pinpoint which term had the highest level of variance and therefore seems very promising. Followingthe method was applied to ’real’ data, where distribution of coefficients were found. All the shapes of distributions found, complied with sound knowledge in terms of which should be uni-modal, sign specific and/or skewed distributions.