Calculating indicators with PythonBiogeme

The package Biogeme (biogeme.epfl.ch) is designed to estimate the parameters of various models using maximum likelihood estimation. But it can also be used to extract indicators from an estimated model. In this document, we describe how to calculate some indicators particularly relevant in the context of discrete choice models: market shares, revenues, elasticities, and willingness to pay. Clearly, the use of the software is not restricted to these indicators, neither to choice models. But these examples illustrate most of the capabilities.

1 The model

We consider a case study involving a transportation mode choice model, using revealed preference data collected in Switzerland in 2009 and 2010 (see Atasoy et al., 2013). The model is a nested logit model with 3 alternatives: public transportation, car and slow modes. The utility functions are defined as:

where ASC_CAR, ASC_SM, BETA_TIME_FULLTIME, BETA_TIME_OTHER, BETA_DIST_MALE, BETA_DIST_FEMALE, BETA_DIST_UNREPORTED, BETA_COST, are parameters to be estimated, TimePT_scale, MarginalCostPT_scaled, TimeCar_scale, CostCarCHF_scale, distance_km_scale are attributes and fulltime, notfulltime, male, female, unreportedGender are socio-economic characteristics. The two alternatives “public transportation” and “slow modes” are grouped into a nest. The complete specification is available in the file 01nestedEstimation.py, reported in Section A.1. We refer the reader to Bierlaire (2016) for an introduction to the syntax.

The parameters are estimated using PythonBiogeme. Their values are reported in Table 1. A file named 01nestedEstimation_param.py is also generated. It contains the values of the estimated parameters written in PythonBiogeme syntax, as well as the code necessary to perform a sensitivity analysis. This code provides the variance-covariance matrix of the estimates.

2 Market shares and revenues

Once the model has been estimated, it must be used to derive useful indicators. PythonBiogeme provides a simulation feature for this purpose. We start by describing how to calculate market shares using sample enumeration. It is necessary to have a sample of individuals from the population. For each of them, the value of each of the variables involved in the model must be known. Note that it is possible to use the same sample that what used for estimation, but only if it contains revealed preferences data. Indeed, the calculation of indicators require real values for the variables, not values that have been engineered to the sake of estimating parameters, like in stated preferences data. It is the procedure used in this document.

More formally, consider a choice model P_n(i|x_n,

_n) providing the probability that individual n chooses alternative i within the choice set

_n, given the explanatory variables x_n. In order to calculate the market shares in the population of size N, a sample of N_s individuals is drawn. As it is rarely possible to draw from the population with equal sampling probability, it is assumed that stratified sampling has been used, and that each individual n in the sample is associated with a weight w_n correcting for sampling biases. The weights are normalized such that

If the alternative i involves a price variable p_in, the expected revenue generated by i is

In practice, the size of the population is rarely known, and the above quantity is used only in the context of price optimization. In this case, the factor N∕N_s can be omitted.

To calculate (2) and (3) with PythonBiogeme, a specification file must be prepared. In our example, the file 02nestedSimulation.py, reported in Section A.2, has been produced as follows:

In the output of the estimation (see the file 01nestedEstimation.html), the sum of all weights have been calculated using the statement

The reported result is 0.814484. Therefore, in order to verify (1), we introduce the following statements:

The following statements are included for the calculation of elasticities and will be used later (see Section 3 for more details):

Therefore, the result of (2) is available in the row “Weighted average”. In this example, the market shares are:

The result of (3) is obtained in the row “Weighted total”. In this case, the expected revenue (generated by the individuals in the sample) is 3018.29 (confidence interval: [2442.87,3826.36]).

3 Elasticities

Consider now one of the variables involved in the model, for instance x_ink, the kth variable associated by individual n to alternative i. The objective is to anticipate the impact of a change of the value of this variable on the choice of individual n, and subsequently on the market share of alternative i.

3.1 Point elasticities

If the variable is continuous, we assume that the relative (infinitesimal) change of the variable is the same for every individual in the population, that is

The disaggregate direct point elasticity of the model with respect to the variable x_ink is defined as

The aggregate direct point elasticity of the model with respect to the average value x_ik is defined as

This equation shows that the calculation of aggregate elasticities involves a weighted sum of disaggregate elasticities. However, the weight is not w_n as for the market share, but a normalized version of w_nP_n(i|x_n,

_n).

The disaggregate cross point elasticity of the model with respect to the variable x_jnk is defined as

It is called cross elasticity because it measures the sensitivity of the model for alternative i with respect to a modification of the attribute of another alternative.

3.2 Arc elasticities

A similar derivation can be done for arc elasticities. In this case, the relative change of the variable is not infinitesimal anymore. The idea is to analyze a before/after scenario. The variable x_ink in the before scenario becomes x_ink + Δx_ink in the after scenario. As above, we assume that the relative change of the variable is the same for every individual in the population, that is

where x_ik is defined by (15). The disaggregate direct arc elasticity of the model with respect to the variable x_ink is defined as

The aggregate direct arc elasticity of the model with respect to the average value x_ik is defined as

The two quantities are also related by (20), following the exact same derivation as for the point elasticity.

3.3 Using PythonBiogeme for point elasticities

The calculation of (16) involves derivatives. For simple models such as logit, the analytical formula of these derivatives can easily be derived. However, their derivation for advanced models can be tedious. It is common to make mistakes in the derivation itself, and even more common to make mistakes in the implementation. Therefore, PythonBiogeme provides an operator that calculates the derivative of a formula. It is illustrated in the file 03nestedElasticities.py, reported in Section A.3. The statements that trigger the calculation of the elasticities are:

The above syntax should be self-explanatory. But there is an important aspect to take into account. In the context of the estimation of the parameters of the model, the variables have been scaled in order to improve the numerical properties of the likelihood function, using statements like

The DefineVariable operator is designed to preprocess the data file, and can be seen as a way to add another column in the data file, defining a new variable. However, the relationship between the new variable and the original one is lost. Therefore, PythonBiogeme is not able to properly calculate the derivatives. In this example, the variable of interest is TimePT, not TimePT_scaled. And their relationship must be explicitly known to correctly calculate the derivatives. Consequently, all statements such as

The aggregate point elasticities can be obtained by aggregating the disaggregate elasticities, using (20). This requires the calculation of the normalization factors

The quantities that must be calculated for each individual in order to derive the aggregate elasticities, correspond to the following entries in the dictionary:

Note that the weights have not been included in the above formula, so that the values of the aggregate elasticities can be found in the row “Weighted total”:

and the aggregate value would have been found in the row “Total” instead of “Weighted total’. Note also that we have omitted to report the confidence intervals in this example, by commenting out the statement:

3.4 Using PythonBiogeme for cross elasticities

The calculation of (21) is performed in a similar way as the direct elasticities (16), using the following statements:

The corresponding aggregate elasticities are calculated exactly like for the direct case, and their values can be found in the row “Weighted total”.

Note that these values are now positive. Indeed, when the travel time or travel cost of a competing mode increase, the market share increases.

3.5 Using PythonBiogeme for arc elasticities

Arc elasticities require a before and after scenarios. In this case, we calculate the sensitivity of the market share of the slow modes alternative when there is a uniform increase of 1 kilometer.

The “before” scenario is represented by the same model as above. The after scenario is modeled using the following statements:

The aggregate elasticity is calculated as explained above. It is equal here to -1.00708, and the confidence interval is [-1.7212,-0.562574].

4 Willingness to pay

If the model contains a cost or price variable (like in this example), it is possible to analyze the trade-off between any variable and money. This reflects the willingness of the decision maker to pay for a modification of another variable of the model. A typical example in transportation is the value of time, that is the amount of money a traveler is willing to pay in order to decrease her travel time.

Let c_in be the cost of alternative i for individual n. Let x_ink be the value of another variable of the model. Let V_in(c_in,x_ink) be the value of the utility function. Consider a scenario where the variable of interest takes the value x_ink + δ_ink^x. We denote by δ_in^c the additional cost that would achieve the same utility, that is

The willingness to pay to increase the value of x_ink is defined as the additional cost per unit of x, that is

and is obtained by solving Equation (26). If x_ink and c_in appear linearly in the utility function, that is if

If x_ink is a continuous variable, and if V_in is differentiable in x_ink and c_in, we can invoke Taylor’s theorem in (26):

Note that if x_ink and c_in appear linearly in the utility function, (32) is the same as (30). If we consider now a scenario where the variable under interest takes the value x_ink - δ_ink^x, the same derivation leads to the willingness to pay to decrease the value of x_ink:

where the last equation assumes that V is linear in these variables. Note that, in this special case of linear utility functions, the value of time is constant across individuals, and is also independent of δ_in^t. This is not true in general.

The calculation of (33) involves the calculation of derivatives. It is done in Pythonbiogeme using the following statements:

The full specification file can be found in Section A.6. The aggregate values are found in the “Weighted average” row of the report file: 3.95822 CHF/hour (confidence interval: [1.98696,7.81565]). Note that this value is abnormally low, which is a sign of a potential poor specification of the model. Note also that, with this specification, the value of time is the same for car and public transportation, as the coefficients of the time and cost variables are generic.

Finally, it is important to look at the distribution of the willingness to pay in the population/sample. The detailed records of the report file allows to do so. It is easy to drag and drop the HTML report file into your favorite spreadsheet software in order to perform additional statistics. In this example, the value of time takes two values, depending on the employment status of the individual:

5 Conclusion

PythonBiogeme is a flexible tool that allows to extract useful indicators from complex models. In this document, we have presented how some indicators relevant for discrete choice models can be generated. The HTML format of the report allows to display the report in your favorite browser. It also allows to import the generated values in a spreadsheet for more manipulations.

A Complete specification files

A.1 01nestedEstimation.py

1## File 01nestedEstimation.py
2## Simple nested logit model for the Optima case study
3## Wed May 10 10:55:12 2017
4
5from biogeme import *
6from headers import *
7from loglikelihood import *
8from statistics import *
9from nested import *
10
11### Three alternatives:
12# CAR: automobile
13# PT: public transportation
14# SM: slow mode (walking, biking)
15
16### List of parameters to be estimated
17ASC_CAR = Beta(’ASC_CAR’,0,-10000,10000,0)
18ASC_SM = Beta(’ASC_SM’,0,-10000,10000,0)
19BETA_TIME_FULLTIME = Beta(’BETA_TIME_FULLTIME’,0,-10000,10000,0)
20BETA_TIME_OTHER = Beta(’BETA_TIME_OTHER’,0,-10000,10000,0)
21BETA_DIST_MALE = Beta(’BETA_DIST_MALE’,0,-10000,10000,0)
22BETA_DIST_FEMALE = Beta(’BETA_DIST_FEMALE’,0,-10000,10000,0)
23BETA_DIST_UNREPORTED = Beta(’BETA_DIST_UNREPORTED’,0,-10000,10000,0)
24BETA_COST = Beta(’BETA_COST’,0,-10000,10000,0)
25
26
27###Definition of variables:
28# For numerical reasons, it is good practice to scale the data to
29# that the values of the parameters are around 1.0.
30
31# The following statements are designed to preprocess the data.
32# It is like creating a new columns in the data file. This
33# should be preferred to the statement like
34# TimePT_scaled = Time_PT / 200.0
35# which will cause the division to be reevaluated again and again,
36# throuh the iterations. For models taking a long time to
37# estimate, it may make a significant difference.
38
39TimePT_scaled = DefineVariable(’TimePT_scaled’, TimePT / 200 )
40TimeCar_scaled = DefineVariable(’TimeCar_scaled’, TimeCar / 200 )
41MarginalCostPT_scaled = DefineVariable(’MarginalCostPT_scaled’,
42                                       MarginalCostPT / 10 )
43CostCarCHF_scaled = DefineVariable(’CostCarCHF_scaled’,
44                                   CostCarCHF / 10 )
45distance_km_scaled = DefineVariable(’distance_km_scaled’,
46                                    distance_km / 5 )
47
48male = DefineVariable(’male’,Gender == 1)
49female = DefineVariable(’female’,Gender == 2)
50unreportedGender = DefineVariable(’unreportedGender’,Gender == -1)
51
52fulltime = DefineVariable(’fulltime’,OccupStat == 1)
53notfulltime = DefineVariable(’notfulltime’,OccupStat != 1)
54
55### Definition of utility functions:
56V_PT = BETA_TIME_FULLTIME * TimePT_scaled * fulltime + \
57       BETA_TIME_OTHER * TimePT_scaled * notfulltime + \
58       BETA_COST * MarginalCostPT_scaled
59V_CAR = ASC_CAR + \
60        BETA_TIME_FULLTIME * TimeCar_scaled * fulltime + \
61        BETA_TIME_OTHER * TimeCar_scaled * notfulltime + \
62        BETA_COST * CostCarCHF_scaled
63V_SM = ASC_SM + \
64       BETA_DIST_MALE * distance_km_scaled * male + \
65       BETA_DIST_FEMALE * distance_km_scaled * female + \
66       BETA_DIST_UNREPORTED * distance_km_scaled * unreportedGender
67
68# Associate utility functions with the numbering of alternatives
69V = {0: V_PT,
70     1: V_CAR,
71     2: V_SM}
72
73# Associate the availability conditions with the alternatives.
74# In this example all alternatives are available for each individual.
75av = {0: 1,
76      1: 1,
77      2: 1}
78
79### DEFINITION OF THE NESTS:
80# 1: nests parameter
81# 2: list of alternatives
82
83NEST_NOCAR = Beta(’NEST_NOCAR’,1,1.0,10,0)
84
85CAR = 1.0 , [ 1]
86NO_CAR = NEST_NOCAR , [ 0, 2]
87nests = CAR, NO_CAR
88
89# All observations verifying the following expression will not be
90# considered for estimation
91BIOGEME_OBJECT.EXCLUDE = Choice == -1
92
93
94# The choice model is a nested logit, with availability conditions
95logprob = lognested(V,av,nests,Choice)
96
97# Defines an itertor on the data
98rowIterator(’obsIter’)
99
100#Statistics
101nullLoglikelihood(av,’obsIter’)
102choiceSet = [0,1,2]
103cteLoglikelihood(choiceSet,Choice,’obsIter’)
104availabilityStatistics(av,’obsIter’)
105
106BIOGEME_OBJECT.STATISTICS[’Gender: males’] = \
107                    Sum(male,’obsIter’)
108BIOGEME_OBJECT.STATISTICS[’Gender: females’] = \
109                    Sum(female,’obsIter’)
110BIOGEME_OBJECT.STATISTICS[’Gender: unreported’] = \
111                    Sum(unreportedGender,’obsIter’)
112BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = \
113                    Sum(fulltime,’obsIter’)
114BIOGEME_OBJECT.STATISTICS[’Sum of weights’] = \
115                    Sum(Weight,’obsIter’)
116
117# Define the likelihood function for the estimation
118BIOGEME_OBJECT.ESTIMATE = Sum(logprob,’obsIter’)
119BIOGEME_OBJECT.PARAMETERS[’optimizationAlgorithm’] = "CFSQP"

A.2 02nestedSimulation.py

1## File 02nestedSimulation.py
2## Simple nested logit model for the Optima case study
3## Wed May 10 11:24:32 2017
4
5from biogeme import *
6from headers import *
7from statistics import *
8from nested import *
9
10### Three alternatives:
11# CAR: automobile
12# PT: public transportation
13# SM: slow mode (walking, biking)
14
15### List of parameters and their estimated value.
16ASC_CAR = Beta(’ASC_CAR’,0.261291,-10000,10000,0,’ASC_CAR’ )
17ASC_SM = Beta(’ASC_SM’,0.0590204,-10000,10000,0,’ASC_SM’ )
18BETA_TIME_FULLTIME = \
19 Beta(’BETA_TIME_FULLTIME’,-1.59709,-10000,10000,0,’BETA_TIME_FULLTIME’ )
20BETA_TIME_OTHER = \
21 Beta(’BETA_TIME_OTHER’,-0.577362,-10000,10000,0,’BETA_TIME_OTHER’ )
22BETA_DIST_MALE = \
23 Beta(’BETA_DIST_MALE’,-0.686327,-10000,10000,0,’BETA_DIST_MALE’ )
24BETA_DIST_FEMALE = \
25 Beta(’BETA_DIST_FEMALE’,-0.83121,-10000,10000,0,’BETA_DIST_FEMALE’ )
26BETA_DIST_UNREPORTED = \
27 Beta(’BETA_DIST_UNREPORTED’,-0.702974,-10000,10000,0,’BETA_DIST_UNREPORTED’ )
28BETA_COST = \
29 Beta(’BETA_COST’,-0.716192,-10000,10000,0,’BETA_COST’ )
30
31
32###Definition of variables:
33# For numerical reasons, it is good practice to scale the data to
34# that the values of the parameters are around 1.0.
35
36# The following statements are designed to preprocess the data. It is
37# like creating a new columns in the data file. This should be
38# preferred to the statement like
39# TimePT_scaled = Time_PT / 200.0
40# which will cause the division to be reevaluated again and again,
41# throuh the iterations. For models taking a long time to estimate, it
42# may make a significant difference.
43
44TimePT_scaled = DefineVariable(’TimePT_scaled’, TimePT / 200 )
45TimeCar_scaled = DefineVariable(’TimeCar_scaled’, TimeCar / 200 )
46MarginalCostPT_scaled = DefineVariable(’MarginalCostPT_scaled’,
47                                       MarginalCostPT / 10 )
48CostCarCHF_scaled = DefineVariable(’CostCarCHF_scaled’,
49                                   CostCarCHF / 10 )
50distance_km_scaled = DefineVariable(’distance_km_scaled’,
51                                    distance_km / 5 )
52
53male = DefineVariable(’male’,Gender == 1)
54female = DefineVariable(’female’,Gender == 2)
55unreportedGender  = DefineVariable(’unreportedGender’,Gender == -1)
56
57fulltime = DefineVariable(’fulltime’,OccupStat == 1)
58notfulltime = DefineVariable(’notfulltime’,OccupStat != 1)
59
60### Definition of utility functions:
61V_PT = BETA_TIME_FULLTIME * TimePT_scaled * fulltime + \
62       BETA_TIME_OTHER * TimePT_scaled * notfulltime + \
63       BETA_COST * MarginalCostPT_scaled
64V_CAR = ASC_CAR + \
65        BETA_TIME_FULLTIME * TimeCar_scaled * fulltime + \
66        BETA_TIME_OTHER * TimeCar_scaled * notfulltime + \
67        BETA_COST * CostCarCHF_scaled
68V_SM = ASC_SM + \
69       BETA_DIST_MALE * distance_km_scaled * male + \
70       BETA_DIST_FEMALE * distance_km_scaled * female + \
71       BETA_DIST_UNREPORTED * distance_km_scaled * unreportedGender
72
73
74# Associate utility functions with the numbering of alternatives
75V = {0: V_PT,
76     1: V_CAR,
77     2: V_SM}
78
79# Associate the availability conditions with the alternatives.
80# In this example all alternatives are available for each individual.
81av = {0: 1,
82      1: 1,
83      2: 1}
84
85### DEFINITION OF THE NESTS:
86# 1: nests parameter
87# 2: list of alternatives
88
89NEST_NOCAR = Beta(’NEST_NOCAR’,1.52853,1,10,0,’NEST_NOCAR’ )
90
91
92CAR = 1.0 , [ 1]
93NO_CAR = NEST_NOCAR , [ 0,  2]
94nests = CAR, NO_CAR
95
96# All observations verifying the following expression will not be
97# considered for estimation
98exclude = (Choice   ==  -1)
99BIOGEME_OBJECT.EXCLUDE =  exclude
100
101##
102## This has been copied-pasted from the file 01nestedEstimation_param.py
103##
104## Code for the sensitivity analysis generated after the estimation of the model
105names = [’ASC_CAR’,’ASC_SM’,’BETA_COST’,’BETA_DIST_FEMALE’,’BETA_DIST_MALE’,’BETA_DIST_UNREPORTED’,’BETA_TIME_FULLTIME’,’BETA_TIME_OTHER’,’NEST_NOCAR’]
106values = [[0.0100225,-0.0023271,0.00151986,0.00285251,0.00621963,0.00247439,0.0235929,0.0224142,-0.00807837],[-0.0023271,0.0469143,0.00431142,-0.0204402,-0.0223745,-0.00774278,-0.00847539,-0.00394251,0.0389318],[0.00151986,0.00431142,0.0191465,0.00673909,0.00559057,0.00676991,-0.000434418,-0.00579638,0.0155749],[0.00285251,-0.0204402,0.00673909,0.0371974,0.0156282,0.0146385,0.010273,0.00438825,0.0106748],[0.00621963,-0.0223745,0.00559057,0.0156282,0.0258642,0.0112879,0.0218765,0.0109824,-0.0062276],[0.00247439,-0.00774278,0.00676991,0.0146385,0.0112879,0.0385363,0.00725802,0.00507749,0.0131128],[0.0235929,-0.00847539,-0.000434418,0.010273,0.0218765,0.00725802,0.110753,0.0555677,-0.0178209],[0.0224142,-0.00394251,-0.00579638,0.00438825,0.0109824,0.00507749,0.0555677,0.0878987,-0.0248326],[-0.00807837,0.0389318,0.0155749,0.0106748,-0.0062276,0.0131128,-0.0178209,-0.0248326,0.0934272]]
107vc = bioMatrix(9,names,values)
108BIOGEME_OBJECT.VARCOVAR = vc
109
110
111
112# The choice model is a nested logit
113prob_pt = nested(V,av,nests,0)
114prob_car = nested(V,av,nests,1)
115prob_sm = nested(V,av,nests,2)
116
117# Defines an itertor on the data
118rowIterator(’obsIter’)
119
120#Statistics
121nullLoglikelihood(av,’obsIter’)
122choiceSet = [0,1,2]
123cteLoglikelihood(choiceSet,Choice,’obsIter’)
124availabilityStatistics(av,’obsIter’)
125
126# Each weight is normalized so that the sum of weights is equal to the
127# number of entries (1906).
128# The normalization factor has been calculated during estimation
129theWeight = Weight * 1906 / 0.814484
130
131
132BIOGEME_OBJECT.STATISTICS[’Gender: males’] = \
133                    Sum(male,’obsIter’)
134BIOGEME_OBJECT.STATISTICS[’Gender: females’] = \
135                    Sum(female,’obsIter’)
136BIOGEME_OBJECT.STATISTICS[’Gender: unreported’] = \
137                    Sum(unreportedGender,’obsIter’)
138BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = \
139                    Sum(fulltime,’obsIter’)
140BIOGEME_OBJECT.STATISTICS[’Sum of weights’] = \
141                    Sum(Weight,’obsIter’)
142BIOGEME_OBJECT.STATISTICS[’Number of entries’] = \
143                    Sum(1-exclude,’obsIter’)
144BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities PT’] = \
145                    Sum(theWeight * prob_pt ,’obsIter’)
146BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities CAR’] = \
147                    Sum(theWeight * prob_car ,’obsIter’)
148BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities SM’] = \
149                    Sum(theWeight * prob_sm ,’obsIter’)
150
151# Define the dictionary for the simulation.
152simulate = {’Prob. car’: prob_car,
153            ’Prob. public transportation’: prob_pt,
154            ’Prob. slow modes’:prob_sm,
155            ’Revenue public transportation’:
156                   prob_pt * MarginalCostPT}
157
158BIOGEME_OBJECT.WEIGHT = theWeight
159BIOGEME_OBJECT.SIMULATE = Enumerate(simulate,’obsIter’)

A.3 03nestedElasticities.py

1## File 03nestedElasticities.py
2## Simple nested logit model for the Optima case study
3## Calculation of direct point elasticities
4## Wed May 10 12:20:59 2017
5
6from biogeme import *
7from headers import *
8from statistics import *
9from nested import *
10
11### Three alternatives:
12# CAR: automobile
13# PT: public transportation
14# SM: slow mode (walking, biking)
15
16### List of parameters and their estimated value.
17ASC_CAR = Beta(’ASC_CAR’,0.261291,-10000,10000,0,’ASC_CAR’ )
18ASC_SM = Beta(’ASC_SM’,0.0590204,-10000,10000,0,’ASC_SM’ )
19BETA_TIME_FULLTIME = \
20 Beta(’BETA_TIME_FULLTIME’,-1.59709,-10000,10000,0,’BETA_TIME_FULLTIME’ )
21BETA_TIME_OTHER = \
22 Beta(’BETA_TIME_OTHER’,-0.577362,-10000,10000,0,’BETA_TIME_OTHER’ )
23BETA_DIST_MALE = \
24 Beta(’BETA_DIST_MALE’,-0.686327,-10000,10000,0,’BETA_DIST_MALE’ )
25BETA_DIST_FEMALE = \
26 Beta(’BETA_DIST_FEMALE’,-0.83121,-10000,10000,0,’BETA_DIST_FEMALE’ )
27BETA_DIST_UNREPORTED = \
28 Beta(’BETA_DIST_UNREPORTED’,-0.702974,-10000,10000,0,’BETA_DIST_UNREPORTED’ )
29BETA_COST = \
30 Beta(’BETA_COST’,-0.716192,-10000,10000,0,’BETA_COST’ )
31
32###Definition of variables:
33# For numerical reasons, it is good practice to scale the data to
34# that the values of the parameters are around 1.0.
35
36### Warning: when calculation derivatives, the total formula must be
37### known to Biogeme. In this case, the use of
38### "DefineVariable" must be omitted, if the derivatives must be
39### calculated with respect to the original variables (as is often the
40### case)
41
42# TimePT_scaled  = DefineVariable(’TimePT_scaled’, TimePT   /  200 )
43TimePT_scaled  = TimePT   /  200
44
45#TimeCar_scaled  = DefineVariable(’TimeCar_scaled’, TimeCar   /  200 )
46TimeCar_scaled  =  TimeCar   /  200
47
48#MarginalCostPT_scaled  = DefineVariable(’MarginalCostPT_scaled’, MarginalCostPT   /  10 )
49MarginalCostPT_scaled  =  MarginalCostPT   /  10
50
51#CostCarCHF_scaled  = DefineVariable(’CostCarCHF_scaled’, CostCarCHF   /  10 )
52CostCarCHF_scaled  = CostCarCHF   /  10
53
54#distance_km_scaled  = DefineVariable(’distance_km_scaled’, distance_km   /  5 )
55distance_km_scaled  = distance_km   /  5
56
57male = DefineVariable(’male’,Gender == 1)
58female = DefineVariable(’female’,Gender == 2)
59unreportedGender  = DefineVariable(’unreportedGender’,Gender == -1)
60
61fulltime = DefineVariable(’fulltime’,OccupStat == 1)
62notfulltime = DefineVariable(’notfulltime’,OccupStat != 1)
63
64### Definition of utility functions:
65
66V_PT = BETA_TIME_FULLTIME * TimePT_scaled * fulltime + \
67       BETA_TIME_OTHER * TimePT_scaled * notfulltime + \
68       BETA_COST * MarginalCostPT_scaled
69V_CAR = ASC_CAR + \
70        BETA_TIME_FULLTIME * TimeCar_scaled * fulltime + \
71        BETA_TIME_OTHER * TimeCar_scaled * notfulltime + \
72        BETA_COST * CostCarCHF_scaled
73V_SM = ASC_SM + \
74       BETA_DIST_MALE * distance_km_scaled * male + \
75       BETA_DIST_FEMALE * distance_km_scaled * female + \
76       BETA_DIST_UNREPORTED * distance_km_scaled * unreportedGender
77
78# Associate utility functions with the numbering of alternatives
79V = {0: V_PT,
80     1: V_CAR,
81     2: V_SM}
82
83# Associate the availability conditions with the alternatives.
84# In this example all alternatives are available for each individual.
85av = {0: 1,
86      1: 1,
87      2: 1}
88
89### DEFINITION OF THE NESTS:
90# 1: nests parameter
91# 2: list of alternatives
92
93NEST_NOCAR = Beta(’NEST_NOCAR’,1.52853,1,10,0,’NEST_NOCAR’ )
94
95
96CAR = 1.0 , [ 1]
97NO_CAR = NEST_NOCAR , [ 0,  2]
98nests = CAR, NO_CAR
99
100# All observations verifying the following expression will not be
101# considered for estimation
102exclude = (Choice   ==  -1)
103BIOGEME_OBJECT.EXCLUDE =  exclude
104
105
106##
107## This has been copied-pasted from the file 01nestedEstimation_param.py
108##
109## Code for the sensitivity analysis generated after the estimation of the model
110names = [’ASC_CAR’,’ASC_SM’,’BETA_COST’,’BETA_DIST_FEMALE’,’BETA_DIST_MALE’,’BETA_DIST_UNREPORTED’,’BETA_TIME_FULLTIME’,’BETA_TIME_OTHER’,’NEST_NOCAR’]
111values = [[0.0100225,-0.0023271,0.00151986,0.00285251,0.00621963,0.00247439,0.0235929,0.0224142,-0.00807837],[-0.0023271,0.0469143,0.00431142,-0.0204402,-0.0223745,-0.00774278,-0.00847539,-0.00394251,0.0389318],[0.00151986,0.00431142,0.0191465,0.00673909,0.00559057,0.00676991,-0.000434418,-0.00579638,0.0155749],[0.00285251,-0.0204402,0.00673909,0.0371974,0.0156282,0.0146385,0.010273,0.00438825,0.0106748],[0.00621963,-0.0223745,0.00559057,0.0156282,0.0258642,0.0112879,0.0218765,0.0109824,-0.0062276],[0.00247439,-0.00774278,0.00676991,0.0146385,0.0112879,0.0385363,0.00725802,0.00507749,0.0131128],[0.0235929,-0.00847539,-0.000434418,0.010273,0.0218765,0.00725802,0.110753,0.0555677,-0.0178209],[0.0224142,-0.00394251,-0.00579638,0.00438825,0.0109824,0.00507749,0.0555677,0.0878987,-0.0248326],[-0.00807837,0.0389318,0.0155749,0.0106748,-0.0062276,0.0131128,-0.0178209,-0.0248326,0.0934272]]
112vc = bioMatrix(9,names,values)
113#BIOGEME_OBJECT.VARCOVAR = vc
114
115
116
117# The choice model is a nested logit
118prob_pt = nested(V,av,nests,0)
119prob_car = nested(V,av,nests,1)
120prob_sm = nested(V,av,nests,2)
121
122elas_pt_time = Derive(prob_pt,’TimePT’) * TimePT / prob_pt
123elas_pt_cost = Derive(prob_pt,’MarginalCostPT’) * MarginalCostPT / prob_pt
124elas_car_time = Derive(prob_car,’TimeCar’) * TimeCar / prob_car
125elas_car_cost = Derive(prob_car,’CostCarCHF’) * CostCarCHF / prob_car
126elas_sm_dist = Derive(prob_sm,’distance_km’) * distance_km / prob_sm
127
128# Defines an itertor on the data
129rowIterator(’obsIter’)
130#Statistics
131nullLoglikelihood(av,’obsIter’)
132choiceSet = [0,1,2]
133cteLoglikelihood(choiceSet,Choice,’obsIter’)
134availabilityStatistics(av,’obsIter’)
135
136# Each weight is normalized so that the sum of weights is equal to the
137# numer of entries (1906)
138# The normalization factor has been calculated during estimation
139
140theWeight = Weight * 1906 / 0.814484
141normalization_pt  = 535.086
142normalization_car = 1244.77
143normalization_sm = 126.147
144
145BIOGEME_OBJECT.STATISTICS[’Gender: males’] = \
146                    Sum(male,’obsIter’)
147BIOGEME_OBJECT.STATISTICS[’Gender: females’] = \
148                    Sum(female,’obsIter’)
149BIOGEME_OBJECT.STATISTICS[’Gender: unreported’] = \
150                    Sum(unreportedGender,’obsIter’)
151BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = \
152                    Sum(fulltime,’obsIter’)
153BIOGEME_OBJECT.STATISTICS[’Sum of weights’] = \
154                    Sum(Weight,’obsIter’)
155BIOGEME_OBJECT.STATISTICS[’Number of entries’] = \
156                    Sum(1-exclude,’obsIter’)
157BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities PT’] = \
158                    Sum(theWeight * prob_pt ,’obsIter’)
159BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities CAR’] = \
160                    Sum(theWeight * prob_car ,’obsIter’)
161BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities SM’] = \
162                    Sum(theWeight * prob_sm ,’obsIter’)
163BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = Sum(fulltime,’obsIter’)
164
165# Define the dictionary for the simulation.
166simulate = {’Disag. Elast. PT - Time’: elas_pt_time,
167            ’Disag. Elast. PT - Cost’: elas_pt_cost,
168            ’Disag. Elast. Car - Time’: elas_car_time,
169            ’Disag. Elast. Car - Cost’: elas_car_cost,
170            ’Disag. Elast. Slow modes - Distance’: elas_sm_dist,
171            ’Agg. Elast. PT - Time’: \
172               elas_pt_time * prob_pt / normalization_pt,
173            ’Agg. Elast. PT - Cost’: \
174               elas_pt_cost * prob_pt / normalization_pt,
175            ’Agg. Elast. Car - Time’: \
176               elas_car_time * prob_car / normalization_car,
177            ’Agg. Elast. Car - Cost’: \
178               elas_car_cost * prob_car / normalization_car,
179            ’Agg. Elast. Slow modes - Distance’: \
180               elas_sm_dist * prob_sm / normalization_sm
181}
182
183BIOGEME_OBJECT.WEIGHT = theWeight
184BIOGEME_OBJECT.SIMULATE = Enumerate(simulate,’obsIter’)

A.4 04nestedElasticities.py

1## File 04nestedElasticities.py
2## Simple nested logit model for the Optima case study
3## Calculation of cross point elasticities
4## Thu May 11 16:38:05 2017
5
6from biogeme import *
7from headers import *
8from statistics import *
9from nested import *
10
11### Three alternatives:
12# CAR: automobile
13# PT: public transportation
14# SM: slow mode (walking, biking)
15
16### List of parameters and their estimated value.
17ASC_CAR = Beta(’ASC_CAR’,0.261291,-10000,10000,0,’ASC_CAR’ )
18ASC_SM = Beta(’ASC_SM’,0.0590204,-10000,10000,0,’ASC_SM’ )
19BETA_TIME_FULLTIME = \
20 Beta(’BETA_TIME_FULLTIME’,-1.59709,-10000,10000,0,’BETA_TIME_FULLTIME’ )
21BETA_TIME_OTHER = \
22 Beta(’BETA_TIME_OTHER’,-0.577362,-10000,10000,0,’BETA_TIME_OTHER’ )
23BETA_DIST_MALE = \
24 Beta(’BETA_DIST_MALE’,-0.686327,-10000,10000,0,’BETA_DIST_MALE’ )
25BETA_DIST_FEMALE = \
26 Beta(’BETA_DIST_FEMALE’,-0.83121,-10000,10000,0,’BETA_DIST_FEMALE’ )
27BETA_DIST_UNREPORTED = \
28 Beta(’BETA_DIST_UNREPORTED’,-0.702974,-10000,10000,0,’BETA_DIST_UNREPORTED’ )
29BETA_COST = \
30 Beta(’BETA_COST’,-0.716192,-10000,10000,0,’BETA_COST’ )
31
32
33###Definition of variables:
34# For numerical reasons, it is good practice to scale the data to
35# that the values of the parameters are around 1.0.
36
37### Warning: when calculation derivatives, the total formula must be
38### known to Biogeme. In this case, the use of
39### "DefineVariable" must be omitted, if the derivatives must be
40### calculated with respect to the original variables (as is often the
41### case)
42
43# TimePT_scaled  = DefineVariable(’TimePT_scaled’, TimePT   /  200 )
44TimePT_scaled  = TimePT   /  200
45
46#TimeCar_scaled  = DefineVariable(’TimeCar_scaled’, TimeCar   /  200 )
47TimeCar_scaled  =  TimeCar   /  200
48
49#MarginalCostPT_scaled  = DefineVariable(’MarginalCostPT_scaled’, MarginalCostPT   /  10 )
50MarginalCostPT_scaled  =  MarginalCostPT   /  10
51
52#CostCarCHF_scaled  = DefineVariable(’CostCarCHF_scaled’, CostCarCHF   /  10 )
53CostCarCHF_scaled  = CostCarCHF   /  10
54
55#distance_km_scaled  = DefineVariable(’distance_km_scaled’, distance_km   /  5 )
56distance_km_scaled  = distance_km   /  5
57
58male = DefineVariable(’male’,Gender == 1)
59female = DefineVariable(’female’,Gender == 2)
60unreportedGender  = DefineVariable(’unreportedGender’,Gender == -1)
61
62fulltime = DefineVariable(’fulltime’,OccupStat == 1)
63notfulltime = DefineVariable(’notfulltime’,OccupStat != 1)
64
65### Definition of utility functions:
66
67V_PT = BETA_TIME_FULLTIME * TimePT_scaled * fulltime + \
68       BETA_TIME_OTHER * TimePT_scaled * notfulltime + \
69       BETA_COST * MarginalCostPT_scaled
70V_CAR = ASC_CAR + \
71        BETA_TIME_FULLTIME * TimeCar_scaled * fulltime + \
72        BETA_TIME_OTHER * TimeCar_scaled * notfulltime + \
73        BETA_COST * CostCarCHF_scaled
74V_SM = ASC_SM + \
75       BETA_DIST_MALE * distance_km_scaled * male + \
76       BETA_DIST_FEMALE * distance_km_scaled * female + \
77       BETA_DIST_UNREPORTED * distance_km_scaled * unreportedGender
78
79# Associate utility functions with the numbering of alternatives
80V = {0: V_PT,
81     1: V_CAR,
82     2: V_SM}
83
84# Associate the availability conditions with the alternatives.
85# In this example all alternatives are available for each individual.
86av = {0: 1,
87      1: 1,
88      2: 1}
89
90### DEFINITION OF THE NESTS:
91# 1: nests parameter
92# 2: list of alternatives
93
94NEST_NOCAR = Beta(’NEST_NOCAR’,1.52853,1,10,0,’NEST_NOCAR’ )
95
96
97CAR = 1.0 , [ 1]
98NO_CAR = NEST_NOCAR , [ 0,  2]
99nests = CAR, NO_CAR
100
101# All observations verifying the following expression will not be
102# considered for estimation
103exclude = (Choice   ==  -1)
104BIOGEME_OBJECT.EXCLUDE =  exclude
105
106
107##
108## This has been copied-pasted from the file 01nestedEstimation_param.py
109##
110## Code for the sensitivity analysis generated after the estimation of the model
111names = [’ASC_CAR’,’ASC_SM’,’BETA_COST’,’BETA_DIST_FEMALE’,’BETA_DIST_MALE’,’BETA_DIST_UNREPORTED’,’BETA_TIME_FULLTIME’,’BETA_TIME_OTHER’,’NEST_NOCAR’]
112values = [[0.0100225,-0.0023271,0.00151986,0.00285251,0.00621963,0.00247439,0.0235929,0.0224142,-0.00807837],[-0.0023271,0.0469143,0.00431142,-0.0204402,-0.0223745,-0.00774278,-0.00847539,-0.00394251,0.0389318],[0.00151986,0.00431142,0.0191465,0.00673909,0.00559057,0.00676991,-0.000434418,-0.00579638,0.0155749],[0.00285251,-0.0204402,0.00673909,0.0371974,0.0156282,0.0146385,0.010273,0.00438825,0.0106748],[0.00621963,-0.0223745,0.00559057,0.0156282,0.0258642,0.0112879,0.0218765,0.0109824,-0.0062276],[0.00247439,-0.00774278,0.00676991,0.0146385,0.0112879,0.0385363,0.00725802,0.00507749,0.0131128],[0.0235929,-0.00847539,-0.000434418,0.010273,0.0218765,0.00725802,0.110753,0.0555677,-0.0178209],[0.0224142,-0.00394251,-0.00579638,0.00438825,0.0109824,0.00507749,0.0555677,0.0878987,-0.0248326],[-0.00807837,0.0389318,0.0155749,0.0106748,-0.0062276,0.0131128,-0.0178209,-0.0248326,0.0934272]]
113vc = bioMatrix(9,names,values)
114#BIOGEME_OBJECT.VARCOVAR = vc
115
116
117
118# The choice model is a nested logit
119prob_pt = nested(V,av,nests,0)
120prob_car = nested(V,av,nests,1)
121prob_sm = nested(V,av,nests,2)
122
123elas_car_cost = Derive(prob_car,’MarginalCostPT’) * MarginalCostPT / prob_car
124elas_car_time = Derive(prob_car,’TimePT’) * TimePT / prob_car
125elas_pt_cost = Derive(prob_pt,’CostCarCHF’) * CostCarCHF / prob_pt
126elas_pt_time = Derive(prob_pt,’TimeCar’) * TimeCar / prob_pt
127
128# Defines an itertor on the data
129rowIterator(’obsIter’)
130#Statistics
131nullLoglikelihood(av,’obsIter’)
132choiceSet = [0,1,2]
133cteLoglikelihood(choiceSet,Choice,’obsIter’)
134availabilityStatistics(av,’obsIter’)
135
136theWeight = Weight * 1906 / 0.814484
137normalization_pt  = 535.086
138normalization_car = 1244.77
139normalization_sm = 126.147
140
141BIOGEME_OBJECT.STATISTICS[’Gender: males’] = \
142                    Sum(male,’obsIter’)
143BIOGEME_OBJECT.STATISTICS[’Gender: females’] = \
144                    Sum(female,’obsIter’)
145BIOGEME_OBJECT.STATISTICS[’Gender: unreported’] = \
146                    Sum(unreportedGender,’obsIter’)
147BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = \
148                    Sum(fulltime,’obsIter’)
149BIOGEME_OBJECT.STATISTICS[’Sum of weights’] = \
150                    Sum(Weight,’obsIter’)
151BIOGEME_OBJECT.STATISTICS[’Number of entries’] = \
152                    Sum(1-exclude,’obsIter’)
153BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities PT’] = \
154                    Sum(theWeight * prob_pt ,’obsIter’)
155BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities CAR’] = \
156                    Sum(theWeight * prob_car ,’obsIter’)
157BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities SM’] = \
158                    Sum(theWeight * prob_sm ,’obsIter’)
159BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = Sum(fulltime,’obsIter’)
160
161# Define the dictionary for the simulation.
162simulate = {’Disag. Elast. PT - Time car’: elas_pt_time,
163            ’Disag. Elast. PT - Cost car’: elas_pt_cost,
164            ’Disag. Elast. Car - Time PT’: elas_car_time,
165            ’Disag. Elast. Car - Cost PT’: elas_car_cost,
166            ’Agg. Elast. Car - Cost PT’: \
167                elas_car_cost * prob_car / normalization_car,
168            ’Agg. Elast. Car - Time PT’: \
169                elas_car_time * prob_car / normalization_car,
170            ’Agg. Elast. PT - Cost car’: \
171                elas_pt_cost * prob_pt / normalization_pt,
172            ’Agg. Elast. PT - Time car’: \
173                elas_pt_time * prob_pt / normalization_pt}
174
175# Each weight is normalized so that the sum of weights is equal to the numer of entries (1906)
176BIOGEME_OBJECT.WEIGHT = theWeight
177BIOGEME_OBJECT.SIMULATE = Enumerate(simulate,’obsIter’)

A.5 05nestedElasticities.py

1## File 05nestedElasticities.py
2## Simple nested logit model for the Optima case study
3## Calculation of direct arc elasticities
4## Thu May 11 16:38:05 2017
5
6from biogeme import *
7from headers import *
8from statistics import *
9from nested import *
10
11### Three alternatives:
12# CAR: automobile
13# PT: public transportation
14# SM: slow mode (walking, biking)
15
16### List of parameters and their estimated value.
17ASC_CAR = Beta(’ASC_CAR’,0.261291,-10000,10000,0,’ASC_CAR’ )
18ASC_SM = Beta(’ASC_SM’,0.0590204,-10000,10000,0,’ASC_SM’ )
19BETA_TIME_FULLTIME = \
20 Beta(’BETA_TIME_FULLTIME’,-1.59709,-10000,10000,0,’BETA_TIME_FULLTIME’ )
21BETA_TIME_OTHER = \
22 Beta(’BETA_TIME_OTHER’,-0.577362,-10000,10000,0,’BETA_TIME_OTHER’ )
23BETA_DIST_MALE = \
24 Beta(’BETA_DIST_MALE’,-0.686327,-10000,10000,0,’BETA_DIST_MALE’ )
25BETA_DIST_FEMALE = \
26 Beta(’BETA_DIST_FEMALE’,-0.83121,-10000,10000,0,’BETA_DIST_FEMALE’ )
27BETA_DIST_UNREPORTED = \
28 Beta(’BETA_DIST_UNREPORTED’,-0.702974,-10000,10000,0,’BETA_DIST_UNREPORTED’ )
29BETA_COST = \
30 Beta(’BETA_COST’,-0.716192,-10000,10000,0,’BETA_COST’ )
31
32###Definition of variables:
33# For numerical reasons, it is good practice to scale the data to
34# that the values of the parameters are around 1.0.
35
36### Warning: when calculation derivatives, the total formula must be
37### known to Biogeme. In this case, the use of
38### "DefineVariable" must be omitted, if the derivatives must be
39### calculated with respect to the original variables (as is often the
40### case)
41
42delta_dist = 1
43
44# TimePT_scaled  = DefineVariable(’TimePT_scaled’, TimePT   /  200 )
45TimePT_scaled  = TimePT   /  200
46
47#TimeCar_scaled  = DefineVariable(’TimeCar_scaled’, TimeCar   /  200 )
48TimeCar_scaled  =  TimeCar   /  200
49
50#MarginalCostPT_scaled  = DefineVariable(’MarginalCostPT_scaled’, MarginalCostPT   /  10 )
51MarginalCostPT_scaled  =  MarginalCostPT   /  10
52
53#CostCarCHF_scaled  = DefineVariable(’CostCarCHF_scaled’, CostCarCHF   /  10 )
54CostCarCHF_scaled  = CostCarCHF   /  10
55
56#distance_km_scaled  = DefineVariable(’distance_km_scaled’, distance_km   /  5 )
57distance_km_scaled  = distance_km   /  5
58distance_km_scaled_after  = (distance_km + delta_dist)   /  5
59
60male = DefineVariable(’male’,Gender == 1)
61female = DefineVariable(’female’,Gender == 2)
62unreportedGender  = DefineVariable(’unreportedGender’,Gender == -1)
63
64fulltime = DefineVariable(’fulltime’,OccupStat == 1)
65notfulltime = DefineVariable(’notfulltime’,OccupStat != 1)
66
67### Definition of utility functions:
68
69V_PT = BETA_TIME_FULLTIME * TimePT_scaled * fulltime + \
70       BETA_TIME_OTHER * TimePT_scaled * notfulltime + \
71       BETA_COST * MarginalCostPT_scaled
72V_CAR = ASC_CAR + \
73        BETA_TIME_FULLTIME * TimeCar_scaled * fulltime + \
74        BETA_TIME_OTHER * TimeCar_scaled * notfulltime + \
75        BETA_COST * CostCarCHF_scaled
76V_SM = ASC_SM + \
77       BETA_DIST_MALE * distance_km_scaled * male + \
78       BETA_DIST_FEMALE * distance_km_scaled * female + \
79       BETA_DIST_UNREPORTED * distance_km_scaled * unreportedGender
80
81V_SM_after = ASC_SM + \
82       BETA_DIST_MALE * distance_km_scaled_after * male + \
83       BETA_DIST_FEMALE * distance_km_scaled_after * female + \
84       BETA_DIST_UNREPORTED * distance_km_scaled_after * unreportedGender
85
86
87# Associate utility functions with the numbering of alternatives
88V = {0: V_PT,
89     1: V_CAR,
90     2: V_SM}
91
92V_after = {0: V_PT,
93           1: V_CAR,
94           2: V_SM_after}
95
96# Associate the availability conditions with the alternatives.
97# In this example all alternatives are available for each individual.
98av = {0: one,
99      1: one,
100      2: one}
101
102### DEFINITION OF THE NESTS:
103# 1: nests parameter
104# 2: list of alternatives
105
106NEST_NOCAR = Beta(’NEST_NOCAR’,1.52853,1,10,0,’NEST_NOCAR’ )
107
108
109CAR = 1.0 , [ 1]
110NO_CAR = NEST_NOCAR , [ 0,  2]
111nests = CAR, NO_CAR
112
113# All observations verifying the following expression will not be
114# considered for estimation
115exclude = (Choice   ==  -1)
116BIOGEME_OBJECT.EXCLUDE =  exclude
117
118
119##
120## This has been copied-pasted from the file 01nestedEstimation_param.py
121##
122## Code for the sensitivity analysis generated after the estimation of the model
123names = [’ASC_CAR’,’ASC_SM’,’BETA_COST’,’BETA_DIST_FEMALE’,’BETA_DIST_MALE’,’BETA_DIST_UNREPORTED’,’BETA_TIME_FULLTIME’,’BETA_TIME_OTHER’,’NEST_NOCAR’]
124values = [[0.0100225,-0.0023271,0.00151986,0.00285251,0.00621963,0.00247439,0.0235929,0.0224142,-0.00807837],[-0.0023271,0.0469143,0.00431142,-0.0204402,-0.0223745,-0.00774278,-0.00847539,-0.00394251,0.0389318],[0.00151986,0.00431142,0.0191465,0.00673909,0.00559057,0.00676991,-0.000434418,-0.00579638,0.0155749],[0.00285251,-0.0204402,0.00673909,0.0371974,0.0156282,0.0146385,0.010273,0.00438825,0.0106748],[0.00621963,-0.0223745,0.00559057,0.0156282,0.0258642,0.0112879,0.0218765,0.0109824,-0.0062276],[0.00247439,-0.00774278,0.00676991,0.0146385,0.0112879,0.0385363,0.00725802,0.00507749,0.0131128],[0.0235929,-0.00847539,-0.000434418,0.010273,0.0218765,0.00725802,0.110753,0.0555677,-0.0178209],[0.0224142,-0.00394251,-0.00579638,0.00438825,0.0109824,0.00507749,0.0555677,0.0878987,-0.0248326],[-0.00807837,0.0389318,0.0155749,0.0106748,-0.0062276,0.0131128,-0.0178209,-0.0248326,0.0934272]]
125vc = bioMatrix(9,names,values)
126BIOGEME_OBJECT.VARCOVAR = vc
127
128# The choice model is a nested logit
129prob_pt = nested(V,av,nests,0)
130prob_car = nested(V,av,nests,1)
131prob_sm = nested(V,av,nests,2)
132
133prob_pt_after = nested(V_after,av,nests,0)
134prob_car_after = nested(V_after,av,nests,1)
135prob_sm_after = nested(V_after,av,nests,2)
136
137elas_sm_dist = (prob_sm_after - prob_sm) * distance_km / (prob_sm * delta_dist)
138
139# Defines an iterator on the data
140rowIterator(’obsIter’)
141#Statistics
142nullLoglikelihood(av,’obsIter’)
143choiceSet = [0,1,2]
144cteLoglikelihood(choiceSet,Choice,’obsIter’)
145availabilityStatistics(av,’obsIter’)
146
147theWeight = Weight * 1906 / 0.814484
148normalization_pt  = 535.086
149normalization_car = 1244.77
150normalization_sm = 126.147
151
152BIOGEME_OBJECT.STATISTICS[’Gender: males’] = \
153                    Sum(male,’obsIter’)
154BIOGEME_OBJECT.STATISTICS[’Gender: females’] = \
155                    Sum(female,’obsIter’)
156BIOGEME_OBJECT.STATISTICS[’Gender: unreported’] = \
157                    Sum(unreportedGender,’obsIter’)
158BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = \
159                    Sum(fulltime,’obsIter’)
160BIOGEME_OBJECT.STATISTICS[’Sum of weights’] = \
161                    Sum(Weight,’obsIter’)
162BIOGEME_OBJECT.STATISTICS[’Number of entries’] = \
163                    Sum(1-exclude,’obsIter’)
164BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities PT’] = \
165                    Sum(theWeight * prob_pt ,’obsIter’)
166BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities CAR’] = \
167                    Sum(theWeight * prob_car ,’obsIter’)
168BIOGEME_OBJECT.STATISTICS[’Normalization for elasticities SM’] = \
169                    Sum(theWeight * prob_sm ,’obsIter’)
170BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = Sum(fulltime,’obsIter’)
171
172
173# Define the dictionary for the simulation.
174simulate = {’Disag. Elast. SM - Distance’: elas_sm_dist,
175            ’Agg. Elast. SM - Distance’: elas_sm_dist * prob_sm / normalization_sm}
176
177# Each weight is normalized so that the sum of weights is equal to the numer of entries (1906)
178BIOGEME_OBJECT.WEIGHT = theWeight
179BIOGEME_OBJECT.SIMULATE = Enumerate(simulate,’obsIter’)

A.6 06nestedWTP.py

1## File 06nestedWTP.py
2## Simple nested logit model for the Optima case study
3## Thu May 11 17:23:04 2017
4
5from biogeme import *
6from headers import *
7from statistics import *
8from nested import *
9
10### Three alternatives:
11# CAR: automobile
12# PT: public transportation
13# SM: slow mode (walking, biking)
14
15### List of parameters and their estimated value.
16ASC_CAR = Beta(’ASC_CAR’,0.261291,-10000,10000,0,’ASC_CAR’ )
17ASC_SM = Beta(’ASC_SM’,0.0590204,-10000,10000,0,’ASC_SM’ )
18BETA_TIME_FULLTIME = \
19 Beta(’BETA_TIME_FULLTIME’,-1.59709,-10000,10000,0,’BETA_TIME_FULLTIME’ )
20BETA_TIME_OTHER = \
21 Beta(’BETA_TIME_OTHER’,-0.577362,-10000,10000,0,’BETA_TIME_OTHER’ )
22BETA_DIST_MALE = \
23 Beta(’BETA_DIST_MALE’,-0.686327,-10000,10000,0,’BETA_DIST_MALE’ )
24BETA_DIST_FEMALE = \
25 Beta(’BETA_DIST_FEMALE’,-0.83121,-10000,10000,0,’BETA_DIST_FEMALE’ )
26BETA_DIST_UNREPORTED = \
27 Beta(’BETA_DIST_UNREPORTED’,-0.702974,-10000,10000,0,’BETA_DIST_UNREPORTED’ )
28BETA_COST = \
29 Beta(’BETA_COST’,-0.716192,-10000,10000,0,’BETA_COST’ )
30
31###Definition of variables:
32# For numerical reasons, it is good practice to scale the data to
33# that the values of the parameters are around 1.0.
34
35### Warning: when calculation derivatives, the total formula must be
36### known to Biogeme. In this case, the use of
37### "DefineVariable" must be omitted, if the derivatives must be
38### calculated with respect to the original variables (as is often the
39### case)
40
41# TimePT_scaled  = DefineVariable(’TimePT_scaled’, TimePT   /  200 )
42TimePT_scaled  = TimePT   /  200
43
44#TimeCar_scaled  = DefineVariable(’TimeCar_scaled’, TimeCar   /  200 )
45TimeCar_scaled  =  TimeCar   /  200
46
47#MarginalCostPT_scaled  = DefineVariable(’MarginalCostPT_scaled’, MarginalCostPT   /  10 )
48MarginalCostPT_scaled  =  MarginalCostPT   /  10
49
50#CostCarCHF_scaled  = DefineVariable(’CostCarCHF_scaled’, CostCarCHF   /  10 )
51CostCarCHF_scaled  = CostCarCHF   /  10
52
53#distance_km_scaled  = DefineVariable(’distance_km_scaled’, distance_km   /  5 )
54distance_km_scaled  = distance_km   /  5
55
56
57male = DefineVariable(’male’,Gender == 1)
58female = DefineVariable(’female’,Gender == 2)
59unreportedGender  = DefineVariable(’unreportedGender’,Gender == -1)
60
61fulltime = DefineVariable(’fulltime’,OccupStat == 1)
62notfulltime = DefineVariable(’notfulltime’,OccupStat != 1)
63
64### Definition of utility functions:
65V_PT = BETA_TIME_FULLTIME * TimePT_scaled * fulltime + \
66       BETA_TIME_OTHER * TimePT_scaled * notfulltime + \
67       BETA_COST * MarginalCostPT_scaled
68V_CAR = ASC_CAR + \
69        BETA_TIME_FULLTIME * TimeCar_scaled * fulltime + \
70        BETA_TIME_OTHER * TimeCar_scaled * notfulltime + \
71        BETA_COST * CostCarCHF_scaled
72V_SM = ASC_SM + \
73       BETA_DIST_MALE * distance_km_scaled * male + \
74       BETA_DIST_FEMALE * distance_km_scaled * female + \
75       BETA_DIST_UNREPORTED * distance_km_scaled * unreportedGender
76
77# It is advised to use the Derive operator, in order to take care
78# automatically of the scaled variables.
79
80WTP_PT_TIME = Derive(V_PT,’TimePT’) / Derive(V_PT,’MarginalCostPT’)
81WTP_CAR_TIME = Derive(V_CAR,’TimeCar’) / Derive(V_CAR,’CostCarCHF’)
82
83# All observations verifying the following expression will not be
84# considered for estimation
85exclude = (Choice   ==  -1)
86BIOGEME_OBJECT.EXCLUDE =  exclude
87
88
89##
90## This has been copied-pasted from the file 01nestedEstimation_param.py
91##
92## Code for the sensitivity analysis generated after the estimation of the model
93names = [’ASC_CAR’,’ASC_SM’,’BETA_COST’,’BETA_DIST_FEMALE’,’BETA_DIST_MALE’,’BETA_DIST_UNREPORTED’,’BETA_TIME_FULLTIME’,’BETA_TIME_OTHER’,’NEST_NOCAR’]
94values = [[0.0100225,-0.0023271,0.00151986,0.00285251,0.00621963,0.00247439,0.0235929,0.0224142,-0.00807837],[-0.0023271,0.0469143,0.00431142,-0.0204402,-0.0223745,-0.00774278,-0.00847539,-0.00394251,0.0389318],[0.00151986,0.00431142,0.0191465,0.00673909,0.00559057,0.00676991,-0.000434418,-0.00579638,0.0155749],[0.00285251,-0.0204402,0.00673909,0.0371974,0.0156282,0.0146385,0.010273,0.00438825,0.0106748],[0.00621963,-0.0223745,0.00559057,0.0156282,0.0258642,0.0112879,0.0218765,0.0109824,-0.0062276],[0.00247439,-0.00774278,0.00676991,0.0146385,0.0112879,0.0385363,0.00725802,0.00507749,0.0131128],[0.0235929,-0.00847539,-0.000434418,0.010273,0.0218765,0.00725802,0.110753,0.0555677,-0.0178209],[0.0224142,-0.00394251,-0.00579638,0.00438825,0.0109824,0.00507749,0.0555677,0.0878987,-0.0248326],[-0.00807837,0.0389318,0.0155749,0.0106748,-0.0062276,0.0131128,-0.0178209,-0.0248326,0.0934272]]
95vc = bioMatrix(9,names,values)
96BIOGEME_OBJECT.VARCOVAR = vc
97
98
99# Defines an itertor on the data
100rowIterator(’obsIter’)
101
102theWeight = Weight * 1906 / 0.814484
103
104
105BIOGEME_OBJECT.STATISTICS[’Gender: males’] = \
106                    Sum(male,’obsIter’)
107BIOGEME_OBJECT.STATISTICS[’Gender: females’] = \
108                    Sum(female,’obsIter’)
109BIOGEME_OBJECT.STATISTICS[’Gender: unreported’] = \
110                    Sum(unreportedGender,’obsIter’)
111BIOGEME_OBJECT.STATISTICS[’Occupation: full time’] = \
112                    Sum(fulltime,’obsIter’)
113BIOGEME_OBJECT.STATISTICS[’Sum of weights’] = \
114                    Sum(Weight,’obsIter’)
115BIOGEME_OBJECT.STATISTICS[’Number of entries’] = \
116                    Sum(1-exclude,’obsIter’)
117
118simulate = {’PT: Time’:TimePT,
119            ’PT: Value of time (CHF/min)’: WTP_PT_TIME,
120            ’PT: Value of time (CHF/h)’: 60 * WTP_PT_TIME,
121            ’Car: Time’:TimeCar,
122            ’Car: Value of time (CHF/min)’: WTP_CAR_TIME,
123            ’Car: Value of time (CHF/h)’: 60 * WTP_CAR_TIME,
124            ’Male’:male,
125            ’Full time’:fulltime}
126
127# Each weight is normalized so that the sum of weights is equal to the
128# number of entries (1906).
129BIOGEME_OBJECT.WEIGHT = theWeight
130BIOGEME_OBJECT.SIMULATE = Enumerate(simulate,’obsIter’)