Choice Models#

import larch as lx

In this guide, we’ll take a look at building a discrete choice model using Larch. We assume you have a decent grasp of the fundamentals of choice modeling – if not, we suggest reading the Discrete Choice Modeling section of the Python for Transportation Modeling course.

Some addition advanced or detailed topics are broken out into seperate guides:

The examples below work with the tiny dataset introduced in the Data Fundamentals section.

The basic structure of a choice model in Larch is contained in the Model object.

m = lx.Model(data)

Nesting Structures#

By default, a model in Larch is assumed to be a simple multinomial logit model, unless a nesting structure is defined. That structure is defined in a model’s graph.

m.graph

Adding a nest can be accomplished the the new_node method, which allows you to give a nesting node’s child codes, a name, and attach a logsum parameter.

z = m.graph.new_node(parameter="Mu_Motorized", children=[1, 2], name="Motorized")
m.graph

The return value of new_node is the code number of the new nest. This is assigned automatically so as to not overlap with any other alternatives or nests. We can use this to develop multi-level nesting structures, by putting that new code number as the child for yet another new nest.

m.graph.new_node(parameter="Mu_Omni", children=[z, 3], name="Omni")
m.graph

Nothing in Larch prevents you from overloading the nesting structure with degenerate nests, as shown above. You may have difficult with estimating parameters if you are not careful with such complex structures. If you need to remove_node, you can do so by giving its code–but you’ll likely find you’ll be much better off just fixing your code and starting over, as node removal can have some odd side effects for complex structures.

m.graph.remove_node(5)
m.graph

Parameter Estimation#

Larch can automatically find all the model parameters contained in the model specification, so we don’t need to address them separately unless we want to modify any defaults.

We can review the parameters Larch has found, as well as the current values set for them, in the parameter frame, or pf.

m.pf

	value	best	initvalue	minimum	maximum	nullvalue	holdfast
param_name
Cost	-0.02	-0.02	0.0	-inf	inf	0.0	0
Income_Bus	0.00	NaN	0.0	-inf	inf	0.0	0
Income_Car	0.10	NaN	0.0	-inf	inf	0.0	0
Mu_Motorized	1.00	NaN	1.0	0.01	1.0	1.0	0
Time	-0.01	-0.01	0.0	-inf	inf	0.0	0

If we want to set certain parameters to be constrained to be certain values, that can be accomplished with the plock method. Because our sample data has so few observations, it won’t be possible to estimate values for all four parameters, so we can assert values for two of them.

m.plock({"Time": -0.01, "Cost": -0.02})
m.pf

	value	best	initvalue	minimum	maximum	nullvalue	holdfast
param_name
Cost	-0.02	-0.02	-0.02	-inf	inf	0.0	1
Income_Bus	0.00	NaN	0.00	-inf	inf	0.0	0
Income_Car	0.10	NaN	0.00	-inf	inf	0.0	0
Mu_Motorized	1.00	NaN	1.00	0.01	1.0	1.0	0
Time	-0.01	-0.01	-0.01	-inf	inf	0.0	1

The default infinite bounds on the remaining parameters can be problematic for some optimization algorithms, so it’s usually good practice to set large but finite limits for those values. The set_cap method can do just that, setting a minimum and maximum value for all the parameters that otherwise have bounds outside the cap.

m.set_cap(100)
m.pf

	value	best	initvalue	minimum	maximum	nullvalue	holdfast
param_name
Cost	-0.02	-0.02	-0.02	-inf	inf	0.0	1
Income_Bus	0.00	NaN	0.00	-100.00	100.0	0.0	0
Income_Car	0.10	NaN	0.00	-100.00	100.0	0.0	0
Mu_Motorized	1.00	NaN	1.00	0.01	1.0	1.0	0
Time	-0.01	-0.01	-0.01	-inf	inf	0.0	1

To actually develop maximum likelihood estimates for the remaining unconstrained parameters, use the maximize_loglike method.

m.maximize_loglike()

Iteration 007 [Optimization terminated successfully]

Best LL = -3.8172546401484566

	value	best	initvalue	minimum	maximum	nullvalue	holdfast
param_name
Cost	-0.020000	-0.020000	-0.02	-inf	inf	0.0	1
Income_Bus	0.028418	0.028418	0.00	-100.00	100.0	0.0	0
Income_Car	0.047842	0.047842	0.00	-100.00	100.0	0.0	0
Mu_Motorized	1.000000	1.000000	1.00	0.01	1.0	1.0	0
Time	-0.010000	-0.010000	-0.01	-inf	inf	0.0	1

key

value

x

	0
Cost	-0.020000
Income_Bus	0.028418
Income_Car	0.047842
Mu_Motorized	1.000000
Time	-0.010000

logloss

np.float64(0.9543136600371142)

d_logloss

	0
Cost	0.000000
Income_Bus	-0.000264
Income_Car	0.000208
Mu_Motorized	0.098815
Time	0.000000

nit

7

nfev

13

njev

7

status

0

message

'Optimization terminated successfully'

success

np.True_

elapsed_time

0:00:00.051397

method

'slsqp'

n_cases

4

iteration_number

7

loglike

np.float64(-3.8172546401484566)

In a Jupyter notebook, this method displays a live-updating view of the progress of the optmization algorithm, so that the analyst can interrupt if something looks wrong.

The maximize_loglike method does not include the calculation of parameter covariance matrixes, standard errors, or t-statistics. For large models, this can be a computationally expensive process, and it is often but not always necessary. Those computatations are made in the calculate_parameter_covariance method instead. Once completed, things like t-statistics and standard errors are available in the parameter frame.

m.calculate_parameter_covariance()
m.pf

	value	best	initvalue	minimum	maximum	nullvalue	holdfast
param_name
Cost	-0.020000	-0.020000	-0.02	-inf	inf	0.0	1
Income_Bus	0.028418	0.028418	0.00	-100.00	100.0	0.0	0
Income_Car	0.047842	0.047842	0.00	-100.00	100.0	0.0	0
Mu_Motorized	1.000000	1.000000	1.00	0.01	1.0	1.0	0
Time	-0.010000	-0.010000	-0.01	-inf	inf	0.0	1

Overspecification#

Overspecification in a discrete choice model occurs when the model includes more explanatory variables (independent variables) than necessary or relevant for accurately predicting choice behaviors. A particular computational flavor of overspecification is multicollinearity, which is when independent variables are highly (or perfectly) correlated with each other. This makes it difficult to estimate the true effect of each variable on the dependent variable (choice behavior) and can lead to unstable parameter estimates. To demonstrate this, we can create a copy of the model and add an Income_Walk term to the utility function.

m2 = m.copy()
m2.utility_co[3] = P.Income_Walk * X.Income / 1000

The three Income_* terms now in the model now form a closed loop, such that the sum of all three of these terms is always 1. The result is an overspecified model. Larch doesn’t stop you from doing this, and may even estimate parameters successfully with the maximize_loglike function.

m2.maximize_loglike()

Iteration 001 [Optimization terminated successfully]

Best LL = -3.8172546356353823

	value	best	initvalue	minimum	maximum	nullvalue	holdfast
param_name
Cost	-0.020000	-0.020000	-0.02	-inf	inf	0.0	1
Income_Bus	0.028418	0.028418	0.00	-100.00	100.0	0.0	0
Income_Car	0.047842	0.047842	0.00	-100.00	100.0	0.0	0
Income_Walk	0.000000	0.000000	0.00	-inf	inf	0.0	0
Mu_Motorized	1.000000	1.000000	1.00	0.01	1.0	1.0	0
Time	-0.010000	-0.010000	-0.01	-inf	inf	0.0	1

/opt/hostedtoolcache/Python/3.10.17/x64/lib/python3.10/site-packages/larch/model/optimization.py:338: UserWarning: slsqp may not play nicely with unbounded parameters
if you get poor results, consider setting global bounds with model.set_cap()
  warnings.warn(  # infinite bounds #  )

key

value

x

	0
Cost	-0.020000
Income_Bus	0.028418
Income_Car	0.047842
Income_Walk	0.000000
Mu_Motorized	1.000000
Time	-0.010000

logloss

np.float64(0.9543136589088456)

d_logloss

	0
Cost	0.000000
Income_Bus	-0.000264
Income_Car	0.000208
Income_Walk	0.000056
Mu_Motorized	0.098815
Time	0.000000

nit

1

nfev

1

njev

1

status

0

message

'Optimization terminated successfully'

success

np.True_

elapsed_time

0:00:00.019119

method

'slsqp'

n_cases

4

iteration_number

1

loglike

np.float64(-3.8172546356353823)

However, when you attempt to calculate the standard errors of the estimates (i.e., the parameter covariance matrix), you may get infinite, NaN, or absurdly large values. Larch also may emit a warning here, to alert you to a possible overspecification problem.

m2.calculate_parameter_covariance()
m2.parameter_summary()

/tmp/ipykernel_7201/1346125230.py:1: PossibleOverspecification: Model is possibly over-specified (hessian is nearly singular).
  m2.calculate_parameter_covariance()

	Value	Std Err	t Stat	Null Value	Constrained
Parameter
Cost	-0.0200	0.00	NA	0.00	fixed value
Income_Bus	0.0284	NA	NA	0.00
Income_Car	0.0478	NA	NA	0.00
Income_Walk	0.00	NA	NA	0.00
Mu_Motorized	1.00	0.00	NA	1.00	Mu_Motorized ≤ 1.0
Time	-0.0100	0.00	NA	0.00	fixed value

If you get such a warning, you can check the model’s possible_overspecification attribute, which may give you a hint of the problem. Here we see that the three Income parameters are highlighted in red.

m2.possible_overspecification

	(1) 0.0001057
Income_Bus	0.577350
Income_Car	0.577350
Income_Walk	0.577350

Reporting#

Larch includes a variety of pre-packaged and a la carte reporting options.

Commonly used report tables are available directly in a Jupyter notebook through a selection of reporting functions.

m.parameter_summary()

	Value	Std Err	t Stat	Null Value	Constrained
Parameter
Cost	-0.0200	0.00	NA	0.00	fixed value
Income_Bus	0.0284	0.0353	0.81	0.00
Income_Car	0.0478	0.0386	1.24	0.00
Mu_Motorized	1.00	0.00	NA	1.00	Mu_Motorized ≤ 1.0
Time	-0.0100	0.00	NA	0.00	fixed value

m.estimation_statistics()

Statistic	Aggregate	Per Case
Number of Cases	4
Log Likelihood at Convergence	-3.82	-0.95
Log Likelihood at Null Parameters	-4.04	-1.01
Rho Squared w.r.t. Null Parameters	0.055

To save a model report to an Excel file, use the to_xlsx method.

m.to_xlsx("/tmp/larch-demo.xlsx")

Choice Models#

Alternatives#

Choices#

Availability#

Utility Functions#

Data Preparation#