Accessibility Measures in Modal Choice Models: A Proof of Concept Applied to Freight Transport

Abstract

Modal choice models applied to interregional or international freight transportation network models are often based on rather coarse origin-destination matrices, containing annual transported tonnages between (sub)regions, for instance. Generally, only basic (sometimes constructed) independent variables (transportation costs or transit times) are used because other variables such as shipment sizes, service frequencies, etc. are not available. Using origin-destination matrices and an assignment model, it is also possible to compute spatial accessibility measures that can further be used as additional explanatory variables. Indeed, several published studies have identified network accessibility as an important element in the mode-choice decision. This paper also shows that the inclusion of an accessibility measure in the utility functions of a logit model substantially improves the performance of a transportation network model, both in the modal choice and the assignment levels of the classical four-step model. Consequently, the assignment of the estimated modal demands results in more accurate estimated traffic on the networks. The model presented in this paper is to be considered as a proof of concept because its workflow should further be streamlined to make it easily useable by modelers.

Share and Cite:

Jourquin, B. (2023) Accessibility Measures in Modal Choice Models: A Proof of Concept Applied to Freight Transport. Journal of Transportation Technologies, 13, 38-60. doi: 10.4236/jtts.2023.131003.

1. Introduction

In the framework of this paper, we are interested in the last two steps of the classical four-stage transport model: modal choice and assignment. Origin-destination (OD) matrices are thus given and used as input for the modal choice model. The specification of the latest, its performance and the measure of its impact on the assignment model are the core topics covered in this paper.

Modal choice models are mostly based on discrete choice theory and, more specifically, on the family of logit models. Stated or revealed preference (SP/RP) data can be used as input for the modal choice model, but not all modal choice models are based on SP/RP data. When applied to interregional or international areas, for example, aggregated demand data, i.e., OD matrices containing annual transported tonnages between the region’s centroids for instance, are most often used. Moreover, only basic (and sometimes constructed) independent variables (fares, costs or transit time) are used to explain the modal choice. Information about other variables that might have an impact on the mode choice decision, such as shipment sizes and service frequency, is generally lacking.

The model presented in this paper belongs to this category, in which one also finds [1] [2]. In the first paper, fares and transit time are used as explanatory variables. To avoid multicollinearity (the explanatory variables are highly correlated), the authors applied a Box-Cox transform to the independent variables. In the second paper, the authors use a weighted logit mode-choice model in which “representative agents”1 are weighted accordingly to their annual transported tonnages. Later, a weighted logit mode-choice model with Box-Cox transformation of the independent variables (transportation costs and transit times) was proposed in [3], combining the approaches of [1] and [2]. This model covers three modes: road, inland waterway (IWW) and rail transport. The model presented in this paper extends this work, including an accessibility measure as an additional variable.

Indeed, in-depth surveys carried out with shippers, carriers and receivers have concluded that network accessibility plays an important role [4] [5] [6] in mode choice. The large ECHO survey conducted in France [7] indicated that more than 30% of shippers consider network accessibility as a criterion for their modal choice. Improved accessibility and connectivity through the establishment of freight corridors are also considered a vector of economic growth [8].

While accessibility is often cited as an important variable in freight mode choice decisions, econometric models that include this variable are rare. An interesting example is the behavioral analysis proposed in [9]. Beside the transportation cost, the trip length, the weight of the commodities and their value, this model integrates the access to road-rail intermodal facilities. However, accessibility is described in this paper as a level of ease to access facilities (easy, neutral or difficult), which is a restrictive definition. Other papers (see the review in [4] ) have also defined accessibility relative to the possibility of reaching an intermodal terminal. Finally, accessibility is sometimes defined as the possibility of reaching a destination with a given transportation mode [10]. If the modal choice model is part of a transport network model, which is the case of the work presented in this paper, this type of accessibility is captured by the assignment step, as a modal route cannot always be found between all OD pairs. This is, for example, often the case for IWW transport.

Accessibility can also be defined from a spatial perspective as a measure of the capacity of a location to be reached from different locations [11]. Similarly, it can also be defined from an origin node perspective to measure the capacity to reach different destinations. Therefore, the topology of the transport networks is an important component in the determination of this type of accessibility.

Reference [12] shows that the accessibility of a location by different transportation modes influences manufacturing output and thus the demand for transport. More generally, some authors claim that “despite the importance of factors such as barriers and factor costs, accessibility remains the key linking pin between logistics and the spatial economy” [13]. They considered that the total logistics costs should include an accessibility measure to be able to describe the spatial restructuring effects of logistics trends. Other also conclude that “among the multiple factors that affect the choice of destination, aspects such as accessibility, land use, infrastructure availability and other external features of the surroundings suggest the existence of spatial interactions between the available alternatives to the shipper” [14].

There are thus enough indications that accessibility, in its spatial dimension, could have a significant impact on modal choice and that it is worth testing a model that explicitly includes some spatial accessibility measure, which is the objective of this paper.

Some attempts to incorporate a spatial dimension in a destination choice model can be found in the literature [15] [16]. Spatial metrics are for instance used to compute travel times by car in the area of Antwerp (Belgium) [16]. However, to the best of our knowledge, nothing has been published on the inclusion of such a spatial dimension in the modal choice models for freight transport using multivariate utility functions.

In this paper, the computation of a relevant accessibility measure is presented, and its possible correlation with cost and transit time is examined. Several utility functions including accessibility measures are presented, among which the one giving the best results is retained.

As the modal choice model is intended to be used in a freight transport network model, its output is used as input for an assignment. The performances of the modal choice and assignment models (with and without the accessibility measure) are compared.

The methodological aspects of the model are introduced in Section 2, while Section 3 describes the dataset used for the case study. The results and the model performance are examined in Section 4.

Note that this research was conducted using exclusively publicly available data and open-source software.

2. Methodological Discussion

Logistic modeling is the most used method to predict the choice of a transportation mode. As a reminder, the general logit formula can be written as

P m = e U m k = 1 M e U k (1)

where:

M: Total number of alternative modes

Pm: Probability of choosing mode m

Um: Utility of mode m

Uk: Utility of mode k within the set of M alternatives

First, a simple model is set up, in which the utility function contains only transportation costs C and transit times T. One could argue that the transport duration is already included in the transportation cost through some time-dependent components (staff salary costs, etc.). However, like [1] [2] we decided to keep C and T as two separated variables to capture the value of the goods during transport, knowing that the value of time is much higher than the opportunity costs [17] [18]. Moreover, the impact of time savings has some importance for policy-makers. Keeping both transportation costs and transit times as separate independent variables makes thus sense. The utility function for this first model can be written as

U m = α C m + β m T m + δ m (2)

where:

Cm = Transportation cost for mode m

Tm = Transit time for mode m

α, βm, δm: Parameters to estimate

In this specification, αis not mode specific, as it does not make sense from an economic theory perspective (it would make welfare analysis difficult). For transit times, βm is mode specific to obtain mode-specific values of time. It is expected that the αand βm coefficients have a negative sign because a higher cost or transit time makes a mode less attractive. Finally, δm are the modal specific intercepts.

Finally, as the transport demand is embedded in origin-destination (OD) matrices containing aggregated data (of the total annual transported quantities, for instance), a weighted logit is used [2] [3].

The subsequent models include one or two accessibility measure(s) in the utility function. As explained in the introduction, the accessibility of a node can be defined as the accessibility from this node to all the other nodes or to this node for commodities coming from other places in the network. As it is not immediately clear which measure is the most relevant for freight transport, three specifications are tested:

U m = α C m + β m T m + σ m A m d + δ m (3)

U m = α C m + β m T m + τ m A m o + δ m (4)

U m = α C m + β m T m + σ m A m d + δ m + τ m A m o (5)

where:

A m d : Capacity of a location d to be reached from others, using mode m (see 2.2)

A m o : Capacity of a location o to reach other locations, using mode m

σm, τm: Parameters to estimate

The accessibilities A m d and A m o are specific to each location and each mode. Therefore, the mode-specific σm and/or τm are estimated.

These models are rather simple and presented here in an Occam’s razor spirit. Special attention must, however, be paid to the correlation between Cm and Tm (multicollinearity) and to the definition of the accessibility measure. Both aspects are discussed in the next subsections.

2.1. Handling Multicollinearity

Transportation costs and transit times are most often correlated, which is problematic as the estimators may have a weak statistical power [19] [20]. They may also have unexpected signs. For instance, an estimator with a positive value for the transportation cost by truck would signify that the probability of using trucks increases with their operational costs, which is not what is expected.

Some methods can help to control multicollinearity [19] [21]. In many cases, a simple log transformation of the explanatory variables can solve the problem, but it does not always succeed, which is, for instance, the case with the dataset used in the case study presented in this paper. Therefore, we decided to apply a Box-Cox transformation [22] of the explanatory variables (Equation (6)) as this method is sometimes used to re-establish the presumed signs of the estimators [23] [24].

B C ( x , λ ) = { x λ 1 λ , if λ 0 ln ( x ) , if λ = 0 (6)

By “bending” the functional form, the used value for λoffers a significant flexibility, providing more degrees of freedom to the modeler: values of λ< 1 produce convex curves, while λ> 1 results in a concave inflection. As a result, appropriate “bendings” of the independent variables can reduce multicollinearity. To include the Box-Cox transforms of C and T, the utility functions of the models presented in Equations (2) to (5) can thus be rewritten2:

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + δ m (7)

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + σ m A m d + δ m (8)

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + τ m A m o + δ m (9)

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + σ m A m d + τ m A m o + δ m (10)

where λc and λt are the Box-Cox parameters to estimate.

2.2. Computing the Accessibility Measure

The computation of an appropriate accessibility index merits some attention. Much has been written about accessibility measures and their ability to evaluate the accessibility impacts of national land-use and transport scenarios, including the related social and economic impacts. It goes beyond the scope of this paper to present an extensive literature review on this topic. The interested reader will find a review in [25] for instance. Most publications deal with passenger transport and are often limited to urban areas. Even if a very simple accessibility measure such as the number of centroids that can be reached by each mode from a given origin can be used [26], the General Accessibility Measure [27] [28] [29] is often applied. It represents the degree of interconnection between a particular reference location and all or a set of other locations in the area [30] [31]. The General Accessibility Measure can be defined from an origin (Equation (11)) or a destination (Equation 12)) perspective:

A m d = o w o f ( L o d ) (11)

A m o = d w d f ( L o d ) (12)

where:

A m d : Capacity of a node d to be reached from other nodes by mode m

A m o : Capacity a node o to reach other nodes by mode m

wd: Weight representing the attractiveness of destination d

wo: Weight representing the attractiveness of origin o

Lod: Measure of separation (trip length) between o and d

f(…): Decay function

The value of these accessibility measures directly depends on the weight wd (transported quantities to destination d) or wo (transported tonnages coming from o) and on the trip length between the nodes. As a weighted logit is implemented, a relative accessibility index [30] is used to avoid a “double counting” of the weights. The relative accessibility indexes can be written as

Λ m d = o w o f ( L o d ) n o w o (13)

Λ m o = d w d f ( L o d ) n d w d (14)

where n is the number of centroids in the origin-destination matrix. Therefore, the utility functions presented in Equations (8) to (10) become

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + σ m Λ m d + δ m (15)

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + τ m Λ m o + δ m (16)

U m = α ( C m λ c 1 λ c ) + β m ( T m λ t 1 λ t ) + σ m Λ m d + τ m Λ m o + δ m (17)

Most authors use the negative exponential form represented by Equation (19) as a decay function, as it is considered to be closely tied to travel behavior [30], [32]. This choice is, however, seldom challenged or even discussed.

Therefore, the five functional forms presented in [31] and illustrated in Figure 1 are tested in Section 4.2:

I) Power:

f ( L o d ) = ( L o d ) γ 1 (18)

II) Exponential:

f ( L o d ) = e γ 2 L o d (19)

III) Log-normal:

f ( L o d ) = e γ 3 ( log ( L o d ) ) 2 (20)

IV) Exponential normal:

f ( L o d ) = e γ 4 ( L o d ) 2 (21)

VI) Exponential square root:

f ( L o d ) = e γ 5 L o d (22)

In these equations, γ1, γ2, γ3, γ4 and γ5 are the parameters to estimate.

Finally, if multicollinearity is undoubtedly a problem that must be tackled for C and T, one could argue that the same problem occurs between the relative accessibility measure Lo or Ld and C and T because these measures are computed using L, which is, by nature, also strongly correlated with C and T. However, the value of L is diluted in the decay function, while wd or wo plays a preponderant role. Using the dataset presented in Section 3, the correlation between the computed relative accessibility measures and C or T varies between 0.02 and 0.2, depending on the transportation mode and the type of commodities. The relative accessibility measures are thus not correlated with the two other independent variables.

3. Dataset Description

The dataset used for the case presented in this paper is gathered from the ETISPlus European FP7 Research Program [33]. Combining data, analytical modeling with maps and an online interface for accessing the data, it provides an information system useful for assessing European transport policies. Public access to several deliverables and many data is provided, including origin-destination matrices and digitized transportation networks. Unfortunately, contrary to what was initially announced, the data have never been updated.

The OD matrices for 2010 at the NUTS-2 regional level are used. They are available for road, railway, and inland waterway transport, with annual volumes (tonnages) for 10 groups of commodities (NST/R Chapters 0 to 9, Table 1). In this paper, a subset of the OD matrices is used, focused on continental European transport. It includes the countries belonging to the European Union (EU) or to the European Economic Association (EEA), apart from islands and very peripheral zones within some countries. The retained regions, with their centroid, are represented in Figure 2.

Figure 1. Shapes of the tested decay functions.

Table 1. Description of the NST/R groups of commodities.

Figure 2. NUTS2 regions and retained centroids for the case study.

As shown in Table 2, the datasets corresponding to the different groups of commodities are quite different in size (number of non-null OD cells), volume (million tons/year) and modal split (market shares). The modal choice models presented in Section 2 are solved for each group of commodities separately, as different coefficients can be expected for different commodities. This is also useful to test the robustness of the model on different datasets.

In addition to the OD matrices, digitized networks are available in the ESRI shapefile format. Even if the downloaded files can be visualized directly in GIS software, the ETISPlus networks cannot be used for assignments. They are indeed not conceived for this, and some important manipulations are needed to make them “assignment compatible” [34]. The resulting networks contain more than 58,000 road links, 1600 inland waterway (IWW) links and 10,000 railroad links.

4. Application and Model Performance

The methodology outlined in Section 2 is applied to the dataset described in Section 3, following the workflow illustrated in Figure 3. The digitized networks, the modal OD matrices and the transportation cost functions (Section 4.1) are used as input for a first assignment using the Nodus [35] open-source transportation network modeling software especially designed for multimodal and intermodal freight transport. This allows retrieving, for each OD cell and each mode, the transportation cost, the transit time and the length of the trip. The latest is further used as input to compute the accessibility measures. Furthermore, for

Table 2. Content of the OD matrices.

Figure 3. Model workflow.

each group of commodities, the modal matrices are combined, and the modal choice models are applied to the merged matrices.

Note that some nodes do not have rail or IWW access and that no modal route can therefore be found. Those cases correspond to a nonexistent cell in the modal OD matrix and can be properly handled by the Biogeme software package [36] used to estimate the logit models (Section 4.3). This also has an impact on the accessibility measures Λ m o and Λ m d (Equations (13) and (14)). Indeed, the numerator is computed using only the OD relations for which a route exists, and the denominator uses a fixed number of nodes n (the number of centroids in the area). As a result, the accessibility of a node decreases with the difference between n and the number of nodes taken into account in the numerator, which is an expected result.

The workflow presented in Figure 3 is rather complex and relies on several tools and programing languages. Therefore, the model must still be considered as a proof of concept that must further be streamlined. Ideally, only one tool should be used, and the model should be able to estimate the parameters of the decay function along with the other estimators. This work must still be done.

4.1. Computation of Costs and Transit Times

The OD matrices and the transportation networks are imported in Nodus [35] to compute the values of the explanatory variables needed for the modal choice models. The software allows retrieving, for each OD pair, each mode m and each group of commodities g, the total cost of transport C m g , the total transit time T m g and the trip length L m g . The explanatory variables are thus constructed variables retrieved from the results of an assignment of each modal OD matrix on its corresponding digitized transportation network.

The total cost C m g of a route between an origin and a destination for a vehicle of mode m transporting commodities of type g is equal to

C m g = l d _ c o s t m g + u l _ c o s t m g + l R m v _ c o s t l , m g (23)

where:

ld_cost: loading cost

ul_cost: unloading cost

mv_cost: moving cost

R: the set of successive links representing the route

The loading and unloading costs are fixed costs, although they vary with the mode and the transported goods. The transshipment costs associated with intermodal transport chains are not explicitly modeled in this paper. The loading factors of the vehicles are taken from the ECCONET research project [37]. They are exogenous but specific for each group of commodities and type of vehicle (trucks, trains or barges) included in the model. The traveling unit cost, or moving cost, mv_cost, depends on the length and on the average commercial speed for the considered mode. It considers all the carrier’s costs during transport: labor, capital invested in vehicles, fuel, maintenance, insurance, services directly linked to a transport, plus all residual indirect costs such as those of administrative services. A given link l belonging to a network of mode m is computed as

m v _ c o s t l , m g = A v e r a g e s p e e d m S p e e d l , m l e n g t h l u n i t m v _ c o s t m g (24)

As the unit mv_cost also contains time-related costs, the average speed/speed ratio allows for considering higher/lower costs than average costs on slower/faster segments of the network. Average Speedm represents the average speed for mode m on the transportation network, and Speedl,m is the average speed on link l.

Similarly, the total transit time T m g has fixed elements (the loading and unloading durations (ld_duration and ul_duration)) and a variable part (the travel duration that depends on length and speed on the successive links along the route). Thus,

T m g = l d _ d u r a t i o n m g + u l _ d u r a t i o n m g + l R m v _ d u r a t i o n l , m (25)

With

m v _ d u r a t i o n l , m = l e n g t h l / s p e e d l , m (26)

4.2. Choice of a Decay Function

The γx parameters of Equations (18) to (22) are estimated using the R “nls2” package [38], which determines the nonlinear least square estimates of the parameters of a nonlinear model. The five decay functions are estimated for the three transportation modes and the ten groups of commodities listed in Table 1. As shown in Figure 1, and unlike the other formulations, the exponential normal decay function has an S-shape. This is probably why it was considered the less efficient decay function in [31]. The conclusion drawn by the authors could be explained by the fact that they used the R2 values of the models to compare the performances of the different functional forms. However, R2 is known not to be an appropriate performance measure for nonlinear functions [39] [40]. Indeed, the following problems occur:

· R2 is consistently high for both excellent and appalling models.

· R2 will not always increase for better models.

· If R2 is used to pick the best model, it leads to the proper model only 28% - 43% of the time [40].

Therefore, another measure, such as the residual standard error, must be used instead. Table 3 compares the relative residual standard error (RSEmodel/RSEbest model) of each specification (I-V, as given by Equations (18) to (22)). It appears that the differences between the specifications are limited but that the exponential normal decay function (IV, Equation (21)) always performs best for road transport. For the other two modes, no decay function clearly outperforms the others.

Table 3. Relative residual standard error for the 5 decay functions (per mode and NST/R).

Altogether, the exponential normal formulation will be used in this paper to compute the relative accessibility measures Λ m d and Λ m o .

4.3. Logit Model Estimation

Now that all the explanatory variables can be computed, the modal choice models can be estimated. Among the three utility functions (Equations (15) to (17)), the one that includes both accessibility measures performs best, indicating that accessibility to the customers from a location and accessibility to the suppliers (sourcing choice) appear to be relevant for the modal choice. Therefore, only the model with both accessibilities is further used and discussed.

Table 4 presents the estimators for the model without accessibility (Equation (7)), while Table 5 contains those of the model with both accessibility measures (Equation (17)). The parameters are estimated using Biogeme [35], an open-source Python package designed for the maximum likelihood estimation of parametric models in general, with a special emphasis on discrete choice models. The following constraints are imposed on the models:

· αand βm must be negative (see Section 2).

· Λ m d and Λ m o must be positive, as better modal accessibility is also supposed to make a mode more attractive.

With a confidence level of 0.05, all the estimators are significant except τroad for NST/R 0 (agricultural products and live animals) and αfor NSTR/5 (metal products) in the model with accessibility measures (Table 5).

The Box-Cox parameters λc and λt for both models are lower than 1 for all groups of commodities, meaning that the negative effect of a cost increase or a longer transit time is marginally decreasing.

In most cases, σmode is lower than τmode, indicating that the capacity of a node

Table 4. Estimated parameters for the model without accessibility measure.

to be reached from other locations plays a less important role in the mode choice decision than the capacity of a location to reach the other locations in the area of interest. This is not true for road transport of NSTR/0 commodities (agricultural products and live animals), for which τroad is equal to zero. The choices of road transport for NSTR/4 (ores and metal waste) and IWW transport for NSTR/2 (solid mineral fuels) are also clearly more influenced by the accessibility of the destination from the different origin nodes. To a lesser extent, this is also true for road transport for NSTR/7 (fertilizers). The case of IWW for NSTR/2 could be explained by the fact that most of the tonnage comes from a reduced number of origins, mainly Belgian and Dutch ports. The topology of the inland waterway networks is such that relatively few European destinations are accessible by barge from these ports. We confess that we did not find an explanation for the other “outliers”.

Altogether, except for those cases, no τmode clearly outperforms σmode, indicating

Table 5. Estimated parameters for the model with accessibility measures.

that both accessibilities are important in the mode choice decision. Thus, not only are the characteristics of the possible modal itineraries between an origin and a destination (cost and transit time) taken into account, but the level of accessibility from/to the whole network at both locations is also important. As the spatial accessibility measures used in this paper are closely related to the concept of connectivity (the density of connections in path or road networks and the directness of links), one can conclude that the qualities of the network connections to the surrounding areas of the origins and destinations are important. In an interregional or international context, this means, for instance, that the will of the European Commission to finalize the TEN (Trans-European Network) makes sense, as this improves the accessibility to some regions. However, the TEN is mainly concentrated in the most important industrial areas. This undoubtedly raises the question of the accessibility of more peripheral areas, which is more the responsibility of local (national or regional) authorities.

4.4. Comparative Performances

The log-likelihood and the Akaike information criterion of the model with accessibility are better for all groups of commodities. As these performance measures do not give a comprehensive idea of the added value of the accessibility measure in the model, an analysis of the mean absolute percentage error (MAPE) between the observed and computed tonnages is presented in Table 6. This metric is commonly used in forecasting. More precisely, the weighted variant of the MAPE is used, because a prediction error for an OD relation with an important tonnage is more damageable than for a less significant OD. Such a MAPE (sometimes referred to as WMAPE in the literature) can be computed as

Table 6. MAPE between observed and estimated quantities at the OD level.

MAPE = z = 1 Z | A z F z | / z = 1 Z A z (27)

where:

Z: number of fitted points

At: actual value

Ft: forecast value

The MAPE values are obtained following the workflow presented in Figure 4. The ETISPlus “observed” modal OD matrices for a given group of commodities are merged, so the resulting matrix contains the sum of the transported quantities by each mode. The modal choice model is then applied to this matrix to obtain the computed (estimated) matrices for each mode. The MAPE can then be computed for each mode and group of commodities, comparing the values of the “observed” OD cells Qxy and their corresponding estimated values Q ¯ x y .

Table 6 shows that the mean absolute percentage error is systematically lower for all modes and groups of commodities for the model with accessibility measures. One could argue that the overall performance of the model remains rather poor, as the average errors are high and even sometimes larger than 100% for the first model. This is obviously because the model does not integrate more relevant explanatory variables (that are not available). The important conclusion is, however, that the integration of the accessibility measures in the utility function substantially improves the predictive power of the modal choice model.

The values presented in Table 6 do not take the impact of the improved modal choice model on the estimated traffic on the networks into account. This can be captured using the workflow presented in Figure 5.

Figure 4. Validation process at the OD level for one group of commodities.

As no observed count data are available along the segments of the networks, the traffic computed by separate assignments of each mode (the “observed” OD matrix of a mode assigned to its own network) is used as a proxy of the actual transported tonnage on each link Tlink. These reference tonnages are then compared to the computed T ¯ l i n k values obtained by an assignment of the estimated modal OD matrices.

Table 7 gives the resulting MAPE values3 between the Tlink and T ¯ l i n k series. The values show that the model with accessibility always performs better, especially for IWW transport. The “Aggregate” line gives the values computed using the total tonnages, including all commodities. These totals represent the traffic on the networks (at least for nonempty vehicles), which is usually of interest to policy-makers. The scatter plots of Figure 6 (in this case for IWW transport of solid mineral fuels) provide further insight into how the model with accessibility improves the predictive power of the assignment model.

Figure 5. Validation process at the network (traffic) level for one mode.

Table 7. MAPE between “observed” and computed traffic on the networks.

Figure 6. Scatter plots of actual vs. estimated tonnages (IWW, NSTR/2).

5. Conclusions and Perspectives

In this paper, a modal choice model is presented in which network accessibility is used as an additional explanatory variable, in addition to transport cost and transit time.

The concept of accessibility is defined in this paper from a spatial perspective as a measure of the capacity of a location to be reached from different locations or from an origin node perspective to measure the capacity to reach different destinations.

The model is designed to be used in a transportation network model. Therefore, its performance is measured at the modal-choice and assignment steps.

Special attention is given to two specific aspects of the model. First, multicollinearity between transport costs and transit times must be addressed. For this purpose, a Box-Cox transform is applied to both independent variables. Second, several possible functional forms of the decay function used to compute the accessibility measures are tested. If the negative exponential form is often considered as appropriate for transport over medium and long distances, as generally cited in the literature, we conclude that the exponential-normal curve fits better. It also appears that the chosen accessibility measures are not correlated with transport costs and transit times.

In most cases, the capacity of a node to be reached from other locations plays a less important role in the mode choice decision than the capacity of a location to reach the other locations in the area of interest. Both accessibility measures are, however, important and taken into account in the mode choice decision.

To assess the added value of the introduction of the (relative) accessibility measures in the modal choice model, two performance indicators are presented for a case study based on the ETISPlus European OD matrices containing annual transported tonnages for 10 groups of commodities. The first compares the mean absolute percentage errors (MAPEs) of the observed transported tonnages and those estimated by the modal choice models. The MAPE values of the model with the accessibility measures are substantially lower. A similar exercise is performed with the “observed” and computed tonnages along the roads, inland waterways, and railroads of the digitized transportation networks. Here also, the MAPE values are also systematically better for the model with accessibility measures.

While keeping the econometric model simple, the inclusion of an accessibility measure in the modal choice model module improves the predictive power of the transportation network model, which is an advantage for modelers because it increases the accuracy of cost-benefit or socioeconomic analysis, for instance. However, the workflow of the model remains rather complex as it relies on several software tools and programing languages. An additional research effort toward a simultaneous estimation of mode choice and decay function is needed to make the model more easily useable by modelers.

Acknowledgements

I would like to thank Pauline Quittelier, a student of mine, who, as part of her master dissertation, looked at the importance of spatial accessibility in a mode choice model. Although this article presents a more complex model, based on several scientific articles and further validations, here work served as an incentive for this research.

NOTES

1A combination of OD pairs and commodity groups.

2Obviously, as specified in Equation (6), ln(Cm) or ln(Tm) is used if λc or λt = 0.

3In order not to bias the calculation, the links connected to only two other links of the same mode and from which it is not possible to change direction are removed from the process. Indeed, the flow on these links is always equal to the flow on their preceding and following links.

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

References

[1] Picard, G. and Gaudry, M. (1998) Exploration of a Box Cox Logit Model of Intercity Freight Mode Choice. Transportation Research Part E: Logistics and Transportation Review, 34, 1-12.
https://doi.org/10.1016/S1366-5545(97)00016-1
[2] Rich, J., Holmblad, P.M. and Hansen, C.O. (2009) A Weighted Logit Freight Mode-Choice Model. Transportation Research Part E: Logistics and Transportation Review, 45, 1006-1019.
https://doi.org/10.1016/j.tre.2009.02.001
[3] Jourquin, B. and Beuthe, M. (2019) Cost, Transit Time and Speed Elasticity Calculations for the European Continental Freight Transport. Transport Policy, 83, 1-12.
https://doi.org/10.1016/j.tranpol.2019.08.009
[4] Holguín-Veras, J., Kalahasthi, L., Campbell, S., González-Calderón, C.A. and Wang, X. (2021) Freight Mode Choice: Results from a Nationwide Qualitative and Quantitative Research Effort. Transportation Research Part A: Policy and Practice, 143, 78-120.
https://doi.org/10.1016/j.tra.2020.11.016
[5] Kim, H.-C., Nicholson, A. and Kusumastuti, D. (2014) Freight Transport Mode Choice and Mode Shift in New Zealand: Findings of a Revealed Preference Survey. In: Sustainable Logistics (Transport and Sustainability, Vol. 6), Emerald Group Publishing Limited, Bingley, 165-192.
https://doi.org/10.1108/S2044-994120140000006007
[6] Madaan, J., Dwivedi, A., Shardeo, V. and Patil, A. (2021) Mode Choice Problem in Context of Freight Transportation: A Systematic Review and Bibliometric Analysis. IJLSM, 1, 1.
https://doi.org/10.1504/IJLSM.2021.10039381
[7] Guilbault, M. and Soppé, M. (2009) Apports des enquêtes chargeurs: Connaissance des chaînes de transport de marchandises et de leurs déterminants logistiques. INRETS.
[8] ITF (2022) Mode Choice in Freight Transport. OECD, ITF Research Reports, Paris.
[9] Samimi, A., Kawamura, K. and Mohammadian, A. (2011) A Behavioral Analysis of Freight Mode Choice Decisions. Transportation Planning and Technology, 34, 857-869.
https://doi.org/10.1080/03081060.2011.600092
[10] de Jong, G. (2014) Mode Choice Models. In: Tavasszy, L. and de Jong, G., Eds., Modelling Freight Transport, Elsevier, Amsterdam, 117-141.
https://doi.org/10.1016/B978-0-12-410400-6.00006-9
[11] Rodrigue, J.-P. (2020) The Geography of Transport Systems. 5th Edition, Routledge, Abingdon, Oxon, New York.
[12] Yi, Y. and Kim, E. (2018) Spatial Economic Impact of Road and Railroad Accessibility on Manufacturing Output: Inter-Modal Relationship between Road and Railroad. Journal of Transport Geography, 66, 144-153.
https://doi.org/10.1016/j.jtrangeo.2017.11.017
[13] Tavasszy, L.A., Ruijgrok, K. and Davydenko, I. (2012) Incorporating Logistics in Freight Transport Demand Models: State-of-the-Art and Research Opportunities. Transport Reviews, 32, 203-219.
https://doi.org/10.1080/01441647.2011.644640
[14] Garrido, R.A. and Leva, M. (2004) Port of Destination and Carrier Selection for Fruit Exports: A Multi-Dimensional Space-Time Multi-Nomialprobit Model. Transportation Research Part B: Methodological, 38, 657-667.
https://doi.org/10.1016/j.trb.2003.10.001
[15] Cervero, R. (2002) Built Environments and Mode Choice: Toward a Normative Framework. Transportation Research Part D: Transport and Environment, 7, 265-284.
https://doi.org/10.1016/S1361-9209(01)00024-4
[16] Hammadou, H., Thomas, I., Verhetsel, A. and Witlox, F. (2008) How to Incorporate the Spatial Dimension in Destination Choice Models: The Case of Antwerp. Transportation Planning and Technology, 31, 153-181.
https://doi.org/10.1080/03081060801948126
[17] Blauwens, G. and Van de Voorde, E. (1988) The Valuation of Time Savings in Commodity Transport. International Journal of Transport Economics, 15, 77-87.
[18] De Jong, G., Bakker, S., Pieters, M. and Wortelboer, P. (2004) New Values of Time and Reliability in Freight Transport in the Netherlands. Association for European Transport.
[19] Greene, W.H. (2012) Econometric Analysis. 7th Edition, Pearson, New York.
[20] Adeboye, N.O., Fagoyinbo, I.S. and Olatayo, T.O. (2014) Estimation of the Effect of Multicollinearity on the Standard Error for Regression Coefficients. IOSRJM Journal of Mathematics, 10, 16-20.
https://doi.org/10.9790/5728-10411620
[21] Dormann, C.F., et al. (2013) Collinearity: A Review of Methods to Deal with It and a Simulation Study Evaluating Their Performance. Ecography, 36, 27-46.
https://doi.org/10.1111/j.1600-0587.2012.07348.x
[22] Box, G.E.P. and Cox, D.R. (1964) An Analysis of Transformations. Journal of the Royal Statistical Society: Series B (Methodological), 26, 211-243.
https://doi.org/10.1111/j.2517-6161.1964.tb00553.x
[23] Fridstrøm, L. and Madslien, A. (1994) Own Account or Hire Freight: A Stated Preference Analysis. 7th International Conference on Travel Behaviour, Valle Nevado, Chile.
https://www.researchgate.net/publication/268221233_Own_account_or_hire_freight_a_stated_preference_analysis
[24] Gaudry, M. (2016) Méthodes Box-Cox, algorithmes de TRIO et demande de transport: Trois consignes occamiennes pour faire rendre sens aux coefficients de régression de modèles simples et logistiques discrets ou agrégés.
[25] Geurs, K.T. and Ritsema van Eck, J.R. (2001) Accessibility Measures: Review and Applications. Evaluation of Accessibility Impacts of Land-Use Transport Scenarios, and Related Social and Economic Impact. Rijksinstituutvoor Volksgezondheid en Milieu, Bilthoven, The Netherlands.
[26] Quittelier, P. (2021) Modèles de choix modal pour le transport de marchandises, Master Thesis, UCL-Louvain School of Management, Mons.
http://hdl.handle.net/2078.1/thesis:31450
[27] Morris, J.M., Dumble, P.L. and Wigan, M.R. (1979) Accessibility Indicators for Transport Planning. Transportation Research Part A: General, 13, 91-109.
https://doi.org/10.1016/0191-2607(79)90012-8
[28] Gutiérrez, J., Monzón, A. and Piñero, J.M. (1998) Accessibility, Network Efficiency, and Transport Infrastructure Planning. Environment and Planning A: Economy and Space, 30, 1337-1350.
https://doi.org/10.1068/a301337
[29] Pooler, J.A. (1995) The Use of Spatial Separation in the Measurement of Transportation Accessibility. Transportation Research Part A: Policy and Practice, 29, 421-427.
https://doi.org/10.1016/0965-8564(95)00013-E
[30] Thomas, I., Hermia, J.-P., Vanelslander, T. and Verhetsel, A. (2003) Accessibility to Freight Transport Networks in Belgium: A Geographical Approach. Tijdschrift voor Economische en Sociale Geografie, 94, 424-438.
https://doi.org/10.1111/1467-9663.00271
[31] Reggiani, A., Bucci, P. and Russo, G. (2011) Accessibility and Impedance Forms: Empirical Applications to the German Commuting Network. International Regional Science Review, 34, 230-252.
https://doi.org/10.1177/0160017610387296
[32] Handy, S.L. and Niemeier, D.A. (1997) Measuring Accessibility: An Exploration of Issues and Alternatives. Environment and Planning A: Economy and Space, 29, 1175-1194.
https://doi.org/10.1068/a291175
[33] Szimba, E., et al. (2012) ETISplus Database Content and Methodology.
[34] Jourquin, B. (2017) Estimation of Travel Time Elasticities with a Multimodal Freight Transport Model Calibrated and Validated on Publicly Available Regional Data. 14th International Nectar Conference, Madrid.
https://www.researchgate.net/publication/317580054_Estimation_of_travel_time_elasticities_with_a_multimodal_freight_transport_model_calibrated_and_validated_on_publicly_available_regional_data
[35] Jourquin, B. (2021) Nodus, the Transportation Network Modeling Software Designed for Multimodal and Intermodal Freight Transport.
http://nodus.uclouvain.be
[36] Bierlaire, M. (2020) A Short Introduction to PandasBiogeme. TRANSP-OR 200605.
http://transp-or.epfl.ch/documents/technicalReports/Bier20.pdf
[37] Beuthe, M., Jourquin, B., Urbain, N., Lingemann, I. and Ubbels, B. (2014) Climate Change Impacts on Transport on the Rhine and Danube: A Multimodal Approach. Transportation Research Part D: Transport and Environment, 27, 6-11.
https://doi.org/10.1016/j.trd.2013.11.002
[38] Grothendieck, G. (2013) Non-Linear Regression with Brute Force.
https://cran.r-project.org/package=nls2
[39] Kvalseth, T.O. (1983) Note on the R2 Measure of Goodness of Fit for Nonlinear Models. Bulletin of the Psychonomic Society, 21, 79-80.
https://doi.org/10.3758/BF03329960
[40] Spiess, A.-N. and Neumeyer, N. (2010) An Evaluation of R2 as an Inadequate Measure for Nonlinear Models in Pharmacological and Biochemical Research: A Monte Carlo Approach. BMC Pharmacology, 10, Article No. 6.
https://doi.org/10.1186/1471-2210-10-6

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.