Traffic Maps and Smartphone Trajectories to Model Air Pollution , Exposure and Health Impact

In this study, we explored to combine traffic maps and smartphone trajectories to model traffic air pollution, exposure and health impact. The approach was step-by-step modeling through the causal chain: engine emission, traffic density versus traffic velocity, traffic pollution concentration, exposure along individual trajectories, and health risk. A generic street with 100 km/h speed limit was used as an example to test the model. A single fixed-time trajectory had maximum exposure at velocity of 45 km/h at maximum pollution concentration. The street population had maximum exposure shifted to a velocity of 15 km/h due to the congestion density of vehicles. The shift is a universal effect of exposure. In this approach, nearly every modeling step of traffic pollution depended on traffic velocity. A traffic map is a super-efficient pre-processor for calculating real-time traffic pollution exposure at global scale using big data analytics.


Introduction
Traffic pollution is the dominant source of air pollution in most metropolitan areas and has major health effects.50% of the world's population lives in urban areas covering only 0.4% of the earth's surface, and 70% are projected to live in urban areas by 2050 [1].In many European cities, industrial air pollution is being replaced by traffic pollution [2].For example in Moscow, traffic pollution accounts for 93% of the air pollution [3] [4].
In most cities, air pollution levels exceed the guideline maximum levels established by the WHO (World Health Organization) to protect human health.People most exposed are those who spend a much time in heavy traffic [5] [6] or reside near heavy traffic [7] [8].For example, US EPA (United States Environmental Protection Agency) claims that 45 million people in the US are living, working, or attending school within 300 feet (91 m) of a major road, airport or railroad [9].Some cities provide air quality information to the public [10] [11], but not individualized information [12] [13].
Recent research projects (e.g., CitiSense, CITI-SENSE, EveryAware, and iS-PEX) provided air quality information at smartphones to citizens equipped with low-cost pollution sensors [14] [15] [16] [17].However, low cost sensors are typically neither stable nor accurate [18] [19] [20] [21].It is infeasible to scale up to population level due to the large cumulative cost.A study in 2013 [20] presented an alternative approach of using individual mobile phone trajectories accumulating exposure from a pollution map without personal sensors.
Since 2013, smartphones with location services has gained nearly 100% mobile phone market penetration.A second trend is that dedicated mobile networks are built to add billions of new sensors to the internet that is the Internet of Things (IoT).Indeed, 90% of all available data today were generated within the last two years.Trends in computer science include big data, data mining, advanced analytics, cognitive computing, virtual reality, and robotics.New algorithms are based on more abstract mathematics, including topology, network theory, and functional analysis.New knowledge is extracted from data lakes filled by streams of heterogeneous data.Simulators predict future states of complex systems as digital twins of the real systems.In this study, traffic emissions, pollution, exposure and health effects were modeled by traffic map and smartphone data.

Method
The modelling principles were to value simplicity, computational speed and scalability, above accuracy.Accuracy improvements can be added to the model at a later stage.The overall method is to mathematically model effects along the causal chain starting with single vehicle tailpipe emissions, ending up with health risk.Traffic map data and smartphone trajectories are used wherever possible.Figure 1 shows construction of a traffic map, an individual exposure trajectory, and up-scaled population risk distribution.Figure 2 illustrates the individual causal chain modeling steps.From the existing traffic maps, the traffic velocity can be extracted.From traffic velocity, by using the traffic engineering models, the traffic density can be calculated (see Section 2.3).Based upon the traffic density and traffic velocity, the traffic emission can be calculated (see Equation No. ( 6)).From traffic emission, by using Gaussian dispersion model, the pollutants concentration can be calculated.By combing individual smart phone based trajectory, the individual exposure can be calculated and the health risks Figure 1.Trajectory based traffic pollution system by using a traffic map with street segment "traffic velocity" or "travel time" as a super-efficient pre-processor to calculate emission, concentration fields, and then using my mobile phone location trajectory to calculate exposure.can be further estimated.The individual exposure and health risk can be scaled up to the population exposure and health risks.

Vehicle Tailpipe Emission
Vehicle tailpipe emission is the sum of emissions from an idle engine and a working engine.The idle engine gives a constant base load while the engine is turned on [22].The emission rate per unit length j q (g/(s•m)) can be given as a sum over N vehicle types: where 0, jk E is the idle engine emission factor (kg/s) of the j th pollutant of vehicle type k, 1, jk E is the working engine emission factor in (kg/m) for the j th pollutant and vehicle type k, k n is the number of vehicles of type k per length in (m), and k v is the velocity (m/s) of a vehicle of type k.On average, vehicles move with the local traffic flow velocity v, so k v v = , and Equation (1) simplifies to The change of vehicle distribution k n is slow, over years, such that the distribution of vehicle types is approximately constant in time and space at country level.Thus, a set of effective constant parameters 0, , j j E E and n can be applied Plug-in Electric Vehicles (PEVs) and Plug-in Electric Hybrid Vehicles (PEHVs) are positive for air quality, but market penetration varies among countries.For example, the share of PEVs of the new car sales in 2015 was 0.66% in the USA and 22.39% in Norway [23].Inserting Equations ( 3)- (5) in Equation (2), the emission rate for pollutant j is ( ) In the next section, traffic maps information is assessed.

Traffic Flow Velocity Measurements
A traffic map (for example from Google, Here, TomTom, Yandex, and Baidu) shows near-real-time traffic velocity or travel time by a color code of data on street segments in a street map [24].Consider a street segment i of length i s in a traffic map, and the travel time i t driving from start to end with local traffic Two velocity averages are used in traffic engineering [25] [26]: time mean velocity t v that is the time-averaged instantaneous velocity of vehicles passing a given position on the road over some time interval t with ( ) m t vehicles meas- ured by a pair of nearby induction loop detectors embedded in the road = ∑ , and space mean velocity s v that is the average velocity of m vehicles passing a fixed street segment of length s where each velocity is calculated by time intervals i t to cross street segment from start to end, for example by video camera recording the segment, and In practice, the time mean velocity is about 2% greater than the space mean velocity.Smartphone location service can be used to measure velocity by spatial difference i s in individual GPS (Global Positioning System) positions over a fixed sampling time interval t and average over vehicles on a street segment ( ) Global traffic maps are calculated by millions of GPS positions, other static and dynamic input data, filtering, position corrections, and historical data to fill in blanks [26] [27].In the next section, the vehicle density is constructed by using traffic-engineering models.

Traffic Engineering Models of Vehicle Density
In traffic engineering, the fundamental diagram for traffic flow relates traffic flux ( nv ), the number of vehicles passing a fixed point per time, to vehicle density [24] [28] [29] [30].Traffic density is related to traffic flow velocity by the van Aerde model (1995) [31] where the average headway per vehicle (street length per vehicle) 1/n is: where 0 v is the free float traffic velocity at zero density c is a constant of the term that ensures zero density as c is a constant time interval per vehicle.For safe driving in Norway, 3 3 s c ≈ .By inverting 1/n Thus, the vehicle density in Equation ( 6) can be obtained from the traffic map velocity.The maximal density max n is given for complete standstill By inserting Equation (11) in Equation ( 6), the emission rate per unit length MacNicholas (2009) [32] developed an alternative traffic model as where 0 v is the free flow velocity, max n is the maximal vehicle density, and c and α are curve-shape constants.The end-points are ( ) . The free flow velocity is based on the speed limit, and the maximum density is given by the average length of vehicles plus a safety margin.The parameters α and c are specified by curve fitting to measured data.MacNicholas (2009) [32] . Equation ( 14) in normalized velocity and density is Inserting Equation ( 15) in Equation ( 6) gives ( ) Van Aerde [31] and MacNicholas [31] ignored fluctuations.An example of a more advanced model is the three phase model of Kerner (1998) [33] with free flow, synchronized flow and a wide-moving jam.A wide-moving jam is a wide jam that has almost a step change in density at the upstream side of the jam.The step change moves upstream at a velocity of about 20 km/h as vehicles are added to the jam.The step change is similar to a solitary wave, a so-called soliton, such as a Tsunami wave, and in physics, the step-change moving jam is called a "jamiton".These effects are ignored here.In the next section, the pollution field is modeled.

Gaussian Plume and Turbulent Mixing at a Street Segment
Air dispersion is modelled by a Gaussian plume [34].The steady state concentration of the j th pollutant j c (in kg/m 3 ) at a x, y, zposition, relative to the center of line source in the downwind x, crosswind y and vertical z directions are given as [35] ( ) ( ) sin where j q is the line source strength or mass emission rate per unit length (kg/(s⋅m)), θ is the angle between the wind direction and the road in the range 0˚ -180˚, h is the effective source height, L is the line source length that is the length of a street segment, u is the average wind speed, and 0 u is the wind speed correction due to a traffic wake.The standard deviations  ( ) , and tend to unity for large x, ( ) erf 2 1 x ≥ ≈ .Atmospheric stability classes are A (very unstable), B, C and D (neutral), E and F (very stable).Consider, for simplicity, that the wind is perpendicular to the road that is 90 The two error functions model a tapering-off of the concentrations over a distance of the order of y σ at the ends of the street segment, i.e., at . For relevant x, the standard deviation y σ is small compared to the street half-length, and the ta- pering-off-effect was ignored.Both error functions are approximately equal to unity, and their sum is equal to two, and Equation ( 17) reduces to ( ) ( ) Next, the vertical standard deviation is modeled.Turbulent wakes or trailing vortices behind vehicles form at fluid mechanical Reynolds numbers Re greater than about 1000 where ρ is air density, v is traffic flow velocity, l is the size of a vehicle and µ is air dynamic viscosity.Wake turbulence mixes released pollutants [36].Air at 20˚C and atmospheric pressure has .Thus, congested traffic has turbulent wakes.Volume averaged turbulent kinetic energy increases linearly with velocity [38], while turbulent mixing by vehicle interaction increases by decreasing velocity.Immediate turbulent mixing is assumed.
Consider two cars A and B, with B in front of car A. In congestion, the distance between the inlet suction of car A and the tailpipe outlet of a car B may be one meter.The exhaust gas of car B is almost directly sucked into car A, and the people in car A are heavily exposed to pollution.At this stage, this added congestion exposure is ignored.
Turbulent mixing increases the size of the emission source by ) Empirical Pasquill Gifford sigmas [39], were made analytical by Green et al.
( ) ( ) The tailpipe and the suction inlet have small vertical positions compared to the mixing length, . Thus, the two exponential terms in Equation (19) are both approximately equal to unity and their sum is equal to two, so that: Velocity is the key variable of pollution concentration.Next, exposure is modeled.

Exposure from Traffic Map Trajectories
Human exposure is concentration times the residence time [20] [41], as follows: where i X is the total exposure for person i over a specified period, jk c is the concentration of pollutant j concentration in microenvironment or street segment k, ik t is the residence time of the person i in segment k, and K is the total number of microenvironments.Individual time-activity patterns are mapped by smartphone location service trajectories.Exposure depends on two types of trajectories: i) Fixed-time trajectory: Individual trajectory of fixed time duration, such as the working hours of taxi drivers, and people residing near a street with heavy traffic; and ii) Fixedroute trajectory: Individual who has to move from location A to B, no matter how long time it takes, such as a commuter who travels the same route from home to work every workday.Fixed time exposure is a sum over time intervals ik t up to a given total time The ( ) jk c p includes the sum of concentration contributions from all street segments and is mathematically a convolution.Residents may have a large daily time T but not directly at the peak pollution on the street.Now, consider a fixed route trajectory.The residence time ik t of exposure at street segment k is related to traffic flow velocity ik v and length ik s of the road segment as: Solved for ik t and inserted into the exposure ( ) where residence time and velocity for a given street segment are functions of time, ( ) During rush hours, the exposed time for a fixed route is longer than outside rush hours.Equation (31) , is derived by Taylor expanding [42] Equation (30) to leading order in a Taylor expansion: The exposure is given by the travel time, , and the exposure per travelled distance may become large.Hence, congestion is a high pollution exposure regime.
In the high velocity regime, the velocity effects of travel time and working engine cancels, and exposure is proportional to vehicle density ( ) where . In the limit of free flow velocity the exposure vanishes.The high-velocity regime is a low exposure regime.To the best of our knowledge, the discovered effect of velocity on exposure new.In the next section, the input parameters to the model are specified.
For the specification of input parameters to the traffic exposure model, Engine emission factors for pollutants are shown in Tables 1-3 from US EPA [22].The free float velocity 0,k v can be set equal to the speed limit.The maximal traffic density max,k n is equal to a dense packing of vehicles on the road.For example, 7 m per vehicle gives a density of 1/7 vehicle per meter, or 143 vehicles per kilo- Light-duty gasoline-fueled vehicles, up to 2722 kg (6000 lb) GVW; gasoline-fueled passenger cars.
MC 7  Motorcycles; only those certified for highway use, all are gasoline-fueled 1 Light-duty gasoline-fueled vehicles; 2 Light-duty gasoline-fueled trucks; 3 Heavy-duty gasoline-fueled vehicles; 4 Light-duty diesel vehicles; 5 Light-duty diesel trucks; 6 Heavy-duty diesel vehicles;  15), the density-velocity shape parameters are assumed to be fixed.
The traffic map provides static data such as street segment length ik s and orientation, and dynamic velocities.A smartphone location service provides trajectories.Weather conditions (e.g., wind direction, and atmospheric stability classes) can be given by a near real-time weather map layer.Currently, traffic and weather map layer data are not available for public use, so collaboration with data providers is needed.

Health Impact
A person moving through a city accumulates a dose of pollution through exposure that gives an incremental increase in health risk that is statistically reflected in the public health.Traditionally, one distinguishes between short-term (i.e.minute, hour, day) acute exposure to pollution that may result in headache/irritation or an asthma attack, and long term, years to lifetime, exposure that can lead to chronic effects including cancer, chronic obstructive pulmonary disease, and neurological problems.
The dose equals concentration times respiration rate times duration and is linear in exposure.The respiration rate, for normal adults is 12 -20 breaths per minute.Each breath volume (or tidal volume) is about 5 liters or 30 -37 ml/kg and total lung volume is about 6 liters.An average of 16 breaths per minute gives a standard deviation of ±25%.Respiration rate increases with increasing heart rate, possibly linearly.Except for runner, bikers and other high-activity persons, people in traffic are passive in a vehicle and have a heart rate close to the resting heart rate; in the range 60 -100 beats/minute.
It is assumed that the risk R (both for an individual and for population) saturates at a maximum level max R , where an increase in exposure gives no further increase in the risk.The exposure level that saturates the risk depends on the seriousness of the risk.For example, the risk of a slight headache due to traffic pollution will saturate at a small exposure, while number of years lost due to early death will saturate at an extremely high exposure.The saturation effect can be modelled by a logistic differential equation as: For small risks max R R  , the risk grows exponentially as function of exposure with a rate r.The growth rate is reduced linearly as the risk increases, and stops growing at maximum risk max R .The logistic risk differential equation can be solved analytically by partial fraction expansion after the R-terms on the right hand side of Equation ( 34) are moved to the left hand side of Equation (34).The initial condition is a background risk Consider a far-from-saturation regime, let ( ) ( ) Then the following sequence of approxima- tions is justified: Divide ( 36) by b R and then subtract unity from both sides and obtain: where b b rX α = . This approximation applies to serious health effects such as early deaths.Since the relative increase in risk is proportional to the relative increase in exposure, the exposure figures can be used as a proxy for health risk figures.
Traffic pollution's impact on health depends both on accumulated exposure (one cause) and on the vulnerability of the person.For example, children and elderly people are more vulnerable to pollution, but also less exposed in traffic.Other factors are body weight, other diseases such as asthma, and exposure to other sources of pollution.

Predictions of Future Exposure and Health Risk
Traffic maps predict one-hour or daily traffic based on historic and current traffic.Individual preferred route selection can be optimized by weighting "time to target location" versus "pollution exposure to target location".Cities have a typical daily M-shaped density peak of morning and afternoon rush hours due to the tidal flow of commuters.
Moreover, one may predict population health risk to optimize urban planning of transportation infrastructure, and residential and working areas.It may even be possible to develop urban simulators as a digital twin to the city where every person in the city has simulated trajectories and automatic collection of exposures and health risks, and used to answer "what if" questions as a valuable tool for politicians and urban planners.

Results and Discussion
The plots in Figures 1-11 are explained in Table 4.
Figure 3 shows the linear single vehicle emission 0, in Equation ( 6) for the pollutants: VOC, THC, and NOx, using the emission rate data per pollutant and type of vehicle from the US EPA [22] (see Tables 1-3).
Figure 4 compares the default van Aerde model [33] and the MacNicholas' [31] model tuned to fit the curve shape of ( ) n v .Figure 5 shows the traffic flow rate ( nv ) against the vehicle density by Equation (11).The maximum flow rate of 2047 vehicles per hour is given by a vehicle density of 40 vehicles per kilometer.Figure 6 and Figure 7 shows the inverse of vertical .Equation (20) shows that the vertical standard deviation ap-      pears as an inverse in the concentration, so the inverse standard deviation is representative of the decaying amplitude of concentration away from the street.Results showed that the calmer meteorological situation (i.e., very stable atmosphere stability class) leads to the higher pollution concentration, and verse versa.
Figure 8 shows the concentration of pollutants versus velocity.The concentration has a maximum of 1501 g/m 3 for a traffic flow velocity of 45 km/h.
Figure 9 shows pollution exposure per length as function of traffic flow velocity together with travel time (or inverse velocity) that is the limit solution for low velocities and density that is the limit solution for high velocities.It is clearly seen that the actual data looks similar to the limit solutions in the applicable Figure 10 shows concentration time's vehicle density ik ijk n c that is a meas- ure for a road segment's contribution to exposure per unit time.The highest contribution to total population exposure is at velocities about 15 km/h.The contribution to exposure at the peak for NOx is about 30 times higher than at maximum velocity, while the concentration is almost constant.This indicates that traffic velocity is an extremely important parameter for traffic pollution health risk.Figure 11 show the segment's contribution to total exposure per unit length is k ik ijk k ik s n c s v in terms of ik ijk ik n c v that is the total contribution to exposure per unit square length for time spent on the segment.Policies reflected by authority regulations are in Norway given as an annual (or winter) average maximum value of pollution concentration at 2 -3 m height above the ground.A yellow limit for NO 2 is 40 μg/m 3 winter averages and a red limit is 40 μg/m 3 annual averages [43].Comparing Figure 8 and Figure 9, it is seen that increasing the velocity or capacity of roads from congested velocities of 25 km/h to 65 -70 km/h, would keep the concentration about constant but reduce health risk by 50% and be Pareto optimal [44].Traffic map companies have developed methods to predict future or typical traffic based on current and historic traffic.The future traffic can be predicted on a short-term basis, typical one hour.By combining smartphone trajectory, this traffic prediction can be used to predict the next hour exposure for a given planned route.It would then be straight forward to compare several possible routes and make an intelligent choice based on weighting "time to target location" versus "pollution exposure to target location" and optimize the route based on individual preferences.
Traffic is well-known to display certain typical patters.Large cities have a typical daily M-shaped density peak of morning and afternoon rush hours due to the flow of commuters in and out of the city.The peak sizes vary typically with weekdays.This information can be used in projecting future long term exposure, identify high exposure groups and check if negative health impact is well-correlated to high exposure groups.
Further one may use the prediction or average maps to estimate where the fu- ture population health risk is highest and direct infrastructure investments to minimize a combination of "population travel time" and "population health risk".Our modelling of exposure showed that the high exposure at low velocity scales as 1 ĩjk ik X v − and this correlates well to the traffic map itself, since a small ik v gives both high exposure and congestion.Most people travel on the peaks of the M-shaped rush hour peaks it is clear that just reducing the size of the rush hour peaks would lead to a significantly improved population health.Exposure maps and health risk maps could be a highly useful tool for urban planning of transport infrastructure in interaction to where people live and work.Even more useful for predictions would be to develop urban simulators where every person in the city have simulated trajectories and would then get simulated exposures and health risks.An urban simulator could then be used to answer all kinds of "what if" questions.An urban simulator could be a highly valuable tool for politicians and urban planners.We predict that by 2030 urban trajectory simulators are routinely being applied in urban planning.

Conclusions
It is feasible to combine traffic maps data with smartphone location service trajectories and big data analytics to simulate near real-time traffic air pollution exposure and health risk.Advantages of the approach are: i) low cost, ii) near real-time, iii) effortless citizen participation, and iv) global scalability.
Nearly every modeling step of traffic air pollution depends on traffic velocity.A traffic map is a super-efficient pre-processor for calculating real-time traffic pollution.
Universally, the exposure and health risk has a peak at lower velocities than the peak of concentration.Congestion is a higher health risk than conventionally believed.

σ
standard deviations from the Gaussian plume model where it is used that

Figure 6 .
Figure 6.Inverse of vertical standard deviation (1/m) vs. distance by stability classes (A: very unstable, B, C and D: neutral, E and F: very stable).

Figure 7 .
Figure 7. Inverse of horizontal standard deviation (1/m) vs. distance by stability classes (A: very unstable, B, C and D: neutral, E and F: very stable).

Figure 8 .
Figure 8. Pollutants concentration amplitude as function of traffic flow velocity.
exposure is decreasing for all four types of pollutants, i.e., VOC, THC, CO and NOx with increasing velocity.

Figure 9 .
Figure 9. Pollution exposure per length as function of traffic flow velocity.

Figure 10 .
Figure 10.Plot showing contribution to exposure by a road segment.

Figure 11 .
Figure 11.Relative size of total exposure per vehicle ik ijk ik n c v .