Elsevier

Physics Reports

Volume 734, 6 March 2018, Pages 1-74
Physics Reports

Human mobility: Models and applications

https://doi.org/10.1016/j.physrep.2018.01.001Get rights and content
Full text access

Abstract

Recent years have witnessed an explosion of extensive geolocated datasets related to human movement, enabling scientists to quantitatively study individual and collective mobility patterns, and to generate models that can capture and reproduce the spatiotemporal structures and regularities in human trajectories. The study of human mobility is especially important for applications such as estimating migratory flows, traffic forecasting, urban planning, and epidemic modeling. In this survey, we review the approaches developed to reproduce various mobility patterns, with the main focus on recent developments. This review can be used both as an introduction to the fundamental modeling principles of human mobility, and as a collection of technical methods applicable to specific mobility-related problems. The review organizes the subject by differentiating between individual and population mobility and also between short-range and long-range mobility. Throughout the text the description of the theory is intertwined with real-world applications.

Keywords

Human mobility
Human dynamics
Random walks
Origin–destination matrices

Nomenclature

    ABM
    Agent-based Model
    ACS
    American Community Survey
    ATM
    Air Transport Management
    ATRS
    Air Transport Research Society
    CDF
    Complementary Cumulative Distribution Function
    CDF
    Cumulative Distribution Function
    CDR
    Call Detail Record
    CLT
    Central Limit Theorem
    CO2
    CODA
    Central Office for Delay Analysis
    CTRW
    Continuous Time Random Walk
    EPR
    Exploration–Preferential Return
    GIS
    Geographic Information System
    GLEaM
    Global Epidemic and Mobility Model
    GPS
    Global Positioning System
    GSM
    Global System for Mobile Communication
    ICT
    IRS
    Internal Revenue Service
    JATM
    Journal of Air Transport Management
    LBSN
    Location-Based Social Network
    MSD
    Mean Square Displacement
    OD
    Origin–Destination
    OSN
    Online Social Network
    PDF
    Probability Density Function
    RMSD
    Squared Root of the MSD
    RNG
    Random Number Generator
    RW
    Random Walk
    SMS
    Short Message Service
    SOI
    Statistics Income Division
    WAN
    World Airport Network

1. Introduction

1.1. Motivation and history

While the term mobility has multiple connotations, in the context of this review it refers to the movement of human beings (individuals as well as groups) in space and time and thus implicitly refers to human mobility. Indeed, from the migration of Homo sapiens out of Africa around 70,000 years ago, through the European discovery of the “New World”, to the existence of expatriate populations in contemporary times, the existence of human beings has always been inextricably linked with their movement. While earlier movement patterns were primarily driven by factors such as climate change, inhospitable landscapes, conflict and food scarcity, in modern times, socio-economic factors such as wage imbalance, differences in welfare and living conditions and globalization play an increasing role.
From hunters and gatherers of prehistoric times to present-day commuters of large metropolitan areas, humans are bound to move on a daily basis in order to earn their livelihood. However, daily trips are also undertaken to perform social and leisure activities. The temporal and spatial scales of these trips are much shorter than those of migratory flows, and they are often characterized by the regularities and periodicities that mark human lives. The daily movement of a large and growing population has important impact on the lives of the individuals and the environmental conditions. Studies conducted in Europe and the United States found that the average household spending on transportation is between 15 and 25 percent of the total expenditures, making transportation the second largest expenditure category after housing. Transportation is also the second source of greenhouse gas emissions to the atmosphere. From these few examples it should be clear that mobility has an enormous impact on human societies and an accurate quantitative description of human mobility is of fundamental importance to understand the processes related to human movement and their impact on the community and the environment. A quantitative theory of human mobility ought to be able to provide answers to relevant questions, such as, what determines the decision to start a trip? which factors determine the choice of the destination? to what extent is human movement predictable, and what is the intrinsic degree of randomness? is it possible to find general rules or laws to explain empirical patterns and regularities exhibited by travels in many diverse countries, such as the distribution of commuting times and distances, and the degree of predictability of future whereabouts?
While the study of human mobility currently spans several disciplines, arguably, geography was the first discipline to analyze mobility data and put forward corresponding theories to describe travel patterns. Indeed, the pioneers of quantitative and theoretical geography in the 50’s defined “geography as (the scientific discipline of) spatial interaction ”  [1]. Early quantitative studies of the movements of people and vehicles were held in large US metropolitan areas [2] (see the first chapters of [3] for details), and initial studies of human travel were of scales ranging from international migrations [[4], [5], [6], [7]] to journey-to-work commuting [[8], [9], [10], [11], [12]]. Indeed, the elucidation and understanding of these patterns was motivated by its relation to several real-world applications such as traffic forecasting [[3], [13], [14]], urban planning [[15], [10]], internal security [[16], [17]] and epidemic modeling [[18], [19], [20]] to name but a few.
The first systematic analysis of the concept of distance as a constraint to movement was proposed in the 19th century: in his 1965 review [21] Gunnar Olsson cites Henry C. Carey’s Principles of Social Science (1858) as the first work to explicitly make the observation about the amount of interaction between two cities being proportional to their population size and inversely proportional to the intervening distance. Few decades later, the geographer Ernst Ravenstein further developed and popularized the idea in a seminal work where he formulated his laws of migration [22]. Further refinements on this theme were made in the 1940’s by the American sociologist Samuel Stouffer [[5], [23]] in his law of intervening opportunities, and by the American linguist George Kingsley Zipf [6]. Zipf’s formulation led to what is now conventionally referred to as the gravity law. The increasing availability of datasets on population movements at various levels of granularity, coupled with the quantitative revolution in geography [[24], [25], [26]], led to the introduction of more sophisticated mathematical methods such as hidden Markov models and diffusion processes. Gender and socio-economic factors behind population movement were further analyzed thanks to richer datasets resulting from surveys and interviews [[27], [28], [29]], and through theories of labor economics focusing on wage differential between locations [30]. Thus through the 20th century, contributions to the theories of human movement were made chiefly in geography, sociology, and economics, while the scale at which this was primarily studied was at the population level.
To provide context for what is to follow, we briefly describe a selection of influential historical contributions, keeping in mind “Stigler’s law of eponymy” which states: “No scientific discovery is named after its original discoverer” [31]. However, if we restrict ourselves to the recognition of distance as a primary factor in determining movement and interactions between places, then it is reasonable to start with the work of Ernst Georg Ravenstein [22].
Laws of Migration. Ravenstein was a German–English geographer who made important contributions to cartography as well as providing one of the first rough estimates of a “maximal” global population based on resource consumption. He was also one of the first to attempt an explanation and prediction of migration patterns within and between countries. Considering the effect of distance as well as the type of migrant (male or female, old or young) as primary factors, he posited the following seven laws:
  • 1.
    Most migrants only travel short distances, and “currents of migrations” are in the direction of the great centers of commerce and industry given that these can absorb the migrants.
  • 2.
    The process of absorption occurs in the following manner: inhabitants of the areas immediately surrounding a rapidly growing town flock to it, thus leaving gaps in the rural areas that are filled by migrants of more remote districts, creating migration flows that reach to “the most remote corner of the kingdom”.
  • 3.
    The process of dispersion is inverse to that of absorption, and exhibits similar features.
  • 4.
    Each main current of migration produces a compensating counter-current.
  • 5.
    Migrants traveling long distances generally go by preference to one of the great centers of commerce or industry.
  • 6.
    The natives of towns are less migratory than those of the rural parts of the country.
  • 7.
    Females are more migratory than males.
Ravenstein added another two laws in 1889 [32]:
  • 8.
    Towns grow more by immigration than by natural increase.
  • 9.
    The volume of migration increases as transport improves and industry grows.
While the laws are non-quantitative and observational in character, Ravenstein correctly identified socio-economic factors as well as distance-constraints to be the essential ingredients behind modern population movement. Consequently, his laws stimulated an enormous volume of work, and although they have been refined and adjusted over the years, the essential ingredients of his formulation remain relevant even today.
Law of Intervening Opportunities. One of the most important refinements was made in the 1940’s by the American sociologist Samuel Stouffer [5]. Roughly speaking, Stouffer was looking to expand upon Ravenstein’s observations regarding migrants moving shorter distances and flocking to commercial centers. To account for this, he proposed that the number of people going a given distance is directly proportional to the number of opportunities at that distance and inversely proportional to the number of intervening opportunities. In other words, trips between two locations are driven primarily by relative accessibility of socio-economic opportunities that lie between those two locations. In this context, opportunity is defined as a potential destination for the termination of a traveler’s journey, whereas an intervening opportunity is one that the traveler rejects in favor of continuing on. In Stouffer’s original formulation, this can be mathematically expressed as (1)dy(r)dr1xdx(r)dr,where y(r) is the cumulative number of migrants that move a distance r from their original location, and x(r) is the cumulative number of intervening opportunities. Assuming that x itself is a continuous function x(r) of distance, then the expression above can be integrated to yield (2)y(r)=logx(r)+C,where C is some constant denoting the number of opportunities at the origin location. Thus, the relationship between mobility and distance is indirect and is established only through an auxiliary dependence via the intervening opportunities: the higher the number of intervening opportunities between two locations at distance r, the smaller the number of migrants that would travel that distance. This may explain why rural migrants may flock to urban centers over large distances, whereas those already in commercial centers are comparatively more stationary.
Distance-Decay and the Gravity Law. Around the same time as Stouffer, the Harvard philologist, George Kingsley Zipf, was expanding upon his famous observation of the rank-frequency dependence in linguistics; the eponymous Zipf’s law, where the frequency of a word ranked z – in terms of usage – has the statistical dependence fz1z [33]. Zipf found that this relation was expandable to other realms of society, specifically the size of cities [34], where the occurrence of a city with population P and consequent rank z also follows the relation (3)Pz1zα.Broadly speaking, Zipf’s argument for this relation was due to the tension between two competing factors. The first, which he called Force of Diversification relates to the likelihood of populations living close to the source of raw materials (commodities) in order to minimize the cost of transportation to production centers. The second effect, referred to as the Force of Unification, is the tendency of populations to aggregate in urban centers due to the minimization of work required to transport finished products to consumers. While the former leads to the formation of multiple urban centers (given that the commodity sources are not localized in one part of a country) with smaller populations, the latter has the competing effect of urban agglomeration in a few centers of large population. Assuming some kind of equilibrium between these quasi-forces, the rank-frequency relation of Eq. (3) with α=1 naturally follows [35]. Deviations from equilibrium, where one force dominates over the other, then leads to a change in the exponent.
Carrying the argument further, under somewhat unrealistic assumptions of equitable share of national income as well as urban centers being self-sufficient (i.e. production and consumption at equal levels), the share of any center i in the total flow of goods is proportional to its population Pi. Therefore, the flow of goods between two centers is proportional to the product of their populations. Finally, if one would like to minimize the cost and work associated with transportation of goods, this flow must be inversely proportional to the distance between centers. Putting all this together we arrive at the relation (4)wijPiPjrij,where wij represents the flow of goods between two population centers i,j and rij is the distance between these two centers. Zipf tested his theory on both freight and population movement data and got good qualitative agreement. This formulation is of course quite different form Stouffer’s explanation as the effect of distance is quite explicit in Eq. (4). Indeed, the form of Eq. (4) is such that it leads to a distance-decay effect suppressing long-range movement. Furthermore, the flow of populations and goods is seen to be as a result of some dynamic equilibrium between the cost of transportation, manufacture, and distribution of goods and services. The references to “forces” as well as its functional dependence on distance eventually led to Zipf’s formulation being dubbed the Gravity law in analogy to Newtonian mechanics.
The common theme connecting Ravenstein, Stouffer and Zipf of course is the geographical distance, though its functional effect on movement is quite different in the Intervening Opportunities and Gravity Law models. Nevertheless, both these models were quite influential on subsequent work, setting off two major strains of parallel research as well as attempts at unification.
Time geography and ICT Data. Measuring, understanding and forecasting the displacements of individuals in space and time has long been part of the program of quantitative and theoretical geography, a branch of geography “born” academically in the 1950’s [[25], [26]]. While the first efforts in capturing human displacements focused on the aggregated levels of flows between spatial units, some also focused on individual trajectories. Torsten Hägerstrand, a Swedish geographer, laid down in the early 1950’s the basis of time geography, and brought a number of conceptual and graphical tools to formalize the trajectories of individuals through space and time. His seminal work [36] remains famous for its proposal to represent individual trajectories in a cube (also known as the “space–time aquarium”), in which the horizontal plane represents the geographical space, while the vertical axis represents time (as depicted in Fig. 1). Hägerstrand proposed a number of graphical conventions (a “notation system” in his own words) to picture the constraints imposed by social life on individual daily trajectories. He also provided means to represent the co-presence and synchronization of several individuals in space, and more generally a set of (essentially graphical) conventions useful to represent the structure and behavior of individual human mobility.
Time geography was naturally invoked (and somewhat rediscovered) in the 1990’s, when the modeling of human mobility shifted towards individual-based simulation (micro/multi-agent/agent-based simulation) [37]. This can be understood in the twin contexts of increased computing power and the development of more expressive programming frameworks, allowing for semantically richer and more ambitious models of human dynamics. However, while the models increased in complexity, they were somewhat artificial; relevant data of comparable complexity and resolution was not available for their calibration. Indeed, while the models progressed, the data lagged behind and the best one had to work with was longitudinal survey data collected since the 1970’s.
  1. Download: Download high-res image (208KB)
  2. Download: Download full-size image

Fig. 1. The cubes of time geography, as first proposed by Torsten Hägerstrand in [36]. The geographical space is represented by the 2D plan, while time is figured by the vertical axis. (Left) The two curves represent the daily space–time trajectories of two individuals living in the same neighborhood and working in the same place. (Right) The geographical footprints continuously and passively produced by individuals through the use of their ICT devices allow to approximate their trajectories. While these re-constructed trajectories are partial and contain errors that might mislead the understanding of underlying trajectories, they are nonetheless more precise nowadays than they were 10 years ago, and produced by a constantly growing number of individuals worldwide.

The beginning of the 21st century saw the introduction and subsequent widespread adoption of mobile phones, as well as the pervasive usage of Global Positioning System (GPS). This led to an exponential increase in data-generation on human movement. Coupled with further progress in computing power and sophisticated data-mining methods, it enabled to capture the movement not just of populations at finer levels of spatial granularity, but potentially of individuals. In particular, the large volume and frequency of call detail records (CDRs) from mobile phones (see Section 2.3) enabled the analysis of human movement at very fine temporal scales. Thus statistical information on mobility became available, on the scale of hours to decades; from the individual, through communities to the level of country-wide populations.
This review predominantly focuses on these later developments. As we will see throughout the text, these new sources of data have rejuvenated the scientific interest for human mobility, opening the door to new questions and measures, as well as enhancing and validating the insight gained from studying traditional data sources. For reasons which are probably technical, historical, and societal, research teams in physics and computer science have broadly “invested” in these new georeferenced (meta)data resulting from human activity [38]. This is evidenced by a quick search on the online website arXiv1 for physics papers that include the terms “Human Mobility” or “Mobility Patterns”. As seen in Fig. 2, since about 2004–2005, the number of such papers displays an almost exponential increase. One must keep in mind that this is probably an incomplete sample, and is underestimating the volume of research in this field. It is likely that one would see an even more dramatic trend if a more comprehensive list of publications were to be included.
While this growing interest of the Physics community in human mobility has multiple reasons, it is somewhat to be expected given the traditional interest of statistical physicists in studying emergent collective properties of systems of many interacting particles and in describing the dynamics of randomly moving particles undergoing (anomalous) diffusion. The availability of data at multiple spatio-temporal scales is helping uncover robust statistical patterns as well as aiding the development of phenomenological theories of human mobility, that are well suited to the tools, methods and paradigms of statistical physics.
  1. Download: Download high-res image (184KB)
  2. Download: Download full-size image

Fig. 2. Fraction of papers on arXiv.org with mention to the terms “Human Mobility” or “Mobility Patterns” from 2004 to 2015. The growth in the number of published papers displays a dramatic increase around 2008. Data source: arXiv.org (last date accessed: Feb 16, 2017).

1.2. Scope and limitations

The recent pace of developments and the volume of work published are such that a comprehensive review of the findings appeared necessary. Moreover, the study of human mobility is a highly interdisciplinary endeavor and progress has occurred in parallel across different academic communities, sometimes with one unaware of the others’ works. This is reflected in different terminology of common metrics and similar results being cast in different and seemingly unrelated contexts. Consequently with this review, we also aim at (i) bringing disparate communities together by unifying the findings in a common context, and (ii) providing new researchers in the field with an accessible starting point and a minimal set of tools, metrics, concepts and models.
One must note that humans, of course, do not move in a vacuum. Just as the diffusion of molecules in materials is mediated by the structural and thermodynamic properties of the material, similarly, spatio-temporal patterns of human movement are necessarily shaped by spatial constraints and limitations of geography. Examples of this are the topography of urban centers or the pattern of roads and transportation infrastructure, the properties of which are studied under the aegis of spatial networks [39]. Yet, this survey is not about Spatial Networks, nor do we give details on the so-called “science of cities” (e.g., city forms, properties of urban transport networks, urban scaling laws, etc. [40]); these two aspects will be considered only when made necessary for the reader to understand the concepts related to human mobility.

1.3. Organization

We begin in Section 2 with a comprehensive (but by no means exhaustive) list of data sources used for empirical studies. In Section 3 we introduce key metrics, measures, spatio-temporal scales, as well as an overview of the fundamental physics behind the study of human mobility. In Section 4 we describe the state-of-the-art families of models (both generative and phenomenological) that best describe the empirical observations of human mobility. The models are categorized according to scale, starting from the level of individuals, through to the collective level of populations, and finally a mixture of the two incorporating the concept of (inter) modality. Continuing with the theme of scales, in Section 5 we describe some selected applications of the framework and models ranging from intra-urban movement to flows between countries and continents including the case of epidemic spreading, transportation systems, and a brief digression on new results on virtual mobility (web browsing). Finally, we conclude in Section 6 with challenges and future directions for the field. We also added Appendix where we provide descriptions of some basic tools and algorithms (agent-based modeling, random walkers, etc.) that may be of use to researchers making initial forays into the field.

2. Data sources

A natural starting point is to describe the nature of empirical data which has been used in mobility research. Indeed, empirical data has been vital to both aggregate and individual mobility investigation, by providing means of parameter calibration as well as model validation. In this section, we outline the main sources available for mobility research and the relevant information that can be extracted from them. The data sources are presented in (rough) chronological order of their historical availability and consequent use in research.

2.1. Census data & surveys

Census data is collected in periodical national surveys in which householders are asked questions about the socio-demographic and economic status of the household members. Of particular interest in terms of mobility are questions related to the location of the workplace, or the place of current and previous residence. Collectively, this information can then be used to estimate commuting flows or internal migration flows within a country.
Different countries introduced national censuses at different times, and the type of information collected also varies across countries and time periods. In the United Kingdom, for example, the first national census was done in 1841 and contained the names, ages, sexes, occupations, and places of birth of each individual living in a household. In 1921, the location of workplaces was added to the questions asked, and since 1961 the location of previous residence was also included. In the United States, the first population census was taken in 1790, while information on workplace and transportation activities was incorporated in 1940 [41]. Nowadays, national censuses are held in most countries typically every ten years. Below we outline some main sources of census and survey data along with the methodology used to extract commuting and migration flows.

2.1.1. Census data

The data on commuting trips between United States’ counties is available from the United States Census Bureau.2 These county-to-county work flow files are available for each state as well as at the aggregated level across the country. Furthermore, files are available in either county of residence format – containing work destinations of people who reside in each county – or county of work format—which contain the county of residence of people working in each county. The inclusion of a commuter in the data is preconditioned upon them working during the week leading up to the census. Due to the questionnaire design, any individual not satisfying this condition is precluded from entering a workplace location. A worker is defined as someone who has spent at least 1 h in paid employment or 15 h in a voluntary role. Any individual required to provide their workplace location was asked further questions about their mode of transport and travel time.
From this information, the census Bureau compiles and publishes aggregated flows corresponding to the residence and workplace location of everyone classed as a worker. In order to validate models and determine statistics such as trip distance and relationship to the origin, destination and intervening population density, further information on the spatial distribution of counties and socio-demographic data is required. This supporting data is available online.3 Among other things, such census data is invaluable for validating generative models of commuter flow and migration. For instance, Census data from the 2000s (consisting of county level work flow data, corresponding to 3141 counties and 34,116,820 commuters within the United States) was used to evaluate the so-called radiation model (described in Section 4.2.2) by Simini et al.  [42].
Commuting and migration flows data for the United Kingdom is also available online.4 The information is available at different spatial resolutions: ranging from the geographical area covered by a single postcode, to areas covered by the 7201 electoral wards. Files contain origin and destination flows divided into subsets of age and gender allowing for a deeper socio-demographic analysis if required. There have been many different countries whose commuting data has been analyzed at the level of municipalities (or smaller administrative units such as zipcodes or electoral tracks) including France5 and most European countries. In most cases, when not directly available on an open data portal, commuting data is available to researchers on request from national statistics institutes (see Fig. 3).
  1. Download: Download high-res image (564KB)
  2. Download: Download full-size image

Fig. 3. Commuting flows compiled from census data. Left panel: The state of Florida partitioned according to its counties. Right panel: Commuting flows between counties, where thickness of lines correspond to volume of flow. Data compiled from the United States Census Bureau.

2.1.2. Tax revenue data

An alternative source for estimating aggregated migration flows is the Statistics of Income Division (SOI) of the Internal Revenue Service (IRS) in the United States. The IRS maintains records of all individual income tax forms filed in each year. Using individual tax return files, it is possible to determine who has, or has not, moved residence or workplace locations in the intervening fiscal year. To do this, first, coded returns for the current filing year are matched to coded returns filed during the prior year. The mailing addresses on the two returns are then compared to one another. If the two are identical, the return is labeled a non-migrant. Any relevant information change during the fiscal year results in the return labeled as that of a migrant. US migration data from 1992–1993 to 2006–2007 is available online.6 Alongside tax returns, estimated migrations flows are also included in each Census,7 based on yearly surveys carried out by the American Community Survey (ACS).

2.1.3. Local travel surveys

An important source of data used to construct trips, and therefore flows between two locations, is local surveys. It provides an advantage over censuses as it is possible to collect a more detailed dataset that includes information such as the purpose of the trip and mode of transport used. However, the increase in accuracy, resolution, and variety of meta-data leads to a corresponding sacrifice in scale. As such, surveys typically have a much lower number of respondents compared to a census. Additionally, these typically cover smaller areas such as a single city (or indeed neighborhoods within cities), as opposed to the level of a state, or an entire country.
A representative example is provided by the household Travel Tracker Survey for the Chicago metropolitan area, conducted between 2007 and 2008, and carried out by the Chicago Metropolitan Agency for Planning.8 This survey contains information on travel activities of household members, among other details. Information was collected from a total of 10,552 households over a one or two-day period, and households were required to provide a detailed travel inventory for each member, including details such as trip purpose, transport mode, departure and arrival times, and public transport information such as boarding location, distance to final destination, and fare paid. A similar survey is available for the Los Angeles region9 and was analyzed by Liang et al.  [43], where 46,000 trips between 2017 zones within the Los Angeles county were extracted. The use of small scale surveys is often augmented by other forms of data (for example GPS tracks, see Section 2.4) in order to have accurate records of an individual’s position combined with an annotated description of the purposes of each trip [[43], [44]].
While historically most prevalent, census data provides us only with coarse-grained patterns of human migration and movement. Surveys, on the other hand, provide far more detail but on a much smaller scale, thus limiting their use in terms of statistical validation of models. While the census reveals little information about where people spend their time, or where and how they travel, at the survey level, data reliability is restricted by scale and self-reporting errors. Thus both sources lack the ability to provide a dynamic picture of human mobility [45].

2.2. Dollar bills

An unusual (and perhaps not immediately apparent) source of mobility data is related to the tracking of currency. An archetypal example of this are online bill-tracking websites, which are designed to monitor the dispersal of individual bank notes worldwide. The movement of a bank note between geographical locations occurs when it is carried by an individual, therefore data collected from such websites may be used to infer trajectories and patterns of human mobility. Brockmann et al. [46] obtained trajectories of 464,670 dollar bills from the website www.wheresgeorge.com, allowing for an analysis of bank note dispersal in the United States (excluding Hawaii and Alaska). The dataset contains entries corresponding to time-stamped reports of the geographic location of individual bills, numbering around 250 million as of 2017. All bills registered on the site are initially entered by a user which involves the input of their ZIP code and a serial number printed on the bill. While the majority of the bills are not entered more than once, around 11% of the bills are reported multiple times, with 3–5 entries per bill being fairly common (Any more than 5 entries is considered rare). Once a bill has been registered, any future hits allow for time and distance between reports to be determined and recorded in the database (see Fig. 4).
  1. Download: Download high-res image (808KB)
  2. Download: Download full-size image

Fig. 4. Trajectories of bank notes originating from four different locations. Tags indicate initial, symbols secondary report locations. Lines represent short time trajectories with traveling time T<14 days. Lines are omitted for the long time trajectories (initial entry: Omaha) with T>100 days. The inset depicts a close-up of the New York area. Pie charts indicate the relative number of secondary reports coarsely sorted by distance. The fractions of secondary reports that occurred at the initial entry location (dark), at short (0<r<50 km), intermediate (50<r<800 km) and long (r>800 km) distances are ordered by increasing brightness of hue. The total number of initial entries are N=2055 (Omaha), N=524 (Seattle), N=231 (New York), N=381 (Jacksonville).

Figure from [46].
While a convenient and large-scale source of mobility data, the use of currency to infer patterns in human mobility is problematic. For instance, bank note dispersals do not contain information on the number of individuals that have carried a given note during two instances of measurement (i.e., when it appears in the database). Consequently, the trajectories of bank notes are likely a convolution of the mobility of several individuals; coupled with the relatively few samples per dollar bill, this makes it a rather inaccurate and problematic source for inferring individual mobility patterns. Nevertheless, similar to census data, currency tracking provides a convenient, and relatively easily accessible picture of the coarse-grained patterns of human mobility.

2.3. Mobile phone records

Perhaps the most important, game-changing data of the last decade for inferring human mobility are the mobile phone CDRs. Most cell phone companies maintain detailed records of customer information related to their calls and text messages. The information contained in these CDRs include the time of the call/message as well as the company’s cell tower routing the communication. Knowing the exact geographic coordinates of the cell tower makes it possible to estimate the approximate location of the user. As mobile phones are (typically) personal devices and are mostly carried by a single person, the corresponding location data can be used to infer a single individual’s movements over the recorded period (an example is shown in Fig. 5). Unlike the census, surveys, or currency tracking, CDRs allow for the characterization of individual mobility patterns, and – depending on the dataset – at an unprecedented spatio-temporal resolution. Furthermore, due to the tremendous trend of global cell-phone adoption, one can conduct multi-scale studies ranging from the level of neighborhoods to the country-wide level (including international travel and movement across borders).
CDR data concern both phone calls and Short Message Service (SMS) exchange and always include the following information: time stamp, caller ID, recipient ID, call duration and antenna (cellular tower) code. Due to privacy concerns, customer identifiers are anonymized before the data is made available to scientists for analysis. Quite often, additional clean up of the data is required to make the data more analyzable; for instance, one typically needs to restrict the dataset to users who visit a minimum threshold number of towers, as well as make calls at a high enough frequency to maintain statistical reliability for each user.
  1. Download: Download high-res image (427KB)
  2. Download: Download full-size image

Fig. 5. Trajectories of two anonymized mobile phone users traveling in the vicinity of N=22 and 76 different cell phone towers during a 3-month-long observational period. Each dot corresponds to a mobile phone tower, and each time a user makes a call, the closest tower that routes the call is recorded, pinpointing the users approximate location. The gray lines represent the Voronoi lattice, approximating each towers area of reception. The colored lines represent the recorded movement of the user between the towers.

Figure from [47].
One of the first work using CDRs was performed by Gonzaléz et al.  [48] on two anonymized datasets provided by a major mobile operator in a large European country. The first set contained the locations of 100,000 randomly selected mobile users whose positions were recorded over a period of 6 months, each time a call or text was received or sent. The second set (collected for validation purposes) corresponded to the locations of 206 mobile users, recorded every two hours over a period of a week. From this data, traces of mobility patterns such as the distribution of displacements between the locations of consecutive calls (jump-lengths) were measured. The temporal period between consecutive calls was also measured to determine a distribution of wait times (i.e., characteristic time spent by users in a given location). The knowledge of users’ consecutive locations also enabled the calculation of so-called return time distributions, measuring the probability of a user to return to a given location within a given time. Other more complex metrics, including the radius of gyration were also extracted. (For details on metrics see Section 3.1.)
The quality of mobility information extracted from mobile phone data depends on the frequency at which the location of the user is recorded. For example, the larger dataset used in [48] displayed an irregular call pattern; many calls were placed over a short time period, followed by long periods of inactivity. Consequently, the data displayed a highly inconsistent temporal frequency, which may confound the analysis of mobility patterns. The smaller-scale validation dataset was therefore collected to account for these irregularities. Other problematic issues also arise, such as the accuracy of recorded locations. In general, it is the position of the cell tower routing the call that is recorded and not the exact location of the individual carrying the phone. Since there is significant variability in the areas covered by cell towers – with coverage ranging from tens of meters in the densest neighborhoods of urban areas, to a few kilometers in rural areas – a recorded user in a rural area may transmit all communication via a single tower during their daily routine, while moving the exact same area as that of an urban user who may ping several towers. Thus, in the former case, despite possibly significant movement on part of the user, for purposes of analysis they are considered stationary, which is obviously misleading.
We must note that obtaining CDRs for research purposes poses significant challenges. Since they are not typically publicly available, one has to directly approach a mobile phone operator. As the data contains sensitive information [49], any data collected by a specific group of researchers may not be shared with other groups, making reproducibility problematic. Yet, recent initiatives have seen some phone companies release large scale CDR data within the context of “challenges” among researchers. The purpose of making this data available is to provide researchers some material allowing them to extract useful information to address major societal challenges. Such recent datasets are those that have been provided by Orange in the framework of their Data 4 Development (D4D) challenge. These contain four anonymized tables for mobile phone users in the Ivory Coast (Côte d’Ivoire) and Senegal. For example, in the case of Ivory Coast, data contained (i) hourly antenna-to-antenna traffic; (ii) individual trajectories of 50,000 users over two weeks with antenna location information; (iii) individual trajectories of 500,000 users over an entire year with sub-prefecture location information; (iv) communication graphs for 5000 users [50]. In particular (iii) contains information regarding individual trajectories for the entire observation period at the cost of reduced spatial resolution; position is recorded as a sub-prefecture rather than antenna location. The raw data consists of a user ID, date and time stamp of the communication and the sub-prefecture number (1–255) which can be linked to a file containing the latitude and longitude of the center of each sub-prefecture [51].
Telecom Italia’s “Big Data Challenge” is another example of a large scale CDR dataset readily available to researchers. Alongside anonymized call records, additional metadata related to weather, news, Twitter activity, and electricity data from two areas in Italy (Milan and Trentino) was provided [52]. Data from this challenge has been used to investigate the relationship between individual daily trips and personal constraints [53], estimate urban population from call volume [54] or determine the relationship between urban communication and happiness [55]. The increasing availability of such large-scale, anonymized datasets for research purposes is an encouraging trend for future research on human mobility. For more detailed information on CDR sources and some associated results (including other research topics than human mobility), see the review by Blondel et al.  [56].

2.4. GPS data

The greatest level of accuracy on movement trajectories is provided by Global Positioning System (GPS) data, which is capable of providing precise information on any location covered by at least four GPS Satellites. GPS trackers are units which receive signals from GPS satellites and compute the device’s position at regular intervals. This technology allows researchers to trace the movement of individuals with a high degree of accuracy and temporal frequency, thus providing a rich source of data that can be analyzed and directly mapped to human mobility patterns (see Fig. 6).
Rhee et al.  [58] used GPS trackers with a position accuracy of less than three meters to record the movements of over 150 individuals across two University campuses, a state fair (US), Disney World, and New York City. The position of the individuals carrying the trackers was recorded every 10 s while they were in these locations resulting in daily traces that had on average a temporal scale of 9–10 h. Additionally, Microsoft has made GPS trajectories from 182 individuals freely available online via their GeoLife project.10 The dataset consists of 17,621 trajectories recorded by GPS trackers and GPS enabled mobile devices, and has a high spatial and temporal scale for over 90% of individuals with positions recorded every 1–5 s or 5–10 m for time periods that span several years. Since its release, GeoLife has been used in studies of both human mobility [[59], [60]] and social interactions [61]. For a review of potential applications see [62].
  1. Download: Download high-res image (989KB)
  2. Download: Download full-size image

Fig. 6. Aggregated GPS position data (collected from vehicles) in a part of Florence, Italy, measured during March 2008; the red dots correspond to a recorded instantaneous velocity 30 km h−1, whereas the yellow dots correspond to velocities in the interval 30–60 km h−1 and the green dots to a velocity 60 km h−1.

Figure from [57].
Transmitters attached to vehicles [[63], [64]] are another source of GPS data. These are useful for studying questions related to urban traffic: monitoring, prediction, and prevention of congestion [64]. In Italy, 2% of privately owned cars have a GPS system; each vehicle has assigned a unique identifying number, thus enabling the tracing of anonymous individual cars’ trajectories with high precision. A GPS signal is transmitted approximately every 2 km, as well as when the engine is switched on and off [63]. Each signal is converted to a data point containing information on position, velocity and distance covered. While the data suffers from inaccuracy due to variables such as satellite coverage and position precision, the temporal precision is of high quality, as is both instantaneous velocity and distance covered. From such GPS vehicle traces, path length distributions can be measured, which corresponds to the distribution of distances traveled in single trips, where a trip is defined as travel that occurs between instances of the engine being started and stopped. Activity distributions, which are equivalent to waiting time distributions in other studies, can also be determined from the elapsed time between consecutive trips. A further advantage of GPS traces over CDRs is that the locations are recorded to a much higher degree of accuracy and at a constant frequency. Yet, there are drawbacks: GPS transmitters do not work indoors and generally rely on batteries as their source of power. As a result, traces can contain periods during which no signal is received or may be terminated prematurely. Furthermore, GPS datasets typically feature a smaller number of individual users, up to several thousand, in comparison to mobile phone data which can provide information on the mobility of millions of users.

2.5. Online data

Another valuable source of location information are Online Social Network (OSN) or Location-Based Social Network (LBSN) services, that attract hundreds of millions of users worldwide. Since the introduction of GPS and wifi chips in smartphones, social network providers have been able to collect valuable data on both the social connections as well as precise geographical locations of their users. Indeed, services such as Twitter, Facebook, Foursquare and Flickr collect geotagged data every time a user enables localization for the content being posted (e.g., checking-in at a restaurant with friends); this is associated with geographic coordinates, a time stamp, and additional information relative to the location itself or the content published by the user. The mobility profiles of the users can be obtained from the list and number of locations other than their home which they have visited over the period of the study (see Fig. 7 for an example). These profiles can range from intra-urban routine trips to worldwide travel [[65], [66]]. From the geographical locations, by inferring a user’s home location, their radius of gyration can be determined using the geographical distance between the home location and reported locations. The jump length distribution can be determined by defining a jump as the geographical distance between consecutive reports for a single user and frequency of reports can be used to assess temporal patterns in human mobility [66]. Other measures such as visitation frequency and predictability are also obtainable [67].
Geotagged data can also be used to assess social interactions between individuals. Scellato et al.  [68] analyzed three online location-based social networks: Foursquare, Brightkite, and Gowalla, finding their exhibition of common properties found in most other real world complex networks, such as fat tailed degree distributions, high clustering coefficient, and short average path length between nodes, confirming the small-world nature of LBSN’s [69]. Furthermore, the probability of a social link between two users was found to exhibit a distance dependence, decreasing with geographical separation between the users. Compared to CDRs, data collected from OSNs and LBSNs, have the advantage of having more contextual information associated with the geographical positions and the users, enabling the study of mobility in a broader context. For example, data collected from Twitter users has been applied to topics such as social networks [[70], [71], [72]], evolution of moods [[73], [74], [75]], and crisis management [[76], [77], [78]] among other features.
  1. Download: Download high-res image (490KB)
  2. Download: Download full-size image

Fig. 7. Country-specific analyses of travel based on Twitter users and on the estimated total number of travelers. Panel A shows the number of Twitter users residing in a country and traveling to another while panel B shows the number of users visiting this particular country. Panels C and D represent the number of Twitter travelers normalized by the extent of Twitter usage in their home country. Finally panel E represents the yearly ratio between the estimated inflow and outflow of travelers, revealing which countries were the origin or destination of international travel.

Figure from [66].
Yet, as in all other data sources, limitations persist. Besides requiring extensive clean up (such as removing users with suspiciously high activity, unreasonably fast movement between two consecutive check-ins or incomplete data [66]), mobility and interaction data originating from online social networks needs further validation in order to be considered truly representative of the general population. Sloan et al.  [79] found that Twitter data originating from the UK has significant demographic differences compared to the wider population data from UK census. Furthermore, there appear to be demographic differences between the users who enable geotagging and those who do not (only 3% of the users were found to have geotagging enabled).

3. Metrics, physics and scales

In this section, we will discuss some of the fundamental metrics used to characterize mobility as well as the associated spatio-temporal scales at which they are relevant. We will then move onto a discussion of some of the physics associated with mobility, including the relations between distance, time and velocity. We end the section with a discussion of energy arguments and interpolation of spatio-temporal scales through the lens of multimodality.

3.1. General metrics

3.1.1. Jump lengths

A key factor in modeling human mobility is the distance an individual travels in a given time period. Measures of distance are often dependent on the source of data being used, and the terms flight length, jump length, displacement, and trip refer to different distance measures that may be extracted from data. For example, early measures of mobility leveraged information about the spatial trajectories of bank-notes (one can think of this as an aggregation of many individual trips as a given banknote necessarily changes many hands). Conventionally, the distance between two instances of the appearance of a banknote in the measured data is termed a jump length [46]. Later, higher resolution measurements of movement were provided by Call Detail Records (CDRs). Here one could reliably measure the location (and corresponding displacement) of individuals based on placement of mobile phone towers that are pinged when one makes a call. Associated with the displacement between two successive calls is a time interval [48], that provides an estimate of the stay of an individual in a location (waiting-times), although it is not possible to detect the position of the user between two consecutive calls. Even higher resolution spatio-temporal data available from Global Positioning System (GPS) allows one to make more stringent definitions of displacement. In addition to the standard definition of jump-lengths [58], one can now define a displacement between stops, i.e. the displacement of individuals between two locations, given that they spent a minimum amount of time per location [63]. Indeed, the use of the terms “stop”, “trip” or “displacement” reflects human behavioral tendencies that motivate people to go from point A to point B. It is therefore important to verify that a “stop” corresponds to an actual behavior and that it is not artificially generated by oversampled or undersampled signals [80].
Regardless of how one chooses to constrain it, the jump length, typically denoted as Δr, is defined as the euclidean distance between r(t) and r(t+dt) corresponding to locations recorded at intervals t and t+dt. Of particular interest is the distribution of Δr within a population; how likely is it that a random member of the population will travel a distance r from their origin location in a time dt? In order to measure this, when modeling human mobility it is common to consider the probability distribution function (PDF) of jump lengths, P(Δr). This may be defined as the probability of finding a displacement Δr in a short time step dt.
Brockmann et al.  [46] determined P(Δr) for the trajectories of dollar bills and the distribution of jump lengths was observed to follow a power law, P(Δr)Δr(1+β), with exponent β=0.59±0.02, independent of the size (in population) of the entry point of a bank note (see Fig. 8). The power-law behavior of P(Δr) has also been observed in the trajectories of mobile phone users [[48], [81]]. The empirical distribution of displacements obtained from mobile phone data analyzed in [48] is well approximated by a truncated power-law distribution: (5)P(Δr)=(ΔrΔr0)(1+β)exp(Δrκ)where r0 and κ represent cutoffs at small and large values of Δr. The value of the observed scaling exponent in Eq. (5) is β=0.75±0.15, not far from β=0.59±0.02 obtained from bank note dispersal, suggesting that the two distributions may capture the same fundamental mechanism driving human mobility patterns. Zhao et al.  [82] measured the distribution of jump lengths using two GPS datasets that, along with location and time stamps, also included transportation mode. From this, the jump length Δr was determined as the longest straight line distance between two locations without a change of direction. Wait times, Δt, were taken to be the time spent in a particular transportation mode. A single trip may be made up of several flights, each of which have a corresponding transportation mode taken directly from the dataset. The transportation modes were grouped into four categories; Walk/Run, Car/Bus/Taxi, Subway/Train and Bike. The distribution of jump lengths is found to be log-normal if each transportation mode is considered individually, whereas P(Δr) for all modes combined follows a truncated power-law with β=0.55 and 0.39 for the two datasets. The distribution of wait times, P(Δt), was found to be exponential, corresponding to a large number of Walk/Run flights that connect the use of other transportation modes within a single trip and therefore have only a short duration. This result provides further insight into studies using on bank note and mobile phone data [[46], [48], [81]] in that the power-law behavior of the distribution of jump lengths is the result of combining several distinct log-normal distributions, corresponding to the different transportation modes available, with exponentially distributed wait times. GPS devices installed in cars transmit the location of the car to a high degree of accuracy every few seconds when the car is in motion. The analysis of such data from over 150,000 cars in Italy over a period of one month [64] found that the probability density function, P(Δr) had two different regimes; an exponential distribution up to a characteristic distance of 20 km corresponding to inter-city travels, and a power-law with β=1.53 corresponding to intra-city movement. Here, Δr corresponds to the true length of a single trip. Trips are distinguished by records that are greater than 20 mins apart; as the GPS device only transmits when the car is moving, using a threshold of 20 mins allows for small stops, such as waiting at traffic lights or re-fueling, to be accounted for and included within a single trip. The exponent, β, is significantly higher than those obtained for bank note dispersal [46] and mobile phone data [[48], [81]]. The authors suggest this is due to the limitation in the GPS data caused by restrictions on the size of the area covered (500 km in length), and the physical limitation on the distance an individual will drive in a single trip.
The empirical characterization of the distributions of jump lengths and waiting times is particularly difficult because these distributions are very susceptible to the biases of the dataset used to measure them, namely on the spatiotemporal resolution, length of the period of observation, size and representativeness of the population, and the transportation mode considered. As a result, there are some discrepancies in the empirical estimates of these distributions when measured using different datasets. The accurate measurement of jump lengths and waiting times distributions has relevant implications on human mobility modeling and related applications, hence it is still an active area of ongoing research. Recent works using high spatiotemporal resolution data to characterize these statistical patterns include [[83], [84]].
  1. Download: Download high-res image (215KB)
  2. Download: Download full-size image

Fig. 8. The short-time dispersal kernel of bank notes. The measured probability density function P(r) of traversing a distance r in less than T=4 days is depicted in blue symbols. It is computed from an ensemble of 20,540 short-time displacements. The dashed black line indicates a power law P(r)r(1+β) with an exponent of β0.59. The inset shows P(r) for three classes of initial entry locations (black triangles for metropolitan areas, diamonds for cities of intermediate size, circles for small towns). Their decay is consistent with the measured exponent β=0.59 (dashed line).

Figure from [46].

3.1.2. Mean square displacement

Research into the distribution of jump lengths suggests that individual trajectories can be described by Lévy flights, a family of models associated with random walks (described in detail in Section 4.1). In this context, a common measure of the potential exploration of area is the Mean Square Displacement (MSD): (6)MSD(t)=(r(t)r0)2Δr(t)2.Here r0 is a vector marking the origin of an individual relative to some reference point, or in other words, the location at which a particular trajectory starts, while r(t) measures the subsequent position of that same individual at time t. The average runs over different independent trajectories or individuals. The scaling of MSD with time provides a measure of the type of diffusion of individuals relative to their starting point in a trip. The MSD(t) has the unit of an area and it corresponds to the average squared distance from the origin after a time t. In general, if the individual mobility trajectories follow a so-called Continuous Time Random Walk, (Cf. Section 4.1.3), the MSD follows the form Δr(t)2tν with ν=2αβ where α and β are the exponents of the waiting-time, i.e. the time interval between two consecutive jumps, and jump length PDFs [46]. In particular, for a random walk (ordinary diffusion) ν=1. Yet a Continuous Time Random Walk (CTRW) type process is not entirely a realistic representation of how humans actually move. In particular, a random walker will tend to drift away from the origin of its trajectory rather rapidly, the longer the elapsed time. Whereas, as one would imagine, empirical measurements indicate that people have a tendency (on average) to return home on a daily basis [48]. Furthermore, the means of transportation constrain the maximum jump length, therefore it has been observed that the PDF’s P(Δt) and P(Δr) have exponential cutoffs; this implies that their variances are finite and hence in the long-time limit the scaling of the MSD will asymptotically converge to that of Brownian motion, MSD(t)t12.
To account for this discrepancy, refinements were made in [81], where analysis was restricted to high-resolution data with a combination of mobile phone records that provided locations each time a call was made, along with location data recorded by mobile services at hourly intervals. The newer measurements revealed that human mobility apparently follows an ultra-slow diffusive process characterized by a slower than logarithmic growth of the MSD with time, MSD(t)(loglog(t))2β (see Fig. 9).
Yet, the picture is typically more complex. Indeed, a common problem arising when calculating the MSD, is that the location of an individual’s origin is often ambiguous [58]. One method of overcoming this issue is to take the average of MSD values measured by varying the origin among all locations that an individual visits [[85], [86]]. Such an approach was taken in [58] where they analyzed the GPS traces of individuals in a number of different location, including a university campus (KAIST), New York City, Disney World, and a US state fair. The averaging procedure applied to the MSD revealed two types of temporal behavior; up to a certain time (30 min) the participants moved super-diffusively (as expected in a CTRW process), whereas for periods greater than 30 minutes, the individuals moved sub-diffusively. It is here that the notion of scales becomes important. The authors explain this dichotomous behavior by showing that at short time scales, the distribution of jump lengths follow a power-law distribution, whereas at longer time scales, jump lengths are more homogeneous, following a Gaussian distribution. While the combination of power law waiting times and jump lengths lead to super-diffusive behavior at shorter spatio-temporal scales, the Gaussian distribution of jump lengths (while maintaining a power law behavior for the waiting times) leads to mobility that appears sub-diffusive [87] at larger spatio-temporal scales.
  1. Download: Download high-res image (291KB)
  2. Download: Download full-size image

Fig. 9. MSD versus time for groups of individuals with different radii of gyration. The gray line represents the analytic prediction of CTRW (Cf. Section 4.1.3) whereas the orange line represents the analytic prediction of the asymptotic behavior MSD(t)(loglog(t))2β, which more accurately reflects the movement pattern of humans.

Figure from [81].

3.1.3. Radius of gyration

The sub-diffusive behavior of humans at certain scales suggests that they tend to move a characteristic distance away from their starting locations. This distance can be quantified by the so-called, radius of gyration, rg, defined as the root mean square distance of a set of points from a given axis. Traditional applications of this measure are rooted in physics (where it is related to the mass moment of inertia) and engineering (distribution of cross sectional area in a column). A general formulation of rg is given by: (7)rg=1Ni=1N(rir0)2where ri are the coordinates of the N individual points (or N measurements of location) and r0 is the position vector of the center of mass of the set of points, rcm=i=1NriN. When applied to human mobility, the radius of gyration can be used to characterize the typical distance of an individual from the center of mass of their trajectory. The radius of gyration is not equivalent to the square root of the Mean Squared Displacement, defined in Eq. (7), although the two quantities are related because the former is a central second moment while the latter is a second moment about the origin.
Gonzalez et al.  [48] used mobile phone call records to determine the radius of gyration for two sets of users; one set with high temporal resolution (position recorded every two hours over one week), and one set with lower resolution on a longer (6 month) scale. The radius of gyration was calculated for each user, a, up to a time t using the recorded positions, ri(a), i=1,,N(a). The value of r0 was taken as the center of mass of each users trajectory; rcma=i=1N(a)ri(a)N(a). The distribution of rg, P(rg), was found to follow a truncated power law of the form: (8)P(rg)=(rgrg(0))(1+βr)exp(rgκr)where r0(0) represents the minimum radius of gyration cutoff due to the spatial sensitivity of the data, and κr represents an upper cutoff mostly due to the finite size of the study area (see Fig. 10).
  1. Download: Download high-res image (215KB)
  2. Download: Download full-size image

Fig. 10. The distribution P(rg) of the radius of gyration measured for two sets of mobile phone users labeled D1 and D2, where rg(T) was measured after 6 months of observation. The solid line represents a truncated power-law fit. The dotted, dashed, and dot-dashed curves show P(rg) obtained from Random Walk, Lévy flight, and truncated Lévy flight models respectively.

Figure from [48].
Indeed, the measured value of the scaling exponent βr=0.65±0.15 indicates a significant degree of heterogeneity in the travel habits of the observed population. Measuring the conditional jump length distribution, P(Δr|rg), revealed that users with small rg travel mostly over small distances, whereas those with large rg tend to display a combination of many small and a few larger jump sizes. Once one accounts for this heterogeneity in travel habits by rescaling the distribution with respect to rg, it leads to a collapse of the data onto a single curve thus, (9)P(Δr|rg)Δr(1+βc)F(Δrrg).Here βc=0.2±0.1 is a “universal” scaling exponent, and F(x) is a scaling function that is constant for x<1 and rapidly decreasing for x1.
  1. Download: Download high-res image (585KB)
  2. Download: Download full-size image

Fig. 11. (a) Radius of gyration rg(t) versus time for mobile phone users separated into three groups according to their final rg(T), where T=6 months. The dashed curves correspond to a logarithmic fit of the form A+Bln(t), where A and B are time-independent coefficients that depend on rg. (b) Probability density function of individual travel distances P(Δr|rg) for users with rg=4, 10, 40, 100 and 200 km. Inset Each group displays a different P(Δr|rg) distribution. Main After rescaling the distance and the distribution with rg, the different curves collapse to a power law (solid line). (c) Return probability distribution, Fpt(t). The prominent peaks capture the tendency of humans to return regularly to the locations they visited before, in contrast with the smooth asymptotic behavior 1(tln(t)2) (solid line) predicted for random walks. (d) A Zipf plot showing the frequency of visiting different locations. The symbols correspond to users that have been observed to visit nL=5,10,30, and 50 different locations. Denoting with L the rank of the location listed in the order of the visit frequency, the data are well approximated by R(L)L1. The inset is the same plot in linear scale, illustrating that 40% of the time individuals are found at their first two preferred locations; bars indicate the standard error.

Figure from [48].
It is important to note that the distributions described above are not independent but are related by the equality P(Δr)=r0P(Δr|rg)P(rg)drg, where P(Δr) is the jump-length distribution introduced previously. If P(Δr) has a power-law scaling exponent β, then we have β=βc+βr, which seems to be in good agreement with empirically measured values. This form of scaling corresponds to a type of random walk called a Lévy flight (Section 4.1.2), and the results suggest that this may be the behavior of individuals up to their associated characteristic distance rg (and beyond which saturation effects take over).
A way to uncover the saturation effect is to examine the time evolution of the radius of gyration, i.e. rg(t). In [48], it was found that the average rg of mobile phone users displays a logarithmic increase with time, rg(t)A+Blnt (see Fig. 11), in contrast to a pure random walk where the scaling is of the form rg(t)tθ. The observed saturation in rg can be attributed to the regularity in travel patterns of the individuals, and specifically to the high probability for an individual to return to a few highly-frequented locations. As pointed out before, the true picture is a mixture of behaviors. Supporting the conclusions made in [58] while measuring the MSD, Zhao et al.  [88] argue that humans engage in a mixture of superdiffusive and subdiffusive behaviors (depending on temporal scale), and the latter specifically can be explained as a consequence of the saturation of rg. This was later reproduced by Song et al.  [81] and demonstrated to be the consequence of the fat-tailed distribution of the jump lengths (Fig. 12).
The k-radius of gyration.
The radius of gyration is a measure to characterize the typical distance traveled by an individual and it depends both on the mutual distance of the locations visited and on the time spent (or the total number of visits) in each location. However, the radius of gyration does not allow us to quantify the relevance of each location in determining an individual’s characteristic mobility. Indeed, an individual who spends a majority of time in their most visited locations, e.g. home and work, will have a large rg if these two locations happen to be quite far from each other. Conversely, even if these most visited locations are close to each other, large values of rg may be reported if the individual happens to visit a number of distant locations.
  1. Download: Download high-res image (216KB)
  2. Download: Download full-size image

Fig. 12. The distribution P(rg) of the radius of gyration rg for mobile-phone users at different moments of time. The straight line, shown as a guide to the eye, represents a power-law decay with the exponent 1+α1.55.

Figure from [81].
To disentangle these effects, one can study the influence of frequency of location visits on the characteristic distance traveled, by studying the k-radius of gyration, rg(k), defined as the radius of gyration computed over an individual’s k most frequently visited locations L1,,Lk (see next section)  [89]. One can express this modified form as (10)rg(k)=1Nkj=1knj(rjrcm(k))2,where Nk is the sum of the visits to the k most frequented locations, rcm(k) is the center of mass computed on those locations and nj is the number of visits to the jth most visited location. As an example, if rg(2)rg, then the characteristic traveled distance is dominated by the two most frequented locations, whereas if rg(2)rg than the two most frequented locations do not offer an accurate characterization of the individual’s travel pattern, requiring us to consider more locations.
  1. Download: Download high-res image (230KB)
  2. Download: Download full-size image

Fig. 13. The mobility networks of returners and explorers for k=2. Nodes (circles) indicate the geographic locations visited by the individual, and each link denotes a travel observed between two locations. When the total rg is small, the two most important locations (red and blue) are close to each other for both two-explorers and two-returners. As the total radius increases the behavior of two-returners and two-explorers starts to differ; for returners, the two most important locations move away from each other; for explorers, they stay close and other clusters of locations emerge far from the center of mass (the gray cross).

Figure from [89].
The analysis of GSM and the GPS data revealed that there exist two distinct classes of individuals, returners and explorers. The characteristic distance traveled by k-returners is dominated by their recurrent movement between a few preferred locations, and their radius of gyration is well approximated by their k-radius of gyration for k2. In contrast, k-explorers have a tendency to wander between a varying number of different locations and their k-radius of gyration is very small compared to their overall rg (see Fig. 13).

3.1.4. Most frequented locations and motifs

When considering patterns in human mobility, particularly movements within a single day or week, it is essential to distinguish between locations based upon their importance. As already mentioned, people have a tendency to return home on a daily basis and therefore most daily and weekly trajectories will start and finish at the same location. One method of quantifying the importance of a location is the use of ranks; the most visited location (likely home or work) would have rank 1, a school or local shop may have rank 2 or 3, etc. In [48], the rank of a location was determined for each individual mobile user by the number of times their position was recorded in the vicinity of the cell tower covering that location. It was found that visitation frequency follows a Zipf law, that is the probability of finding a user at a location of rank L is approximately P(L)1L.
Another method of distinguishing between locations is to construct each individual’s mobility pattern as a network. Schneider et al.  [44] used data from both mobile phone users and travel survey respondents to construct weekday mobility networks for each individual. These profiles consisted of nodes to represent locations visited and directed edges to represent trips between the locations. Every daily trip network started and ended at the home location of each user, which was determined as the location they were most often recorded at between 3:00 am and 3:30 am. For mobile phone users, only days where calls were made, and therefore location was recorded, during 8 or more 30-min time slots were included. Daily networks were constructed for weekdays only in order to identify patterns in mobility during a typical day (see Fig. 14).
It was found that 90% of the recorded trips made by all users can be described with just 17 daily networks. These 17 trip patterns can be described as motifs; a sub-network within a complex network [90]. In this case, a daily network was considered to be a motif if it occurred more than 0.5% in the datasets. The result that the daily patterns of human mobility can be constructed by just 17 trip networks suggests that these motifs represent the underlying regularities that exist in our daily movements and are therefore useful for the accurate modeling and simulation of humans’ mobility patterns.
  1. Download: Download high-res image (243KB)
  2. Download: Download full-size image

Fig. 14. Daily mobility patterns for two anonymous mobile phone users over a period of 4 days. The home location of each user is highlighted (black circle) and connected over the entire observation period with a dashed line. While the aggregate mobility profiles (graph on the xy-plane) are rather diverse, the individual daily profiles (brown, red, blue and purple from top to bottom for different days) share common features (based on figure in Schneider et al.  [44].

3.1.5. Origin–Destination matrices

The Origin–Destination(OD) matrix T is the standard object in aggregated mobility studies and transport planning. It provides an estimate of the number of individuals traveling between locations in a given area, over a given period of time. More precisely, an OD matrix is a n×m matrix where n is the number of different “Origin” zones, m is the number of “Destination” zones, and Tij is the number of people traveling from zone i to zone j. More commonly, an area under study is partitioned to an equal number of origin and destination points and therefore, n=m. The size n of the OD matrix then depends on the spatial scale/resolution at which the data has been collected. Traditionally, zones are administrative units, whose size may vary from census and electoral units to entire municipalities, departments or states, depending on the question that motivated the development of the OD matrix. Obviously, the maximal spatial resolution of an OD matrix depends on the data source.
An OD matrix can be empirically derived, being estimated from household or roadside travel surveys, traffic counts, and more recently from individual digital footprints (see below). It can also be the output of a model, like in the classic 4-step transport model widely used in urban planning.11 OD matrices and the 4-steps framework have been used in transport planning since at least the middle of the 20th century, and there exists an enormous literature that covers its origins, uses, models, synthetic indicators, and so on [91].
OD matrices are of specific importance to models of aggregate flow, that is models of human mobility at the population level, rather than individual level. Examples of such models are gravity and intervening opportunities models (Section 4). In empirical OD matrices, Tij indicates the number of travelers from i to j measured during observation time window, whereas if the OD matrix is a model’s estimate or prediction Tij usually indicates the expected (average) number of travelers between the two locations. The OD matrix diagonal elements are usually 0 to indicate that people only move between distinct locations. In some studies the diagonal elements are larger than zero and indicate the number of non-traveling individuals in the origin location (or those who travel within the origin location). The fraction of travelers from the origin i to all other locations can be calculated as pij=TijTi, where Ti=jTij. In this case, the entry pij may represent the probability of an individual located at i to select location j as their destination over all other possible locations.
Until the 2000’s, the traditional approaches to estimate OD matrices consisted in relying on travel surveys or counting. The drawback of such approaches are related to their cost to setup, frequency of updates, and that they cover a limited sample of individuals (a few hundreds or thousands of households) [92]. Consequently, to circumvent these difficulties, a large body of literature has dealt with extracting OD matrices from recently available, individual digital footprints [[92], [93], [94], [95], [96], [97], [98], [99], [100]]. Early work on this topic has already been summarized in a dedicated review a few years ago [101]. Here we do not go deeply into the details of the numerous methods that have been proposed, and give only the general picture for the case of average daily journey-to-work commuting.
Starting from time-sequences of successive locations, the goal is to identify important places (“anchor points”) for each individual, such as the residence and workplace. The heuristic underlying method is straightforward: due to the circadian regular daily rhythms of human activity, one can make the reasonable assumption that for most individuals, the most frequent location of mobile devices will be very near the residence during non-working hours of weekdays, and during weekends. Similarly, during working hours on weekdays, it is likely that for most individuals their devices will be located just next to their workplace or location of primary activity (which include students, unemployed, retired, and employed individuals with a non-fixed workplace) [[20], [98], [99], [100]]. In some cases validation of methodology was conducted by asking a small group of volunteers to complete a survey, whose results were then compared to the key locations extracted from the recorded activity of their mobile devices [95].
Alexander et al.  [99] proposed a method for constructing an OD matrix from mobile phone records. Such method allows for the aggregation of data at the individual level in a way that lends itself to analysis at the population level. Implementing the proposed method, in order to infer a user’s origin and destination, clustered locations are extracted from the mobile phone record. By clustering locations in close proximity to one another into a single origin or destination, noise from the data, such as inexact triangulation, is eliminated thus allowing for a more accurate analysis. The clustered locations are then assigned a type: work, home, or other. The assignment takes into account the time of day a user is observed there, the duration of their stay, and the day of the week. By the nature of mobile phone records, location is only recorded when a user makes or receives a call, therefore the arrival times and duration of stay at a given location may not reflect the true times. To correct for this, surveys on trips may be used to assign true arrival times and duration from a probability distribution derived from the survey data. Then, trips can be constructed for every user between each location at which they are observed. In order to ensure that the final OD matrix is representative, users that do not satisfy certain conditions, such as on average of at least one visit to the home location per day, may be removed from the dataset. The daily trips are then combined for all users and locations allowing for the aggregate flow between all possible locations to be determined and an origin–destination matrix to be constructed.
Lenormand et al.  [98] compared the journey-to-work OD matrices extracted from three different data sources (census, mobile phone CDR data, and Twitter data) in Madrid and Barcelona, two of the largest European urban areas. Projecting residential and workplace locations identified in each dataset on regular square grids, in order to have a common spatial resolution for each dataset, they measured the correlation between the OD matrices estimated through each data source, as a function of the level of spatial aggregation. They found good correlation for square grids of size 2km (ρ>0.9), and very good correlation between the various sources when aggregating the individual data sources (Twitter, CDR data) on the map of administrative units (municipalities) used by the travel surveys (ρ>0.99) (here ρ is the Pearson correlation coefficient). Using a slightly different method that aggregates nearby cell towers when identifying important locations, Alexander et al.  [99] estimated OD matrices of daily average commuting trips from CDR data in the Boston area, and then performed a sensitivity analysis of the correlation between this OD and the OD issued from transport surveys at different levels of spatial aggregation. Tizzoni et al.  [20] also compared OD estimated from census and mobile phone data at different scales of spatial aggregation. They found lower correlation values, but applied a simpler method to estimate the OD from mobile phone data, when the aforementioned papers applied more stringent filters to remove outliers, fusion phone antennas close to one another, etc. Several algorithms in the literature are reviewed in [100], and further successive refinements and improvements in the methodology have been made since [[97], [92], [102], [99]].

3.2. Physics of mobility

3.2.1. Distance, travel time, and effective speed

In essence, the metrics described above, deal with the concepts of distances traveled and the time elapsed within and between journeys. Another important feature, which has not been discussed, is the notion of speed. To understand the relation between these quantities, it is important to note that mobility occurs over a large variety of distances. Framing it in the context of urban agglomerations, intra-urban mobility occurs over distances typically in the range of 1–10 kms, while inter-urban displacements occur generally over a wider range of the order of 100 kms (or more for large countries such as the US), and inter-country and intercontinental trips cover distances typically of the order of 1000 kms. While to first order, one may think of time spent on a given trip being roughly proportional to the distance traveled, one has to take into account the mode of transportation, which itself depends on the distance. For short-range travel, slow transportation modes with many stops are utilized (such as walking or public transportation), and for longer distances, one typically takes fast trains or planes with comparatively fewer stops. Thus the relation between distance, travel-times, and therefore speed, is fairly nuanced, as can be seen in Fig. 15, where we show the relation between apparent speed (i.e. the geodesic distance divided by the travel time) and the travel distance [103].
In general, a trip can consist of multiple connections and may in fact be multimodal (multiple components of travel) with the corresponding waiting-time distributions associated with the modes of transportation [104]. Also, it is observed that the apparent speed v¯ increases with travel distance according to the following power-law functional form [103] (11)v¯rβ,where β0.5. This particular dependence is primarily due to a combination of the hierarchical structure of transportation systems [84] and the fact that waiting-times (parking, take-off, landing, etc.) decrease in proportion to trip distance.
  1. Download: Download high-res image (366KB)
  2. Download: Download full-size image

Fig. 15. Apparent speed versus travel distance. The boxes represent the intervals for the different transportation modes. In the insets, the average result for cars (top left) and for plane travel (bottom right) are shown. The dashed lines represent power law fits.

Figure from [103].
For example, it has been observed for public transportation as well as personal cars [84] that a plot of the average trip velocity as a function of the trip duration reveals an effective acceleration (Fig. 16) of the form (12)v¯=v0+at.This effective acceleration is a direct consequence of the hierarchical organization of roads: the longer the trip, the more likely that higher velocity roads such as highways are used, thus trips can be decomposed into a ascending cascade to faster roads, followed by a descending cascade when approaching the target location. Eq. (12) implies that the distance evolves with trip duration as (13)rv0t+12at2,from which the relation v¯r directly follows [103].
  1. Download: Download high-res image (158KB)
  2. Download: Download full-size image

Fig. 16. Empirical average speed versus the duration of the trip obtained from GPS data for cars (blue dots). The red solid line represents a fit to the form seen in Eq. (12) with v0=17.9 km h−1 and a=16.7 km h−2. Observe the saturation at t>2 h due to the finite number of layers in the transportation hierarchy. The orange dashed lines represent the best fit to |v¯|t which corresponds to a Brownian acceleration model.

Figure from [84].
  1. Download: Download high-res image (135KB)
  2. Download: Download full-size image

Fig. 17. Variation of the total delay due to congestion with population for 97 urbanized areas in the US. A power law fit gives an exponent of 1.270±0.067.

Figure from [105].

3.2.2. Travel time budget

In the context of intra-urban mobility, Marchetti [106], based on ideas developed by Zehavi [107], proposed the idea of a travel time budget of about one hour per day irrespective of location. This implies that with improvements in transportation technology and corresponding increase in speed, a greater amount of distance is covered within the budgeted time, thus allowing for urban sprawl. This assumption can be reformulated as the rational locator hypothesis [108] which posits that individuals maintain approximately a constant journey-to-work travel times by adjusting their home and workplace. This was revisited in [108] on data for travel times in Washington DC for 1968, 1988, 1994, and Twin Cities for 1990 and 2000. The results show that there is a strong dependence on geography: for the Washington DC greater urban area, travel times are relatively stable, while the results for Twin Cities show a marked increase of the commute time. This has also been confirmed in another study [105] that discussed the impact of congestion on mobility patterns. In particular, it has been measured for US cities that typical travel delay due to congestion increases with the population as (14)ΔτP1+δwhere δ0.3, indicating that average commuting time increases with population, due to the (nonlinear) congestion effects, in sharp contrast with the travel time budget hypothesis (see Fig. 17).

3.2.3. Energy arguments

Mobility, ultimately, is about energy. Indeed, moving an object of a certain mass at a certain speed requires a given amount of energy. Consequently, one would expect to apply energy concepts to understand human travel behavior [109]. First, it has been observed that the average travel times for different modes of transportation are inversely proportional to the energy consumption rates measured for the respective physical activities (obtained from ergonomic measurements of the related O2 consumption in units of kJ/min [110]) . Second, when daily travel-time distributions of different transport modes such as walking, cycling, bus, or car travel are appropriately scaled, they turn out to have a universal functional relationship (see Fig. 18).
This corresponds to a canonical-like energy distribution (with exceptions for short trips) hinting at a law of constant average energy consumption related to daily traveling. The argument, first proposed by Kölbl et al.  [109], goes as follows. If we define the energy Ei spent per transportation mode i, the average energy consumption per day E¯ is constant and independent of the mode of transportation. The corresponding entropy is given by (15)S=P(Ei)lnP(Ei)dEi,and the constraints on the energy distribution read (16)P(Ei)dEi=1;EiP(Ei)=E¯dEi,which leads to the canonical distribution (17)P(Ei)eβEi.However, the data indicates that the probability to expend small energy is vanishing, and this was accounted for by introducing a cutoff of the form eαE¯Ei. This cutoff term is reminiscent of the so-called “Simonson effect” [111], where short trips are not very likely to be taken with a given mode if the energy spent is much less than αE¯. The complete energy distribution is then given by (18)P(Ei)eαE¯EiβEi.This finding, highlights the importance of physical concepts in understanding mobility and more generally social phenomena. In particular, it contains only physical variables such as travel times and energies, that are both measurable, in contrast with utilities introduced in classical choice modeling that describe preferences that are not measurable. Yet, as the authors point out, there are several issues that need to be accounted for, including measurement errors as well as multimodal trips that combine different transportation modes.
  1. Download: Download high-res image (261KB)
  2. Download: Download full-size image

Fig. 18. Rescaled travel time distribution for different transport modes (linear-log). Points represent different travel modes and the solid line is a fit to the rescaled energy distribution (18).

Figure from [109].

3.3. Interpolation of scales: the importance of multimodality

As discussed previously, mobility occurs over multiple spatio-temporal scales (multimodal) and thus a comprehensive picture of human mobility requires an accounting of the effects of multimodality. Indeed, the transition between different modes of transport necessarily implies a temporal cost that increases in proportion to the complexity of a trip (number of modes, or indeed level of mode). For example, in the UK, a rough estimate shows that roughly 23% of travel time is accounted for by connections made between modes in trips [104].
As showed in Fig. 19, the average structure of travel time varies with trip length and between cities. With the exception of London, short trips are composed of longer waiting times than riding times (i.e more time is spent being stationary than on the move), where the waiting times are mostly intra-layer in nature, due to bus–bus interchanges. If the transportation network is particularly multi-modal, inter-layer waiting times and walking times start to play a significant role for longer values of .
The effect of multiple scales on movement can be unpacked by defining two types of trajectories. One is the (ideal) quickest path that connects two points and neglects delays due to inter-modal connections (walking and waiting times). For this ideal path to exist, one would need perfect synchronization between modes (bus schedules aligning perfectly with train schedules for example). The other (more realistic) type of trajectories are so-called time-respecting paths that are the quickest paths accounting for inter-modal effects such as arrival, departure and connection times.
  1. Download: Download high-res image (978KB)
  2. Download: Download full-size image

Fig. 19. The anatomy of the transportation networks of selected cities in the UK. (a) London, (b) Manchester, (c) Edinburgh: total travel time in function of trip length separated by mode of travel. (d) Different colors represent different regimes versus the trip length. (i) Red: the trips are mostly done on the bus layer and display waiting times larger than riding times (a regime not present in London). (ii) Green: riding times exceed waiting times and most of the distance is covered in the bus layer. (iii) Blue: riding times exceed waiting times and most of the distance is covered in the metro and rail layers.

Figure from [104].
For OD pairs i and j, denoting the ideal path time as τm(i,j) and the corresponding time-respecting path as τt(i,j), the delay due to the lack of synchronization can be measured by (19)δ(i,j)=τt(i,j)τm(i,j)1.Averaging this ratio over all OD pairs in a city, one obtains a characteristic delay δ¯. As an example, for cities in the UK the variation of δ¯ with trip length l is unimodal, reaching its maximum δmax for short trips and then decreasing with distance according to (20)δ¯δmin+δmaxδminν,where ν0.5. This surprising collapse (see Fig. 20) suggests an underlying process describing the accumulation of waiting and walking times along time-respecting paths.
It is clear for a given trip that the number of modes and the frequency of their use play an important role in determining δ¯. To measure their effect a natural quantity is the average number of stop events per unit time (21)Ω=αCαΔtα,where Cα is the number of stop events in mode α and Δtα the duration spent in that mode.
  1. Download: Download high-res image (333KB)
  2. Download: Download full-size image

Fig. 20. Dependence of the synchronization inefficiency δ on path length l for cities in the UK. All cities appear to collapse onto a curve described by Eq. (20).

Figure from [104].
The quantity Ω can be thought of as a measure of the efficiency of transportation modes in terms of synchronization. The connection between δ̄ and Ω appears to follow a power law [104] (22)δ̄Ωμ,where μ0.3±0.1. The trend is due to the fact that larger values of Ω implies larger frequency and thus a better synchronization between modes. The observed small value of μ is however bad news in terms of efficiency: Decreasing inefficiency by a factor of 2 requires a 10-fold increase in Ω meaning essentially that the trip would need to be unimodal.
Thus as the results show, for large cities with multi-modal transportation system, the temporal cost due to interchange between layers plays a key role in the statistics of trips. Indeed, a challenge is posed due to the entanglement between the temporal and multilayer aspects of the system.

4. General mobility models

Models of human mobility can be aimed at reproducing individual mobility patterns or general population flows. In both cases one must necessarily take into account the characteristic spatial and temporal scales of the mobility process, which can vary from hundreds of meters to thousands of kilometers and from hours to years. For this reason, each of these cases have been tackled with distinct modeling frameworks. Individual mobility is subject to a certain level of uncertainty associated with free will and arbitrariness in the actions of individuals, leading to a degree of stochasticity in trip patterns. Consequently, minimal models borrow concepts and methods from random walks and Brownian motion. However, several studies highlighted that individual trajectories are far from random, possessing a high degree of regularity and predictability, which can be exploited to predict an individual’s future whereabouts and to construct realistic generative models of individual mobility. At the level of population flows, models describe the aggregate mobility of many individuals and aim to reproduce Origin–Destination (OD) matrices by estimating the average number of travelers between any two spatial regions (e.g. municipalities) per unit time (i.e. daily in the case of commuting flows, yearly in the case of migration flows). Most modeling approaches derive the mobility flows as a function of a few relevant variables of the regions considered, such as mutual distances, population levels, GDP per capita, etc. The next sections review the state-of-the-art mobility models starting from individual-like approaches, moving on to flows at the population level and concluding by the interpolation between these scales (inter-modality).

4.1. Individual-level (random walks)

A random walk is mathematically defined as a path formed by successive discrete random steps although they can be described in the continuum limit as well (Section 4.1.3). The simplest version, however, deals with spatial displacements ΔXi that are taken at discrete times ti. For the sake of simplicity, we are going to concentrate first in the case of 1D and later will discuss the generalization to higher dimensions. If x0=0 corresponds to the initial position of the walker at time ti=0 (we can arbitrarily fix the origin in the initial position of the walker), then the position after N steps is given by the random variable (23)X(tN)=i=1NΔXi,where each displacement ΔXi is a random variable extracted from a probability distribution f(Δx), and draws are assumed to be statistically independent. The probability distribution f(Δx) can be used to determine the probability density function (PDF), P(x,t) for the process to be at position x at time t which completely characterizes the nature of the walk and the related spatial and temporal measures. For example, the mean square displacement (Section 3.1.2) corresponds to the second moment MSD(t)=X(t)2, and the nth moment is obtained from (24)X(t)n=xnP(x,t)dx,where brackets indicate ensemble averages over multiple realizations of walks.
Of particular interest when analyzing models of individual mobility is the scaling of the square root of the mean squared displacement (RMSD), R(t)=MSD(t), with time t; a relationship that characterizes the speed of displacement from the origin with time. The scaling of the MSD can be used to categorize the type of diffusive motion of the random walker. Ordinary Brownian motion (Section 4.1.1) has a MSD that scales linearly with time (for any spatial dimension) therefore on average we expect that after a time t the distance of the walker from the origin is proportional to the square root of the elapsed time; R(t)t12. Taking this definition of diffusion, random walks that have displacement growing at a slower rate than t12 are said to be sub-diffusive. In contrast, if displacement grows at a rate faster than t12, the random walk is classed as super-diffusive. The models of random walks that can lead to these different behaviors are outlined below.
As an example, the most basic form of a random walk is the discrete symmetric random walk in 1D, for which ΔXi=±1 with equal probability. For this simple case after an elapsed time t=N (where N is the number of steps taken by the walker), the first two moments are XN = 0 and XN2=N.

4.1.1. Brownian motion

Brownian motion is a class of random walk originally developed to describe the motion of a particle suspended in a fluid (liquid or gas) [112]. Such a particle undergoes many rapid instantaneous collisions with much smaller particles in the medium, resulting in a trajectory characterized by a series of irregular and random displacements (see Fig. A.47 for pictorial representation of such movement). Mathematically, a 1-dimensional Brownian motion is a random walk in the space of real numbers R with independent and normally distributed increments where the probability to observe a displacement of magnitude X from the origin location after a time t is Gaussian distributed with mean zero and variance proportional to t. Brownian motion can be defined as a limit of the discrete symmetric random walk. Let us assume that the particle can take steps of length 1k to its left or right with equal probability, and that after time t the particle has taken N=tk steps. For a given k, the displacement of the particle at time t is given by (25)Xk(t)=1ki=1tkΔXi,and taking the limit k leads to Brownian motion as a consequence of the Central Limit Theorem (CLT). In fact, in the large k limit Xk(t) tends to X(t), whose PDF is (26)P(x,t)=12πtσ2e(xμkt)22tσ2,where μ=ΔX and σ2=ΔX2 are the mean and variance of the random walk displacements. For this case, the first moment is zero and σ2=1, so the MSD is simply t and the scaling of the RMSD is R(t)t12, corresponding to ordinary diffusion. Using a similar limit, it can be shown that a d-dimensional random walk converges to a d-dimensional Brownian motion, for which the probability that the walker is found at distance x from the initial position is P(x,t)=(4πt)d2ex24t, and the RMSD always scales as the square root of t, R(t)t12.

4.1.2. Lévy flight

Unlike Brownian motion, there is a class of random walks called Lévy Flights, for which one cannot use the CLT. A Lévy Flight is composed of a series of small displacements, interspersed occasionally by a very large displacement (see Fig. A.48 for a graphical representation of Lévy Flights). It is formally defined as the sum of independent identically distributed random variables whose PDF for a single jump has a divergent second moment due to a long-tailed distribution of the form (27)f(Δx)1Δx1+β,with 0<β<2. If the displacement after k steps of size 1k1β is defined as Xk(t), then the random variable ZN defined as the re-scaled sum of N=tk independent random variables distributed as Eq. (27), takes the form (28)ZN=1(tk)1βi=1tkΔXi=1t1βXk(t).The rescaled variable satisfies a generalization of the CLT, namely the Lévy–Khintchine theorem which states that the PDF of ZN in the limit N is a so called α-stable (Lévy) distribution. These distributions do not have a closed formed in real space; instead the characteristic function can be written in Fourier space, and for this particular example, the tail would show the same power law behavior as Eq. (27). Changing variables from ZN to Xk via Eq. (28) we obtain (29)ZNf(z);Xk(t)1t1βfxt1β.The MSD is equal to the second moment (30)X(t)20x21t1βfxt1βdx=t2β0y2fydyand hence RMSD for a Lévy flight scales super-diffusively: R(t)t1β.
The jumps in Lévy flights can be considerable, but occurs within the same time step as a short one (which is unrealistic), and thus Lévy flights can be seen as only a rough approximation to actual human trajectories.

4.1.3. Continuous time random walk

The random walk models discussed thus far have been discrete in time. In each time interval, an instance of a jump is dictated by the corresponding jump-length distribution. A continuous time random walk (CTRW) is a random walk in which the number of jumps made in a time interval dt is also a random variable or equivalently, the time elapsed between jumps (wait-time ΔT) is also a random variable.
If the PDF of jump-lengths is f(Δx) and that for wait-times is ϕ(Δt), and these are independent, the CTRW consists of pairwise random and independent events with ΔX and ΔT drawn from the joint PDF, P(Δx,Δt)=f(Δx)ϕ(Δt), which denotes the probability that a jump of length Δx is taken after a time Δt. According to this model, after N steps, the total displacement, XN, and the total elapsed time TN are given by: (31)XN=i=1NΔXi;TN=i=1NΔTi.
The PDF of the process, P(x,t) can be Fourier–Laplace transformed to give: (32)W(k,u)=1ϕ̃(u)u(1ϕ̃(u)f̃(k))where ϕ̃(u) and f̃(k) are the Laplace and Fourier transforms of ϕ(Δt) and f(Δx) respectively. Taking the inverse transform we get (33)P(x,t)=12π12πidudkeutikxW(k,u).This expression for P(x,t) can be analyzed according to the asymptotic behavior of distributions ϕ(Δt) and f(Δx). In particular four types of models are obtained, depending on whether none, either, or both distributions have heavy tails. The properties of these models and their application to human mobility are discussed below.
Ordinary Diffusion: If the expectation value of ϕ(Δt) and the variance of f(Δx) are both finite, i.e 1ϕ̃(u)τu and f̃(k)1(σk)2 then from Eq. (32) and Eq. (33), asymptotically: (34)P(x,t)1tex2Dtwhere D is a diffusion constant. Therefore a CTRW with well defined jump-length and waiting-time distributions, is asymptotically (with time) equivalent to Brownian Motion.
  1. Download: Download high-res image (480KB)
  2. Download: Download full-size image

Fig. 21. Schematic of the different (asymptotic) classes of CTRW defined in the text, as a function of the waiting-time and jump-length exponents 0<α<1 and 0<β<2. Lévy flights, fractional Brownian motion as well as ordinary diffusion are limiting cases of the more general class of ambivalent processes.

Figure from [46].
Lévy Flights: If f(Δx)Δx(1+β) (0<β<2), and ϕ(Δt) has a finite variance, a Lévy flight is recovered. Following the same procedure as Section 4.1.2, the PDF of the process is given by (35)P(x,t)1t1βG(xt1β),where G(xt1β) is a scaling (limiting) function. Therefore if the distribution of jump lengths is a power-law and the wait-times are well defined, R(t)t1β and a continuous time random walker will follow a super-diffusive path, equivalent to a Lévy Flight.
Fractional Brownian Motion: Conversely, if jump-lengths with a finite variance are combined with a power-law distribution of wait-times ϕ(Δt)Δt(1+α), (0<α<2), the PDF is (36)P(x,t)1tα2H(xtα2),with H being a non-Gaussian limiting function. In this case, the effect of the distribution of waiting times is to slow down the random walk. Here R(t)tα2, consequently the walk is sub-diffusive for α<1 and super-diffusive for 1<α<2.
Ambivalent Processes: The fourth variant occurs when both f(Δx) and ϕ(Δt) are heavy-tailed. In this case, R(t)tαβ, and hence the nature of the diffusive behavior is fully specified by α and β. For β<2α, the CTRW is super-diffusive and for β>2α, it is sub-diffusive. If β=2α, the random walk converges to ordinary diffusion/Brownian motion, despite the diverging moments of the respective distributions. A schematic of these limiting cases is provided in Fig. 21. Given the sensitivity of the models to the parameters of the wait times and jump lengths, the importance of their accurate measurement from data cannot be over-stated. In any event, the ambivalent process model is most often used for describing the mobility of individuals. Analysis of various data sources (GPS, CDRs, Dollar bills) has found that both jump length distributions and the distribution of wait times display power-law behavior [[46], [48], [88], [81]]. The parameter ranges measured from data correspond to α estimated from empirical data range from 0.42α0.8 [[88], [81]] and values of 0.31β0.75  [[88], [48]].

4.1.4. Preferential return

An important aspect of human behavior, missing from the models discussed thus far, is the tendency of individuals to return to one or more locations on a daily basis (so-called preferential return). In a CTRW process, the number of distinct sites S visited by a random walker in time t is given by (37)S(t)tμ,with μ=α for a CTRW and μ=1 for a Lévy Flight [113]. The analysis of CDR’s (Section 2.3), revealed a power-law distribution of jump-size with exponent α=0.8, while μ was independently measured to be μ=0.6±0.02, considerably less than the theoretical prediction. Indeed for random walks, the probability of an individual to visit any distinct location becomes asymptotically uniform, whereas analysis of visitation patterns [48] from data suggests that the rank-frequency visitation frequency follows a Zipf’s law: (38)fkkζ,where k corresponds to the rank of location according to frequency of visit. This implies the distribution of visitation frequencies follows P(f)f(1+1ζ).
Furthermore the scaling of the MSD in the CTRW model suggests that an individual will asymptotically drift away from the origin (home) in contrast for what is known from daily experience. To account for the human trait of returning to locations, Song et al.  [81] include two extensions to the CTRW model: exploration and preferential return.
Exploration is defined as the probability for an individual to move to a previously unvisited location according to (39)Pnew=ρSγ,where ρ and γ are parameters of the model and S increases by one for each new visit. Preferential return on the other hand is the probability to return to a previously visited location and is thus complementary to exploration: Pret=1ρSγ (see Fig. 22 for a schematic).
  1. Download: Download high-res image (172KB)
  2. Download: Download full-size image

Fig. 22. Schematic description of the Exploration and Preferential Return model. Starting at time t from the configuration shown in the left panel, indicating that the user visited previously S=4 locations with frequency fi that is proportional to the size of circles drawn at each location, at time t+Δt (with Δt drawn from the P(Δt) fat-tailed distribution) the user can either visit a new location at distance Δr from their present location, where Δr is chosen from the P(Δr) fat-tailed distribution (exploration; upper panel), or return to a previously visited location with probability Pret=1ρSγ, where the next location will be chosen with probability Πi=fi (preferential return; lower panel).

Figure from [81].
The probability to visit a previous location i, denoted Πi, is determined by the number of previous visits to i, i.e., Πi=fi where fi is the visitation frequency. The parameters of the model are restricted to 0<ρ<1 and γ0. Considering the total number of steps, N, an individual makes in time t, and noting that dSdN=Pnew, the number of distinct locations visited in time t is given by SN1(1+γ). For a power-law waiting time distribution ϕ(Δt), we have tn1α and hence the exponent in Eq. (37) is (40)μ=α1+γ,thus leading to slower rate of exploration than predicted by CTRW processes.
Furthermore, the number of visits to location i at step N, mi(N), increases according to (41)dmidN=Πi(1Pnew),where Πi=fi=miimi(N). As S(t), for γ>0, Pnew0 and therefore dmidN=miimi(N). Noting that the sum of visits over all locations is equivalent to the number of steps taken, imi(N)=N, the expression mi(N)=NNi is obtained, where Ni denotes the first jump to location i. It is clear that the likelihood of visiting a location increases with number of earlier visits, and as such, the rank ki of location i follows the relation ki=S(ni)ni1(1+γ). Combined with the fact that fi is proportional to mi(N) this results in the relation fkkζ with ζ=1+γ, thereby accounting for the relation in Eq. (38).
The MSD is related to S via (42)MSDβ2log(1S1ζζ1)+c,where c is a constant. Given the measured range of the parameters, this leads to three asymptotic regimes: MSD(logt)2β for ζ<1; MSD(loglogt)2β for ζ=1; and MSDXmax for ζ>1 where Xmax denotes the saturation point of the MSD. Measured values suggest that ζ=1.2±0.1 indicating a saturation of movement at long times [81]. This regime corresponds to an individual’s motion being dominated by their most visited location and is more in line with expected human behavior.

4.1.5. Recency

The concept of recency was introduced to solve discrepancies that emerges under the standard preferential return assumptions. Specifically, the fact that the earlier a location is discovered, the more visits it will receive, leading to a cumulative advantage precluding people from changing preference of location (unlike what is observed in data). Barbosa et al.  [114] proposed a model in which the exploration phase in human movements also considers recently-visited locations and not solely frequently-visited locations. Two rank variables Kf and Ks are defined to characterize, respectively, the frequency and recency of a given location in the context of an individual’s trajectories. More precisely, the rank variables can be described as:
  • Ks
    is the recency-based rank. A location with Ks=1 at time t means that it was the previous visited location. Ks=2 means that such location was the second-most-recent location visited up to time t and so on.
  • Kf
    is the frequency-based rank. A location with Kf=1 at time t means that it was the most visited location up to that point in time. Similarly, a location with Kf=2 is the second-most-visited location up to time t, and so on.
The model can be described as follows: first, a population of N agents is initialized and scattered randomly over a discrete lattice with L×L cells, each one representing a possible location. The initial position of each agent is accounted as its first visit. At each time step agents can visit a new location with probability similar to the preferential return model, Eq. (39).
The return phase happens with probability Pret analogous to Eq. (39), with the caveat that return jumps selects location i from frequently visited ones with probability α and recently visited locations with probability 1α thus: (43)Prets=(1α)PretΠiks(i)ν,Pretf=αPretΠikf(i)1γ,where ks(i) is the recency-based rank and kf(i) is the frequency-based rank of the location i. When α=1 the preferential return model is recovered. The empirical measurements for the two datasets considered had α=0.1, and ν=1.6. γ=0.6 was kept the same as [81].
As seen from Fig. 23 the preferential return model does not capture the broader distribution of p(Kf,Ks) for recently-visited locations, an effect captured by the recency-based refinements. The primary differences can be seen for the distribution of the recency rank Ks seen in log-linear scale. In Fig. A.49 one can also see the slight differences in spatial patterns generated from simulations of the preferential return and recency models.
  1. Download: Download high-res image (929KB)
  2. Download: Download full-size image

Fig. 23. Comparison between the Preferential Return (EPR) model and the recency-based (RM) model. (a) The analysis of the return ranks generated by the EPR model shows that it reproduces a pattern similar to the one observed from the empirical analysis. (b) Probability of return to recently-visited locations (i.e., low Ks). (c) Distribution of the frequency ranks, the preferential return mechanisms (labeled EPR) exhibits a power-law distribution. The activation of the recency mechanism does not affect the frequency rank distribution. (d) Ks distribution, the EPR mechanism does not capture the power-law behavior observed on the empirical data.

Figure from [114].

4.1.6. Social-based models

It is natural to assume that two individuals who have a social interaction, such as friends, family or colleagues, do not always move independently [[115], [116], [117]]. Occasionally, they will share full trips, destinations or origins. The trips can be also synchronized if the objective is to meet somewhere or go back home after a meeting. Furthermore and closing the loop, the social network of an individual typically reflects the geography of their life with tighter connections with people spatially closer or at least in clusters related to the places in which the person has previously resided. These correlations have been observed in several publications. For example, a relation between distance and “online friendships” was described in 2005 by Liben-Nowell et al.  [118] and was later confirmed using surveys [[119], [120]], social networks [[121], [122]] and mobile phone records [[123], [124], [125]]. Indeed this feature formed the basis of a generative model describing the behavior of population level aggregate economic indicators across urban systems [126]. Possible methods for exploiting this intuitive observation towards improving forecasting of individual movement has been explored in [127]. For example, nonlinear time series prediction methods based on the delay embedding theorem by Takens [128] can be used to forecast an individual’s future mobility given a detailed history of past movements and the assumption of a certain degree of determinism in mobility patterns. Furthermore, improvements in prediction accuracy by about one or two orders of magnitude has been observed when including the time series of past movements of members of the social network (individual’s acquaintances).
Conversely, online [[129], [130], [131]] social links can be inferred from co-occurrences of individuals in space and time. In particular, the probability that two users have a friendship link on the Flickr social network grows significantly with the number of distinct geographical locations that they both visited within a given time threshold, where the spatial and temporal information are extracted from the geo-tagged photos they published. The probability of being friends is higher if the size of the regions and the temporal range between the two observations decrease. This result can be explained by simple mobility models exploiting the fact that friends are more likely to spend time together in the same place, often live close to each other and that the jump-size distribution decays as a power law. Similarly, the location of a person may be predicted by those of their near contacts [132]. Two individuals with similar mobility patterns are generally in close proximity in the who-calls-whom social network of mobile phone users [133]. Indeed, strong correlations exist between various classical topological measures of the proximity of two mobile phone users in the social network, such as Adamic–Adar, Jaccard and Katz, and various measures of spatio-temporal proximity, such as co-location rates and spatial cosine similarity. Combining co-location information with information on the relative positions of users in the social network it is possible to predict the formation of new social ties with higher precision than using only information derived from the proximity in the social network.
The idea of interconnecting mobility and social interactions has been considered in different theoretical contexts. Detailed models have been proposed in the area of transportation to take into account the relation between social network and transport demand. This includes microsimulation of transport systems [[115], [116], [117]] in which agents may have communication between them and have common objectives [134]. Social groups and their sizes can impact transport demand [135], daily schedules, social relations and face-to-face interactions [[136], [137], [138], [139], [140]]. A simpler framework has been introduced from a physics perspective, as for example formulated by [141]. The system is composed of N agents within a radius r moving in a 2D square of lateral size L and periodic boundary conditions. The starting position of the agents and their directions of movement are randomly selected. All the agents have the same initial speed v0 and they travel in straight lines until they “crash”. Every crash produces a new social tie between the pair of agents involved. After every crash, the velocities of the two agents are updated randomly selecting new directions of movement and increasing their speeds according to the expression: (44)|vi(t)|=vo+cki(t),where vi(t) is the velocity of agent i at time t immediately after the crash, c is a constant and ki(t) is the number of connections of i in the social network. The number of agents N is constant, but after a certain period of time old agents are removed and substituted with new agents with speed vo and no social links, allowing the system to reach a stationary state. The characteristics of the emerging social network are analyzed. The degree distribution depends on the constant c and on the lifetime of the agents in the system. For small c, the distribution is well approximated by a Poissonian distribution, but deviates from this shape with increasing c and average degree k. Beyond the distribution, the model can reproduce other properties of real social networks such as the degree–degree correlations, the size of the large connected cluster and the abundance of cliques.
More realistic mobility models in the context of generated social networks were explored in [[142], [143]]. In particular the former proposed a model inspired by the methods described in Section 4.1.4 with important variations in the main ingredients (see Fig. 24). The initial conditions were set such that the agents are placed according to measured population density in the area considered. Then at every step, each agent decides to visit a social contact, with probability pv or, otherwise moves according to a Lévy-like flight with probability 1pv, with the destination chosen in proportion to the population density in that area. This precludes agents from moving to unphysical locations such as water-bodies, geographic barriers and the like. After movement, the agent generates a new directed social link with probability p in the neighborhood of her present position and with probability pc with a random agent. The parameter pc represents social relations that emerge online, inspired by the observations made by [118]. Note that the jump distribution can be calibrated from measurements made in [81], and the tie formation parameter p can be sampled at different scales from measurements made in [130]. This leaves only two free parameters in the model: the probability of visiting friends pv and that of creating new links regardless of the distance pc. These parameters were adjusted to reproduce an error function containing topological and geographical properties of a set of online social networks such as Twitter, Gowalla and Brightkite. With these parameters, the model reproduces the degree distributions found in [141] as well as the spatial dependence of link distribution, the probability of reciprocal ties, the overlap of the social environment, the density of triangles and the disparity of the triangles in terms of distances of their vertices. The model is also amenable to a mean-field analytical treatment that demonstrates the necessity of accounting for social effects on mobility (visiting friends for example) to obtain realistic reproductions of the behavior seen in social networks.
  1. Download: Download high-res image (214KB)
  2. Download: Download full-size image

Fig. 24. Sketch showing the main ingredients of Grabowicz’s model. The agent’s update of position and network is marked in red, while its contacts are in blue. The model has two main steps: one in which the mobility is determined in terms of either visiting a friend with probability pv or a Lévy-like jump otherwise. After this, a new social link may be established with probability p in the neighborhood of the new position or at random with pc.

Figure from [142].
The variant proposed by [143] instead focuses on behavior seen at shorter time-scales such as intra-day mobility, for which period the social network can be approximated as static. Like the preferential return model, at each step an agent decides to return to a previously visited location with probability 1ρSγ or explore a new one with ρSγ, but there is an additional “social pressure” component. This occurs with probability α where one of the agent’s contacts are selected at random with probability proportional to the co-similarity in the location visiting profiles. Consequently, the new location is chosen from the list of the contact’s previously visited locations. In the absence of social pressure, (probability 1α) the behavior is the same as Section 4.1.4. The introduction of the social pressure component does a better job at reproducing the re-visitation profiles measured in CDR data than models without this component.

4.2. Population-level

Mobility information gathered at the individual level can be aggregated to study the flows of individuals traveling from one region to another at different spatio-temporal scales. These flows can be organized in the framework of OD matrices (see Section 3.1.5). Such a format, with all possible combinations of origins and destinations for trips, is easily transformed into a directed weighted network in which nodes denote locations (for example counties or municipalities) and link weights correspond to the flow of travelers between the two locations. As discussed earlier in Section 3.1.5, OD matrices can be empirically estimated from transportation surveys, traffic counts or individuals’ geolocated ICT data. An OD matrix, or its corresponding network, provides useful information on the travel demand between the origin and destination areas, representing a valuable asset widely studied and used in Geography, Transportation research and Urban Planning. Therefore, being able to obtain accurate estimation of OD matrices is crucial for both modeling and applications, and this problem has attracted the interest of researchers and decision makers for decades.
Note that individual mobility patterns are straightforwardly aggregated into flows, however the inverse problem, i.e the disaggregation of flows, is typically not possible. Instead, one has to resort to determine dependences between mobility flows and a limited set of “static” attributes of the locations that would allow predictions on how changes in these attributes can potentially influence future travel demand. To this end, various spatial interaction models have been proposed to predict flows of individuals based on a small number of key local attributes. Considering a region of interest divided into n locations, the purpose of these models is to estimate the number of trips Tij from location i to location j from the socio-economic characteristics of the populations of i and j, and their spatial distribution. Models of spatial flows have been traditionally developed starting from the principle of entropy maximization subjected to various constraints. In the strongest version, the constraints are the number of people leaving and entering each location, but in softer ones there can be other proxy variables to represent the demand and attraction of the trips’ origins and destinations such as population levels, the finite amount of resources for travels, utility functions to describe individual choices over competing alternatives, etc [[144], [145], [146], [147], [148]].
In the simplest version, the objective is to infer flows from the product of two types of variables: one type that depends on an attribute of each single location (e.g. the population), and the other type that depends on a quantity relating a pair of locations (e.g. the distance or travel time). The differences between the various models consist of the choice of variables considered, and the specific functional forms in which these variables enter. Over the years, two main schools of thought have emerged. The first one assumes that the number of trips between two locations is a decreasing function of their distance, giving rise to the so-called gravity models [[150], [6]]; the second variant postulates that the number of intervening opportunities, defined as the number of potential destinations between two locations, determines the mobility flow between them, and models that share this assumption are called Intervening Opportunities models [5]. An illustrative example that highlights the fundamental differences between distance and intervening opportunities is shown in Fig. 25. In addition to providing the mathematical framework to model human mobility flows [[39], [151], [152]], these models have found successful applications in estimating other spatial flows, including cargo shipping volume [153] and social interactions from inter-city phone calls [[124], [154]].
  1. Download: Download high-res image (387KB)
  2. Download: Download full-size image

Fig. 25. Differences between distance-based and intervening opportunity models. (a) The radiation model uses distance as a search criterion. (b) The cost-based radiation model uses network travel cost as a search criterion, which usually has a heterogeneous distribution. (c) The flow Tij through edge (i,j) is the sum of contributions from all those mobility fluxes ϕab whose minimal cost paths ωab contain (i, j).

Figure from [149].

4.2.1. Gravity models

Preceded conceptually by the work of H.C. Carey in 1858 about land use [150] and the retailing models of W. J. Reilly in 1931 [155], George K. Zipf proposed in 1946 an equation to calculate mobility flows inspired by Newton’s law of gravitation [6].12 In his work, Zipf highlights the importance of the distance for human migration patterns where the magnitude Tij of a migratory flow between two communities i and j can be approximated by (45)TijPiPjrij,where Pi and Pj are the respective populations and rij the distance between i and j.
The basic assumptions of this model are that the number of trips leaving i is proportional to its population, the attractivity of j is also proportional to Pj, and finally, that there is a cost effect in terms of distance traveled. These ideas can be generalized assuming a relation of the type: (46)Tij=Kmimjf(rij),where K is a constant, the masses mi and mj relate to the number of trips leaving i or the ones attracted by j, and f(rij), called a “friction factor” or “deterrence function”, is a decreasing function of distance. As in the original version of the model, the masses usually are functions of population (not necessarily linear); common functional forms used in the literature are mi=Piα or mj=Pjγ [91]. Unlike the original version, however, other variables, such as gdp-per-capita, may factor in the definition of the masses [[156], [157]]. The distance function f(rij) is commonly modeled with a power-law or an exponential form, although more complicated functions can be considered, such as a combination of the two, (47)f(rij)=αrijβerijrc.Indeed, the optimal form of the function may change according to the purpose of the trips, the spatial granularity of the locations, and the transportation mode [39]. For example, in the case of commuting flows, the value of the exponent is highly correlated with the scale [[158], [159]] as can be seen in Fig. 26. In other applications, the distance may not be the appropriate variable to quantify the cost of travel between two locations, and other variables such as travel time or economic (i.e. monetary) cost of a trip may offer better characterizations.
The ability to estimate, even if as a crude approximation, trip-flows, and consequently, traffic demand between two different locations as a function of their local properties, has made the gravity model widely popular in transport planning [[160], [91]], in studies of geography [161] and spatial economics [[162], [163]]. It has been also used in situations where the knowledge of mobility flows is essential but the empirical data is not available as in the case of the characterization and modeling of epidemic spreading patterns [[164], [165], [166], [167], [157]]. Despite its widespread use and historical popularity, one must keep in mind that the gravity model is a gross simplification of travel flows, and in many cases, falls far short of capturing actual empirical observations [[42], [168], [159]]. Furthermore, the model requires the estimation of a number of free parameters, making it rather sensitive to fluctuations or incompleteness in data [[152], [42]].
  1. Download: Download high-res image (381KB)
  2. Download: Download full-size image

Fig. 26. The distance exponent β (Eq. (47)) as a function of average unit surface area. (a) Normalized gravity laws with an exponential distance decay function. (b) Normalized gravity laws with a power distance decay function. (c) Schneider’s intervening opportunities law. (d) Extended radiation law.

Figure from [159].
Constrained gravity models. Some of the limitations apparent in the gravity model, can be resolved via certain constrained versions. For example, one may hold the number of people originating from a location i to be a known quantity Oi, and the gravity model is then used to estimate the destination, constituting a so-called singly-constrained gravity model of the form, (48)Tij=KiOimjf(rij)=Oimjf(rij)kmkf(rik).As can be seen, in this formulation, the proportionality constants Ki depend on the location of the origin and its distance to the other places considered. One can go further and fix now also the total number of travelers arriving at a destination j as Dj=iTij, leading to a doubly-constrained gravity model. For each pair in the OD, the flow is calculated as (49)Tij=KiOiLjDjf(rij),where there are now two flavors of proportionality constants, (50)Ki=jLjDjf(rij)1,Lj=iKiOif(rij)1,usually calibrated with an Iterative Proportional Fitting procedure [169].
The use of singly-, doubly- or non-constrained models depends on the amount of information available and on the pursued objective. If the aim is to approximate the mobility flows and transport demand from indirect socio-economic variables of different geographical areas, then one employs non-constrained models. On the other hand, if out-going or in-going flows are empirically measured quantities, and the objective is to estimate the elements of the OD matrix Tij, then one employs constrained models.
Maximum entropy derivation of gravity models. While the gravity model may seem rather ad hoc in terms of the form of Eq. (46), an argument in favor of this functional form was provided by Alan Wilson [170] in the framework of classical transportation theory combined with entropy maximization. Essentially, the argument posits that in the absence of detailed information, out of all possible variants of OD matrices, the most probable ones are those that can be obtained with the highest number of trip configurations under opportune global and local constraint satisfaction. The objective is thus to find the set of flows {Tij} that maximizes the number of possible configurations of trips associated with it, respecting the possible constraints.
Let Ω({Tij}) be the number of distinct arrangements of individuals (configurations) that give rise to the set of flows {Tij}, corresponding to the number of ways in which T11 individuals can be selected from the total number of travelers T=ijTij; T12 from the remaining TT11 and so on and so forth. Then we have that (51)Ω({Tij})=T!T11!(TT11)!(TT11)!T12!(TT11T12)!=T!ijTij!.One then determines the maximum of Ω using Lagrange multipliers, subject to the constraints: (52)jTij=OiiTij=DjijTijCij=C,where Cij is the cost of travel from location i to location j. The first two constraints ensure that trips originating and terminating in every location are equal to their observed values, whereas the third constraint fixes a total cost for all trips, C. In the limit of a large number of trips, T, the configuration that maximizes Ω is by far the most probable, and hence dominates over all other configurations. The resulting doubly-constrained gravity model takes the form (53)Tij=KiOiLjDjeβCij,where the values of Ki and Lj are set in order to fulfill the first two constraints in Eq. (52). The Lagrange multiplier β appearing in Eq. (53) controls the effect of cost on flows, and its value is empirically calibrated. It is worth noting that one can introduce a power law distance decay f(rij)=rijβ by considering a cost function of the form Cijln(rij). These arguments have been further developed in a series of Refs. [[147], [148]] that also includes the distinguishable nature of the travelers and therefore necessarily modifying the statistics.
Using gravity models to estimate mobility flows. In order to calibrate and apply gravity models to estimate mobility flows within a given region the following procedure is usually adopted.
  • 1.
    First, depending on the objectives, a flavor of gravity model is selected from either the unconstrained version, Eq. (46), or one of the constrained versions, i.e Eq. (48), or Eq. (49).
  • 2.
    Second, the set of independent variables, population size, gdp or gdp-per-capita, distance, etc., as well as their relation with the local outflow, the attractiveness and the travel cost must be established. Although the choice of functions are somewhat arbitrary, common forms are power laws for the origin and destination populations, and exponential or power laws for the distance dependence. These particular functional forms are chosen to enable a fast and accurate calibration of the model, as it ensures that the logarithm of the flow depends linearly on some functions of the populations and the distance, allowing researchers to apply linear regression methods to determine the parameter values.
  • 3.
    Third, the parameter values are selected in order to maximize the fit between the flows estimated by the gravity model and the empirical flows observed in the region of interest. The best fit values of the parameters are determined using an optimization algorithm that either minimizes some error function between the model’s estimates and the observed data [171], or maximizes the likelihood function of the observed data given the model’s parameters [172]. Generalized Linear Models (GLM) [173] are a generalization of linear regression that are usually applied to fit the parameters of globally and singly constrained gravity models. GLM methods are more adapt than Ordinary Linear Regression (OLM) as it allows for the use of a wider and more realistic range of probabilistic models to capture fluctuations in flow estimates.
Both the model training and parameter calibration steps can be employed either on the entire dataset or on subsamples, particularly if the aim is to validate the performance of the model in relation to the remaining set.

4.2.2. Intervening opportunities models

Along with the gravity model, one of the first attempts to provide a conceptual and formal model of human mobility was introduced in 1940 by Stouffer [5]. Contradicting a long tradition in the social sciences – that distance is the central factor in determining mobility – Stouffer proposed a conceptual framework in which distance and mobility are not directly related. Instead, Stouffer suggested that what plays the key role in determining migration is the number of intervening opportunities or the cumulative number of opportunities between the origin and the destination. In the paper, Stouffer does not provide a precise definition for “opportunities”, leaving it to be defined depending on the social phenomena under investigation.
The basic idea behind the intervening opportunities (IO) model is that the decision to make a trip is not explicitly related to the distance between origin and destination, but to the relative accessibility of opportunities for satisfying the objective of the trip. An opportunity is a destination that a trip-maker considers as a possible termination point for their journey, and an intervening opportunity is a location that is closer to the trip maker than the final destination but is rejected by the trip maker. The law of intervening opportunities as proposed by Stouffer in 1940 states “The number of persons going a given distance is directly proportional to the number of opportunities at that distance and inversely proportional to the number of intervening opportunities”. Stouffer used this model to estimate migration patterns between services and residences [5]. The traditional form of the intervening opportunities model is usually given by Schneider’s version of Stouffer’s original model [174]. Schneider’s hypothesis states that “The probability that a trip ends in a given location is equal to the probability that this location offers an acceptable opportunity times the probability that an acceptable opportunity in another location closer to the origin of the trips has not been chosen”. More formally, the flow Tij from the origin location i to the jth location ranked by travel cost from i is given by: (54)Tij=OieLVij1eLVij1eLVin.Here, Oi is the total number of trips originating from i and the second term represents the probability that one of these trips ends in location j. The denominator is a normalization factor ensuring that the probabilities sum to 1 (i.e. jTij=Oi). This probability depends on Vij, the cumulative number of opportunities up to the jth location ranked by travel cost from the origin location i (and n is the total number of locations in the region considered). Usually, the population, mj, or the total number of arrivals, Dj=iTij, are assumed to be proportional to the number of “real opportunities” in location j. The value of the parameter L can be seen as the constant probability of accepting an opportunity destination. As in the case of the gravity model, the value of L is adjusted in order to obtain simulated flows as close as possible to observed data.
Several studies and variants of intervening opportunities models have been developed on this concept [[161], [175], [176], [177], [178], [179]]. The gravity and the intervening opportunities models have been compared several times during the second half of the twentieth century, showing that generally both models perform comparably [[180], [181], [182], [183]]. In fact one can cast the intervening opportunities model as a special variant of the gravity model with the friction factor f(rij) replaced by a function of the number of opportunities between the two locations, f(Vij)=eLVij1 [[184], [183]]. Consequently, the intervening opportunities model can also be derived from entropy maximization methods as discussed earlier in the context of the constrained gravity model. Indeed, some authors have proposed hybrid gravity-opportunities models taking into account both the effect of distance and the number of opportunities between locations [[185], [186]].
However, unlike the gravity approach and despite relative good performances, the intervening opportunities model has suffered a growing loss of popularity. This is mainly due to the lack of research effort into the implementation and calibration of the model, attributable to the fact that the theoretical and practical advantages of opportunities models over the gravity ones are not overwhelming [91].
  1. Download: Download high-res image (1MB)
  2. Download: Download full-size image

Fig. 27. Schematic of the radiation model. (a) Commuting flows in two pairs of counties, one in Utah (UT) and the other in Alabama (AL), with similar origin (m, blue) and destination (n, green) populations and comparable distance r between them (see bottom left table). Number of travelers in the data, as predicted by the gravity model and finally for the radiation model shown as upper right inset. The definition of the radiation model: (b) An individual (e.g. living in Saratoga County, NY) applies for jobs in all counties and collects potential employment offers. The number of job opportunities in each county is chosen to be proportional to the resident population. Each offers attractiveness (benefit) is represented by a random variable with distribution p(z), the numbers placed in each county representing the best offer among the jobs in that area. Each county is marked in green (red) if its best offer is better (lower) than the best offer in the home county. (c) An individual accepts the closest job that offers better benefits than his home county.

Figure from [42].

4.2.3. The radiation model

The concept of intervening opportunities and Schneider’s hypothesis has recently triggered a renewed interest thanks to the recently proposed radiation model [42]. The radiation model assumes that the choice of a traveler’s destination consists of two steps. First, each opportunity in every location is assigned a fitness represented by a number, z, chosen from some distribution p(z), whose value represents the quality of the opportunity for the traveler. Second, the traveler ranks all opportunities according to their distances from the origin location and chooses the closest opportunity with a fitness higher than the traveler’s fitness threshold, which is another random number extracted from the fitness distribution p(z) (see Fig. 27). As a result, the average number of travelers from location i to location j, Tij, takes the form: (55)Tij=Oi11miMmimj(mi+sij)(mi+mj+sij).Here again, the destination of the Oi trips originating in i is sampled from a distribution of probabilities that a trip originating in i ends in location j. This probability depends on the number of opportunities at the origin mi, at the destination mj and the number of opportunities sij in a circle of radius rij centered in i (excluding the source and destination). This conditional probability needs to be normalized so that the probability that a trip originating in the region of interest ends in this region is equal to 1. In case of a finite system it is possible to show that this is equal to 1miM where M=imi is the total number of opportunities [168]. In the original version of the radiation model, the number of opportunities is approximated by the population, but the total inflows Dj to each destination can also be used [[158], [159], [168]]. The great advantage of the radiation model compared with other spatial interaction models is the absence of a parameter to calibrate with observed data. In particular, the flows defined in Eq. (55) are independent of the fitness distribution p(z). However, this advantage represents also a limitation since the model does not seem to be very robust to changes in the spatial scale [[158], [168], [43], [159]]. To overcome this drawback, a radiation model with opportunities’ selection [187] and an extended radiation model [188] have been proposed. In this extended version, the conditional probability to perform a trip between two locations according to the spatial distribution of opportunities is derived under the survival analysis framework introducing a parameter α to control the effect of the number of opportunities between the source and the destination on the location choice. The addition of this scaling parameter seems to greatly improve the performance of the model, and similar to the gravity model, its value can be inferred from the scale of the study region according to the homogeneity of the opportunities’ spatial distribution [[188], [159]].
An interpretation of the radiation model in the context of job search and, consequently, for the formation of commuting flow networks has been given in [189]. The basic components are individuals residing within a demarcated geographical area, who are seeking employment. Job locations are uniformly distributed in space and characterized by a fitness parameter z, which itself is drawn from a distribution that includes aspects such as wage and worker-convenience. Individuals have a certain tolerance level, z, and will search in a progressively increasing radius from their residence, with the search terminating at the first instance of the condition z>z. Using extreme value statistics, it is possible to determine the distance distribution, P(r), between residence and job places, which is independent of the distribution of z (for non-pathological cases), and was calculated to be of the form (56)P(r)=2ρπr(1+ρπr2)2,where ρ is the density of jobs. The model satisfactorily reproduces some of the empirical features of the travel distance distributions in commuting data as can be seen in Fig. 28.
  1. Download: Download high-res image (517KB)
  2. Download: Download full-size image

Fig. 28. The closest opportunity model (a) Sketch with the main ingredients of the model: the residence place of the agents and the opportunities with their correspondent quality until the last one at distance r is selected. In (b), (c) and (d) rescaled distributions of commuting distances. In blue, the empirical data for several years in the three countries, in dark blue the averaged empirical distributions, and superimposed in red the model fits using a single parameter.

Figure from [189].

4.2.4. Comparison between models

The models described so far provide theoretical expectations for the OD matrices in terms of the flow values Tij between geographical regions i and j. Before employing these for practical applications, it is essential to validate the expected (calculated) flows against empirical evidence, that may be limited in the sense of being restricted in space compared to regions of interest, or indeed to shorter time windows than the temporal period of interest. By evaluating the performance of each model, one selects the one that most closely matches the empirical data (if available). The accuracy, in general, will depend on the spatio-temporal scale as well as on available (meta) information and the extent of missing data. It is crucial that comparison between models is done on an equal footing, i.e using the same extent of available data and in equal spatio-temporal resolutions. As discussed, the model output is the value of Tij, subject to constraints, which range from Eq. (46), where only the total sum of the flows is imposed via the parameter K, the singly constrained framework models based on intervening opportunities or the closest opportunity case, which is production constrained, in the sense that outflows Oi are fixed  a priori and must be preserved for every location, and finally to the doubly-constrained models, where both the outflows Oi and the inflows Di in every region i are preserved.
In general, all of the described models share some common assumptions:
  1. Download: Download high-res image (504KB)
  2. Download: Download full-size image

Fig. 29. Performance comparison of the different models described in the text using the CPC metric defined in Eq. (57) applied to census commuting data from England and Wales (E&A), France (FRA), Italy (ITA), Mexico (MEX), Spain (SPA), USA. Additional census data at the city-scale was gathered from London (LON) and Paris (PAR). The different symbols and colors represent the different flavors of the model. The short-forms exp (exponential) and pow (power-law) refer to the functional forms of distance dependence.

Figure from [159].
  • 1.
    The trip distribution Pij that generate the flows Tij is independent of the trip production Oi.
  • 2.
    In both the unconstrained and singly-constrained models, choice of travel destinations are statistically independent (i.e there are no memory effects).
  • 3.
    Flows are estimated as a product of variables related to opportunities and distance (i.e. variables are “separable”).
Under these assumptions, it is possible to draw statistical laws that govern the distribution of travels Pij, and from thereon, build models with varying levels of constraints. This is important to adapt the models to the available information and to fairly compare between them.
One such comparison was carried out in [159], where commuting data was obtained from the census offices of England and Wales (E&A), France (FRA), Italy (ITA), Mexico (MEX), Spain (SPA), USA and then in shorter spatial scales, from the cities of London (LON) and Paris (PAR). Several variants of the gravity, intervening opportunities and radiation models were considered with multiple levels of constraints. The predicted flows Tij were compared with the empirical data using two related metrics: The common part of commuters (CPC) for all location pairs i,j with positive flows both in the empirical dataset and in the model prediction, defined as (57)CPC=i,jmin(Tijm,Tije)T=112i,j|TijmTije|T.Here T=i,jTije is the total number of commuters, Tijm the model prediction for the flow, and Tije the empirical value. The CPC is one, if the agreement is perfect and zero if there is no overlap between data and model. The ability of the models to recover the topological structure of the original network was assessed through the second metric, termed the common part of links (CPL) and defined as (58)CPL=2i,j1Ti,jm>01Ti,jei,j1Ti,jm>0+i,j1Ti,je>0,where 1Ti,j>0 is the Heaviside function. The CPL measures the proportion of links in common between the simulated and the observed networks, it is null if there is no link in common and one if both networks are topologically equivalent. The results in terms of the CPC are shown in Fig. 29. For this choice of datasets, the gravity model is seen to moderately outperform the other models. One must note of course, that this might change across different datasets.

4.3. Intermodality

A complete description of mobility must take into account the mutlimodal structure of transport and the transitions between the different modes as alluded to in Section 3.3. A good framework for carrying out such an analysis has been developed in recent years in the context of multilayer or multiplex networks [[190], [191]]. These are networks in which the nodes can be present in one or multiple layers, and each layer by itself contains a set of links or node–node interactions; if a large fraction of nodes in the network are present in all apparent layers, then it is termed multiplex. On the other hand if one finds different flavors of nodes restricted within their own putative layers, then the network is dubbed a multilayered one. Indeed, each layer of the network constitutes a separate network in its own right depending on the context of its interactions. For example, in social networks, layers may correspond to different groupings such as coworkers, family, different cohorts of friends and so on, all of which have within-layer interactions. Information diffusion in the network, however, occurs through each layer and across the full multilayer structure. Translating this to the case of mobility and transportation networks, layers may correspond to different transportation modes (road, subway, airline), while connections between layers constitute the interchanges between these modes. Within this context, the standard way to construct multilayer networks is to associate locations to nodes and flows, and frequency of travel (or simply unweighted simple connections) to links between different transportation modes (see, for example Fig. 30).
Traditionally, single-mode transport networks have been studied using mono-layers. Examples include metro networks in Beijing [193], Boston [194], London [[195], [196]], Montreal [197], Paris [195] and Seoul [198]; bus networks in China [[199], [200]] and Poland [201]; train networks in Boston, Vienna [202] and in India [203]; and combined public transportation networks in 14 cities around the world [204]. In most of these cases, the transport network was generated using different perspectives. One such perspective is the L-space representation, where nodes correspond to stations in the transportation map, and are linked together if they are consecutive stops in a transportation line. Similarly, the network can be built in the P-space framework, if nodes are stations linked together if they correspond to stops in the same line (irrespective of sequence). This latter representation corresponds to a one-mode projection of a bipartite network [69] consisting of stations and lines. (For a graphic explanation of this notation see [204].) In addition to geographical aspects, the link weights can also incorporate information on the number of passengers in the same route, among other features.
  1. Download: Download high-res image (871KB)
  2. Download: Download full-size image

Fig. 30. An example of a multilayered network approach to mobility. Public transport networks in the London metro area, separated into multiple layers consisting of the bus, subway and rail networks. In this particular instance, one can see that the London bus network is the most “used” layer.

Figure from [192].
The fact that a single transport network can be represented in multiple ways, naturally paved the way for the use of multilayer structures [[205], [206]]. In this case, each layer contains the expression of the network in a different space, and the multilayer allows for a deeper study of the network accessibility for travelers. As mentioned before, a multilayer network can be also employed to condense the transport information coming from different modes. An example for this can be seen in Fig. 30 which shows elements of the transportation network in London, combining the bus, train and metro networks, whereas additional elements including air and ferry can also be incorporated [192]. The more intricate topology of multilayer networks necessarily affects the spreading processes taking place on the system (diffusion and navigation) [[207], [208], [191]] that are modeled using the methods discussed in Sections 4.1 Individual-level (random walks), 4.2 Population-level. This leads to new features such as congestion [209], change in the navigability of the network subject to disruptions [[210], [211]], produces new dynamic regimes that are absent in monolayers [212] and abrupt reactions to minimal topological changes [213].
A particular type of multilayer representation of transport networks is worth a separate discussion. In this case, nodes still correspond to locations, and links are assigned according to one of the space-frameworks (L or P), but the weight of the links now account for the temporal duration of trips between the nodes. This network configuration, which some authors refer to as a PT-space, appear in each of the layers, where each layer usually corresponds to a single transport mode. With such a representation, the optimal time for a traveler traversing between a given OD pair, can be calculated using optimal path algorithms across the multilayer structure, containing the full transport network and taking into account intermodality. These types of multilayers have been studied, for example, in British [104], French [214] and Colombian [215] cities. In the British case, one detects a shift in preference for different transportation modes as well as the overlap between temporally and spatially optimal paths as a function of distance. In the French case, a few particular (temporally optimal) connections were singled out, so-called efficient connections, and their occurrence (and hence influence) in the general mobility modes were studied across a number of cities. Using a similar framework, with two layers corresponding to road and metro networks in the cities of London and New York, congestion effects as a function of varying the speed of metro lines was studied [216]. Finally, it is worth mentioning the point raised in [217], where the authors discuss the differences between building the network by considering a layer as a single transport mode or as a single line. The latter description provide a deeper view of the inefficiencies due to line transfer of the passengers, while the former is well-suited to study topological aspects and geographical coverage.

5. Selected domains of application

Here we discuss some selected applications of the frameworks, concepts, models and datasets introduced thus far. We organize this section by scale, ranging from single scale (mostly dealing with transportation modes), multi-scale applications with a focus on movement in cities and epidemic spreading, and finally end with some new developments related to virtual scales, i.e mobility patterns seen in online activity.

5.1. Single-scale

5.1.1. Pedestrian movement

Large concentrations of people such as those occurring during festivals, sport events or religious ceremonies can present a challenge in terms of service and transport demand as well as public safety. Avalanches in stadiums, processions and even subway stations have occurred in recent decades leading in high casualties [218]. In the face of this, strategies need to be devised for ensuring prompt evacuation in cases of emergency when considering the configuration of new buildings, public spaces and transport systems. Consequently, understanding the empirical patterns governing pedestrian movement and being able to build corresponding models, has the potential to benefit a wide range of disciplines including architecture, urban planning and public service stakeholders among others. Given the broad range of interest, a large amount of research has been devoted to this sphere, over the past few decades. Indeed, several reviews exist on this topic [[218], [219], [220], [221], [222], [223]], so instead of providing a detailed retrospect, we provide here an overview of the most recent developments.
Modeling frameworks on pedestrian movement can be broadly classified according to the approach used to consider individual pedestrians. For example, when the aim is crowd control, then usually the number of individuals considered is quite large, and the metrics of interest are crowd velocities or pressures. In such a case, one can define fields at every point in space and time to characterize the state of the system. These fields usually include the density of pedestrians ρ(x,t), the local velocity v(x,t) or the flow across an exit, J(x,t). In analogy with fluid dynamics, one sees a continuity equation of the form, (59)ρ(x,t)t=(ρ(x,t)v(x,t)),stemming from the conservation number of people in the crowd. The motion of pedestrians can be assumed to minimize the time spent in the system. This condition is included via the introduction of a field ϕ(x,t) coupled with the density in such a way that the local velocity shows a relation (60)v(x,t)=f(ρ)ϕϕ,where the gradient of ϕ determines the direction of motion and f(ρ) is a simple function of the density ensuring that the pedestrians speed decreases as the density increases. Possible choices are a linear relation, f(ρ)=ρρmax, keeping ρ<ρmax or a non linear one, f(ρ)=(ρρmax)2, among others [[224], [225], [226]]. The connection with ϕ is established through the function f(ρ) thus, (61)ϕ=1f(ρ),leading to the so-called Hughes model. In a scenario with individuals exiting a room, it produces two regimes in the pedestrian mobility depending on the global density, one with free flow and another with congestion. Furthermore, it turns out that shock-waves and moving congestion fronts emerge due to the narrowing of the crowd close to the door, leading to intermittent exit flow, avalanches and stop-and-go effects [225].
An alternative approach is to use agent based modeling where the state of every individual (agent) i is simulated in detail. This framework is especially applicable to simulations of heterogeneous populations in which the agents may show different features. The most popular model falling into this category is the so-called social force model proposed by Helbing and Molnár [[228], [229]]. The idea behind this model is to apply the analogues of Newton’s laws of motion to each pedestrian i. If the agent’s velocity is vi, the location ri at time t, then the acceleration can be expressed as (62)dvidt=fid+fia+firb+jifijr+η(r,t).The first term on the right hand side accounts for the tendency of the individuals to walk at a certain desired speed vi0. It is usually written as (63)fid=vi0viτ,which ensures the recovery of the desired speed in a characteristic reaction time τ. The second term fia is an attraction force depending on the agent’s position introduced to guarantee that walkers are compelled to a certain target or direction. The third force, firb, is repulsive depending on the agent’s location and takes into account the effects of the boundaries and other obstacles on the agent’s trajectory. The next term represents a repulsive force felt by the presence of other agents j different from i. Finally, η(r,t), is an uncorrelated noise term that introduces low levels of random fluctuations to avoid deadlocks.
  1. Download: Download high-res image (467KB)
  2. Download: Download full-size image

Fig. 31. Social force model simulation of a crowd trying to leave a room through a narrow door. (a) a representative configuration. (b), the sequence of leaving times of the agents as a function of v0. In (c) and (d), the total evacuation time and the average flow of people as a function of v0.

Figure from [227].
The social force model reproduces several phenomena observed in empirical circumstances. For example, when two groups of pedestrians moving in opposite directions meet in a corridor, unidirectional stripes are formed, or when they meet in a door, the flow displays intermittence due to clogging [229] (Fig. 31). Variations of this model have been used to analyze escape patterns from a building in low visibility conditions [[227], [218]], to assess evacuation plans in skyscrapers [[231], [232]], ships [233] and aircraft [234]. In relation to crowd control, they have also been used to analyze how to mitigate avalanches in the Hajj religious gathering in Saudi Arabia [[230], [235]], in the Notting Hill Carnival in London [236] or, along with mobility models for other transport media, to study evacuation of cities and populated areas in case of natural disasters such as flooding  [[237], [238], [239]]. Additionally, these models provide the basic tool to model scenarios such as the formation of Mosh and circle pits as a mark of collective motion in heavy music concerts [240], as well as the risk of disease propagation in massive gatherings [241].
  1. Download: Download high-res image (629KB)
  2. Download: Download full-size image

Fig. 32. Analysis of an avalanche event during the Hajj pilgrimage of 2006. (a) Representative trajectories of the laminar flow, the stop-and-go and the turbulent regimes for individual movements. (b) The velocity in the turbulent regime. (c) The “pressure” as a function of time. (d) Different distributions of speed increments in the two regimes. (e), Distribution of displacements between consecutive stops. (f) The structure function in the turbulent regime.

Figure from [230].
In the last decade, there has been a large volume of research related to the constituent terms of Eq. (62), including empirical calibration of the different coefficients as well as the introduction of a cognitive basis for the different forces [[220], [235]]. In high density regimes, such as those occurring in videos taken of the disaster of the Jamarat Bridge in the Hajj pilgrimage of 2006, the trajectories of single pedestrians become turbulent [[230], [235]] (Fig. 32). The original version of the social force model is unable to reproduce this turbulent regime and consequently modifications to the force terms in high density regimes, were introduced. One of these modifications is the centrifugal force model, so-called because of its similarity with the classical centrifugal force in rotating bodies, and involves a modification of the mutual repulsion term fijr thus, (64)fijr=miKijVij2rijeij.Here mi is the mass of pedestrian i, Kij is a coefficient enforcing the influence only of pedestrians in front of i on its movements and eij is the unit vector pointing from i to j [[242], [243]] . This modification has the effect of introducing both laminar as well as turbulent regimes, a feature missing in earlier versions of the social force model. Further refinements also included the interactions of individuals based on specific boundaries and shapes of obstacles [244].
A second modification to the social force model deals with the cognitive capacities and limitations of pedestrians and how they can affect the force terms of the model [[245], [246]]. Examples include analysis of the time scales regulating pedestrian interactions [247], the addition of a field to take into account the (partial) global perception of individuals and the environment under consideration [[248], [249]], or the role of leaders [250]. The capacity of the pedestrians to foresee the movements of others and the dangers of obstacles have been modeled within this framework in [[245], [251]]. The model has also been adapted to agents with an anisotropic perception of their environment [252] and its instabilities in a context of stop-and-go oscillations have been studied in [[253], [254]]. A systematic stability analysis of its numerical solutions has been provided in [255] together with a version of the model that notably increases its computational performance.
In addition to this, there exist a family of models that lie between the macroscopic and microscopic approaches, characterized by discretization of space into cells, and the specification of rules on how agents navigate between these cells. Such cellular automata models have the advantage of computational scalability, as for example, in [256] where simulations of crowds of size 105, were conducted with running time scaling linearly with the number of individuals. The price to pay for this increased computational performance is the loss of local details within each cell. These models have been used to study clogging in exits, and due to obstacles by adding a “friction term” between agents [[234], [257]], pedestrian flow through subsequent bottlenecks [258] and the formation of stripes in junctions with agents moving in different directions [259]. Some modifications to these models include, for example, behavioral factors of the agents such as path selection based on the time necessary to reach the destination [260], heterogeneous agents [261] and long-range spatial awareness [262].
While some of the models described thus far have been inspired by empirical data obtained from crowds in different contexts, challenges remain in actually monitoring crowds to get accurate and precise measurements on individual movement (see [222] for a recent review on methods used for this). One way to calibrate models is to take a controlled approach, i.e conduct experiments on a smaller scale, in which groups of people are directed to move under specific constraints. This controlled environment can provide insight into questions such as how pedestrians walk in social groups [266], the quantification of pressure and density in different types of bottlenecks and junctions [267], the perception of personal space [268], the clogging effects and instabilities in narrow areas [[269], [270]] and the asymmetric flow produced by stairs [271]. One factor that is common to such experiment-based research is the generation and analysis of “fundamental diagrams” [272] (Fig. 33), that represent the relationship between the average velocity v and the density ρ of pedestrians, as well the flux through a surface J (an exit) among other parameters. In the free flow regime, with no obstacles and low density, pedestrians tend to move at an optimal velocity. As the density increases, the velocity decreases until the crowd stops. Similarly, the flux through an exit also decreases with increasing pedestrian concentrations. The relations between these quantities are typically non-linear and are of great relevance for the design of buildings and urban spaces. Due to this, a large effort has been invested in the analysis of these diagrams. The results suggest while qualitatively the functional forms of v(ρ) and J(ρ) have common features across different settings, quantitative differences emerge based on extraneous factors. Some of these are cultural perception of personal space (a comparison between India and Germany was performed in [263]), the presence of T-junctions [273], the individual’s perception of open versus closed spaces [274] and the bidirectional flow of agents [275].
  1. Download: Download high-res image (284KB)
  2. Download: Download full-size image

Fig. 33. Fundamental diagram with the speed as a function of the density of pedestrians for several experiments in a closed loop circuit (ovoid-like). On the left, experiments in India with different loop lengths, plot originally taken from [263 ]. On the right, experiments in Germany with two sets of people: students and soldiers, figure originally taken from [264]. The composite figure comes from [265].

Here it is worth mentioning two works that taking advantage of new data sources coming from mobile applications have studied the mobility of people running and, in general, doing sport. The first one [276] by Althoff et al. analyzes sport habits for 700,000 people in 111 countries around the world. The authors find a relation between within country inequality in the exercise habits and the levels of obesity. Such inequality is connected, among other things, with the cities “walkability” and the presence of areas to run, which decreases the gender gap at the hour of running and contributes to reduce obesity. In the second [277], a similar dataset is employed to study how the social network affects sport patterns. The effects of gender, activity as well as redundancy of running habits in the contacts of the ego networks are considered in terms of on how much the individuals are influenced to run more frequently and longer.

5.1.2. Air transportation

On the opposite end of the spectrum from pedestrian mobility (which is necessarily short-range) is long-range transportation over thousands of kilometers, which is primarily represented by air transportation. Air transport is of paramount importance for modern global connectivity, playing a central role in the world economy and in the interchange of people, ideas and, unfortunately, also the propagation of diseases. While the study of such systems has a long tradition in the engineering community, primarily related to aeronautics and air traffic management [[279], [280]], recent years has provided further impetus to this line of research with the development of tools based on network science [69]. In this setting, initial studies were conducted from a static perspective, where airports are the nodes of the network, with a link between a pair of airports if a direct flight exists between them, discarding any directional information as well as temporal dynamics. The basic statistical features of such networks were analyzed, for instance, in [[281], [282], [278]] for the World Airport Network (WAN), in  [283] for the Chinese network and in [284] for Brazil. Fig. 34 shows the location of the high-degree nodes (maximum traffic) and those with the highest betweenness centrality (bottlenecks) in the WAN. Statistical distributions of the degree (number of destinations of the airports), the traffic (number of passengers per route or per airport) and betweenness were found to be typically heavy-tailed.
  1. Download: Download high-res image (471KB)
  2. Download: Download full-size image

Fig. 34. (a) The relation between node degree (traffic) and betweenness centrality (measure of load on a node) in the world airport network (WAN). (b) The location of the 25 most connected cities. (c) The top 25 cities in terms of the load of traffic (betweenness centrality).

Figure from [278].
Furthermore, it was found that the networks can be divided into modules with a certain level of self-contained traffic [278], although this division is not seasonably stable when the focus is set on smaller geographical scales such as the US network [285]. The average number of passengers per route wij was determined to be a non-linear function of the number of connections of the airports with the form (65)wijkikjθ,where ki and kj are the degree (number of connections) of the origin and destination airports, respectively, and θ12 [282]. An explanation of these measured topological properties has been proposed by simple models based on cumulative advantage [[281], [282]]. The robustness of these networks to disruptions has been considered in terms of a unipartite framework [286] as well as multiplex-like approaches [[287], [288]], where the network of each airline forms a layer and the airports appearing in the different layers connect the structure. A short review of network analysis on air transportation can be found in [289].
Of course the static topological framework does not take into account the dynamics occurring on the network. Indeed a key component of air transportation is that goods and people arrive in their destination in time. In particular logistical failures can propagate in the network where travel schedules are not met, with the delay propagating to effect a significant fraction of the system and paralyzing airport operations. This idea was first explored in the context of air transportation in [291], where the concept of delay multipliers was introduced. This can be seen as a combination of a delayed flight not only affecting the departure of other flights in the same airport, but also affecting connecting flights down the transportation chain, leading to the formation of avalanches or “delay-trees” [290] as shown in Fig. 35. Direct costs of flight delays in Europe amount to more than one billion euros [292], while in the US direct and indirect costs go beyond 40 billion dollars [293]. Delays directly affect airlines since they increase operational costs, but as an indirect factor they also bring associated reputational costs [[294], [295]]. Passengers, on the other hand, undergo a direct loss of time, which can be further exasperated by lost connections leading to missed business and leisure opportunities. Finally, efforts to recover delays airborne usually imply excess fuel consumption and larger CO2 emissions.
  1. Download: Download high-res image (147KB)
  2. Download: Download full-size image

Fig. 35. A typical configuration of a delay tree. Each node represents an airport and the links represent connections between them. The link weights denote temporal delays (in minutes) and the tree is composed of different levels or generations. Each node has a certain branching number (ρ) for the next level.

Figure from [290].
Given these challenges, it is important to characterize the sources of initial (primary) delays [[296], [297]] and devise mitigating strategies. Unfortunately this is a highly complex problem with the involvement of numerous (and sometimes unrelated) factors such as weather conditions, technical failures in the aircraft, aircraft rotations, crew and passenger connections, organizational issues in airports, and the mechanisms through which these delays are transferred and amplified in subsequent operations [[298], [291], [299]]. Airline schedules typically include a buffer time to absorb these delays [300], but there are frequent scenarios when this is not nearly enough. The Central Office for Delay Analysis of Eurocontrol (CODA) in Europe and its counterpart in the US, the Bureau of Transportation Statistics of the Department of Transportation, release monthly reports on flight performance including statistics on the major causes of delays and the airport affected. Based on these reports, studies have been conducted in single hubs such as Newark [301] as well as parts of the European [[280], [302]] and US [[295], [303], [290]] networks.
There have been primarily two approaches considered to model such effects. One class of models probe the network structure to search for weak points, analyzing jamming and congestion phenomena, especially trying to isolate those airports that cause cascades of delays that span a finite-fraction of the network [[305], [306], [307], [308]]. This approach has also been extended to multiplex networks of airports and airlines also attempting to identify carriers that are most responsible for delays [287]. In addition to these more-stylized approaches, there exist other families of models whose objectives are to provide realistic predictions that can be implemented in real-time. Such models are primarily agent-based and incorporate fine-grained details of air operations while sacrificing analytical tractability. Indeed, this has been the main approach adopted in the area of transport engineering related to air transport management (ATM). There are hundreds of contributions on this line mainly presented in sectoral conferences such as the USA/Europe ATM Seminar, the Air Transport Research Society (ATRS) Conference, ICRAT, the SESAR Innovation Days, etc., whose proceedings are accessible online as well as publications in journals such as Journal of Air Transport Management (JATM). The advances in this area cover topics such as the optimization of runway management, the effects of bad weather, sector congestion, how the advent of drone technology affects commercial air transportation, among many other minute details. While a comprehensive listing of all the findings would require a review by itself, we mention a few that are directly related to delay propagation in networks. Micro-modeling of the connection between air-transport logistics and delays was first explored in [[309], [310]]. Later a similar framework was used to assess the economic impact of a major disruption in a European hub [311]. Also in Europe, within the umbrella of the WPE of the Joint SESAR Undertaking, several projects as NEWO, TREE and POEM have studied the modeling of delay propagation from different perspectives: the first two from the network and airline managers’ and the latter from the passenger’s point of view.
  1. Download: Download high-res image (508KB)
  2. Download: Download full-size image

Fig. 36. Comparison between the congested cluster size as a function of time measured from empirical data. Congested airports are defined as those with an average delay per departing flight of over 29 min in intervals of one hour. The congested cluster is obtained from the largest connected component of the network formed by congested airports connected with direct flights during the day considered. The model has been tested with all the ingredients working or only with some of them to check their importance in delay propagation. (a)–(d) refer to variants of the model taking into account all or some of the factors.

Figure from [304].
In the same context, data-driven models have been developed to reproduce the delay propagation patterns in the US [312]. These models were validated against empirical performance data [313], showing an acceptable level of accuracy and precision in the prediction of delayed flights, identification of airports displaying major departure-delays, network-wide congestion and generalized bad weather conditions [304]. The models are built with an agent-based approach applied at the level of aircraft, with delays appearing at any point in the operations due to technical, airport organizational or weather-related issues incorporated as initial conditions to the model. The airport management is simulated with a system of queues that depend on the specific network considered (US or Europe). One of the primary finding of such analysis is that the mechanism with the strongest potential to effect delay is passenger and crew connections (Fig. 36).

5.1.3. Sea networks

Another example of long-range transportation are sea transportation networks, which have received comparatively less attention than their air traffic counterparts. While historically, sea transportation was the main mode of long-range human mobility, in recent times they mostly involve movement of cargo. Still, for sake of completeness we provide here a brief description on studies conducted on such networks. Typically such networks are studies as representing the ports as nodes, with links connecting them whenever major cargo liners travel from one to other [314]. The links typically are given weights that correspond to the number of ships (trips) along the route. The resulting network spans the globe, connecting different continents as shown in Fig. 37. Similar to that seen in air transportation networks, statistical distributions of typical topological properties are heavy-tailed, with only a few number of ports accounting for a majority of global traffic, with the majority playing a peripheral role [153]. The centrality of ports was studied in [315], where it was found that Singapore, Antwerp, Bhushan and Rotterdam top the list in terms of both traffic and load. A similar relationship between connectivity and traffic as in Eq. (65) was noted with an exponent of 1.46±0.1 [153]. Furthermore, the network can be segmented by the type of ship (containers, oil tankers, etc.) in terms of a multiplex network framework [316]. A systematic analysis on the similarities between the world airport network and the boat network was performed in [317], including a study of the resilience of both networks. From a modeling perspective certain aspects of the flows can be reproduced with a gravity model (Section 4.2.1), although there are some strong limitations. An important additional application of this network, besides the transport of goods, is the study of the invasion of species brought by carriers from different parts of the world [[318], [319]].
  1. Download: Download high-res image (864KB)
  2. Download: Download full-size image

Fig. 37. (a) The global boat cargo network consisting of ports as nodes, and connections between them when boats navigate from one to the other. The links are colored according to the volume of traffic and their shape is constructed from the geodesic distance between ports. (b) A map of the top 50 ports in terms of their betweenness centrality. Also listed are the top 20.

Figure from [153].

5.2. Multi-scale

5.2.1. Intra-urban mobility

One of the more important and much studied applications of mobility is that seen within cities. Indeed, there are good reasons for this. One is just the availability of more data. With rapid urbanization, an increasing part of the global population is living in urban areas. Large cities are early adopters of new technologies have large populations and consequently more mobile phone users and high densities of phone antennas. Consequently the spatial resolution of CDR data (Section 2.3) collected in cities is order of magnitude higher when compared to rural/non-urban areas. The same considerations hold for the data generated from social networking applications, such as Twitter or Foursquare. Indeed, the usage of these applications has been shown to have socially and spatially biases; they are more likely to be used by urban residents, educated individuals, young people and middle age employed adults [320].
The other of course is the unique set of challenges that abound from large agglomerations of people resulting in high population density. One of these is congestion that has an associated energy cost as measured for example by CO2 emissions. As a large part of human movement occurs in individual vehicles, accurate measurements of transportation flows are needed to infer how congestion affects carbon emissions. For example, it is known that CO2 emissions mainly depend on the time spent traveling, and in places of high density such as cities where congestion appears, traffic jams are responsible for carbon emissions that scale super-linearly with the population size [[321], [105]]. Yet another reason for studying intra urban mobility is related to that fact that cities typically have high levels of socioeconomic inequality among their residents, with residential segregation that is amplified by other forms of segregation, perceived in mobility practices and facilities that may differ depending on the neighborhood of residence [322]. Additionally, the multi-modal nature of transport networks in urban areas (which include privately owned vehicles, buses, metros, taxis, tramways, bike sharing systems, and pedestrians) gives rise to specific questions related to the navigability of these multi-layer networks. For example, the combination of the regularity of daily circadian rhythms and smart transport card technology provides data that allows for the study of intriguing phenomena, such as familiar strangers [323], and can be used to uncover the properties of “hidden” temporal networks of the city, formed by individuals that temporarily co-locate in the same spaces, but do not know each other (see Fig. 38).
There is an important body of mixed literature, quantitative and qualitative, on intra-urban mobility that relies upon longitudinal travel surveys, started more than 60 years ago. Covering it is beyond the scope of this review, and possible starting points for those interested may be found in  [[324], [325], [326]], which focus on North-American metropolises. Ref. [327] was one of the first papers to explicitly discuss the possibilities offered by individual digital footprints for urban analysis. It underlined that such data could be used to gain understanding of a number of phenomena associated to intra-urban mobility. Following this, a series of papers highlighted possibilities of real-time visualization and monitoring of displacements in cities [[328], [329], [96]].
  1. Download: Download high-res image (1MB)
  2. Download: Download full-size image

Fig. 38. Statistics of the co-location times for “familiar strangers”. In (a), typical contact network. In (b), inter-encounter distribution times. In (c), joint probability distribution of inter-encounter times. In (d), distribution time between encounters for groups.

Figure from [323].
The first studies that utilized metadata produced by handheld devices carried by individuals focused on a single city, and essentially consisted of visualizations and simple aggregated measures, providing information on the density of individuals using a device in various areas of the city and at various times of the day. Metadata provided by mobile phone operators contains information on the nationality of the phone number, which is also common on social networking user profiles. Consequently one of the first uses of such databases of individual geotagged data was to model the movements of foreign visitors in cities. [330] studied the location and mobility patterns of tourists in Rome, and subsequent studies highlighted the specific travel patterns of visitors, when compared to the movements of city residents, notably in Paris [[331], [332]], Jakarta and Singapore [333].
Another popular case study are ones which focus on the influence an area of residence has on intra-urban mobility patterns. Most of the first intra-urban studies using ICT data in these categories have focused on the organization of mobility in one or two large cities. One of the very first in this category was  [95], which used mobile phone CDR data recorded during a period D of 62 consecutive days in New York and Los Angeles. They compared aggregated mobility statistics; the individuals’ mean daily range – defined for each individual as the mean of the maximal distance traveled each day, dDmaxij(d(i,j))|D| – and the individuals’ maximal daily range – defined as maxD(maxij(d(i,j))) (see Fig. 39). They showed that residents of Los Angeles globally commute over further distances, but also highlighted heterogeneities; some residents in Manhattan commute farther than their counterparts in Los Angeles and the neighborhood of residence has an influence on the average journey-to-work commuting distance. Combining mobile phone data and odometer data (a distance measuring device for bikes and automobiles), [334] performed similar comparative measures between the two data sources, highlighting the influence of a number of built environmental factors on the travel distances of Boston residents. [335] went further by combining mobile phone data and census data which provided information on the proportion of racial groups in each of the cities’ census tracks. They showed that residential segregation was amplified by a mobility segregation in the urban areas of Boston and Los Angeles.
  1. Download: Download high-res image (232KB)
  2. Download: Download full-size image

Fig. 39. Boxplot with the daily mobility ranges in Los Angeles and New York.

Figure from [95].
Focusing on a single city, some authors have investigated the relation between morphology [336] and mobility patterns: [337] used smart cards data collected over several weeks in the London subway to provide measures linking the polycentric spatial structure of London and the organization of mobility patterns in the city. Relying upon mobile phone data collected during 2 months in 31 Spanish urban areas, [338] showed how the spatial organization of hotspots and the shape of cities evolve during the course of a typical weekday, as residents move inside the cities, highlighting different typical city forms (see Fig. 40). Thanks to a large Foursquare dataset that records GPS positioning data, [65] compared the distribution of jump lengths in 34 different cities worldwide. Fitting the distribution in each city, they conclude that at the city scale, human displacements cannot be properly fitted by a power-law distribution, a result in agreement with several other case studies held for particular cities, which found that the distribution of travel distances inside an urban area is more properly fitted by an exponential distribution P(Δr)exp(Δrr0). They propose an individual model derived from Stouffer’s intervening opportunities, that states that the important parameter for explaining destination choice in cities is not the distance by itself, but rather the density of possible alternative destinations in a perimeter of distance d. Given a set U of places in a city, the probability of moving from place uU to a place vU is formally defined as (66)Pr[uw]1ranku(v)a,where ranku(v)|{w:d(u,w)<d(u,v)}|, which can be seen as an urban counterpart to the radiation model proposed in the same year by [42] (see Section 4).
One of the most debated questions in the recent literature on human mobility has been the shape of the statistical distributions of the distances traveled by individuals P(Δr), and their travel times at various spatial and temporal scales and in different environments. Providing a neat characterization of these distributions is of extreme importance, and a prerequisite to modeling. Indeed, the distribution of travel distances P(Δr) is a key ingredient that any reasonable model should be able to reproduce and explain. [339] contains a very clear recap of the efforts of the last decade on this question, and a table summarizing the parameters values of the fitting distributions that have been proposed. These distributions have been characterized thanks to various sources of individual data (dollar bills, mobile phones, private car GPS, taxis, longitudinal travel surveys, and geolocated tweets, among others). In most cases, the fit of P(Δr) is consistent with the general form of a truncated power law distribution (67)P(Δr)(Δr+r0)βexpΔrκ,with κ=0 corresponding to the case of a non-truncated power-law, and β=0 corresponding to an exponential distribution of scale κ. The majority of papers propose truncated power law fits for P(Δr), and argue that the distribution of displacements follows an underlying Lévy-flight process.
  1. Download: Download high-res image (196KB)
  2. Download: Download full-size image

Fig. 40. Scatter plot with the number of hotspots detected as a function of the population of the cities. The colors of the curves and the points correspond to two different ways of defining the threshold marking an activity cell as hotspot.

Figure from [338].
The studies compared in [339] provided very different results for the parameters values β, r0 and κ of the general form above, and consequently they proposed different underlying processes for explaining the shape of this distribution. To settle the discrepancies, a convincing modeling framework, able to reproduce a number of stylized facts, and P(Δr) is required, and more research and cross-checks should be performed. Data accessibility is always an issue in this type of research, so one workaround passes is repeating the same measures on many different datasets, to enable a systematic comparison of the empirical results as done by [339].
Apart from this, some of the many aspects studied on intra-urban mobility relate to the typical distance scales of movement [63]; measures of travel duration [340]; effects of population size on the structure and dynamics of mobility [[105], [341]]; the effect of city topology on the displacements of its inhabitants (in terms of travel time, distance, and spatial organization of flows) [342] optimal ‘strategies’ for navigating multimodal transport networks [104], the efficiency of transport itineraries [343]; efficiencies in transportation based on ride-sharing and route-planning [[344], [345], [346]]; the level of complexity of transportation maps, which might exceed cognitive limits, and therefore explain sub optimal navigation by city residents [347].

5.2.2. Epidemic spreading

Perhaps one of the most important applications of studies on human mobility and transportation systems relates to epidemic spreading. While epidemiology as a discipline has several hundred years of tradition, only recently has it been possible to model the global spreading patterns of diseases in a realistic and reasonably accurate way. Of course, every disease has its own peculiarities, and requires tailored modeling with the inclusion of the specific mechanisms associated with it. Consequently, the literature in this area is extensive, with dedicated monographs and a number of reviews  [[348], [349], [350], [351]]. Here we discuss how recent knowledge of human mobility has influenced the latest generation of epidemiological models.
Whilst it has been known for some time that air transportation plays a major role in the global spreading of diseases [[353], [354]], the implementation of air traffic data in epidemic models has only been possible in recent times with the advances in information technologies and computing resources over the past decade or so [[355], [356], [357]], with the majority of studies focusing on the spread of influenza. The flu is a paradigmatic example of contact diseases that spreads annually cross the globe, and for which contagion is airborne with a patient clinical process that allows for relatively simple modeling. Indeed, improved surveillance systems have brought new insights on the temporal and spatial patterns of its propagation [[165], [358]].
  1. Download: Download high-res image (1MB)
  2. Download: Download full-size image

Fig. 41. In (a), shortest (most-likely) disease propagation pathways for an epidemic starting in Hong Kong. Deff stands for the “radial distance from the disease origin as defined in [352]”. In (b), time evolution of a simulated pandemic starting in Hong Kong. In (c), epidemic arrival time as a function of Deff in the simulation. In (d) and (e), same analysis but with data for the 2009 H1N1 flu pandemic and for the 2003 SARS outbreak.

Figure from [352].
Of course, new modeling frameworks are not restricted to the flu but can be also extended to other contact diseases such as SARS [359], mosquito-driven infections like malaria [360] or yellow fever [241], or even Ebola and other emerging health threats [[361], [362]]. In many such cases, the first instance of the disease in a particular location occurs through the arrival of infected humans from the source location via air transportation. The world airport network (WAN) can thus be considered as the backbone through which new emerging diseases arrive where connections with larger flows of passengers marking the most likely pathways for propagation [352] (see Fig. 41). Indeed, within the framework of the WAN (as discussed in Section 5.1.2), it is possible to estimate the potential of every airport as a source for a pandemic [363].
Concerning realistic models, two main frameworks have been used in the literature: agent-based modeling and metapopulations. The difference between the two lies in the level of information needed. Agent-based models attempt to generate synthetic population mimicking at an individual (agent) level the realistics aspects related to disease propagation. A detailed knowledge is therefore required of features such as location of residential areas, businesses and schools, leisure activities, population divisions by age, gender and sizes of households as well as individual mobility. These are typically sourced from census-like surveys and then a synthetic population is created assigning each individual to a household, a work/school place and a characteristic mobility profile. The earliest variants of such models were developed in the 1990’s within the Los Alamos project Transportation Analysis Simulation System or TRANSIMS. The objective of TRANSIMS was to reproduce via agent-based modeling the daily life of Portland, Oregon, including population, mobility, income levels and other details sourced from the US census [365]. This was later adapted to study the evolution of epidemics, called Epidemic Disease Simulation System or EPISIMS, which was used to study different scenarios to prevent the propagation of smallpox [366]. Similar models were applied to study the propagation of a strain of avian flu, H5N1, in South East Asia, where for instance, epidemic spreading was simulated with the source centered in Thailand (see Fig. 42), and including neighboring countries at a range of 100 kms containing about 80 million inhabitants (agents) [[364], [367]]. This was primarily done to assess the effect of several intervention measures (distribution of antivirals and reducing the social contacts) to halt the progress of the disease. Similar models have been applied to even larger populations in the US and UK [[368], [369], [370], [371]]. The level of granularity present in such models, enables minute analysis such as the effect of closing specific schools, employing different vaccination protocols (say by age) on disease propagation [[369], [370]]. The arrival of the 2009 flu pandemic provided further impetus to such modeling approaches and agent-based models have been now developed for most of Western European countries [[372], [373], [374], [375]] where a dazzling amount of scenarios have been exhaustively simulated including the risk of spread of a newly created virus strain from a lab following the case of the creation of a new H5N1 influenza strain [376]. One must note that the mobility aspect in such models relates to the movement of the individuals (agents) and is explicitly implemented both at long and short ranges involving multiple models of (simulated) transportation.
  1. Download: Download high-res image (895KB)
  2. Download: Download full-size image

Fig. 42. In (a), time evolution of a flu epidemic with R0=1.5 in the Southeast Asia. Infectious individuals are shown in red, while green ones are those already recovered or removed. In (b), daily incidence of the simulation on average in dark blue, several realizations in different colors and in gray the 95% confidence interval. In (c), distance from the origin of the disease. In (d), the proportion of infected people by age group. In (e), distribution of number of secondary cases produced per infectious individual in the early stages of the outbreak.

Figure from [364].
While agent-based models provide the best approximation to reality, they are severely limited by the fact that they require highly detailed input information, the data for which in many cases is noisy, unreliable or just simply unavailable. Furthermore, given the sheer complexity of such models, an analytical treatment is nearly impossible. On the other hand, the metapopulation framework (first introduced in the context of ecology [[377], [378]]), balances granularity with analytic tractability, and has easier to obtain data as input parameters [[353], [379], [380]]. In this model, the population of interest is divided into subpopulations, within which contacts are modeled by a fully-mixed mean-field approach, whereas contacts between subpopulations occur through the (measured) mobility networks. One of the earliest such models is the so-called Global Epidemic and Mobility model or GLEaM [381]. GLEaM is a global simulation platform that divides the globe in cells of 15 × 15 min of arc, which are grouped in basins of attraction around the major transportation hubs, which in most cases are airports. The mobility is then simulated at two levels: long range air transport, which is calibrated from the world airport network, and short-range mobility obtained from either census commuting data or estimated using the gravity law [167]. Fig. 43 displays the different components of the model.
Most notably, GLEaM was used to study the propagation patterns of the 2009 H1N1 influenza pandemic in real time. The pandemic started in a small population of Mexico early in that year [382], and its unfolding received attention of mass media as well as the scientific community, which brought unprecedented details on the characteristics of the first imported cases in several countries. This information allowed for the calibration of the infectivity and the seasonality in GLEaM in early May 2009 [381]. Once calibrated, the model predicted the disease propagation patterns: essentially, the arrival and prevalence peak times for the rest of the countries of the world, predictions that were later validated, after the full surveillance reports were available in 2012 [358]. Additionally, the model helped estimate the actual initial number of cases present in Mexico in the early stages of the disease [383]. Ever since, the model has become an important tool for subsequent studies that takes its results as a baseline (see, for example [[352], [363]]) and has been also used to assess the risk of spreading of new emerging health threats [[384], [362], [385]].
  1. Download: Download high-res image (819KB)
  2. Download: Download full-size image

Fig. 43. In (a), air transport network zoomed in for the US. In (b), commuting network in the same area. In both cases, the link weights are plotted using a heatmap code. In (c)–(f), comparison between flows coming from commuting data wD and those produced with a gravity model wM with the aim of extrapolating short-range mobility in zones where the data was not available.

Figure from [166].
The flexibility of the multilevel framework inherent in the model allows for the quantitative assessment of the differences between short-range and long-range traveling on the disease propagation. In particular it was determined that travel restrictions via decrease of the air traffic (or temporary quarantines) can at best delay the disease spread by a few days or weeks. Unless one totally shuts down the network, or severely restricts it (which of course carries a massive economic cost) global propagation is almost guaranteed [[386], [385]]. Newer versions of such models factor in facets of human behavior, such as the tendency of people to restrict contacts and travel when made aware of the existence of a dangerous disease [387], as well as heterogeneity in the population (age, gender, socioeconomic indicators) and its effect on travel patterns [388]. Possible future directions regarding these models include the possibility of feeding them with real-time ICT collected data on mobility [[389], [390], [20], [391]] or hybrid frameworks that merge aspects of agent-based and metapopulation modeling [372].

5.3. Virtual-scale

5.3.1. Web (online) mobility

Ostensibly, physical human mobility and virtual navigation are indeed activities of very distinct nature, yet there are a number of reasons why the latter can be cast in the context of the former. Indeed, many daily activities that occurred in specific physical locations, such as work, study, leisure, financial transactions, purchasing, are now increasingly migrating towards being conducted online. Moreover, the world wide web (www) has a humanitarian and social role, in the sense that it helps individuals and populations with limited mobility to have access (albeit virtually) to locations, people and information hitherto unreachable. For instance, the disabled [[392], [393], [394]] and the elderly [[395], [396], [397]], at an individual level, for whom the www represents a means of overcoming their physical limitations, or even entire populations, whose mobility is restricted for political or geographical reasons, for which the www is their only access to the outside world [398]. In this section we present an overview of the recent empirical evidence on the statistical similarities between virtual and physical navigation.
In the context of this parallel between virtual and physical mobility, we differentiate two classes of activities that although related, have practical and functional distinctions, that is, mobility in virtual spaces (for example, World of Warcraft, Second Life and The Sims) and navigation in virtual spaceless environments (for example, browsing the WWW or navigating through the interface of a device). The difference between these two classes of activities is given by the fact that in the first there is an explicit notion of space such that displacements between different locations come with a temporal cost, often proportional to some distance measure between them [400].
  1. Download: Download high-res image (172KB)
  2. Download: Download full-size image

Fig. 44. Mobility activities within the Pardus MMOG. The Mean Squared Displacement (main panel) suggests a subdiffusive process with exponent ν=0.26, whereas the return probabilities in number of discrete jumps τ (inset) behave as a power law with exponent α1.3.

(adapted from Szell et al.  [399]).
An empirical analysis of the behaviors and trajectories in the virtual space of the online game Pardus suggests a subdiffuse behavior in online mobility [399], similar to that observed in physical mobility [[81], [48]] (Fig. 44). Additionally, it appears that the return time (in number of discrete steps) of users of the Pardus game follows a power-law distribution with an exponential cut-off (inset of Fig. 44). These similarities between virtual and physical mobility were later empirically corroborated by other similar datasets. A comparative analyses between traces of World of Warcraft (WoW) players and GPS traces [401] revealed that the distributions of jump lengths and waiting times are both approximated by lognormal distributions (Fig. 45).
Furthermore, the analyses of CDRs along with the Web activities of 20,000 mobile phone users [402] suggest that visitation frequencies in both activities have a stretched exponential distribution (see Fig. 46). However, Web traces from mobile phone users tend to overrepresent a particular set of activities such as fact finding, information gathering and communication [403]. More recently, the analyses of complete Web browsing activities of 582 users, has revealed that it is possible to profile Web users based on their exploratory vs. exploitative behaviors in their browsing patterns [404]. More surprisingly, however, is the fact that the distribution of returners and explorers profiles are remarkably similar to the one observed in human mobility [89].
  1. Download: Download high-res image (822KB)
  2. Download: Download full-size image

Fig. 45. Jump lengths (top) and waiting-times (bottom) distributions within the virtual environment of World of Warcraft (left) and physical mobility from GPS traces (right).

(adapted from [400]).
The observed similarities between physical and virtual navigation is beginning to draw increased attention from the scientific community. Of course the analogy can onlybe taken so far, as compared to physical movement, virtual activity is relatively recent (on an evolutionary time scale) and much remains to be understood. It is reasonable, however, to believe that online activity is driven by the same psychological mechanism as off-line behaviors, now adapted to new environments or contexts. In fact, such intuition has increasingly found scientific support, both from a theoretical and empirical point of view, such as evidence of foraging behaviors in online shopping [405] and information consumption on the Web [[406], [407]], similar to that observed in human resource acquisition behaviors [408].
  1. Download: Download high-res image (337KB)
  2. Download: Download full-size image

Fig. 46. Visitation frequencies distribution in online (a) and physical (b) spaces.

(adapted from Zhao et al.  [402]).

6. Conclusions

In this work we discussed the state-of-the art in the field of human mobility. After a short historical description of the field, we focused on current work starting with the type of data utilized in most research projects, followed by the description of metrics and models of individual mobility and population mobility. We illustrated a list of selected applications of human mobility modeling, demonstrating the insights the field can bring to the solution of real-world problems.
It should be clear from the survey that the study, understanding, and modeling of human mobility is of primacy to many areas of society. At a deeper level the current research raises a fundamental question regarding human cognitive limitation. Among the many features described here, one in particular stands out: Why are we likely to return to frequently-visited locations or recently-visited locations? Does it relate to our ability to remember where we have been? Is it a innate characteristic of individuals or something that can be learned (i.e. nature vs. nurture)? The same can be said about regularities whereby human behavior is divided between those that are explorers or returners in their mobility pattern [89]. Indeed, this begs the interesting question, as to whether this observed division is an inherent property of society, say acting as a stabilizing influence? Indeed, this dichotomy is also found in other context, such as social learning, where it has been suggested that a population cooperating to solve a complex task can attain a better performance if it is composed by a mixture of explorers, individuals who try new solutions, and exploiters, individuals that have less propensity to abandon the optimally extant solutions [409]. More generally, the exploration vs exploitation dilemma is a fundamental issue in optimal decision-making and is analyzed in economics (multi-armed bandits) and computer science (reinforcement learning) and finds natural applications in marketing and advertisement, where recommendation algorithms determine whether people are likely to adopt familiar or unfamiliar products. All these problems require an understanding of human dynamics and many of these areas can possibly be studied using approaches inspired from research in human mobility. With the data we continue to collect as a society, and given the methodology developed to explain regularities in human mobility, we now have an additional set of mathematical tools to characterize the regularities in human activities more broadly.
A different structural dimension linked to human mobility relates to the invariant that humans are linked to each other forming social networks. Studies of social networks aim at grasping the multidimensional properties of the structures formed by entities (e.g., persons and organizations) and the ties connecting them together [410]. In fact, the intersection between human mobility and social networks goes beyond the obvious analogy that humans are social and their movement may be related to them being social [[411], [412], [413], [414], [415]]. The field of Social Network research is a huge enterprise in its own right and elucidating the connection between social networks and human movement is a nascent effort that should be explored in more depth.
An interesting research frontier in human mobility in the near future will likely concern the shift from traditional vehicles to autonomous, self-driving, vehicles. The diffusion of autonomous vehicles will transform private and public transportation, with sweeping consequences for society, the economy and the environment. How exactly will these transformations happen and how will mobility habits change in response to these new technologies? How can such vehicles be controlled and how does one plan routes in order to optimize fuel and time consumption, reducing congestion and pollution? Answers to these important questions, may well arise from a fuller understanding of human mobility and behavior at the individual and population (collective) levels.
Finally, it should be clear from this survey that interdisciplinarity and cross-fertilization is a condicio sine qua non to the success of the field. Computer scientists, physicists, social scientists, environmentalists, engineers, government officials and many other professions need to work together to develop and implement solutions to many of the problems society faces today that could benefit from understanding mobility (and dynamics) such as crime, urban planning, energy consumption, social integration, to name but a few.

Acknowledgments

HB, GG, RM and MT were partially supported by the US Army Research Office under Agreement Number W911NF-17-1-0127. FS is supported by EPSRC First Grant EP/P012906/1. JJR received funding from the Spanish Ministry of Economy, Industry and Competitiveness (MINEICO) and FEDER (EU) under the project ESOTECOS (FIS2015-63628-C2-2-R), and also from the European Commission H2020 project BigData4ATM (699260) under the SESAR Joint Undertaking.

Appendix. Modeling frameworks and algorithms

As we advance our understanding of the fundamental mechanisms governing human traveling behaviors, mobility models become increasingly more refined. Therefore, implementing those models requires more specialized tools such as efficient graph libraries, advanced spatial capabilities (GIS) and statistical routines, to name a few. Luckily, libraries for such purposes are widely available for the most popular languages, allowing researchers to focus their efforts (and grant resources) onto more relevant parts of their contributions. In this section we present several practical aspects related to the modeling and simulation of human traveling behaviors. Such details do not correspond necessarily to implementation details but rather to many of the basic building blocks from which we can construct all of the mobility models we presented in this survey.

A.1. Modeling frameworks

Computational models are becoming increasingly important in contemporary research, with applications ranging widely in scope, technique, and goals. Strictly speaking, any simulation model implemented and executed on a computer can be considered a computational model, from numerical models of complex dynamical systems to recurrent neural networks in modern applications of artificial intelligence. What all computer models have in common is the fact that they are described in such a way that can be unambiguously translated to low-level code to run on a computer. Methodological and implementation details vary in several dimensions and must be carefully defined considering the characteristics of the system or process being modeled, the objectives of the experiments, and the expected results.
It is important to emphasize that a mobility model can be composed of smaller modeling blocks such as random number generators, optimizers, individual-level dynamics to name but a few. The actual macrolevel dynamics results therefore from the combination of these smaller processes. For instance, a CTRW is composed of waiting-time and jump-length distribution models. Often times, more complex mobility models incorporate ingredients such as social interactions [[142], [143]] and environmental features [[184], [160], [186], [42]]. The soundness of a mobility model will thus rely heavily on the validity of those sub-components in representing the processes being modeled. Not surprisingly, many of the most groundbreaking models were based, at one level or another, on physical mechanisms and first principle formulations. However, what all those mobility models have in common is the fact that they are based on Monte Carlo methods, in the sense that they rely on generating repeated random samples as an approximation for the behavior of the system (in this case, human displacements).
Computational models can come in different flavors, varying in many aspects such as modeling approach and complexity, and are used to simulate human mobility to validate the analytical models by comparing data generated from the simulation with real world traces. Simulations can be grouped into three major modeling approaches: numerical, particle-based and agent-based models (ABMs). Also, there are simulation of discrete events and simulation based on system dynamics [416]. The former consists in conducting entities through processes or queues in order to analyze the general behavior exhibited by a system over time. In the latter, simulations are performed by analyzing the behavior of variables in systems of differential equations [417]. Historically, the first simulations were numerical in nature and although effective, in many cases the results of the simulations of the models did not have good adherence to reality, especially in cases where many abstractions are needed (i.e., intentional disregard of aspects of systems to simplify the model) [418]. The rise in popularity of ABMs, especially to model human mobility and social phenomenas, is due to the fact that they are more suitable when the simplicity of a Monte Carlo method is not sufficient or appropriate to represent more complex entities such as adaptive processes or local interactions. Also ABMs are especially useful when second-order dynamics or emergent phenomena are involved. In the context of human mobility, elements such as past-events memory, social networks and spatial capabilities make ABMs the natural modeling approach.
Agent-based models are a class of computational models that emphasizes interactions of entities to assess their effects on a system. The key point is that simple rules and micro-level interactions lead to the emergence of complex macroscopic regularities and organizations that represent the observable outcomes to allow the study of the dynamics of individual behavior. Computational agents, according to Wooldridge and Jennings [419], normally have the following characteristics:
    Autonomy
    The agents’ operation must take place without the direct control of its actions or internal states;
    Social behavior
    Agents must interact with other agents through some kind of standard communication;
    Reactivity
    Agents must be able to understand the environment and respond to it;
    Initiative
    In addition to reacting to the environment, agents are also able to take the initiative of an action, changing their behavior so as to fulfill a certain purpose.
Based on these aspects, simulations carried out with agent-based models allow for a deep investigation on the system from microlevel behaviors to macrolevel phenomena. Thus, agent-based models, depending on the kind of model to be designed, may show several benefits over other modeling techniques. According to Bonabeau [420], they may be summarized as follows:
    Production of emergent phenomena
    resulting from the interactions of individual entities.
    Possibility to offer a natural way to describe systems
    that are composed by interacting entities (e.g., stock markets).
    Be flexible
    because agents can be modeled from different levels and approaches, endowing them with different kinds of behaviors, degrees of rationality or learning skills.

A.2. Algorithms

In this sections we provide the pseudo-code of the algorithms required to implements some of the mobility models. We also provide pseudo code to generate random numbers sampled from specific probability distributions. The general framework we use to model individual mobility is agent based as introduced in Appendix A.1.

A.2.1. Individual mobility

The ubiquity of cellular phones leads to the availability of large datasetscapturing spatio-temporal traces of human trajectories. In the context of mobility modeling, the mathematical framework of Random Walks (RW), a class of stochastic random path models, has been extremely successful at reproducing some of the statistical properties of movements at a long-term limit. The mechanism at the core of random walks consists of a series of successive random events alternating displacements (i.e. movements) and changes of direction (Section 4.1).
Random walk.
A simple random walk model has been described in Section 4.1.1. A way to simulate a random walk on a computer is to let the agent move to a Ddimensional lattice. We can simulate a simple random walk of N steps on a 2-dimensional lattice (see Fig. A.47) by starting from the origin (0,0) of the space, and then iteratively choose one of the four possible directions available (up, down, left, right), and move of one unit of length in that direction. The process stops when we iterated N times. The procedure is illustrated in Algorithm 1. Note that It is crucial to choose each direction with the same chance or the random walk will be biased to some direction. For such purpose, modern computing devices and operating systems provides specific functions. In our case, we assume we have a function RANDOM(min,max) that is able to generate a uniformly distributed random number in the range [min,max). Such function is trivial enough to be implemented on any platform or any programming language.
  1. Download: Download high-res image (110KB)
  2. Download: Download full-size image
Lévy flight.
Lévy Flight have been described in details in Section 4.1.2. The main difference with the simple random walk is the type of distribution used to generate the jump length at each step, which follows a heavy-tailed distribution (e.g., power law). Therefore the Lévy flight is a succession of short jumps and some occasional long jumps (Fig. A.48). When trying to implement a Lévy flight on a computer there are several issue to take in account:
  1. Download: Download high-res image (131KB)
  2. Download: Download full-size image

Fig. A.47(a). (a) 5000 steps.

  1. Download: Download high-res image (183KB)
  2. Download: Download full-size image

Fig. A.47(b). (b) 25,000 steps.

Fig. A.47. Illustration of random walks in two dimensions. In A.47(a), we have 5000 discrete steps while in A.47(b), 25,000 discrete steps. In the figure, more traversed lines are represented by darker lines.

  1. Download: Download high-res image (70KB)
  2. Download: Download full-size image

Fig. A.48(a). (a) 5000 steps.

  1. Download: Download high-res image (122KB)
  2. Download: Download full-size image

Fig. A.48(b). (b) 25000 steps.

Fig. A.48. Sample trajectories of Lévy flights in two dimensions for β=0.6 and step length Δr1. In A.48(a), 5000 steps while in A.48(b), 25,000 discrete steps.

  • If there are RNG functions available to draw a number from a distribution, usually the power law distribution is not supported.
  • For a power law distribution we need to specify α and xmin values.
  • Continuous power law distribution is easy to deal with, but the discretization process may introduce an error.
To generate a random number drawn from a power law distribution we can use the transformation method [421]. Usually, on a computer we have a source of random reals r uniformly distributed in the interval [0,1) that are provided by a pseudo-random number generator. In this case, we can obtain a random number x drawn from a continuous probability density p(x) with a cumulative distribution function (CDF) P by obtaining the functional inverse of the CDF [422]: (A.1)x=P1(1r)where (A.2)P(x)=xxminα+1for a continuous power law, and (A.3)P(x)=ζ(α,x)ζ(α,xmin)for a discrete power law. Therefore we obtain (A.4)x=xmin(1r)1(α1)for the continuous case, which is straightforward to implement in any programming languages. However, the discrete form of P(x) cannot be inverted in closed form, therefore we solve P(x)=x=x+p(x)=1r instead by a combination of doubling up starting from x=xmin, which ensure that 1r[P(x),P(2x)), and binary search (see Algorithm 2). Note that we need to evaluate the generalized Riemann zeta function ζ to compute P(x) which might be computationally expensive, and it is usually done by using third party scientific libraries. When speed is important, techniques like dynamic programming can greatly improve algorithm speed by storing the result of previous evaluations that therefore need not to be computed again. Furthermore, if accuracy is not of great importance, which is the case for most applications where xmin>5, we can approximate the discrete power law to a continuous one. We first generate power law distributed numbers yxmin12 and then round it off to the nearest integer x=y+12 [422]: (A.5)x=xmin12(1r)1(α1)+12.It is straightforward to notice that the approximation is bigger for smaller value of x and therefore it is maximum when x=xmin. The error between the approximated distribution and the discrete one when x=xmin can be computed as: (A.6)Δp=1xmin+12xmin12α+1xminζ(α,xmin)
  1. Download: Download high-res image (131KB)
  2. Download: Download full-size image
Once we are able to generate power law distributed random numbers, we can modify the algorithm of the simple random walk to implement a Lévy walk by choosing a direction randomly and uniformly, an angle θ, and the length r of the jump (in this case from a continuous power law distribution). In this case we move away from a simple lattice to let the coordinates to be in the Euclidean plane R2 (Algorithm 3).
  1. Download: Download high-res image (78KB)
  2. Download: Download full-size image
Continuous-time random walk (CTRW).
A limitation of the RW models introduced so far is that time intervals between two successive steps are constant (Section 4.1.3). In order to simulate a CTRW we can modify Algorithm 3 to include the wait time τ between jumps (Algorithm 4). Notice that the wait time and jump length need not to be drawn from power law distributions, but CTRW with power laws has been shown to be useful to model human mobility [46]. Moreover, the power laws can have different parameters.
  1. Download: Download high-res image (126KB)
  2. Download: Download full-size image
Exploration–preferential return (EPR).
Although there is an extensive literature supporting the idea that human trajectories indeed follow reproducible scaling laws, the predictions of Lévy flight and CTRW models systematically deviate from empirical results. Song et al.  [81] proposed two key mechanisms, unique to human trajectories, that were missing from existing models: Exploration and Preferential Return (see Section 4.1.4). Another aspect that differs from previous study is the use of power law with exponential cutoff for the jump length and wait time (Fig. A.49(a)). Power law with cutoff does not have a closed-form expression. However, we can generate a random number drawn from a power law with parameters xmin,α and an exponential cutoff K with the following process [422]: generate an exponentially distributed random number x=xmin1λln1r, where r is a uniformly distributed number in the interval [0,1). Then, accept or reject it with probability p or 1p respectively, where p=xxminα. Repeating the process until a number x is accepted will guarantee that it will follow the desired distribution. This procedure is illustrated by Algorithm 5. Note that in the pseudo-code x and y are lists, therefore the operator +has the meaning of concatenation of the two operands. Finally, if we define S as the set of previously visited locations, Fi the number of times (frequency) that the location i has been visited, and ρ and γ the parameters that control the probability Pnew of an exploration jump then the EPR random walk is implemented by Algorithm 6. Note that while the x and y coordinates vary continuously, the space is actually divided in discrete “locations”, like Voronoi patches or squares of a grid. Therefore, different points in the plane may be part of the same location; to this purpose we use the function CURRENT-LOCATION(x,y) to obtain the location associated with the current coordinates of the agent.
  1. Download: Download high-res image (275KB)
  2. Download: Download full-size image
  1. Download: Download high-res image (285KB)
  2. Download: Download full-size image
Recency.
A further improvement over the EPR model has been introduced by Barbosa et al.  [114]. The main observation is that frequently visited locations are also recently visited locations (Fig. A.49(b)). However, recently visited locations might not always be frequently visited, because it might be that a location has been visited for the first time (Section 4.1.5). The algorithm to implement the recency mobility model is exactly the same as for EPR, but when deciding for a return jump we can choose either a frequency based return with probability α or a recency based return with probability 1α, where α is a parameter extracted from empirical data (Eq. (43), Algorithm 7).
  1. Download: Download high-res image (122KB)
  2. Download: Download full-size image
Social models.
It is intuitive to understand that the mobility of socially related individuals is correlated; for example friends often hang out and visit places together. Therefore, mobility models that include social network links and proximity based interactions were developed (see Section 4.1.6). Two prominent examples are the models described in [142] and [143], which can be seen as variations of the preferential return model (Section 4.1.4), where the preferential return is governed by the social strength between the users instead of the visitation frequency of the locations. Given the previously presented building blocks and algorithms, it should be trivial at this point to implement these models, therefore the explicit algorithms are omitted.
  1. Download: Download high-res image (151KB)
  2. Download: Download full-size image

Fig. A.49(a). (a) Individual mobility model.

  1. Download: Download high-res image (111KB)
  2. Download: Download full-size image

Fig. A.49(b). (b) Recency model.

Fig. A.49. The darker is the color of the dot, the more the location has been visited. A.49(a) In the Individual Mobility (IM) model proposed by Song et al. , a user goes back to frequently visited locations with increasing probability, therefore the color of the visited locations is mostly black. Furthermore, most of the visited locations are very close to the initial location from where the user movement started. A.49(b) In the recency model, a user can decide to go back to a recently visited location even though it has been visited only once or few times, therefore the visitation frequency is more evenly distributed. Furthermore, the spatial pattern shows several clusters further from the starting location, similarly to what happens with Lévy flights.

In [142], at every step each agent chooses to visit a social contact (i.e., another agent with whom the agent has a link) with probability pv. Otherwise, it chooses a new location with the complementary probability 1pv following a Lèvy-like flight. After the movement, the agent creates a new link with probability p in the “neighborhood”, that is the nearby agents, and probability pc with a random agent. An agent can easily keep track of the social links with the other agents by updating a list of the agent’s ids it meets.
In [143], the first part of the mobility model is identical to the preferential return model, where the agent returns to a previously visited location with probability 1ρSγ or visit a new location with probability ρSγ. However, it adds an additional “social” step. The actual choice of the location to visit, either new or returning, is made based on the social influence with probability α or, with probability 1α, just the individual preference (i.e., proportionally to the location visitation frequency). In case the location is chosen according to a social contact, the social contact j is chosen with a probability directly proportional to the current mobility similarity between the two agents. The mobility similarity between two agents i and j is defined as the cosine similarity in the location visiting profiles: cosθi,j=vivj|vi||vj|where vi and vj represent the total visits made by the agents to each location in the simulation space.

A.2.2. Population mobility

A simple way to formalize the flow of individuals from one location to another is to divide the area of interest into different zones labeled as i=1,,n and to count the number of individuals going from location i to location j, i. These numbers Tij are the elements of the so called Origin–Destination (OD) matrix (Section 3.1.5) (A.7)OD(t)=T11T12T1mT21T22T2mTn1Tn2Tnm.Such matrix defines a directed and weighted network, and in the general case is time-dependent. Note that usually n=m and Tij=0,i=j. The OD matrix is very different from a “segment” measurement, where we can easily count the number of individuals going through a link of the transportation system under consideration (e.g., one of the flights of the airline network, a segment of road, etc.). In contrast, the OD matrix is usually extremely difficult and costly to obtain and measure. There are obviously many factors which control the origin–destination matrix: land use, location of industries and residential areas, accessibility, etc.
In order to simulate the OD matrix using the multi-agent modeling framework we need to define a process that drives the movement of the agents from one location to another. Let us consider a discrete-time agent simulation for which at every time tick t we know the elements Tij of the matrix OD(t). Tij can be considered as a frequency count, therefore we can transform the elements of the OD matrix by computing the relative frequency Pij: Pij=TijijTij Pij represents the probability of an agent to move from location i to location j at a specific instant in time t. If we want to run a simulation for an amount of time τ, for each time instant t the simulation algorithm normalizes the matrix OD(t), then loop through each element Pij of the normalized matrix OD and move an agent, MOVE-AGENT(i,j), from i to j with the probability Pij (Algorithm 8). This framework is general and applies to any mobility model that uses origin–destination matrices such as the gravity models family (Section 4.2.1), intervening opportunities, and radiation model. However each model may be different in the way that the elements Tij are computed. It is also interesting to note that if we represent the movement of the agents over e.g., a map layer, then the flux of agents over the connection between two locations should represent and estimate of the intensity of the traffic.
  1. Download: Download high-res image (165KB)
  2. Download: Download full-size image

References

View Abstract