Journal Article•DOI•

A Multi-Scale Analysis of 27,000 Urban Street Networks: Every US City, Town, Urbanized Area, and Zillow Neighborhood

Geoff Boeing¹•Institutions (1)

29 Mar 2017-arXiv: Social and Information Networks-

TL;DR: This study illustrates the use of OSMnx and OpenStreetMap to consistently conduct street network analysis with extremely large sample sizes, with clearly defined network definitions and extents for reproducibility, and using nonplanar, directed graphs.

read less

Abstract: OpenStreetMap offers a valuable source of worldwide geospatial data useful to urban researchers. This study uses the OSMnx software to automatically download and analyze 27,000 US street networks from OpenStreetMap at metropolitan, municipal, and neighborhood scales - namely, every US city and town, census urbanized area, and Zillow-defined neighborhood. It presents empirical findings on US urban form and street network characteristics, emphasizing measures relevant to graph theory, transportation, urban design, and morphology such as structure, connectedness, density, centrality, and resilience. In the past, street network data acquisition and processing have been challenging and ad hoc. This study illustrates the use of OSMnx and OpenStreetMap to consistently conduct street network analysis with extremely large sample sizes, with clearly defined network definitions and extents for reproducibility, and using nonplanar, directed graphs. These street networks and measures data have been shared in a public repository for other researchers to use.

...read moreread less

Summary (3 min read)

Jump to: [Introduction] – [Methodology] – [Street network measures] – [Metropolitan-scale street networks] – [Municipal-scale street networks] – [Neighborhood-scale street networks] – [Discussion] and [Conclusion]

Introduction

Cross-sectional analysis of American urban form can reveal these artifacts and histories through street networks at metropolitan, municipal, and neighborhood scales.
Second, reproducibility has been difficult when the dozens of decisions that go into analysis—such as spatial extents, topological simplification and correction, definitions of nodes and edges, etc.—are ad hoc or only partly reported (e.g. Porta et al., 2006; Strano et al., 2013).
First, it describes and demonstrates a new methodology for easily and consistently acquiring, constructing, and analyzing large samples of street networks as nonplanar directed graphs.
Third, it investigates with large sample sizes some previous smallersample findings in the research literature.

Methodology

Street networks can be conceptualized as primal, directed, nonplanar graphs.
Planar graphs may reasonably model the street networks of old European town centers, but poorly model the street networks of modern autocentric cities like Los Angeles or Shanghai with many gradeseparated expressways, bridges, and underpasses (Boeing, 2018b).
OSMnx is a Python-based research tool that easily downloads OpenStreetMap data for any place name, address, or polygon in the world, then constructs it into a spatially-embedded graph-theoretic object for analysis and visualization (Boeing, 2017).
The authors retain only the urbanized areas subset of these data (i.e. areas with greater than 50,000 population), discarding the small urban clusters subset.
The second set of geometries defines their municipal-scale study sites using 51 separate TIGER/Line shapefiles (again, 2016) of US Census Bureau places within all 50 states plus DC.

Street network measures

The network’s average node degree quantifies connectedness in terms of the average number of edges incident to its nodes.
It measures the average number of physical streets that emanate from each node (i.e. intersection or dead-end).
In total, this study cross-sectionally analyzes 27,009 networks: 497 urbanized areas’ street networks, 19,655 cities’ and towns’ street networks, and 6857 neighborhoods’ street networks.
These sample sizes are larger than those of any previous similar study.

Metropolitan-scale street networks

Table 1 presents summary statistics for the entire data set of 497 urbanized areas.
The gridlike San Angelo, TX urbanized area has the most streets per node (3.2) on average, and (outside of Puerto Rico, which contains the seven lowest urbanized areas) the sprawling, disconnected Lexington Park, MD urbanized area has the fewest (2.2).
The relationship between fine-grained networks and connectedness/gridness is not, however, clear-cut: intersection density has only a weak, positive linear relationship with the proportion of four-way intersections in the urbanized area (r2 ¼ 0:17).
Densities and average distances such as intersection density and the average street segment length exhibit only moderate heterogeneity.
Due to the substantial variation in urbanized area size, from 25 to 9000 km2, the preceding analysis covers a wide swath of metropolitan types.

Municipal-scale street networks

Table 3 presents summary statistics of street network characteristics across the entire data set of 19,655 cities and towns—every incorporated city and town in the US.
The latter’s small sample size may limit the generalizability of this finding.
These distributions comprise the lognormal, Gumbel, gamma, exponentiated Weibull, Fréchet, power-law, uniform, and exponential distributions.
An exception to this general pattern, of course, lies in consistently-sized orthogonal grids filling a city’s incorporated spatial extents.
The authors find that such cities are not uncommon in the US, particularly between the Mississippi River and the Rocky Mountains: the Great Plains states are characterized by a unique street network form that is both orthogonal and reasonably dense.

Neighborhood-scale street networks

The authors have thus far examined every urban street network in the US at the metropolitan and municipal scales.
While the metropolitan scale captures the emergent character of the wider region’s complex system, and the municipal scale captures planning decisions made by a single city government, the neighborhood best represents the scale of individual urban design interventions into the urban form.
A few neighborhoods have no intersections within their Zillow-defined boundaries, resulting in a minimum intersection density of 0 across the data set.
Nationwide, the typical neighborhood averages 2.9 streets per intersection, reflecting the prevalence of three-way intersections in the US, discussed earlier.
Some central San Francisco orthogonal grid networks with many four-way intersections—such as Downtown, Chinatown, and the Financial District—have surprisingly low ANCs: 1.5, 1.3, and 1.6, respectively.

Discussion

These findings suggest the influence of planning eras, design paradigms, transportation technologies, topography, and economics on US street network density, resilience, and connectedness.
The median average circuity is lower across the neighborhoods data set than across the municipal set, which in turn is lower than across the urbanized areas set.
This analysis finds a strong linear relationship, invariant across scales, between total street length and the number of nodes in a network.
The spatial signatures of the Homestead Act, successive land use regulations, urban design paradigms, and planning instruments remain clearly etched in these cities’ urban forms and street networks today.

Conclusion

First, it presented empirical urban morphological findings from metric and topological analyses of the street networks of every US city/town, urbanized area, and Zillow neighborhood—particularly focusing on density, connectedness, and resilience.
Second, its methods demonstrate the use of OSMnx as a new street network research toolkit, suggesting to urban planners and scholars new methods for acquiring and analyzing data consistently and at scale.
Third, it built on past findings about the distribution of street segment lengths and the relationship between the total street length and the number of nodes in a network.
This study hasmade all of these network datasets—for 497 urbanized areas, 19,655 cities and towns, and 6857 neighborhoods—along with all of their attribute data and morphological measures available in an online public repository for other researchers to study and repurpose.

Did you find this useful? Give us your feedback

Figures (10)

Table 1. Central tendency and statistical dispersion for selected measures of all US urbanized areas’ street networks: is the mean, is the standard deviation, and D is the dispersion index 2 .

Figure 3. The linear relationship between total street length and number of nodes in the street networks of every US urbanized area, city/town, and Zillow neighborhood.

Table 5. Central tendency and statistical dispersion for selected measures of all the neighborhood-scale street networks: is the mean, is the standard deviation, and D is the dispersion index 2 .

Figure 1. Intersection density and average streets per node per urbanized area in the contiguous US.

Table 3. Central tendency and statistical dispersion for selected measures of all incorporated cities and towns in the US: is the mean, is the standard deviation, and D is the dispersion index 2 .

Figure 4. Contiguous US states by median of mean streets per node and by median of mean street segment length in municipal street networks.

Table 4. Median values, aggregated by state plus DC, of selected measures of the municipal-scale street networks for every city and town in the US.

Figure 2. Distribution of node types in 9 urbanized areas, with number of streets emanating from the node on the x-axis and proportion of nodes of this type on the y-axis.

Table 2. Selected measures of the 30 largest (by land area) urbanized areas’ street networks.

Figure 5. Square-mile comparisons of central cities and their suburbs. Left: top, downtown Philadelphia, PA; bottom, its suburb, King of Prussia. Middle: top, downtown Portland, OR; bottom, its suburb, Beaverton. Right: top, downtown San Francisco, CA; bottom, its suburb, Concord.

Content maybe subject to copyright Report

UC Berkeley

UC Berkeley Previously Published Works

Title

A multi-scale analysis of 27,000 urban street networks: Every US city,

town, urbanized area, and Zillow neighborhood

Permalink

https://escholarship.org/uc/item/80n7572n

Journal

Environment and Planning B: Urban Analytics and City Science, 47(4)

ISSN

2399-8083 2399-8091

Author

Boeing, Geoff

Publication Date

2018-08-08

DOI

10.1177/2399808318784595

Data Availability

The data associated with this publication are available at:

https://dataverse.harvard.edu/dataverse/osmnx-street-networks

Peer reviewed

eScholarship.org Powered by the California Digital Library

University of California

Article

Urban Analytics and

City Science

A multi-scale analysis of

27,000 urban street

networks: Every US city,

town, urbanized area, and

Zillow neighborhood

Geoff Boeing

University of California, USA

Abstract

OpenStreetMap offers a valuable source of worldwide geospatial data useful to urban researchers. This

study uses the OSMnx software to automatically download and analyze 27,000 US street networks

from OpenStreetMap at metropolitan, municipal, and neighborhood scales—namely, every US city and

town, census urbanized area, and Zillow-defined neighborhood. It presents empirical findings on US

urban form and street network characteristics, emphasizing measures r eleva nt to graph theory,

transportation, urban design, and morphology such as structure, connectedness, density, centrality,

and resilience. In the past, street network data acquisition and processing have been challenging and ad

hoc. This study illustrates the use of OSMnx and OpenStr eetMap to consistently conduct street

network analysi s with extremely large sample sizes, with clearly defin ed network definiti ons and

extents for repr oducibility, and using nonplanar, directed graphs. These street networks and

measures data have been shared in a public repository for other resear chers to use.

Keywords

GIS, network analysis, OpenStreetMap, street networks, urban form, urban morphology

Introduction

On 20 May 1862, Abraham Lincoln signed the Homestead Act into law, making land across

the United States’ Midwest and Great Plains available for free to applicants (Porterﬁeld,

2005). Under its auspices over the next 70 years, the federal government distributed 10% of

the entire US landmass to private owners in the form of 1.6 million homesteads (Lee, 1979;

Sherraden, 2005). New towns with gridiron street networks sprang up rapidly across the

Great Plains and Midwest, due to both the prevailing urban design paradigm of the day and

the standardized rectilinear town plats used repeatedly to lay out instant new cities

(Southworth and Ben-Joseph, 1997). Through path dependence, the spatial signatures of

Corresponding author:

Geoff Boeing, School of Public Policy and Urban Affairs, Northeastern University, 360 Huntington Ave, 310 Renaissance

Park, Boston, MA 02115, USA.

Email: g.boeing@northeastern.edu

EPB: Urban Analytics and City Science

2020, Vol. 47(4) 590–608

! The Author(s) 2018

Article reuse guidelines:

sagepub.com/journals-permissions

DOI: 10.1177/2399808318784595

journals.sagepub.com/home/epb

these land use laws, design paradigms, and planning instruments can still be seen today in

these cities’ urban forms and street networks. Cross-sectional analysis of American urban

form can reveal these artifacts and histories through street networks at metropolitan,

municipal, and neighborhood scales.

Network analysis is a natural approach to the study of cities as complex systems (Masucci

et al., 2009). The empirical literature on street networks is growing ever richer, but suﬀers

from some limitations—discussed in detail in Boeing (2017) and summarized here. First,

sample sizes tend to be fairly small due to data availability, gathering, and processing

constraints: most studies in this literature that conduct topological or metric analyses tend

to have sample sizes ranging around 10 to 50 networks (Barthelemy and Flammini, 2008;

Buhl et al., 2006; Cardillo et al., 2006; Strano et al., 2013), which may limit the

generalizability and interpretability of ﬁndings. Second, reproducibility has been diﬃcult

when the dozens of decisions that go into analysis—such as spatial extents, topological

simpliﬁcation and correction, deﬁnitions of nodes and edges, etc.—are ad hoc or only

partly reported (e.g. Porta et al., 2006; Strano et al., 2013). Third, and related to the ﬁrst

two, studies frequently oversimplify to planar or undirected primal graphs for tractability

(e.g. Barthelemy and Flammini, 2008; Buhl et al., 2006; Cardillo et al., 2006; Masucci et al.,

2009), or use dual graphs despite the loss of geographic, metric information (Batty, 2005;

Crucitti et al., 2006a, 2006b; Jiang and Claramunt, 2002; Ratti, 2004).

This study addresses these limitations by conducting a morphological analysis of urban street

networks at multiple scales, with large sample sizes, with clearly deﬁned network deﬁnitions and

extents for reproducibility, and using nonplanar, directed graphs. In particular, it examines

27,000 urban street networks—represented as primal, nonplanar, weighted multidigraphs

with possible self-loops—at multiple overlapping scales across the US, focusing on structure,

connectedness, centrality, and resilience. It examines the street networks of every incorporated

city and town, census urbanized area, and Zillow-deﬁned neighborhood in the US. To do so, it

uses OSMnx

—a new street network research toolkit (Boeing, 2017)—to download, model, and

analyze these street networks at metropolitan, municipal, and neighborhood scales. These street

networks and measures data sets have been compiled and shared in a public repository at the

Harvard Dataverse

for other researchers to use.

The purpose of this paper is threefold. First, it describes and demonstrates a new

methodology for easily and consistently acquiring, constructing, and analyzing large samples

of street networks as nonplanar directed graphs. Second, it presents empirical ﬁndings of

descriptive urban morphology for the street networks of every US city, urbanized area, and

Zillow neighborhood. Third, it investigates with large sample sizes some previous smaller-

sample ﬁndings in the research literature. This paper is organized as follows. In the next

section, it discusses the data sources, tools, and methods used to collect, model, and analyze

these street networks. Then, it presents ﬁndings of the analyses at metropolitan, municipal, and

neighborhood scales. Finally, it concludes with a discussion of these ﬁndings and their

implications for street network analysis, urban morphology, and city planning.

Methodology

A network (also called a graph in mathematics) comprises a set of nodes connected to one

another by a set of edges. Street networks can be conceptualized as primal, directed,

nonplanar graphs. A primal street network represents intersections as nodes and street

segments as edges. A directed network has directed edges: that is, edge uv points one-way

from node u to node v, but there need not exist a reciprocal edge vu.Aplanar network can be

represented in two dimensions with its edges intersecting only at nodes (O’Sullivan, 2014;

Boeing 591

Viana et al., 2013). Most street networks are nonplanar—due to grade-separated

expressways, overpasses, bridges, tunnels, etc.—but most quantitative studies of urban

street networks represent them as planar (e.g. Barthelemy and Flammini, 2008; Buhl

et al., 2006; Cardillo et al., 2006; Masucci et al., 2009; Strano et al., 2013) for tractability

because bridges and tunnels are uncommon in some cities. Planar graphs may reasonably

model the street networks of old European town centers, but poorly model the street

networks of modern autocentric cities like Los Angeles or Shanghai with many grade-

separated expressways, bridges, and underpasses (Boeing, 2018b).

Study sites and data acquisition

This study uses OSMnx to download, construct, correct, analyze, and visualize street

network graphs at metropolitan, municipal, and neighborhood scales. OSMnx is a

Python-based research tool that easily downloads OpenStreetMap data for any place

name, address, or polygon in the world, then constructs it into a spatially-embedded

graph-theoretic object for analysis and visualization (Boeing, 2017). OpenStreetMap is a

collaborative worldwide mapping project that makes its spatial data available via various

APIs (Corcoran et al., 2013; Jokar Arsanjani et al., 2015). These data are of high quality and

compare favorably to CIA World Factbook estimates and US Census TIGER/Line data

(Frizzelle et al., 2009; Haklay, 2010; Maron, 2015; Over et al., 2010; Wu et al., 2005; Zielstra

and Hochmair, 2011). In 2007, OpenStreetMap imported the TIGER/Line roads (2005

vintage) and since then, many community-led corrections and improvements have been

made (Willis, 2008). Many of these additions go beyond TIGER/Line’s scope, including

passageways between buildings, footpaths through parks, bike routes, and detailed feature

attributes such as ﬁner-grained street classiﬁers, speed limits, etc.

To deﬁne the study sites and their spatial boundaries, we use three sets of geometries. The

ﬁrst deﬁnes the metropolitan-scale study sites using the 2016 TIGER/Line shapeﬁle of US

Census Bureau urban areas. Each census-deﬁned urban area comprises a set of tracts that

meet a minimum density threshold (US Census Bureau, 2010). We retain only the urbanized

areas subset of these data (i.e. areas with greater than 50,000 population), discarding the

small urban clusters subset. The second set of geometries deﬁnes our municipal-scale study

sites using 51 separate TIGER/Line shapeﬁles (again, 2016) of US Census Bureau places

within all 50 states plus DC. We discard the subset of census-designated places (i.e. small

unincorporated communities) in these data, while retaining every US city and town. The

third set of geometries deﬁnes the neighborhood-scale study sites using 42 separate shapeﬁles

from Zillow, a real estate database company. These shapeﬁles contain neighborhood

boundaries for major cities in 41 states plus DC. This fairly new data set comprises nearly

7000 neighborhoods, but as Schernthanner et al. (2016) point out, Zillow does not publish

the methodology used to construct these boundaries. However, despite its newness it already

has a track record in the academic literature: Besbris et al. (2015) use Zillow boundaries to

examine neighborhood stigma and Albrecht and Abramovitz (2014) use them to study

neighborhood-level poverty in New York.

For each of these geometries, we use OSMnx to download the (drivable, public) street

network within it, a process described in detail in Boeing (2017) and summarized here. First

OSMnx buﬀers each geometry by 0.5 km, then downloads the OpenStreetMap ‘‘nodes’’ and

‘‘ways’’ within this buﬀer. Next, it constructs a street network from these data, corrects the

topology, calculates street counts per node, then truncates the network to the original,

desired polygon. OSMnx saves each of these networks to disk as GraphML and

shapeﬁles. Finally, it calculates metric and topological measures for each network,

592 EPB: Urban Analytics and City Science 47(4)

summarized below. Such measures extend the toolkit commonly used in urban form studies

(Ewing and Cervero, 2010; Talen, 2003).

Street network measures

Brief descriptions of these OSMnx-calculated measures are discussed here, but extended

technical deﬁnitions and algorithms can be found in e.g. (Albert and Baraba

si, 2002;

Barthelemy, 2011; Brandes and Erlebach, 2005; Costa et al., 2007; Cranmer et al., 2017;

Dorogovtsev and Mendes, 2002; Newman, 2003, 2010; Trudeau, 1994). The average street

segment length is a linear proxy for block size and speciﬁes the network’s grain. Node density

divides the node count by the network’s area, while intersection density excludes dead-ends to

represent the density of street junctions. Edge density divides the total directed network

length by area, while street density does the same for an undirected representation of the

graph (to not double-count bidirectional streets). Average circuity measures the ratio of edge

lengths to the great-circle distances between the nodes these edges connect, indicating the

street pattern’s curvilinearity (cf. Boeing, 2018a).

The network’s average node degree quantiﬁes connectedness in terms of the average number

of edges incident to its nodes. The average streets per node adapts this for physical form rather

than directed circulation. It measures the average number of physical streets that emanate from

each node (i.e. intersection or dead-end). The distribution and proportion of streets per node

characterize the type, pervasiveness, and spatial dispersal of network connectedness and dead-

ends. Connectivity represents the fewest number of nodes or edges that will disconnect the

network if they are removed and is thus an indicator of resilience. A network’s average node

connectivity (ANC)—the mean number of internally node-disjoint paths between each pair of

nodes—more usefully represents how many nodes must be removed on average to disconnect a

randomly selected pair of nodes (Beineke et al., 2002; Dankelmann and Oellermann, 2003).

Brittle points of vulnerability characterize networks with low average connectivity.

A node’s clustering coeﬃcient represents the ratio between its neighbors’ links and the

maximum number of links that could exist between them (Jiang and Claramunt, 2004;

Opsahl and Panzarasa, 2009). The weighted clustering coeﬃcient weights this by edge

length and the average clustering coeﬃcient is the mean of the clustering coeﬃcients of all

the nodes. Betweenness centrality evaluates how many of the network’s shortest paths pass

through some node (or edge) to indicate its importance (Barthelemy, 2004; Huang et al.,

2016; Zhong et al., 2017). A network’s maximum betweenness centrality (MBC) measures the

share of shortest paths that pass through the network’s most important node: higher

maximum betweenness centralities suggest networks more prone to ineﬃciency if this

important choke point should fail. Finally, PageRank ranks nodes based on the structure

of incoming links and the rank of the source node (Agryzkov et al., 2012; Brin and Page,

1998; Chin and Wen, 2015; Gleich, 2015; Jiang, 2009).

In total, this study cross-sectionally analyzes 27,009 networks: 497 urbanized areas’ street

networks, 19,655 cities’ and towns’ street networks, and 6857 neighborhoods’ street networks.

These sample sizes are larger than those of any previous similar study. The following section

presents the ﬁndings of these analyses at metropolitan, municipal, and neighborhood scales.

Results

Metropolitan-scale street networks

Table 1 presents summary statistics for the entire data set of 497 urbanized areas. These

urbanized areas span a wide range of sizes, from the Delano, CA Urbanized Area’s 26 km

Boeing 593

HTML Viewer

Frequently Asked Questions (2)

Q1. What are the contributions mentioned in the paper "A multi-scale analysis of 27,000 urban street networks: every us city, town, urbanized area, and zillow neighborhood" ?

This study uses the OSMnx software to automatically download and analyze 27,000 US street networks from OpenStreetMap at metropolitan, municipal, and neighborhood scales—namely, every US city and town, census urbanized area, and Zillow-defined neighborhood. This study illustrates the use of OSMnx and OpenStreetMap to consistently conduct street network analysis with extremely large sample sizes, with clearly defined network definitions and extents for reproducibility, and using nonplanar, directed graphs.

Q2. What have the authors stated for future works in "A multi-scale analysis of 27,000 urban street networks: every us city, town, urbanized area, and zillow neighborhood" ?

This study hasmade all of these network datasets—for 497 urbanized areas, 19,655 cities and towns, and 6857 neighborhoods—along with all of their attribute data and morphological measures available in an online public repository for other researchers to study and repurpose.

A Multi-Scale Analysis of 27,000 Urban Street Networks: Every US City, Town, Urbanized Area, and Zillow Neighborhood

Summary (3 min read)

Introduction

Methodology

Street network measures

Metropolitan-scale street networks

Municipal-scale street networks

Neighborhood-scale street networks

Discussion

Conclusion

Figures (10)

Citations

Cites background or methods from "A Multi-Scale Analysis of 27,000 Ur..."

Cites background from "A Multi-Scale Analysis of 27,000 Ur..."

References

"A Multi-Scale Analysis of 27,000 Ur..." refers background in this paper

"A Multi-Scale Analysis of 27,000 Ur..." refers background in this paper

"A Multi-Scale Analysis of 27,000 Ur..." refers background in this paper

"A Multi-Scale Analysis of 27,000 Ur..." refers background in this paper

Related Papers (5)

Frequently Asked Questions (2)

Q1. What are the contributions mentioned in the paper "A multi-scale analysis of 27,000 urban street networks: every us city, town, urbanized area, and zillow neighborhood" ?

Q2. What have the authors stated for future works in "A multi-scale analysis of 27,000 urban street networks: every us city, town, urbanized area, and zillow neighborhood" ?