Collection of data sources for the CorrelAid Urban Planning X Spatial Data Hackathon with City Interaction Lab in 2021.
Origin-destination analysis is important for a deeper understanding of mobility and of many processes in the city. However, just a pure origin-destination matrix analysis is not sufficient since it does not provide us enough information about the trip itself. By using additional information, which can be extracted from the openstreetmaps API we can detect and perform a more in-depth origin-destination matrix analysis.
This repo contains the following data sets:
Thanks to the Deutsche Bahn Open Data Portal, we can access a data set of bikesharing trip start and end from 2014-2017 in several German cities, originating from the station-based bike rental "Call A Bike".
For this hackathon an extract of the first two weeks of August 2016 in Hamburg is available ("bikeshare_trips_hh.csv"), containing trips and 208 stations in Hamburg. The data is already cleaned and contains only relevant variables:
- city_rental_zone: name of the city
- trip_id: unique identfier for each trip between two stations
- bike_id: unique identifier for a rental bike
- customer_id: unique identifier for a customer (no further information about users is provided)
- start_station_id: unique identifier of station where bike was rented from
- end_station_id: unique identifier of station where bike was returned (loop trips are possible)
- datetime_start: start time and date of rental
- datetime_end: end time and date of rental
Additionally, a data set of all 208 stations, their unique identifier, their name and latitude and longitude can be found in "bikeshare_stations_hh.csv".
More information about the original data set can be found here (German only).
R: osmdata
osmdata is a useful R-package for simple osm-queries. Check out the code below for some examples (see the documentation linked above for more info).
library(osmdata)
library(dplyr)
library(sf)
library(leaflet)
library(ggplot2)
# check available tags (sub categories) inside keys ----------------------
available_features()
available_tags("amenity")
available_tags("shop")
available_tags("highway")
# get all traffic lights in Hamburg --------------------------------------
## highway also includes bus stops, for example
query_traffic_signal <- getbb('Hamburg') %>%
opq() %>%
add_osm_feature(key = 'highway', value = 'traffic_signals') %>%
osmdata_sf()
# extract points only
traffic_lights <- query_traffic_signal$osm_points
# plot
ggplot(landuse_sf) +
geom_sf()
# get all cinemas in Hamburg ----------------------------------------------
query_cinema <- opq(bbox = 'Hamburg') %>%
add_osm_feature(key = 'amenity', value = 'cinema') %>%
osmdata_sf()
# extract points only
cinemas <- query_cinemas$osm_points
# plot
ggplot(cinemas) +
geom_sf()
# get all supermarkets in Hamburg ----------------------------------------
query_supermarket <- opq(bbox = 'Hamburg') %>%
add_osm_feature(key = 'shop', value = 'supermarket') %>%
osmdata_sf()
# extract points only
supermarkets <- query_supermarket$osm_points
# plot
ggplot(supermarkets) +
geom_sf()
The data for landuse in Hamburg is provided as a shapefile in the repo as a shapefile available on the geofabrik download server. For R users, it is also available as a sf-dataframe (landuse.rds).
# read in landuse for Hamburg
landuse_sf <- readRDS("data/landuse_sf.rds")
ggplot(landuse_sf) +
geom_sf(aes(color=fclass))
Pyrosm and osmnx are both Python libraries that can be used to extract openstreetmap data (and do other things).
Pyrosm makes use of .pbf-files it gets from places like geofabrik. In its docs there is quite a few examples how to get different kinds of data.
Here are a few examples for Hamburg:
import geopandas as gpd
from pyrosm import OSM, get_data
# Download data for Hamburg
fp = get_data("Hamburg")
# need to do this too to actually download it
print("\nDownload will happen:")
fp = get_data("Hamburg", update=True)
print(fp)
# initialize query to data
osm = OSM(fp)
# EXAMPLES
## get bus stops in Hamburg
busstop_filter = {"highway": ["bus_stop"]}
busstops = osm.get_pois(custom_filter=busstop_filter)
# get all amenities and shops (like in pyrosm docs)
custom_filter = {'amenity': True, "shop": True}
pois = osm.get_pois(custom_filter=custom_filter)
# check which amenity types there are
pois.amenity.unique()
# check which shop types there are
pois.shop.unique()
# one of the amenities available are restaurants! Filter geodataframe for restaurants
restaurants = pois[pois.amenity == "restaurant"]
# plot on a map
ax = restaurants.iloc[0:100,].plot(markersize=3, figsize=(12,12), legend=True, legend_kwds=dict(loc='upper left', ncol=5, bbox_to_anchor=(1, 1)))
osmnx and its "geometries_from_place" - function also connects directly to the osm API and lets you extract points of interest.
If those libraries don't work for you, geofabrik has all the data for Hamburg in one file in different formats (shp for example) that can be read in Python using geopandas. dj