EUBUCCO

European building stock characteristics in a common and open database for 322+ million individual buildings.

EUBUCCO v0.2 Now Available!
100% Attribute Completeness
>50% Footprint Coverage Improvement
Parquet Release (S3 Compatible)

Dataset overview

One harmonized database, 55 open sources

EUBUCCO unifies governmental cadastres, OpenStreetMap, and Microsoft footprints into a single, consistent, analysis-ready database — every building carries the same schema, with its data source and attribute provenance recorded.

322M+individual buildings
30countries · EU-27 + UK, NO, CH
55open datasets harmonized
5core attributes per building
  • Government registries 62.2%
  • Microsoft footprints 20.4%
  • OpenStreetMap 17.4%
Preparing map…
Open the coverage explorer →

Buildings by country

Stacked by data source · all 30 countries (scroll)

Attribute coverage

Share of buildings carrying each attribute, by provenance

Type100%
Subtype100%
Height100%
Floors100%
Constr. year15.9%
  • Ground truth
  • Merged
  • ML estimated

Ground truth from official registries; merged from other footprint datasets; the remainder inferred with machine learning.

A building record

See full data schema in docs →

Identifiers
idunique ID
region_idNUTS3 code
city_idLAU code
Attributes
typebinary use type
subtypedetailed use
heightmeters
floorscount
construction_yearyear
Geometry
geometryWKB, EPSG:3035
Provenance
<attr>_sourceorigin dataset
<attr>_source_idssource records
<attr>_confidenceuncertainty

From the continent down to the building

Every footprint carries its height and use type information. Here illustrated for Zürich.  Open the map explorer →

Preparing 3D view…

Why it exists

Built for urban-scale research

EUBUCCO gives researchers a centralized, harmonized basis for high-resolution urban sustainability studies across scales — from continental comparisons down to single neighborhoods. It underpins use cases such as energy-system modelling, climate and natural-hazard risk assessment, and urban morphology.

How it was made

This dataset is described in a companion Data Descriptor published in Scientific Data, which details the methodology and technical validation. All code used to generate the dataset is openly available through the EUBUCCO GitHub organization.

If you use the data for your project, please cite:

BibTeX

People

Hosted at the Potsdam Institute for Climate Impact Research (PIK) and the Technical University Berlin, in the research group headed by Prof. Dr. Felix Creutzig. Development since v0.2 is led by Florian Nachtigall, with Nikola Milojevic-Dupont and Felix Wagner previously serving as project leads.

Nikola Milojevic-Dupont

Nikola Milojevic-Dupont

Previously Project Lead

Florian Nachtigall

Florian Nachtigall

Project Lead

Felix Wagner

Felix Wagner

Previously Project Lead

Funding & contact

Funded by the Climate Change Center Berlin Brandenburg and the CircEUlar project of the European Union's Horizon Europe research and innovation programme (grant agreement 101056810).

For any question or suggestion, please open a GitHub issue or write to nachtigall(at)tu-berlin.de.