Google AI Releases The Open Buildings Dataset, A New Open-Source Dataset Containing The Locations And Footprints Of >500M Buildings Across Africa

Google uses artificial intelligence technology to find millions of buildings on the satellite map that were previously difficult to locate. These can now be used for humanitarian aid or other purposes. Google utilized its building detection model (“Continental-Scale Building Detection from High Resolution Satellite Imagery) to create an Open Buildings dataset, containing locations and footprints of 516 million buildings with coverage across most African continent countries.

In this data set, there are millions of buildings that have not been discovered in the past. These newly-discovered building materials will help the outside world understand African populations and where they live, facilitating health care services such as education or vaccination to their communities.

Google’s team of developers built a training set for their building detection model by manually labeling 1.75 million buildings in 100k images to make the most accurate identification possible, even when dealing with rural or urban environments that have vastly different properties and features. The need to identify what kind of dwelling place is being captured was especially difficult during scoping missions in remote areas where natural landmarks were plentiful. At the same time, dense surroundings made it hard to differentiate between multiple structures on an aerial image at once.

Google researchers relied on a bottom-up strategy to train their model. They first classified each pixel as either building or nonbuilding and then grouped these pixels into individual instances before training the U-Net architecture, which is commonly used in satellite image analysis. The advantage of this type of system is its compact design for handling large quantities of imaging data without placing too much strain on computing power. This was important since the application of this algorithm to continental-scale satellite imagery requires a tremendous amount of computing power.

The data set provides the exact location, polygon outline, size, and confidence score of each building as a building. Furthermore, it includes information on open location code but does not include any identifying information such as street address or type. Google also excludes sensitive areas in their dataset to protect vulnerable ethnic groups. 

This research work is part of our AI for Social Good efforts and was led by Google Research, Ghana.




A message from Asif Razzaq, Co-founder of Marktechpost:

Show your support for our mission ‘making AI understandable for all’ by joining/connecting through our 34k+ FB GroupLinkedIn Page and Quora AI Group.

Advertisement/Sponsored Post:

If you are a company looking to promote your product/webinar/conference/service, feel free to reach out via email to We offer sponsored posts and advertisements.

Asif Razzaq is the CEO of Marktechpost Media Inc.. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of Artificial Intelligence for social good. His most recent endeavor is the launch of an Artificial Intelligence Media Platform, Marktechpost, which stands out for its in-depth coverage of machine learning and deep learning news that is both technically sound and easily understandable by a wide audience. The platform boasts of over 2 million monthly views, illustrating its popularity among audiences.

🚀 LLMWare Launches SLIMs: Small Specialized Function-Calling Models for Multi-Step Automation [Check out all the models]