This project aims at predicting human density combining Landsat-8 satellite images and Machine Learning.
Images are taken directly from U.S. Geological Survey website. One can query images with criteria like :
Here after, a screenshot of France query
and the 70 resulting datasets (thumbnails) projected, queried with minimum possible cloud covering (<20%), day acquisition and between May to September 2013.
Each dataset is composed of 11 bands (1 pixel = 30x30 meters). NDVI (Normalized Difference Vegetal Indice) is a combination of band 4 (red) and band 5 (red-edge) that make vegetation in evidence according to (values span from -1 and 1) :
We then expect high values (rich vegetation) to explain low density areas, while low values (poor vegetation) should explain high densities. Below, RGB and NDVI images for different densities:
Histogram of NDVI (1024 bins) is then finally taken as city descriptor. Superposition of various histograms shows clearly a relation between NDVI behavior and density :
Precise densities can be taken from French official institute census (INSEE). This census is from 2013, which perfectly matches our Landsat-8 datasets. We categorize densities in order to deal with a classification problems (6 categories)
We can now go through supervised learning. France data has been used for training/validation. Belgium, Netherlands and Switzerland have been used for testing model generalization.
{:class=”img-responsive
Code is avalaible in my github at following link. There is a complete tutorial on how to make a density prediction of any geographic area. Japan is taken as example.