calculate green view index for each image and attach score as attribute to point #8

danbjoseph · 2024-02-05T19:29:48Z

from the Treepedia project, the green view index is calculated in GreenView_Calculate.py
see also this paper: li2015.pdf

please see the previous steps as described in the README and follow those conventions for reading data in and out. we want the process to be geo-file agnostic to the extent possible. and modular so that in the future we can add or modify steps, etc.

the geofile use in the readme example can be downloaded here: https://drive.google.com/file/d/1fpI4I5KP2WyVD5PeytW_hoXZswOt0dwA/view?usp=sharing

we want to read in the main output file (if following the example in the readme, it will be a geopackage), which will have a collection of points and associated metadata.

we want to use the image_path to read in the image associated with each point. if available - note that some points will have a "None" value for image_path and "NULL" for image_id (however, please have a more robust check than watching for a string value of "None" as I think the indicator of no match change in the future). calculate the GVI and then store the value back into the dataset in a new column.

not needed if you're using the readme data but some examples of images we want to use include this

and this

The text was updated successfully, but these errors were encountered:

banjtheman · 2024-03-07T01:57:59Z

Was able to use Claude Sonnet Vision API to provide a score, here is an example output

{
"gvi": 10,
"reason": "The image shows a residential area with buildings, fences, and a road. There are some small trees and shrubs visible along the road and around the buildings, but overall, the green vegetation coverage appears to be minimal, likely around 10% of the visible area in the image."
}

banjtheman · 2024-03-07T02:02:30Z

If we have some images with "known" GVI would like to baseline it against the vision API

danbjoseph · 2024-03-07T14:38:52Z

Hey @banjtheman, thanks for exploring options! I should add some notes to the readme - we want to prioritize things that are transparent, efficient, open, low or no cost, and can be run locally if possible.

The GVI in Treepedia is a researched methodology - see this published paper - that will produce repeatable results. My understanding is that the same prompt to an LLM will not necessarily result in the same output every time. Additionally in your result we can guess that the "10%" mentioned in the reason field is linked to the "10" in the gvi field, but that is only an assumption and not a documented process.
Access to the Claude Sonnet Vision API may have a free tier or be otherwise subsidized, but training and running LLMs has been noted as quite costly and I don't trust these companies to not raise prices in the near future.
Users of our tool may have slow and/or expensive internet. If we can avoid additional bandwidth requirements, it will make it easier for them.
A more specialized image segmentation model could possibly do the same (or better) analysis more efficiently and possibly be something that could run on a laptop.

For the MVP, I would like to match what the Treepedia project did. There are plans to add other analysis options into the tool chain. If you think the Clause Sonnet Vision API offers unique advantages, please post your thoughts onto #10.

banjtheman · 2024-03-07T20:37:32Z

Thanks for the added context!!!! Will add the Claude stuff to #10

Hmm, I couldn't access the paper, do you have a copy somewhere would want to see if we can codify

danbjoseph · 2024-03-07T21:43:19Z

li2015.pdf

banjtheman · 2024-03-07T22:26:07Z

Was able to convert the old GreenView_calcuate.py to use some modern frameworks, that all run locally
Gets this for the first image

Green view score:
17.177629470825195

import cv2
import numpy as np
from skimage.filters import threshold_otsu


def get_gvi_score(image_path):
    """
    Calculate the Green View Index (GVI) for a given image file.

    Args:
        image_path (str): Path to the image file.

    Returns:
        float: The Green View Index (GVI) score for the given image.
    """
    # Load the image
    original_image = cv2.imread(image_path)

    # Convert to RGB color space
    rgb_image = cv2.cvtColor(original_image, cv2.COLOR_BGR2RGB)

    # Calculate ExG (Excess Green)
    r, g, b = cv2.split(rgb_image.astype(np.float32) / 255)
    exg = 2 * g - r - b

    # Apply Otsu's thresholding on ExG
    threshold = threshold_otsu(exg)
    green_pixels = (exg > threshold).sum()
    total_pixels = original_image.shape[0] * original_image.shape[1]

    # Calculate the Green View Index (GVI)
    gvi_score = (green_pixels / total_pixels) * 100

    return gvi_score


print("Green view score:")
print(get_gvi_score("example_greenview_image.jpg"))

ioalexei · 2024-03-21T11:43:55Z

Using the above sample I fleshed this out - had to do it on a sample of 10 points as I don't have enough space to download all the mapillary images for the dataset. Is this in the right area of what you're after?

import pandas as pd
import geopandas as gpd 
import os
import cv2
import numpy as np
from skimage.filters import threshold_otsu

def get_gvi_score(image_path):
    """
    Calculate the Green View Index (GVI) for a given image file.

    Args:
        image_path (str): Path to the image file.

    Returns:
        float: The Green View Index (GVI) score for the given image.
    """
    # Load the image
    original_image = cv2.imread(image_path)

    # Convert to RGB color space
    rgb_image = cv2.cvtColor(original_image, cv2.COLOR_BGR2RGB)

    # Calculate ExG (Excess Green)
    r, g, b = cv2.split(rgb_image.astype(np.float32) / 255)
    exg = 2 * g - r - b

    # Apply Otsu's thresholding on ExG
    threshold = threshold_otsu(exg)
    green_pixels = (exg > threshold).sum()
    total_pixels = original_image.shape[0] * original_image.shape[1]

    # Calculate the Green View Index (GVI)
    gvi_score = (green_pixels / total_pixels) * 100

    return gvi_score

# Set the directory with the mapillary images 
img_dir = "./data/raw/mapillary" # replace with path to mapillary images 

# Make an empty dataframe to hold the data
df = pd.DataFrame({"filename": [], "gvi_score": []})

# Loop through each image in the Mapillary folder and get the GVI score 
for i in os.listdir(img_dir): 
	gvi_score = get_gvi_score(os.path.join(img_dir, i))

	temp_df = pd.DataFrame({"filename": [i], "gvi_score": [gvi_score]})

	print(i, "\t", str(gvi_score))

	df = pd.concat([df, temp_df], ignore_index=True)

# Create an image ID from the file name, to match to the point dataset
df['image_id'] = df['filename'].str[:-5]

# Open the interim point data
gdf = gpd.read_file("./data/interim/sample10_images.gpkg") # replace with path to interim gpkg

# Join the GVI score to the interim point data using the `image id` attribute
gdf = gdf.merge(df, how='left', on='image_id')

# Export as GPKG
gdf.to_file("./data/processed/sample10_images.gpkg", layer="gvi_scores")

danbjoseph added this to street-view-green-view for Indonesian Red Cross pilot Jan 30, 2024

danbjoseph converted this from a draft issue Feb 5, 2024

ioalexei mentioned this issue Mar 26, 2024

Gvi to points #29

Merged

danbjoseph closed this as completed Apr 4, 2024

github-project-automation bot moved this from Todo to Done in street-view-green-view for Indonesian Red Cross pilot Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

calculate green view index for each image and attach score as attribute to point #8

calculate green view index for each image and attach score as attribute to point #8

danbjoseph commented Feb 5, 2024 •

edited

Loading

banjtheman commented Mar 7, 2024

banjtheman commented Mar 7, 2024

danbjoseph commented Mar 7, 2024

banjtheman commented Mar 7, 2024

danbjoseph commented Mar 7, 2024

banjtheman commented Mar 7, 2024 •

edited

Loading

ioalexei commented Mar 21, 2024

calculate green view index for each image and attach score as attribute to point #8

calculate green view index for each image and attach score as attribute to point #8

Comments

danbjoseph commented Feb 5, 2024 • edited Loading

banjtheman commented Mar 7, 2024

banjtheman commented Mar 7, 2024

danbjoseph commented Mar 7, 2024

banjtheman commented Mar 7, 2024

danbjoseph commented Mar 7, 2024

banjtheman commented Mar 7, 2024 • edited Loading

ioalexei commented Mar 21, 2024

danbjoseph commented Feb 5, 2024 •

edited

Loading

banjtheman commented Mar 7, 2024 •

edited

Loading