Add docs for accessing cogs with terra #125

moradology · 2025-01-13T18:08:11Z

This PR adds some documentation about accessing data within cogs in R with terra without loading the whole file. Some (as yet unclear) combination of terra and the r kernel's behaviors prevented the kind of byte-range logging in #124, so this short guide sticks to demonstrating the API and describing its cloud-optimized behaviors

review-notebook-app · 2025-01-13T18:08:17Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

review-notebook-app · 2025-01-14T00:59:04Z

View / edit / reply to this conversation on ReviewNB

wildintellect commented on 2025-01-14T00:59:03Z
----------------------------------------------------------------

Be clear the NO SIGN is particular to this dataset that's in a public bucket.

Should we also add

terra::setGDALconfig("GDAL_DISABLE_READDIR_ON_OPEN", "EMPTY_DIR") which is a speed up to keep GDAL from looking for things like TFW and other external files?

review-notebook-app · 2025-01-14T00:59:04Z

View / edit / reply to this conversation on ReviewNB

wildintellect commented on 2025-01-14T00:59:04Z
----------------------------------------------------------------

this is an interesting approach, nice because it operates with only 1 file, but a little weird at the same time in terms of arbitrarily calculating some neighboring extents based on the image.

I wonder if it's better to have a known bbox and two images to pull partials. I don't really want to complicate it by adding STAC (which we have some example to borrow from)

Maybe this would make sense with a plot of the full extent that has a box from the crop extents to be used.

moradology · 2025-01-14T03:55:42Z

I agree the 1/100th square is pretty arbitrary. Perhaps it would make sense to build out and draw geojson for the extents to grab and the geotiff tiles to motivate the role of tiling in reducing necessary requests?

wildintellect · 2025-01-14T16:11:25Z

I also realized that this example only shows AWS, we probably need a note that this method is valid for all "vsi" methods in gdal, and that the stars package can also be used. TBD link to the page about vectors which doesn't exist yet.

moradology · 2025-01-14T21:27:05Z

I included a little note about VSI and a reference to the docs that explain such filesystems. I've also replaced the 1/100th (arbitrary) read with a couple of reads that specifically articulate the impact of internal tile structure on how (and how many) requests get made

moradology · 2025-01-14T21:29:10Z

So far, there's no stars coverage but this is only because it appears stars requires a version of sf we don't have and which appears to not be installed (non zero exit) via either install.packages("stars") or, more specifically install.packages("stars") in the MAAP jupyter R kernel

review-notebook-app · 2025-01-15T16:17:27Z

View / edit / reply to this conversation on ReviewNB

zacdezgeo commented on 2025-01-15T16:17:26Z
----------------------------------------------------------------

maybe we can link the https://guide.cloudnativegeo.org/ in the introduction?

review-notebook-app · 2025-01-15T18:14:17Z

View / edit / reply to this conversation on ReviewNB

jsignell commented on 2025-01-15T18:14:17Z
----------------------------------------------------------------

I think in the very first sentence you should say what COGs are (a file format for storing raster data with a 2D aligned grid?)

The key benefits section feels like some double counting. I think you should remove "Subset Access" and "Cloud-Native Efficiency" those feel like more the mechanisms that enable "Cost Savings" and "Fast Workflows". Also don't COGs enable pyramiding? That feels like it could be mentioned somehow as well.

moradology commented on 2025-01-15T19:55:36Z
----------------------------------------------------------------

Good call. I even explicitly use overviews later on so I should probably acknowledge that

review-notebook-app · 2025-01-15T18:14:18Z

View / edit / reply to this conversation on ReviewNB

jsignell commented on 2025-01-15T18:14:18Z
----------------------------------------------------------------

This is kind of a meta comment, but I tend to glaze over inline comments in notebooks. Maybe just put these each in their own cell with markdown cells describing what is happening.

moradology commented on 2025-01-15T19:54:49Z
----------------------------------------------------------------

Wondered about that. They're so nice for getting the information right up next to the place where it has context but I think I do the same...

review-notebook-app · 2025-01-15T18:14:19Z

View / edit / reply to this conversation on ReviewNB

jsignell commented on 2025-01-15T18:14:18Z
----------------------------------------------------------------

The tile information is block size and layout right?

Statistics are block level right? Might be useful to make that explicit.

review-notebook-app · 2025-01-15T18:14:20Z

View / edit / reply to this conversation on ReviewNB

jsignell commented on 2025-01-15T18:14:19Z
----------------------------------------------------------------

what are these axes? I guess they are meters in EPSG 9807 or something, but good to make it clear

review-notebook-app · 2025-01-15T18:14:20Z

View / edit / reply to this conversation on ReviewNB

jsignell commented on 2025-01-15T18:14:20Z
----------------------------------------------------------------

Just a question I had in reading this: can you read the header for the COG without downloading the file?

Also: is there a way to prove that only one tile was downloaded?

wildintellect commented on 2025-01-15T21:52:42Z
----------------------------------------------------------------

Reading the header before download is effectively what rast does before crop
The only way I've ever been able to tell is external byte/network logging OR how fast it is. You can easily Time loading a crop vs loading the whole thing, and see just how much faster a partial read is.

moradology commented on 2025-01-22T16:17:51Z
----------------------------------------------------------------

In the python docs, I have some custom logging that demonstrates the byte-ranges being read. Unfortunately, not so easily done with terra's use of GDAL

review-notebook-app · 2025-01-15T18:14:21Z

View / edit / reply to this conversation on ReviewNB

jsignell commented on 2025-01-15T18:14:21Z
----------------------------------------------------------------

Line #6.    block_size_y <- block_size_pixels * r_res[2]

Do you think it would be clearer to just give a bounding box without showing how it is created? If we think about a real world scenario it doesn't matter that people know how to create the box. Actually maybe the whole code cell here should just be hidden since the visual serves a didactic purpose. Same with the other plot that shows gridlines.

moradology commented on 2025-01-15T20:06:31Z
----------------------------------------------------------------

I think I disagree on this point because it might be good to show how these APIs for pixel and geospatial coordinates work. I'll change things over if your take here is seconded though - I can totally see your point

wildintellect commented on 2025-01-16T00:10:50Z
----------------------------------------------------------------

the question is who is the target audience? I think the initial target is data users who almost always just need a bbox (or polygon) of data. I'm ok hiding some of that magic from them, maybe a collapsed cell block is the right answer.

moradology · 2025-01-15T19:54:50Z

Wondered about that. They're so nice for getting the information right up next to the place where it has context but I think I do the same...

View entire conversation on ReviewNB

moradology · 2025-01-15T19:55:37Z

Good call. I even explicitly use overviews later on so I should probably acknowledge that

View entire conversation on ReviewNB

moradology · 2025-01-15T20:06:32Z

I think I disagree on this point because it might be good to show how these APIs for pixel and geospatial coordinates work. I'll change things over if your take here is seconded though - I can totally see your point

View entire conversation on ReviewNB

wildintellect · 2025-01-15T21:52:43Z

Reading the header before download is effectively what rast does before crop
The only way I've ever been able to tell is external byte/network logging OR how fast it is. You can easily Time loading a crop vs loading the whole thing, and see just how much faster a partial read is.

View entire conversation on ReviewNB

wildintellect · 2025-01-16T00:10:51Z

the question is who is the target audience? I think the initial target is data users who almost always just need a bbox (or polygon) of data. I'm ok hiding some of that magic from them, maybe a collapsed cell block is the right answer.

View entire conversation on ReviewNB

moradology · 2025-01-22T16:17:53Z

In the python docs, I have some custom logging that demonstrates the byte-ranges being read. Unfortunately, not so easily done with terra's use of GDAL

View entire conversation on ReviewNB

wildintellect · 2025-01-24T17:27:27Z

@smk0033 and @hrodmn you probably want to check this one too

smk0033 · 2025-01-29T21:51:33Z

Additional thought: is it worth adding/possible to add a TL;DR section for this notebook similar to the Python version?

It might also be worth linking and mentioning the Python version in this notebook and vice versa. Other than that I don't think I have any additional comments - I really like the detail and explanations in both notebooks!

review-notebook-app · 2025-02-25T23:09:21Z

View / edit / reply to this conversation on ReviewNB

wildintellect commented on 2025-02-25T23:09:21Z
----------------------------------------------------------------

Link to STARS documentation https://r-spatial.github.io/stars/

cloud-optimized-geotiffs/accessing-cogs-in-r-terra.ipynb

wildintellect · 2025-04-23T23:34:48Z

@moradology it's looking pretty good. Can you close the resolved comments so we know what remains?

wildintellect · 2025-04-23T23:35:52Z

Minor note to check what the Navigation text will be for this tutorials with a local render or wait until we open the PR to main

smk0033 · 2025-06-02T14:08:13Z

@moradology it's looking pretty good. Can you close the resolved comments so we know what remains?

Hey @moradology! I know you've been busy with Black Marble, have you had a chance to close out resolved comments for this PR yet?

moradology · 2025-06-02T23:37:17Z

Nope, this slipped from my agenda. I'll aim to get this taken care of before standup this week

moradology · 2025-06-04T18:32:29Z

OK, that was much easier than I'd expected. Was afraid that there were new tasks I'd missed but it appears all comments had been resolved already 😅

smk0033 · 2025-06-04T19:11:52Z

Great, thanks! Since Zac is out, adding Henry as an additional reviewer

smk0033 · 2025-06-09T16:36:30Z

Thanks @jsignell! Would this be good to merge now? @wildintellect

wildintellect

We'll need to update the Branch, and then plan to run this PR after merging some others to main first. cc: @jsignell

netlify · 2025-06-09T20:52:48Z

✅ Deploy Preview for harmonious-cajeta-5542ab ready!

Name	Link
🔨 Latest commit	`b20b9ec`
🔍 Latest deploy log	https://app.netlify.com/projects/harmonious-cajeta-5542ab/deploys/6847498d299a970008e67584
😎 Deploy Preview	https://deploy-preview-125--harmonious-cajeta-5542ab.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

* Add docs for accessing cogs with terra * Update docs to better illustrate internal tiling * Add more information on gdal configs * Add dependencies for stars and implement stars guide for R

wildintellect requested review from wildintellect and zacdezgeo January 13, 2025 23:32

wildintellect reviewed Feb 25, 2025

View reviewed changes

smk0033 requested a review from wildintellect April 15, 2025 20:20

smk0033 requested a review from hrodmn June 4, 2025 19:08

jsignell force-pushed the staging branch from 8184234 to b0924a1 Compare June 5, 2025 17:30

wildintellect approved these changes Jun 9, 2025

View reviewed changes

moradology added 7 commits June 9, 2025 22:52

Add docs for accessing cogs with terra

ff7186b

Update docs to better illustrate internal tiling

8b163d9

Add more information on gdal configs

bc2ea4f

Address review comments

610df0b

Add dependencies for stars and implement stars guide for R

63ff059

Update quarto yml

d96619d

Address review comments

b20b9ec

jsignell force-pushed the feature/r-cog-access branch from 6f3536a to b20b9ec Compare June 9, 2025 20:52

jsignell merged commit 7e253a0 into cloudnativegeo:staging Jun 9, 2025
6 of 8 checks passed

Add docs for accessing cogs with terra #125

Add docs for accessing cogs with terra #125

Uh oh!

Conversation

moradology commented Jan 13, 2025

Uh oh!

review-notebook-app bot commented Jan 13, 2025

Uh oh!

review-notebook-app bot commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

moradology commented Jan 14, 2025

Uh oh!

wildintellect commented Jan 14, 2025

Uh oh!

moradology commented Jan 14, 2025

Uh oh!

moradology commented Jan 14, 2025

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Jan 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

moradology commented Jan 15, 2025

Uh oh!

moradology commented Jan 15, 2025

Uh oh!

moradology commented Jan 15, 2025

Uh oh!

wildintellect commented Jan 15, 2025

Uh oh!

wildintellect commented Jan 16, 2025

Uh oh!

moradology commented Jan 22, 2025

Uh oh!

wildintellect commented Jan 24, 2025

Uh oh!

smk0033 commented Jan 29, 2025

Uh oh!

review-notebook-app bot commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wildintellect commented Apr 23, 2025

Uh oh!

wildintellect commented Apr 23, 2025

Uh oh!

smk0033 commented Jun 2, 2025

Uh oh!

moradology commented Jun 2, 2025

Uh oh!

moradology commented Jun 4, 2025

Uh oh!

smk0033 commented Jun 4, 2025

Uh oh!

smk0033 commented Jun 9, 2025

review-notebook-app bot commented Jan 14, 2025 •

edited

Loading

review-notebook-app bot commented Jan 14, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Jan 15, 2025 •

edited

Loading

review-notebook-app bot commented Feb 25, 2025 •

edited

Loading

netlify bot commented Jun 9, 2025 •

edited

Loading