Compute area-weighted statistics by vincentsarago · Pull Request #640 · cogeotiff/rio-tiler

vincentsarago · 2023-09-20T22:03:23Z

This PR adds a coverage options to the get_array_statistics function to enable area weighted statistics. This PR do not take care of the coverage array creation which should be done by the client application (maybe with some helper in rio-tiler)

cc @kylebarron @j08lue

To Do

more tests
docs
validate the logic (why some values are defined using weight or not)
add coverage utility functions

vincentsarago · 2023-09-20T22:07:31Z

-                "minority": float(keys[counts.tolist().index(counts.min())].tolist())
-                if valid_pixels
-                else numpy.nan,
+                "mean": float(array.mean()),


we don't use the weighted array for the min and max (like https://github.com/isciences/exactextract#supported-statistics)

vincentsarago · 2023-09-20T22:08:48Z

+                "majority": majority,
+                "minority": minority,
                "unique": float(counts.size),
                **dict(zip(percentiles_names, percentiles_values)),


🤔 maybe the percentiles should use the weighted array?

kylebarron · 2023-09-20T22:09:04Z

+    # 3, 4
+    data = np.ma.array((1, 2, 3, 4)).reshape((1, 2, 2))
+
+    # Coverage Array


Maybe we should call this the weight array instead of coverage? coverage doesn't seem to make the most sense to me here.

in https://github.com/isciences/exactextract, weight are really weight, while the coverage is called cell coverage fractions. I didn't want to confuse people 🤷

I see, coverage came from exactextract because their weights are usually derived from the coverage of each polygon in the cell. I think weight is more general than coverage though. There could be use cases for weighted zonal stats that aren't partial-coverage

coverage_weights ?

Would we ever have both coverage and weights?

It looks like exactextract does support both coverage and weight in effect, because it allows weight as a parameter and it computes coverage itself

Would we ever have both coverage and weights?

Not planned right now but I don't want to be blocked in the future. I agree that coverage is not a perfect name but I don't want to use weights because it's not it's not weights but spatial fraction. I'm open to change to better name if you have ideas :D

vincentsarago · 2023-09-21T19:50:40Z

+            (self.height, cover_scale, self.width, cover_scale)
+        ).astype("float32")
+
+        return cover_array.sum(-1).sum(1) / (cover_scale**2)


slightly modified version of perrygeo/python-rasterstats#136

@sgoodm I know ☝️ is a 7 years old PR (😅) but I hope you don't mind that I reused some of the code here 🙏

Not at all, @jacobwhall and I have been pushing use of COGs in our work at @aiddata and are fans of these projects, so happy to share some code to support 👍 👍

vincentsarago · 2023-09-22T05:39:03Z

Note: we don't have a BaseReader method that returns statistics for a GeoJSON Feature but this is how it could look like in the client (e.g TiTiler):

with Reader(path) as src:
    data = src_dst.feature(
        shape,
        shape_crs=WGS84_CRS,
    )

    coverage_array = data.get_coverage_array(
        shape, shape_crs=WGS84_CRS
    )

    stats = data.statistics(coverage=coverage_array)

add fractional coverage statistics

8ea1f6a

vincentsarago commented Sep 20, 2023

View reviewed changes

Comment thread rio_tiler/utils.py

vincentsarago commented Sep 20, 2023

View reviewed changes

kylebarron approved these changes Sep 20, 2023

View reviewed changes

add helper method to create coverage array

5f4eb8d

vincentsarago commented Sep 21, 2023

View reviewed changes

update type hint

3775316

vincentsarago requested a review from kylebarron September 25, 2023 08:50

vincentsarago marked this pull request as ready for review September 25, 2023 19:16

kylebarron approved these changes Sep 27, 2023

View reviewed changes

update docs

22c80c6

vincentsarago merged commit 7bed9de into main Sep 27, 2023

vincentsarago deleted the CoverageStatistics branch September 27, 2023 18:33

j08lue mentioned this pull request Oct 5, 2023

Compute area-weighted averages NASA-IMPACT/veda-backend#226

Closed

2 tasks

Conversation

vincentsarago commented Sep 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

To Do

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vincentsarago commented Sep 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vincentsarago commented Sep 20, 2023 •

edited

Loading