Custom aggregation functions (or: NA handling in aggregation functions) #97

hol430 · 2025-03-07T04:38:39Z

I have a field which was created by reading from multiple sources and then calling copyLayers(..., keep.all.to = TRUE). My sources cover (somewhat) different time periods, so the end result is that on some dates, I have NA values for one field. I then call aggregateYears() (though I suppose this applies to other aggregation functions as well), which returns NA values for days which contain at least one missing value.

What would be nice would be if aggregateYears() and friends accepted an na.rm argument or similar, though I'm not sure if that argument is supported by all of the currently-implemented aggregators. Otherwise, if we could pass in a custom aggregation function, that would solve the issue as well. I'm not sure how easy that would be to implement - if it's too hard, then having another aggregation method called something like "mean_na_rm" would be easy to implement and would solve the problem.

I have a workaround, so it's not really urgent or anything, but I think this use case (comparing the seasonality of two not-quite-temporally-overlapping layers) is not totally far-fetched and it would be nice if the package was able to handle this for us.

The text was updated successfully, but these errors were encountered:

MagicForrest · 2025-03-07T07:46:58Z

Hi Drew. I see the problem. I think something can be done. Exactly which option will take a little thinking.

One thing to be aware of. DGVMTools wasn't really designed for combining objects with different time periods with copyLayers() like that. Rather two sources with different time periods can be kept separate but be plotted together by bundling them in a list. Or if you want to compare them directly, then compareLayers() is the tool for that. Exactly what processing do you need to do on the two datasets? Is there a way to do it would combing data from multiple Sources in one Field? Also, would it work to apply aggregateYears() before coming the layers?

But, yes, you are absolutely right. It would be good have something that also worked for this use case. I think that adding na.rm would work and be a very decent fix, and totally consistent with the normal R conventions. So I'll probably look at that.

This is a common use case when plotting layers which cover different time periods. See: MagicForrest/DGVMTools#97

hol430 added a commit to hie-dave/lpjg-output-analysis that referenced this issue Mar 17, 2025

Fixed subannual plots for fields with non-overlapping layers

7ad95e1

This is a common use case when plotting layers which cover different time periods. See: MagicForrest/DGVMTools#97

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom aggregation functions (or: NA handling in aggregation functions) #97

Custom aggregation functions (or: NA handling in aggregation functions) #97

hol430 commented Mar 7, 2025

MagicForrest commented Mar 7, 2025

Custom aggregation functions (or: NA handling in aggregation functions) #97

Custom aggregation functions (or: NA handling in aggregation functions) #97

Comments

hol430 commented Mar 7, 2025

MagicForrest commented Mar 7, 2025