I am trying to resample a forest cover raster with high resolution (25 meters) and categorical data (1 to 13) to a new <code>RasterLayer</code> with a lower resolution (~ 1 km). My idea is to combine the forest cover data with other lower-resolution raster data : <ol> <li> I tried <code>raster::resample()</code>, but since the data is categorical I lost a lot of information: <pre class="prettyprint"><code>summary(as.factor(df$loss_year_mosaic_30m)) 0 1 2 3 4 5 6 7 8 9 10 11 12 13 3777691 65 101 50 151 145 159 295 291 134 102 126 104 91 </code></pre> As you can see, the new raster has the desired resolution but have lots of zeros as well. I suppose that is normal since I used the ´ngb´ option in <code>resample</code>. </li> <li> The second strategy was using <code>raster::aggregate()</code> but I find difficult to define a factor integer since the change of resolution is not straightforward (like the double of the resolution or alike). My high-resolution raster has the following resolution, and I want it to aggregate it to a <code>0.008333333, 0.008333333 (x, y)</code> resolution to the same extent. <pre class="prettyprint"><code>loss_year class : RasterLayer dimensions : 70503, 59566, 4199581698 (nrow, ncol, ncell) resolution : 0.00025, 0.00025 (x, y) extent : -81.73875, -66.84725, -4.2285, 13.39725 (xmin, xmax, ymin, ymax) coord. ref. : +proj=longlat +datum=WGS84 +no_defs +ellps=WGS84 +towgs84=0,0,0 data source : /Volumes/LaCie/Deforestacion/Hansen/loss_year_mosaic_30m.tif names : loss_year_mosaic_30m values : 0, 13 (min, max) </code></pre> I have tried a factor of ~33.33 following the description of the <code>aggregate</code> help: "The number of cells is the number of cells of x divided by <code>fact*fact</code> (when fact is a single number)." Nonetheless, the resulting raster data do not seem to have the same number of rows and columns as my other low-resolution rasters. </li> </ol> I have never used this high-resolution data, and I am also computationally limited (some of this commands can be parallelized using <code>clusterR</code>, but sometimes they took the same time than the non-parallelized commands, especially since they do not work for nearest neighboor calculations). I am short of ideas; maybe I can try <code>layerize</code> to obtain a count raster, but I have to ´aggregate´ and the <code>factor</code> problem arises. Since this processes are taking me days to process, I do want to know the most efficient way to create a lower resolution raster without losing much information A reproducible example could be the following: <pre class="prettyprint"><code>r_hr <- raster(nrow=70, ncol=70) #High resolution raster with categorical data set.seed(0) r_hr[] <- round(runif(1:ncell(r_hr), 1, 5)) r_lr <- raster(nrow=6, ncol=6) #Low resolution raster </code></pre> First strategy: loss of information <pre class="prettyprint"><code>r <- resample(r_hr, r_lr, method = "ngb") #The raster data is categorical </code></pre> Second strategy: difficult to define an aggregate factor <pre class="prettyprint"><code>r <- aggregate(r_hr, factor) #How to define a factor to get exactly the same number of cells of h_lr? </code></pre> Another option: <code>layerize</code> <pre class="prettyprint"><code>r_brick <- layerize(r_hr) aggregate(r_brick, factor) #How to define factor to coincide with the r_lr dimensions? </code></pre> Thanks for your help!

<pre class="prettyprint"><code>r_hr <- raster(nrow=70, ncol=70) #High resolution raster with categorical data set.seed(0) r_hr[] <- round(runif(1:ncell(r_hr), 1, 5)) r_lr <- raster(nrow=6, ncol=6) r_hr #class : RasterLayer #dimensions : 70, 70, 4900 (nrow, ncol, ncell) #resolution : 5.142857, 2.571429 (x, y) #extent : -180, 180, -90, 90 (xmin, xmax, ymin, ymax) #coord. ref. : +proj=longlat +datum=WGS84 +ellps=WGS84 +towgs84=0,0,0 #data source : in memory #names : layer #values : 1, 5 (min, max) r_lr #class : RasterLayer #dimensions : 6, 6, 36 (nrow, ncol, ncell) #resolution : 60, 30 (x, y) #extent : -180, 180, -90, 90 (xmin, xmax, ymin, ymax) #coord. ref. : +proj=longlat +datum=WGS84 +ellps=WGS84 +towgs84=0,0,0 </code></pre> Direct aggregate is not possible, because 70/6 is not an integer. <pre class="prettyprint"><code>dim(r_hr)[1:2] / dim(r_lr)[1:2] #[1] 11.66667 11.66667 </code></pre> Nearest neighbor resampling is not a good idea either as the results would be arbitrary. Here is a by layer approach that you suggested and dww also showed already. <pre class="prettyprint"><code>b <- layerize(r_hr) fact <- round(dim(r_hr)[1:2] / dim(r_lr)[1:2]) a <- aggregate(b, fact) x <- resample(a, r_lr) </code></pre> Now you have proportions. If you want a single class you could do <pre class="prettyprint"><code>y <- which.max(x) </code></pre> In that case, another approach would be to aggregate the classes <pre class="prettyprint"><code>ag <- aggregate(r_hr, fact, modal) agx <- resample(ag, r_lr, method='ngb') </code></pre> Note that <code>agx</code> and <code>y</code> are the same. But they can both be problematic as you might have cells with 5 classes with each about 20%, making it rather unreasonable to pick one winner.

It is pretty standard practice to aggregate land cover maps into layers of %cover. I.e you should aim to produce 13 layers, each being something like %cover in that grid cell. Doing this allows you to reduce the resolution while retaining as much information as possible. N.B if you require a different summary statistic than <code>%</code>, should be easy to adapt the following method to whatever statistic you want, by changing the <code>fun =</code> function in <code>aggregate</code>. The following method is pretty fast (it takes just a few seconds on my laptop to process raster with 100 million cells): First, let's create some dummy rasters to use <pre class="prettyprint"><code>Nhr <- 1e4 # resolution of high-res raster Nlr <- 333 # resolution of low-res raster r.hr <- raster(ncols=Nhr, nrows=Nhr) r.lr <- raster(ncols=Nlr, nrows=Nlr) r.hr[] <- sample(1:13, Nhr^2, replace=T) </code></pre> Now, we begin by aggregating the high res raster to almost the same resolution as the low res one (to nearest integer number of cells). Each resulting layer contains the fraction of area within that cell in which value of original raster is N. <pre class="prettyprint"><code>Nratio <- as.integer(Nhr/Nlr) # ratio of high to low resolutions, to nearest integer value for aggregation layer1 <- aggregate(r.hr, Nratio, fun=function(x, na.rm=T) {mean(x==1, na.rm=na.rm)}) layer2 <- aggregate(r.hr, Nratio, fun=function(x, na.rm=T) {mean(x==2, na.rm=na.rm)}) </code></pre> And finally, resample low res raster to the desired resolution <pre class="prettyprint"><code>layer1 <- resample(layer1, r.lr, method = "ngb") layer2 <- resample(layer2, r.lr, method = "ngb") </code></pre> repeat for each layer, and build your layers into a stack or a multi-band raster

Resample raster

Tags:

r

resolution

gis

raster

I am trying to resample a forest cover raster with high resolution (25 meters) and categorical data (1 to 13) to a new RasterLayer with a lower resolution (~ 1 km). My idea is to combine the forest cover data with other lower-resolution raster data :

I tried raster::resample(), but since the data is categorical I lost a lot of information:

Click to copy
```
summary(as.factor(df$loss_year_mosaic_30m))
  0       1   2   3  4   5   6   7  8   9   10  11   12  13
3777691  65  101 50 151 145 159 295 291 134 102 126 104  91 
```
As you can see, the new raster has the desired resolution but have lots of zeros as well. I suppose that is normal since I used the ´ngb´ option in resample.
The second strategy was using raster::aggregate() but I find difficult to define a factor integer since the change of resolution is not straightforward (like the double of the resolution or alike).

My high-resolution raster has the following resolution, and I want it to aggregate it to a 0.008333333, 0.008333333 (x, y) resolution to the same extent.

Click to copy
```
loss_year
class       : RasterLayer 
dimensions  : 70503, 59566, 4199581698  (nrow, ncol, ncell)
resolution  : 0.00025, 0.00025  (x, y)
extent      : -81.73875, -66.84725, -4.2285, 13.39725  (xmin, xmax, ymin, ymax)
coord. ref. : +proj=longlat +datum=WGS84 +no_defs +ellps=WGS84 +towgs84=0,0,0 
data source : /Volumes/LaCie/Deforestacion/Hansen/loss_year_mosaic_30m.tif 
names       : loss_year_mosaic_30m 
values      : 0, 13  (min, max)
```
I have tried a factor of ~33.33 following the description of the aggregate help: "The number of cells is the number of cells of x divided by fact*fact (when fact is a single number)." Nonetheless, the resulting raster data do not seem to have the same number of rows and columns as my other low-resolution rasters.

I have never used this high-resolution data, and I am also computationally limited (some of this commands can be parallelized using clusterR, but sometimes they took the same time than the non-parallelized commands, especially since they do not work for nearest neighboor calculations).

I am short of ideas; maybe I can try layerize to obtain a count raster, but I have to ´aggregate´ and the factor problem arises. Since this processes are taking me days to process, I do want to know the most efficient way to create a lower resolution raster without losing much information

A reproducible example could be the following:

Click to copy

r_hr <- raster(nrow=70, ncol=70) #High resolution raster with categorical data
set.seed(0)
r_hr[] <- round(runif(1:ncell(r_hr), 1, 5))
r_lr <- raster(nrow=6, ncol=6) #Low resolution raster

First strategy: loss of information

Click to copy

r <- resample(r_hr, r_lr, method = "ngb") #The raster data is categorical

Second strategy: difficult to define an aggregate factor

Click to copy

r <- aggregate(r_hr, factor) #How to define a factor to get exactly the same number of cells of h_lr?

Another option: layerize

Click to copy

r_brick <- layerize(r_hr)
aggregate(r_brick, factor) #How to define factor to coincide with the r_lr dimensions?

Thanks for your help!

300

asked Jun 21 '16 23:06

topcat

2 Answers

Click to copy

r_hr <- raster(nrow=70, ncol=70) #High resolution raster with categorical data
set.seed(0)
r_hr[] <- round(runif(1:ncell(r_hr), 1, 5))
r_lr <- raster(nrow=6, ncol=6)

r_hr
#class       : RasterLayer 
#dimensions  : 70, 70, 4900  (nrow, ncol, ncell)
#resolution  : 5.142857, 2.571429  (x, y)
#extent      : -180, 180, -90, 90  (xmin, xmax, ymin, ymax)
#coord. ref. : +proj=longlat +datum=WGS84 +ellps=WGS84 +towgs84=0,0,0 
#data source : in memory
#names       : layer 
#values      : 1, 5  (min, max)

r_lr
#class       : RasterLayer 
#dimensions  : 6, 6, 36  (nrow, ncol, ncell)
#resolution  : 60, 30  (x, y)
#extent      : -180, 180, -90, 90  (xmin, xmax, ymin, ymax)
#coord. ref. : +proj=longlat +datum=WGS84 +ellps=WGS84 +towgs84=0,0,0

Direct aggregate is not possible, because 70/6 is not an integer.

Click to copy

dim(r_hr)[1:2] / dim(r_lr)[1:2]
#[1] 11.66667 11.66667

Nearest neighbor resampling is not a good idea either as the results would be arbitrary.

Here is a by layer approach that you suggested and dww also showed already.

Click to copy

b <- layerize(r_hr)
fact <- round(dim(r_hr)[1:2] / dim(r_lr)[1:2])
a <- aggregate(b, fact)
x <- resample(a, r_lr)

Now you have proportions. If you want a single class you could do

Click to copy

y <- which.max(x)

In that case, another approach would be to aggregate the classes

Click to copy

ag <- aggregate(r_hr, fact, modal) 
agx <- resample(ag, r_lr, method='ngb')

Note that agx and y are the same. But they can both be problematic as you might have cells with 5 classes with each about 20%, making it rather unreasonable to pick one winner.

answered Sep 19 '22 10:09

Robert Hijmans

It is pretty standard practice to aggregate land cover maps into layers of %cover. I.e you should aim to produce 13 layers, each being something like %cover in that grid cell. Doing this allows you to reduce the resolution while retaining as much information as possible. N.B if you require a different summary statistic than %, should be easy to adapt the following method to whatever statistic you want, by changing the fun = function in aggregate.

The following method is pretty fast (it takes just a few seconds on my laptop to process raster with 100 million cells):

First, let's create some dummy rasters to use

Click to copy

Nhr <- 1e4 # resolution of high-res raster
Nlr <- 333 # resolution of low-res raster
r.hr <- raster(ncols=Nhr, nrows=Nhr)
r.lr <- raster(ncols=Nlr, nrows=Nlr)
r.hr[] <- sample(1:13, Nhr^2, replace=T)

Now, we begin by aggregating the high res raster to almost the same resolution as the low res one (to nearest integer number of cells). Each resulting layer contains the fraction of area within that cell in which value of original raster is N.

Click to copy

Nratio <- as.integer(Nhr/Nlr) # ratio of high to low resolutions, to nearest integer value for aggregation
layer1 <- aggregate(r.hr, Nratio, fun=function(x, na.rm=T) {mean(x==1, na.rm=na.rm)})
layer2 <- aggregate(r.hr, Nratio, fun=function(x, na.rm=T) {mean(x==2, na.rm=na.rm)})

And finally, resample low res raster to the desired resolution

Click to copy

layer1 <- resample(layer1, r.lr, method = "ngb") 
layer2 <- resample(layer2, r.lr, method = "ngb")

repeat for each layer, and build your layers into a stack or a multi-band raster

answered Sep 21 '22 10:09

dww

Related questions
                            
                                Missing Ribbon in ggplot2
                            
                                How to use the spread function properly in tidyr
                            
                                R Replacing NAs with a unique random numer
                            
                                Q: Create leaflet map in for loop in rmarkdown html
                            
                                R/ggplot2: Collapse or remove segment of y-axis from scatter-plot
                            
                                Web scraping the IIS based website
                            
                                Add horizontal lines to stacked barplot in ggplot2 in R, and show in legend
                            
                                reactive radioButtons with tooltipBS in shiny
                            
                                Can we pass a function as an argument
                            
                                How to test an installed package with testthat?
                            
                                Distinguishing between infinity and negative infinity during value replacement in R
                            
                                Removing both row and column of partial NA value
                            
                                Without root access, run R with tuned BLAS when it is linked with reference BLAS
                            
                                Shiny - plot with renderUI not display in shiny
                            
                                dplyr: how to use mutate function to make a key combining a prefix string and a sequence of values
                            
                                How to custom or display modebar in plotly?
                            
                                R For loop delete range of rows from one string to a second string in a column
                            
                                How do I get the derivative of the function?
                            
                                Error in match.fun(FUN)
                            
                                Use of ! (or any logical operator) with %>% (magrittr) produces unexpected output

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Resample raster

Tags:

r

resolution

gis

raster

topcat

People also ask

2 Answers

Robert Hijmans

dww

Recent Activity

Donate For Us