How to join data from multiple netCDF files with xarray in Python?

Tags:

I'm trying to open multiple netCDF files with xarray in Python. The files have data with same shape and I want to join them, creating a new dimension.

I tried to use concat_dim argument for xarray.open_mfdataset(), but it doesn't work as expected. An example is given below, which open two files with temperature data for 124 times, 241 latitudes and 480 longitudes:

DS = xr.open_mfdataset( 'eraINTERIM_t2m_*.nc', concat_dim='cases' )
da_t2m = DS.t2m

print( da_t2m )

With this code, I expect that the result data array will have a shape like (cases: 2, time: 124, latitude: 241, longitude: 480). However, its shape was (cases: 2, time: 248, latitude: 241, longitude: 480). It creates a new dimension, but also sums the leftmost dimension: 'time' dimension of two datasets. I was wondering whether it's an error from 'xarray.open_mfdateset' or it's an expected behavior because 'time' dimension is UNLIMITED for both datasets.

Is there a way to join data from these files directly using xarray and get the above expected return?

Thank you.

Mateus

336

asked Apr 01 '19 14:04

Mateus da Silva Teixeira

Video Answer

1 Answers

Extending from my comment I would try this:

def preproc(ds):
    ds = ds.assign({'stime': (['time'], ds.time)}).drop('time').rename({'time': 'ntime'})
    # we might need to tweak this a bit further, depending on the actual data layout
    return ds

DS = xr.open_mfdataset( 'eraINTERIM_t2m_*.nc', concat_dim='cases', preprocess=preproc)

The good thing here is, that you keep the original time coordinate in stime while renaming the original dimension (time -> ntime).

If everything works well, you should get resulting dimensions as (cases, ntime, latitude, longitude).

Disclaimer: I do similar in a loop with a final concat (wich works very well), but did not test the preprocess-approach.

154

answered Oct 19 '22 05:10

kmuehlbauer

Related questions
                            
                                ValueError: The computed initial MA coefficients are not invertible You should induce invertibility
                            
                                How can I plot a heatmap on a sphere given a list of latitudes and longitudes?
                            
                                Configuring Visual Studio Code for remote Python interpreter via SSH
                            
                                The axis argument to unique is not supported for dtype object
                            
                                How to make a tkinter canvas background transparent?
                            
                                Is there a builtin way to define a function that takes either 1 argument or 3?
                            
                                multivariable linearization in python: 'Pow' object has no attribute 'sqrt'
                            
                                How to make Pycharm run all python unit tests recursively from tests folder
                            
                                I can't import tensorflow-gpu
                            
                                Find the current line number of a running python process
                            
                                Airflow : ExternalTaskSensor doesn't trigger the task
                            
                                Python sum list of dicts by key with nested dicts
                            
                                Efficiently aggregate a resampled collection of datetimes in pandas
                            
                                Loading hdf5 files into python xarrays
                            
                                How can i use tensorflow object detection to only detect persons?
                            
                                Why is cross_val_predict not appropriate for measuring the generalisation error?
                            
                                Does Buildout support value substitution in the extends option?
                            
                                Storing RTSP stream as video file with OpenCV VideoWriter
                            
                                How to configure Python to ignore the hostname verification?
                            
                                Run command from one container to another

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to join data from multiple netCDF files with xarray in Python?

Tags:

python

concatenation

netcdf

python-xarray

Mateus da Silva Teixeira

People also ask

Video Answer

1 Answers

kmuehlbauer

Recent Activity

Donate For Us