to_dataframe: no valid index for a 0-dimensional object #4228

ghislainp · 2020-07-15T15:58:43Z

What happened:
xr.DataArray([1], coords=[('onecoord', [2])]).sel(onecoord=2).to_dataframe(name='name') raise an exception ValueError: no valid index for a 0-dimensional object

What you expected to happen:

the same behavior as: xr.DataArray([1], coords=[('onecoord', [2])]).to_dataframe(name='name')

Anything else we need to know?:

I see that the array after the selection has no "dims" anymore, and this is what cause the error. but it still has one "coords", this is confusing. Is there any documentation about this difference ?

Environment:

INSTALLED VERSIONS ------------------ commit: None python: 3.7.6 | packaged by conda-forge | (default, Jun 1 2020, 18:57:50) [GCC 7.5.0] python-bits: 64 OS: Linux OS-release: 4.19.0-9-amd64 machine: x86_64 processor: byteorder: little LC_ALL: None LANG: en_US.UTF-8 LOCALE: en_US.UTF-8 libhdf5: 1.10.5 libnetcdf: 4.7.4

xarray: 0.15.1
pandas: 1.0.4
numpy: 1.18.5
scipy: 1.4.1
netCDF4: 1.5.3
pydap: None
h5netcdf: None
h5py: 2.10.0
Nio: None
zarr: 2.4.0
cftime: 1.1.3
nc_time_axis: None
PseudoNetCDF: None
rasterio: None
cfgrib: None
iris: None
bottleneck: 1.3.2
dask: 2.18.1
distributed: 2.18.0
matplotlib: 3.2.1
cartopy: None
seaborn: 0.10.1
numbagg: None
setuptools: 47.3.1.post20200616
pip: 20.1.1
conda: 4.8.3
pytest: 5.4.3
IPython: 7.15.0
sphinx: 3.1.1

dcherian · 2020-07-15T17:02:14Z

You need

xr.DataArray([1], coords=[('onecoord', [2])]).sel(onecoord=[2]).to_dataframe(name='name')

The difference is using onecoord=2 gives a scalar

>>> xr.DataArray([1], coords=[('onecoord', [2])]).sel(onecoord=2)
<xarray.DataArray ()>
array(1)
Coordinates:
    onecoord  int64 2

while using onecoord=[2] gives a 1 element vector

>>> xr.DataArray([1], coords=[('onecoord', [2])]).sel(onecoord=[2])
<xarray.DataArray (onecoord: 1)>
array([1])
Coordinates:
  * onecoord  (onecoord) int64 2

And to_dataframe cannot handle scalars.

I am not sure that there is a sensible way to convert a scalar DataArray to a DataFrame but we should throw a more informative error in any case.

ghislainp · 2020-07-15T17:54:57Z

thanks for the very clear response. The behaviro make sense.

In fact, I should have explained what I'm trying to achieve, as this is kind of "take". I've a dict like this:

{'label1' : dict(coord1=1, coord2=4), 
'label2' : dict(coord1=5, coord2=6),
'label3' : dict(coord1=4, coord2=2),
}

and I want to build an xarray (and then a dataframe) with coord1 and coord2 replaced by a new dims with values 'label1', 'label2', 'label3'.

I've done that by iterating over the dict, selecting with sel using the dict values, convert to dataframe and then concat the dataframes. pd.concat([x.sel(**d[k]).to_dataframe() or k in d]

A better option would be to do this "sel" or "take" with xarray only.
Do you have an idea how to do it with existing xarray methods?

dcherian · 2020-07-15T18:32:36Z

You could do it with "advanced indexing" by providing a dataarray to the .sel or .isel methods: https://xarray.pydata.org/en/stable/indexing.html#more-advanced-indexing

da = xr.DataArray([[1, 2, 3], [4,5,6]], dims=["coord1", "coord2"], coords={"coord2": [10, 20, 30], "coord1": [1,2]}) 

i1 = xr.DataArray([1, 0], dims=["z"], coords={"z": ["label1", "label2"]})
i2 = xr.DataArray([2, 1], dims=["z"], coords={"z": ["label1", "label2"]})  

da.isel(coord1=i1, coord2=i2, drop=True).to_dataframe(name="asd")

        asd
z          
label1    6
label2    2

dcherian added error reporting good first issue labels Jul 15, 2020

dcherian changed the title ~~no valid index for a 0-dimensional object~~ to_dataframe: no valid index for a 0-dimensional object Jul 15, 2020

Jul	AUG	Sep
	25
2019	2020	2021

pydata / xarray

to_dataframe: no valid index for a 0-dimensional object #4228

to_dataframe: no valid index for a 0-dimensional object #4228

ghislainp commented Jul 15, 2020 •

edited by dcherian

dcherian commented Jul 15, 2020

ghislainp commented Jul 15, 2020 •

edited by dcherian

dcherian commented Jul 15, 2020

pydata / xarray

Sponsor pydata/xarray

Join GitHub today

to_dataframe: no valid index for a 0-dimensional object #4228

to_dataframe: no valid index for a 0-dimensional object #4228

Comments

ghislainp commented Jul 15, 2020 • edited by dcherian

dcherian commented Jul 15, 2020

ghislainp commented Jul 15, 2020 • edited by dcherian

dcherian commented Jul 15, 2020

ghislainp commented Jul 15, 2020 •

edited by dcherian

ghislainp commented Jul 15, 2020 •

edited by dcherian