Pandas interpolate NaNs based on different column

Tags:

I have the following DataFrame (extract)

data = pd.DataFrame([[0., -10.88948939, 74.22099994, 1.5, "NW", 0], [0.819377018, -10.88948939, 74.22099994, 1.5, "NW", 1], [8.47965933, -10.88948939, 74.22099994, 1.5, "NW", 10], [15.38036833, -10.88948939, 74.22099994, 1.5, "NW", 20]], columns=["Velocity", "X", "Y", "Z", "wind_direction", "wind_speed"])

Velocity  X      Y     Z  wind_direction wind_speed
0        -10.88 74.22 1.5 NW             0
0.82     -10.89 74.22 1.5 NW             1
8.48     -10.89 74.22 1.5 NW             10
15.38    -10.89 74.22 1.5 NW             20

It represents the results of a CFD simulation for a specific coordinate (X, Y, Z) and two boundary conditions (wind_direction and wind_speed).

I would like to estimate Velocity for the same point (X, Y, Z), same wind_direction, but intermediate wind_speed, say 4.6. I have this additional row in my dataframe

NaN -10.89 74.22 1.5 NW 4.6

Now I would like to interpolate to fill the NaN based on the wind_speed. For the example above I would expect to get 6.643773541

The number comes from the linear interpolation:

0.82 + (4.6 - 1)/(10 - 1) * (8.48 - 0.82)

Any idea? Thanks

UPDATE

I have found a solution to the issue above. The trick is to use groupby and define a function that interpolates over the dataframe that is created by groupby and passed to apply(). In my case, this is the function

def interp(x, wind_speed):
    g = interpolate.interp1d(np.array(x["wind_speed"]), np.array(x["Velocity"]))
    return g(wind_speed)

and this is my groupby

group = df.groupby("point").apply(interp, wind_speed)

The function interp has to be called with a parameter that represents the point where to perform the interpolation.

I wonder whether there is a better way to do it.

725

asked Dec 01 '14 18:12

Rojj

1 Answers

def interp(x, wind_speed):
    g = interpolate.interp1d(np.array(x["wind_speed"]), np.array(x["Velocity"]))
    return g(wind_speed)

and this is my groupby

group = df.groupby("point").apply(interp, wind_speed)

The function interp has to be called with a parameter that represents the point where to perform the interpolation.

I wonder whether there is a better way to do it.

152

answered Dec 13 '22 05:12

Rojj

Related questions
                            
                                mutagen: how to detect and embed album art in mp3, flac and mp4
                            
                                Python splitext
                            
                                Sqlite load_extension fail for spatialite in Python
                            
                                Why do python exceptions typically not print offending values?
                            
                                segfault using numpy's lapack_lite with multiprocessing on osx, not linux
                            
                                Omit (or format) the value of a variable when documenting with Sphinx
                            
                                IOError: [Errno 22] Invalid argument when reading/writing large bytestring
                            
                                How to find leaks in Python ctypes libraries
                            
                                Parallel many dimensional optimization
                            
                                Add custom Django admin action
                            
                                How can I use Django Social Auth to connect with Twitter?
                            
                                Flask User Management : How to make Stateless Server using better authentication ways?
                            
                                How to speed up Levenshtein distance calculation
                            
                                Django - Distinguish different types of IntegrityError
                            
                                matplotlib exit after animation
                            
                                How to verify a .__getitem__() call in a Mock mock_calls list during unit testing
                            
                                Capture 192 kHz audio using Python 3
                            
                                How to close the browser after completing a download?
                            
                                How to make Sphinx Respect Importing Classes Into Package with __init__.py
                            
                                Are python 3.x venv environments relocatable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas interpolate NaNs based on different column

Tags:

python

pandas

interpolation

Rojj

People also ask

1 Answers

Rojj

Recent Activity

Donate For Us