Creating a new column in Panda by using lambda function on two existing columns

Tags:

I am able to add a new column in Panda by defining user function and then using apply. However, I want to do this using lambda; is there a way around?

For Example, df has two columns a and b. I want to create a new column c which is equal to the longest length between a and b.

Some thing like:

df['c'] = df.apply(lambda x, len(df['a']) if len(df['a']) > len(df['b']) or len(df['b']) )

One approach:

df = pd.DataFrame({'a':['dfg','f','fff','fgrf','fghj'], 'b' : ['sd','dfg','edr','df','fghjky']})  df['c'] = df.apply(lambda x: max([len(x) for x in [df['a'], df['b']]])) print df       a       b   c 0   dfg      sd NaN 1     f     dfg NaN 2   fff     edr NaN 3  fgrf      df NaN 4  fghj  fghjky NaN

514

asked Nov 12 '15 20:11

piyush sharma

1 Answers

You can use function map and select by function np.where more info

print df #     a     b #0  aaa  rrrr #1   bb     k #2  ccc     e #condition if condition is True then len column a else column b df['c'] = np.where(df['a'].map(len) > df['b'].map(len), df['a'].map(len), df['b'].map(len)) print df #     a     b  c #0  aaa  rrrr  4 #1   bb     k  2 #2  ccc     e  3

Next solution is with function apply with parameter axis=1:

axis = 1 or ‘columns’: apply function to each row

df['c'] = df.apply(lambda x: max(len(x['a']), len(x['b'])), axis=1)

159

answered Sep 22 '22 22:09

jezrael

Related questions
                            
                                Constructing 3D Pandas DataFrame
                            
                                How to print utf-8 to console with Python 3.4 (Windows 8)?
                            
                                Regex and unicode
                            
                                Python native coroutines and send()
                            
                                uWSGI raises OSError: write error during large request
                            
                                How can I convert a two column array to a matrix with counts of occurences?
                            
                                ConfigObj/ConfigParser vs. using YAML for Python settings file
                            
                                Calling Python from Ruby
                            
                                Python: Usable Max and Min values
                            
                                Parallel Pip install
                            
                                Python: what are the nearest Linux and OSX equivalents of winsound.Beep?
                            
                                The preferred way to set matplotlib figure/axes properties
                            
                                isinstance(foo,bar) vs type(foo) is bar
                            
                                What is the type hint for a (any) python module?
                            
                                matplotlib - subplots with fixed aspect ratio
                            
                                Python method name with double-underscore is overridden?
                            
                                what's the tornado ioloop, and tornado's workflow?
                            
                                How to create a SSH tunnel using Python and Paramiko?
                            
                                Fastest way to calculate the centroid of a set of coordinate tuples in python without numpy
                            
                                When to use Category rather than Object?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Creating a new column in Panda by using lambda function on two existing columns

Tags:

python

pandas

lambda

multiple-columns

calculated-columns

piyush sharma

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us