I have DataFrame similat to this. How to add new column with names of rows that have same value in one of the column? For example: Have this: <pre class="prettyprint"><code> name building a blue b white c blue d red e blue f red </code></pre> How to get this? <pre class="prettyprint"><code> name building in_building_with a blue [c, e] b white [] c blue [a, e] d red [f] e blue [a, c] f red [d] </code></pre>

This is approach(worst) I can only think of : <pre class="prettyprint"><code>r = df.groupby('building')['name'].agg(dict) df['in_building_with'] = df.apply(lambda x: [r[x['building']][i] for i in (r[x['building']].keys()-[x.name])], axis=1) </code></pre> <hr> df: <pre class="prettyprint"><code>name building in_building_with 0 a blue [c, e] 1 b white [] 2 c blue [a, e] 3 d red [f] 4 e blue [a, c] 5 f red [d] </code></pre> <hr> Approach: <ol> <li>Make a dictionary which will give your indices where the building occurs.</li> </ol> <hr> <pre class="prettyprint"><code>building blue {0: 'a', 2: 'c', 4: 'e'} red {3: 'd', 5: 'f'} white {1: 'b'} dtype: object </code></pre> <hr> <ol start="2"> <li>subtract the index of the current building from the list since you are looking at the element other than it to get the indices of appearance.</li> </ol> <hr> <pre class="prettyprint"><code>r[x['building']].keys()-[x.name] </code></pre> <hr> <ol start="3"> <li>Get the values at those indices and make them into a list.</li> </ol>

Create new column with data that has same column

  name  building 
  a     blue
  b     white
  c     blue
  d     red
  e     blue
  f     red

How to get this?

  name  building  in_building_with
  a     blue      [c, e]
  b     white     []
  c     blue      [a, e]
  d     red       [f]
  e     blue      [a, c]
  f     red       [d]

760

asked Dec 07 '20 12:12

cvakodobro

2 Answers

This is approach(worst) I can only think of :

r = df.groupby('building')['name'].agg(dict)
df['in_building_with'] = df.apply(lambda  x: [r[x['building']][i] for i in (r[x['building']].keys()-[x.name])], axis=1)

df:

name    building    in_building_with
0   a   blue    [c, e]
1   b   white   []
2   c   blue    [a, e]
3   d   red     [f]
4   e   blue    [a, c]
5   f   red     [d]

Approach:

Make a dictionary which will give your indices where the building occurs.

building
blue     {0: 'a', 2: 'c', 4: 'e'}
red              {3: 'd', 5: 'f'}
white                    {1: 'b'}
dtype: object

subtract the index of the current building from the list since you are looking at the element other than it to get the indices of appearance.

r[x['building']].keys()-[x.name]

Get the values at those indices and make them into a list.

138

answered Sep 24 '22 04:09

Pygirl

If order is not important, you could do:

# create groups
groups = df.groupby('building').transform(dict.fromkeys).squeeze()

# remove value from each group
df['in_building_with'] = [list(group.keys() - (e,)) for e, group in zip(df['name'], groups)]

print(df)

Output

  name building in_building_with
0    a     blue           [e, c]
1    b    white               []
2    c     blue           [e, a]
3    d      red              [f]
4    e     blue           [a, c]
5    f      red              [d]

answered Sep 25 '22 04:09

Dani Mesejo

Related questions
                            
                                AWS CDK: Error when deploying Redis ElastiCache: Subnet group belongs to a different VPC than CacheCluster
                            
                                Holoviews charts sharing axis when combined and outputted
                            
                                Custom color palette in seaborn
                            
                                wave.Error: unknown format: 3 arises when trying to convert a wav file into text in Python
                            
                                By how much can i approx. reduce disk volume by using dvc?
                            
                                Calculating row-wise time difference in python
                            
                                Dash Application Python Button for refresh the page
                            
                                VS Code Python pip is not recognized
                            
                                Django 3 - Model.save() when providing a default for the primary key
                            
                                How to run Django with Uvicorn webserver?
                            
                                DeprecationWarning: The default dtype for empty Series will be 'object' instead of 'float64' in a future version warning
                            
                                RuntimeError: Unable to create link (name already exists) Keras
                            
                                What does TypeError, __init__() missing 1 required positional argument: 'get_response' mean in python?
                            
                                py-datatable 'in' operator?
                            
                                Optimization on piecewise linear regression
                            
                                Count if in multiple index Dataframe
                            
                                How does the value of the name parameter to setuptools.setup affect the results?
                            
                                Python session SAMESITE=None not being set
                            
                                How to key press detection on a Linux terminal, low level style in python
                            
                                Error getting for src type CV_8UC3 to CV_8UC1 in OpenCV Python? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Create new column with data that has same column

Tags:

python

pandas

dataframe

cvakodobro

People also ask

2 Answers

Pygirl

Dani Mesejo

Recent Activity

Donate For Us