I have a data frame in python/pyspark with columns <code>id</code> <code>time</code> <code>city</code> <code>zip</code> and so on...... Now I added a new column <code>name</code> to this data frame. Now I have to arrange the columns in such a way that the <code>name</code> column comes after <code>id</code> I have done like below <pre class="prettyprint"><code>change_cols = ['id', 'name'] cols = ([col for col in change_cols if col in df] + [col for col in df if col not in change_cols]) df = df[cols] </code></pre> I am getting this error <pre class="prettyprint"><code>pyspark.sql.utils.AnalysisException: u"Reference 'id' is ambiguous, could be: id#609, id#1224.;" </code></pre> Why is this error occuring. How can I rectify this.

You can use <code>select</code> to change the order of the columns: <pre class="prettyprint"><code>df.select("id","name","time","city") </code></pre>

If you're working with a large number of columns: <pre class="prettyprint"><code>df.select(sorted(df.columns)) </code></pre>

Python/pyspark data frame rearrange columns

Tags:

python

pyspark

spark-dataframe

I have a data frame in python/pyspark with columns id time city zip and so on......

Now I added a new column name to this data frame.

Now I have to arrange the columns in such a way that the name column comes after id

I have done like below

change_cols = ['id', 'name']  cols = ([col for col in change_cols if col in df]          + [col for col in df if col not in change_cols])  df = df[cols]

I am getting this error

pyspark.sql.utils.AnalysisException: u"Reference 'id' is ambiguous, could be: id#609, id#1224.;"

Why is this error occuring. How can I rectify this.

864

asked Mar 20 '17 19:03

User12345

2 Answers

You can use select to change the order of the columns:

df.select("id","name","time","city")

127

answered Sep 29 '22 07:09

Alex

If you're working with a large number of columns:

df.select(sorted(df.columns))

answered Sep 29 '22 09:09

melchoir55

Related questions
                            
                                After install ROS Kinetic, cannot import OpenCV
                            
                                How to use an Exception's attributes in Python?
                            
                                Faster way of polygon intersection with shapely
                            
                                How can I replace the first occurrence of a character in every word?
                            
                                How to sort multidimensional array by column?
                            
                                Python 3 In Memory Zipfile Error. string argument expected, got 'bytes'
                            
                                How to clear the Entry widget after a button is pressed in Tkinter?
                            
                                How to create Password Field in Model Django
                            
                                understanding numpy's dstack function
                            
                                How I can get rid of None values in dictionary?
                            
                                How can I check if an object is an iterator in Python?
                            
                                How can I convert os.path.getctime()?
                            
                                TensorFlow: training on my own image
                            
                                How does a threading.Thread yield the rest of its quantum in Python?
                            
                                How to create tzinfo when I have UTC offset?
                            
                                How to pad with zeros a tensor along some axis (Python)
                            
                                How to add custom css file to Sphinx?
                            
                                How to limit log file size in python
                            
                                Matplotlib figure to image as a numpy array
                            
                                spark 2.1.0 session config settings (pyspark)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With