I am using the Python API of Spark version 1.4.1. My row object looks like this : <pre class="prettyprint"><code>row_info = Row(name = Tim, age = 5, is_subscribed = false) </code></pre> How can I get as a result, a list of the object attributes ? Something like : <code>["name", "age", "is_subscribed"]</code>

If you don't care about the order you can simply extract these from a <code>dict</code>: <pre class="prettyprint"><code>list(row_info.asDict()) </code></pre> otherwise the only option I am aware of is using <code>__fields__</code> directly: <pre class="prettyprint"><code>row_info.__fields__ </code></pre>

How can I get from 'pyspark.sql.types.Row' all the columns/attributes name?

Tags:

python

attributes

row

apache-spark

pyspark

I am using the Python API of Spark version 1.4.1.

My row object looks like this :

row_info = Row(name = Tim, age = 5, is_subscribed = false)

How can I get as a result, a list of the object attributes ? Something like : ["name", "age", "is_subscribed"]

983

asked Jan 28 '16 16:01

dng

1 Answers

If you don't care about the order you can simply extract these from a dict:

list(row_info.asDict())

otherwise the only option I am aware of is using __fields__ directly:

row_info.__fields__

answered Nov 13 '22 19:11

zero323

Related questions
                            
                                How to open files given as command line arguments in python? [closed]
                            
                                Finding the maximum of a function
                            
                                How to make python3.2 interpreter the default interpreter in debian
                            
                                Python: Calculate Voronoi Tesselation from Scipy's Delaunay Triangulation in 3D
                            
                                How to handle a long SQL statement string in Python
                            
                                Why is local variable access faster than class member access in Python?
                            
                                How to modify the navigation toolbar easily in a matplotlib figure window?
                            
                                No module named numpy
                            
                                How do I use a minimization function in scipy with constraints
                            
                                Does a slicing operation give me a deep or shallow copy?
                            
                                Pandas and unicode
                            
                                Extremely long wait time when loading REST resource from angularjs
                            
                                Conditionally join a list of strings in Jinja
                            
                                Predicting missing values with scikit-learn's Imputer module
                            
                                How does the list comprehension to flatten a python list work?
                            
                                How can I abort a task in a multiprocessing.Pool after a timeout?
                            
                                pandas dataframe drop columns by number of nan
                            
                                Change the type of User ID to UUID
                            
                                Scrapy, Python: Multiple Item Classes in one pipeline?
                            
                                How to Load Data into Amazon Redshift via Python Boto3?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With