How to create a sample single-column Spark DataFrame in Python?

Tags:

I want to create a sample single-column DataFrame, but the following code is not working:

df = spark.createDataFrame(["10","11","13"], ("age"))  ## ValueError ## ... ## ValueError: Could not parse datatype: age

The expected result:

Click to copy

age 10 11 13

964

asked Dec 06 '17 12:12

Ajish Kb

2 Answers

the following code is not working

With single element you need a schema as type

Click to copy

spark.createDataFrame(["10","11","13"], "string").toDF("age")

or DataType:

Click to copy

from pyspark.sql.types import StringType  spark.createDataFrame(["10","11","13"], StringType()).toDF("age")

With name elements should be tuples and schema as sequence:

Click to copy

spark.createDataFrame([("10", ), ("11", ), ("13",  )], ["age"])

answered Sep 23 '22 19:09

Alper t. Turker

Well .. There is some pretty easy method for creating sample dataframe in PySpark

Click to copy

>>> df = sc.parallelize([[1,2,3], [2,3,4]]).toDF() >>> df.show() +---+---+---+ | _1| _2| _3| +---+---+---+ |  1|  2|  3| |  2|  3|  4| +---+---+---+

to create with some column names

Click to copy

>>> df1 = sc.parallelize([[1,2,3], [2,3,4]]).toDF(("a", "b", "c")) >>> df1.show() +---+---+---+ |  a|  b|  c| +---+---+---+ |  1|  2|  3| |  2|  3|  4| +---+---+---+

In this way, no need to define schema too.Hope this is the simplest way

answered Sep 23 '22 19:09

Sarath Chandra Vema

Related questions
                            
                                Printing on the same line on a jupyter notebook
                            
                                Python pandas: mean and sum groupby on different columns at the same time
                            
                                Django: Does unique_together imply db_index=True in the same way that ForeignKey does?
                            
                                Fit a gaussian function
                            
                                "SSL: certificate_verify_failed" error when scraping https://www.thenewboston.com/
                            
                                Remove non-business days rows from pandas dataframe
                            
                                Failing to import itertools in Python 3.5.2
                            
                                How to drop column according to NAN percentage for dataframe?
                            
                                Numpy import error Python3 on Raspberry Pi?
                            
                                SQLAlchemy Relationship Filter?
                            
                                matplotlib matshow labels
                            
                                Multiprocessing a function with several inputs
                            
                                Deriving a class from TestCase throws two errors
                            
                                Trying to parse JSON in Python. ValueError: Expecting property name [duplicate]
                            
                                flask-sqlalchemy - PostgreSQL - Define specific schema for table?
                            
                                Setting Different error bar colors in bar plot in matplotlib
                            
                                Specify which python version pylint should evaluate for
                            
                                How to 'update' or 'overwrite' a python list
                            
                                Get all combinations of elements from two lists?
                            
                                Linear Regression on Pandas DataFrame using Sklearn ( IndexError: tuple index out of range)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to create a sample single-column Spark DataFrame in Python?

Tags:

python

apache-spark

apache-spark-sql

pyspark

Ajish Kb

People also ask

2 Answers

Alper t. Turker

Sarath Chandra Vema

Recent Activity

Donate For Us