Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

unable to install pyspark

Tags:

python

pyspark

I am trying to install pyspark as this:

python setup.py install

I get this error:

Could not import pypandoc - required to package PySpark

pypandoc is installed already

Any ideas how can I install pyspark?

like image 226
user1471980 Avatar asked Jul 24 '18 13:07

user1471980


People also ask

How do I know if PySpark is installed?

To test if your installation was successful, open Command Prompt, change to SPARK_HOME directory and type bin\pyspark. This should start the PySpark shell which can be used to interactively work with Spark. The last message provides a hint on how to work with Spark in the PySpark shell using the sc or sqlContext names.


2 Answers

I faced the same issue and solved it as below install pypandoc before installing pyspark

pip install pypandoc
pip install pyspark
like image 82
viraj ghorpade Avatar answered Sep 18 '22 15:09

viraj ghorpade


Try installing pypandoc with python3 with pip3 install pypandoc.

like image 40
kubote Avatar answered Sep 19 '22 15:09

kubote