I am trying to install the pyspark using pip install like below. But I got the following errors.
(python_virenv)edamame$ pip install pyspark
Collecting pyspark
Could not find a version that satisfies the requirement pyspark (from versions: )
No matching distribution found for pyspark
Does anyone have any idea? Thanks!
As of Spark 2.2, PySpark is now available in PyPI.
pip install pyspark
As of Spark 2.1, PySpark is pip installable but not yet from PyPI, which is under consideration for 2.2 in this ticket. To install PySpark you now just need download Spark 2.1+ and run setup.py:
cd spark-2.1/python/
pip install -e .
Big thanks to @Holden!
pyspark is not in PyPI so you could not directly use pip install to install it.
Instead you could download a proper version of Spark here: http://spark.apache.org/downloads.html, and you will get a compressed TAR file. Then unpack it and pyspark is in its python folder.
To open the Python version of the Spark shell, you could go into your Spark directory and type:
bin/pyspark
or
bin\pyspark
in Windows.
pyspark doesn't even exist in PyPI as you can see from https://pypi.python.org/pypi?%3Aaction=search&term=pyspark&submit=search, that's why pip is telling you it can't find it.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With