Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Python support for SparkRunner in Apache Beam

Is there support for running python programs using Apache beam and SparkRunner?

The documentation doesn't seem to have it: https://beam.apache.org/get-started/wordcount-example/#apache-spark-runner

And when I look at the API reference https://beam.apache.org/documentation/sdks/pydoc/0.6.0/apache_beam.runners.html I don't find any mention of SparkRunner in there.

There is mention and support for Java I believe, but I'm wondering if the python support is there?

like image 528
Rohan Jain Avatar asked Jun 11 '26 18:06

Rohan Jain


1 Answers

There's no support for running pipelines built with Apache Beam's Python SDK on the Apache Spark at the moment. However, this work is in progress, embodied in the Apache Beam portability framework.

Stay tuned -- this is something that should be available relatively soon!

like image 87
Davor Bonaci Avatar answered Jun 13 '26 09:06

Davor Bonaci



Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!