Skip to content Skip to sidebar Skip to footer

Error With H20context Running Pysparkling With Spark 2.1

I'm getting this error when trying to run a Pysparkling script on an AWS EMR cluster. I can get everything to work when downloading Sparkling water 2.1.8 and running it from a pysp

Solution 1:

You do not need to attach Sparkling Water packages (option --packages), but you need to provide pysparkling Python package (it contains all necessary binary dependencies internally).

The best way is to download binary distribution of Sparkling Water from http://h2o.ai/download and use bin/pysparkling script or use spark directly:

$SPARK_HOME/bin/pyspark --py-files h2o_pysparkling_2.1-2.1.8.zip

Post a Comment for "Error With H20context Running Pysparkling With Spark 2.1"