I'm not able to run a simple <code>spark</code> job in <code>Scala IDE</code> (Maven spark project) installed on <code>Windows 7</code> Spark core dependency has been added. <pre class="prettyprint"><code>val conf = new SparkConf().setAppName("DemoDF").setMaster("local") val sc = new SparkContext(conf) val logData = sc.textFile("File.txt") logData.count() </code></pre> Error: <pre class="prettyprint"><code>16/02/26 18:29:33 INFO SparkContext: Created broadcast 0 from textFile at FrameDemo.scala:13 16/02/26 18:29:34 ERROR Shell: Failed to locate the winutils binary in the hadoop binary path java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries. at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:278) at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:300) at org.apache.hadoop.util.Shell.<clinit>(Shell.java:293) at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:76) at org.apache.hadoop.mapred.FileInputFormat.setInputPaths(FileInputFormat.java:362) at org.apache.spark.SparkContext$$anonfun$hadoopFile$1$$anonfun$33.apply(SparkContext.scala:1015) at org.apache.spark.SparkContext$$anonfun$hadoopFile$1$$anonfun$33.apply(SparkContext.scala:1015) at org.apache.spark.rdd.HadoopRDD$$anonfun$getJobConf$6.apply(HadoopRDD.scala:176) at org.apache.spark.rdd.HadoopRDD$$anonfun$getJobConf$6.apply(HadoopRDD.scala:176) at scala.Option.map(Option.scala:145) at org.apache.spark.rdd.HadoopRDD.getJobConf(HadoopRDD.scala:176) at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:195) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237) at scala.Option.getOrElse(Option.scala:120) at org.apache.spark.rdd.RDD.partitions(RDD.scala:237) at org.apache.spark.rdd.MapPartitionsRDD.getPartitions(MapPartitionsRDD.scala:35) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239) at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237) at scala.Option.getOrElse(Option.scala:120) at org.apache.spark.rdd.RDD.partitions(RDD.scala:237) at org.apache.spark.SparkContext.runJob(SparkContext.scala:1929) at org.apache.spark.rdd.RDD.count(RDD.scala:1143) at com.org.SparkDF.FrameDemo$.main(FrameDemo.scala:14) at com.org.SparkDF.FrameDemo.main(FrameDemo.scala) </code></pre>

Here is a good explanation of your problem with the solution. <ol> <li> Download the version of winutils.exe from https://github.com/steveloughran/winutils. </li> <li> Set up your <code>HADOOP_HOME</code> environment variable on the OS level or programmatically: <code>System.setProperty("hadoop.home.dir", "full path to the folder with winutils");</code> </li> <li> Enjoy </li> </ol>

java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries. spark Eclipse on windows 7

Tags:

eclipse

scala

apache-spark

I'm not able to run a simple spark job in Scala IDE (Maven spark project) installed on Windows 7

Spark core dependency has been added.

val conf = new SparkConf().setAppName("DemoDF").setMaster("local") val sc = new SparkContext(conf) val logData = sc.textFile("File.txt") logData.count()

Error:

16/02/26 18:29:33 INFO SparkContext: Created broadcast 0 from textFile at FrameDemo.scala:13 16/02/26 18:29:34 ERROR Shell: Failed to locate the winutils binary in the hadoop binary path java.io.IOException: Could not locate executable null\bin\winutils.exe in the Hadoop binaries.     at org.apache.hadoop.util.Shell.getQualifiedBinPath(Shell.java:278)     at org.apache.hadoop.util.Shell.getWinUtilsPath(Shell.java:300)     at org.apache.hadoop.util.Shell.<clinit>(Shell.java:293)     at org.apache.hadoop.util.StringUtils.<clinit>(StringUtils.java:76)     at org.apache.hadoop.mapred.FileInputFormat.setInputPaths(FileInputFormat.java:362)     at <br>org.apache.spark.SparkContext$$anonfun$hadoopFile$1$$anonfun$33.apply(SparkContext.scala:1015)     at org.apache.spark.SparkContext$$anonfun$hadoopFile$1$$anonfun$33.apply(SparkContext.scala:1015)     at <br>org.apache.spark.rdd.HadoopRDD$$anonfun$getJobConf$6.apply(HadoopRDD.scala:176)     at <br>org.apache.spark.rdd.HadoopRDD$$anonfun$getJobConf$6.apply(HadoopRDD.scala:176)<br>     at scala.Option.map(Option.scala:145)<br>     at org.apache.spark.rdd.HadoopRDD.getJobConf(HadoopRDD.scala:176)<br>     at org.apache.spark.rdd.HadoopRDD.getPartitions(HadoopRDD.scala:195)<br>     at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)<br>     at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)<br>     at scala.Option.getOrElse(Option.scala:120)<br>     at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)<br>     at org.apache.spark.rdd.MapPartitionsRDD.getPartitions(MapPartitionsRDD.scala:35)<br>     at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:239)<br>     at org.apache.spark.rdd.RDD$$anonfun$partitions$2.apply(RDD.scala:237)<br>     at scala.Option.getOrElse(Option.scala:120)<br>     at org.apache.spark.rdd.RDD.partitions(RDD.scala:237)<br>     at org.apache.spark.SparkContext.runJob(SparkContext.scala:1929)<br>     at org.apache.spark.rdd.RDD.count(RDD.scala:1143)<br>     at com.org.SparkDF.FrameDemo$.main(FrameDemo.scala:14)<br>     at com.org.SparkDF.FrameDemo.main(FrameDemo.scala)<br>

835

asked Feb 26 '16 13:02

Elvish_Blade

1 Answers

Here is a good explanation of your problem with the solution.

Download the version of winutils.exe from https://github.com/steveloughran/winutils.
Set up your HADOOP_HOME environment variable on the OS level or programmatically:

System.setProperty("hadoop.home.dir", "full path to the folder with winutils");
Enjoy

answered Oct 05 '22 07:10

Taky

Related questions
                            
                                Eclipse - Incompatible JVM. Version 1.8.0_261 of the JVM is not suitable for this product. Version: 11 or greater is required
                            
                                eclipse won't start - no java virtual machine was found
                            
                                What does the > (greater than bracket) mean beside file names in Eclipse's Package Explorer?
                            
                                jar not loaded. See Servlet Spec 2.3, section 9.7.2. Offending class: javax/servlet/Servlet.class
                            
                                Any reason to clean up unused imports in Java, other than reducing clutter?
                            
                                "Unresolved inclusion" error with Eclipse CDT for C standard library headers
                            
                                Android ClassNotFoundException: Didn't find class on path
                            
                                How to create unit tests easily in eclipse [closed]
                            
                                "Auth Failed" error with EGit and GitHub
                            
                                Eclipse: enable assertions
                            
                                Uninstalling Android ADT
                            
                                m2e error in MavenArchiver.getManifest()
                            
                                Eclipse WTP vs sydeo, " serves modules without publishing "
                            
                                Eclipse: Exclude specific packages when autocompleting a class name
                            
                                Setting property 'source' to 'org.eclipse.jst.jee.server:JSFTut' did not find a matching property
                            
                                In Eclipse, what can cause Package Explorer "red-x" error-icon when all Java sources compile without errors?
                            
                                How to exclude specific folders or files from validation in Eclipse?
                            
                                In eclipse, unable to reference an android library project in another android project
                            
                                How to upgrade Eclipse for Java EE Developers?
                            
                                Importing a Maven project into Eclipse from Git

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With