I'm using Linux with Hadoop, Cloudera and HBase. Could you tell me how to correct this error? Error: <code>could to find or load main class org.apache.nutch.crawl.InjectorJob</code> The following command gave me the error: <pre class="prettyprint"><code>src/bin/nutch inject crawl/crawldb dmoz/ </code></pre> if you need any other information ask for me.

I think you probably missed a step or two. Please confirm: <ol> <li>Did you install Apache ANT and then navigate to the nutch folder and type in "ant"?</li> <li>Did you set the environment variables: <ul> <li>NUTCH_JAVA_HOME: The java implementation to use. Overrides <code>JAVA_HOME</code>.</li> <li>NUTCH_HEAPSIZE: The maximum amount of heap to use, in MB. Default is 1000.</li> <li>NUTCH_OPTS: Extra Java runtime options.Multiple options must be separated by white space.</li> <li>NUTCH_LOG_DIR: Log directory <code>(default: $NUTCH_HOME/logs)</code> </li> <li>NUTCH_LOGFILE: Log file <code>(default: hadoop.log)</code> </li> <li>NUTCH_CONF_DIR: Path(s) to configuration files <code>(default: $NUTCH_HOME/conf)</code>. Multiple paths must be separated by a colon ':'.</li> <li>JAVA_HOME</li> <li>NUTCH_JAVA_HOME </li> <li>NUTCH_HOME </li> </ul> </li> </ol> If you install using "ant", then you will get a new folder in <code>/nutch called /nutch/runtime/local</code> and this is from where you must actually run nutch. Tip: Try reading this page.

could to find or load main class org.apache.nutch.crawl.InjectorJob

Tags:

solr

hadoop

nutch

I'm using Linux with Hadoop, Cloudera and HBase.

Could you tell me how to correct this error?

Error: could to find or load main class org.apache.nutch.crawl.InjectorJob

The following command gave me the error:

src/bin/nutch inject crawl/crawldb dmoz/

if you need any other information ask for me.

810

asked Mar 09 '15 09:03

orilion

1 Answers

I think you probably missed a step or two. Please confirm:

Did you install Apache ANT and then navigate to the nutch folder and type in "ant"?
Did you set the environment variables:
- NUTCH_JAVA_HOME: The java implementation to use. Overrides JAVA_HOME.
- NUTCH_HEAPSIZE: The maximum amount of heap to use, in MB. Default is 1000.
- NUTCH_OPTS: Extra Java runtime options.Multiple options must be separated by white space.
- NUTCH_LOG_DIR: Log directory (default: $NUTCH_HOME/logs)
- NUTCH_LOGFILE: Log file (default: hadoop.log)
- NUTCH_CONF_DIR: Path(s) to configuration files (default: $NUTCH_HOME/conf). Multiple paths must be separated by a colon ':'.
- JAVA_HOME
- NUTCH_JAVA_HOME
- NUTCH_HOME

If you install using "ant", then you will get a new folder in /nutch called /nutch/runtime/local and this is from where you must actually run nutch.

Tip: Try reading this page.

answered Oct 25 '22 00:10

coderama

Related questions
                            
                                Hadoop and Stata
                            
                                How to interpret MapReduce Performance Counters
                            
                                How to set up Hadoop in Docker Swarm?
                            
                                pyspark : how to check if a file exists in hdfs
                            
                                "LOST" node in EMR Cluster
                            
                                Making spark use /etc/hosts file for binding in YARN cluster mode
                            
                                Passing additional parameters to dbConnect function for JDBCDriver in R
                            
                                Not able to install hadoop using Cloudera Manager
                            
                                Why would Spark choose to do all work on a single node?
                            
                                HBASE 0.94.1 compatibility with hadoop
                            
                                Hadoop Ports Clarification

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With