How to run apache nutch different jobs in parallel manner

Tags:

I am using nutch 2.3. All jobs run one after other i.e. first generator, fetch, parse, index etc. I want to run some jobs simultaneously. I know some jobs cannot run in parallel but other can e.g parse job, dbupdate, indexjob should be run with fetch.

Is it possible ? My basic objective is to run fetcher job all the time. I suppose that we can do it with different timestamp. Can anyone guide me the proper way ?

734

asked May 05 '15 06:05

Hafiz Muhammad Shafiq

1 Answers

If you check out the nutch web app server, you will find out that it can execute multiple crawl job in parallel.You should check out the source code of the Nutch 2.3 for webapp[NutchUiServer]. Hope this helps.

101

answered Nov 15 '22 17:11

Mubin Shrestha

Related questions
                            
                                listFiles() returns null when it shouldn't. It used to work properly until recently and hasn't been modified
                            
                                Combo of IdentityHashMap and WeakHashMap
                            
                                Changing locale programmatically not working in some devices
                            
                                Three ways to know existence of an akka actor
                            
                                Detecting changes in Galera cluster DB (mysql). Implementing application cache invalidation
                            
                                @ControllerAdvice exception handling together with @ResponseStatus
                            
                                Missing worksheets and page size issue when excel (.xlsx) convert to pdf (.pdf) using open office
                            
                                generating Persian PDF with iText
                            
                                Compile Java code without dependencies [duplicate]
                            
                                Send Message to all clients via SimpMessagingTemplate in ServletContextListener
                            
                                Java generic object with multiple interfaces casting
                            
                                Mock or build a Jersey InboundJaxrsResponse
                            
                                Standard JSON parser that comes bundled with Java [duplicate]
                            
                                Generating colors of noise in Java
                            
                                Executing Groovy scripts embed in Java on runtime for Android
                            
                                Can I mix media types in a Spring Rest method?
                            
                                Spring configuration class loading order with spring-boot @ConditionalOnMissingBean?
                            
                                Parameter value did not match expected type
                            
                                Mockito Allow different argument types to mock overloaded method
                            
                                Why IntelliJ IDEA doesn't see HttpClients?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to run apache nutch different jobs in parallel manner

Tags:

java

apache

web-crawler

nutch

Hafiz Muhammad Shafiq

People also ask

1 Answers

Mubin Shrestha

Recent Activity

Donate For Us