Terminate multiple threads when any thread completes a task

Tags:

I am new to both python, and to threads. I have written python code which acts as a web crawler and searches sites for a specific keyword. My question is, how can I use threads to run three different instances of my class at the same time. When one of the instances finds the keyword, all three must close and stop crawling the web. Here is some code.

class Crawler:       def __init__(self):             # the actual code for finding the keyword    def main():           Crawl = Crawler()   if __name__ == "__main__":         main()

How can I use threads to have Crawler do three different crawls at the same time?

926

asked Jun 08 '11 22:06

user446836

1 Answers

There doesn't seem to be a (simple) way to terminate a thread in Python.

Here is a simple example of running multiple HTTP requests in parallel:

import threading  def crawl():     import urllib2     data = urllib2.urlopen("http://www.google.com/").read()      print "Read google.com"  threads = []  for n in range(10):     thread = threading.Thread(target=crawl)     thread.start()      threads.append(thread)  # to wait until all three functions are finished  print "Waiting..."  for thread in threads:     thread.join()  print "Complete."

With additional overhead, you can use a multi-process aproach that's more powerful and allows you to terminate thread-like processes.

I've extended the example to use that. I hope this will be helpful to you:

import multiprocessing  def crawl(result_queue):     import urllib2     data = urllib2.urlopen("http://news.ycombinator.com/").read()      print "Requested..."      if "result found (for example)":         result_queue.put("result!")      print "Read site."  processs = [] result_queue = multiprocessing.Queue()  for n in range(4): # start 4 processes crawling for the result     process = multiprocessing.Process(target=crawl, args=[result_queue])     process.start()     processs.append(process)  print "Waiting for result..."  result = result_queue.get() # waits until any of the proccess have `.put()` a result  for process in processs: # then kill them all off     process.terminate()  print "Got result:", result

109

answered Sep 25 '22 17:09

Jeremy

Related questions
                            
                                Getting all field names from a protocol buffer?
                            
                                Repeating each element of a numpy array 5 times
                            
                                ValueError: Layer sequential_20 expects 1 inputs, but it received 2 input tensors
                            
                                What is internal representation of string in Python 3.x
                            
                                Get window position & size with python
                            
                                Is it possible to dereference variable id's?
                            
                                Travis special requirements for each python version
                            
                                sqlalchemy: create relations but without foreign key constraint in db?
                            
                                Python - Drop row if two columns are NaN
                            
                                returning numpy arrays via pybind11
                            
                                pip3 on python3.9 fails on 'HTMLParser' object has no attribute 'unescape' [duplicate]
                            
                                Listing serial (COM) ports on Windows?
                            
                                select from sqlite table where rowid in list using python sqlite3 — DB-API 2.0
                            
                                Crawling with an authenticated session in Scrapy
                            
                                Count occurrences of each of certain words in pandas dataframe
                            
                                What does the "r" in pythons re.compile(r' pattern flags') mean?
                            
                                detecting idle time using python
                            
                                How to call an element in a numpy array?
                            
                                Access django models inside of Scrapy
                            
                                How to get a complete exception stack trace in Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Terminate multiple threads when any thread completes a task

Tags:

python

multithreading

user446836

People also ask

1 Answers

Jeremy

Recent Activity

Donate For Us