How to handle output with Luigi

Tags:

luigi

I'm trying to grasp how luigi works, and I get the idea, but actual implementation is a bit harder ;) This is what i have:

class MyTask(luigi.Task):

    x = luigi.IntParameter()

    def requires(self):
        return OtherTask(self.x)

    def run(self):
        print(self.x)

class OtherTask(luigi.Task):

    x = luigi.IntParameter()

    def run(self):
        y = self.x + 1
        print(y)

And this fails with RuntimeError: Unfulfilled dependency at run time: OtherTask_3_5862334ee2. I've figured that I need to produce output using def output(self): to workaround this issue\feature. And I can't comprehend how do I produce reasonable output without writing to a file, say:

def output(self):
    return luigi.LocalTarget('words.txt')

def run(self):

    words = [
            'apple',
            'banana',
            'grapefruit'
            ]

    with self.output().open('w') as f:
        for word in words:
            f.write('{word}\n'.format(word=word))

I've tried reading the documentation, but I can't understand the concept behind output at all. What if I need to output to screen only. What if I need to output an object to another task? Thanks!

883

asked Sep 14 '16 16:09

4c74356b41

1 Answers

What if I need to output an object to another task?

Luigi tasks can run in different processes. Therefore you do usually have to write to disk, a database, pickle, or some external mechanism that allows data to be exchanged between the processes (and the existence of which can be verified) if you want to exchange an object that is the result of a task.

As opposed to writing the output() method, which requires a target, you can also override the complete() method where you can write any custom logic that allows the tasks to be considered complete.

173

answered Oct 14 '22 01:10

MattMcKnight

Related questions
                            
                                django - AttributeError: 'NoneType' object has no attribute 'first_name'
                            
                                How to transform vector into unit vector in Tensorflow
                            
                                Finding conditional probability of trigram in python nltk
                            
                                split sentence without space in python (nltk?)
                            
                                Inverse filtering using Python
                            
                                Python OpenCV plot circles at a list of centre coordinates
                            
                                Python regular expression to replace everything but specific words
                            
                                How to unittest Flask websocket server (Flask-SocketIO)
                            
                                How to make a Luigi task generate an in-memory list as target
                            
                                Python- np.mean() giving wrong means?
                            
                                How to access MultiIndex column after groupby in pandas?
                            
                                Zip cyclically over multiple lists in Python
                            
                                Python3 Asyncio shared resources between concurrent tasks
                            
                                Distributed Tensorflow: ValueError “When: When using replicas, all Variables must have their device set” set: name: "Variable"
                            
                                conv2d_transpose is dependent on batch_size when making predictions
                            
                                In python, does lock get automatically released when an exception happens?
                            
                                Infinite loop while adding two integers using bitwise operations?
                            
                                Scraping all text using Scrapy without knowing webpages' structure
                            
                                PostgreSQL: Unable to drop a specific table named "user"
                            
                                How to stop bokeh server?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With