How do you send arguments to a generator function using tf.data.Dataset.from_generator()?

Tags:

I would like to create a number of tf.data.Dataset using the from_generator() function. I would like to send an argument to the generator function (raw_data_gen). The idea is that the generator function will yield different data depending on the argument sent. In this way I would like raw_data_gen to be able to provide either training, validation or test data.

training_dataset = tf.data.Dataset.from_generator(raw_data_gen, (tf.float32, tf.uint8), ([None, 1], [None]), args=([1]))

validation_dataset = tf.data.Dataset.from_generator(raw_data_gen, (tf.float32, tf.uint8), ([None, 1], [None]), args=([2]))

test_dataset = tf.data.Dataset.from_generator(raw_data_gen, (tf.float32, tf.uint8), ([None, 1], [None]), args=([3]))

The error message I get when I try to call from_generator() in this way is:

TypeError: from_generator() got an unexpected keyword argument 'args'

Here is the raw_data_gen function although I'm not sure if you will need this as my hunch is that the problem is with the call of from_generator():

def raw_data_gen(train_val_or_test):

    if train_val_or_test == 1:        
        #For every filename collected in the list
        for filename, lab in training_filepath_label_dict.items():
            raw_data, samplerate = soundfile.read(filename)
            try: #assume the audio is stereo, ready to be sliced
                raw_data = raw_data[:,0] #raw_data is a np.array, just take first channel with slice
            except IndexError:
                pass #this must be mono audio
            yield raw_data, lab

    elif train_val_or_test == 2:
        #For every filename collected in the list
        for filename, lab in validation_filepath_label_dict.items():
            raw_data, samplerate = soundfile.read(filename)
            try: #assume the audio is stereo, ready to be sliced
                raw_data = raw_data[:,0] #raw_data is a np.array, just take first channel with slice
            except IndexError:
                pass #this must be mono audio
            yield raw_data, lab

    elif train_val_or_test == 3:
        #For every filename collected in the list
        for filename, lab in test_filepath_label_dict.items():
            raw_data, samplerate = soundfile.read(filename)
            try: #assume the audio is stereo, ready to be sliced
                raw_data = raw_data[:,0] #raw_data is a np.array, just take first channel with slice
            except IndexError:
                pass #this must be mono audio
            yield raw_data, lab

    else:
        print("generator function called with an argument not in [1, 2, 3]")
        raise ValueError()

221

asked Sep 21 '18 11:09

michael_question_answerer

1 Answers

You need to define a new function based on raw_data_gen that doesn't take any arguments. You can use the lambda keyword to do this.

training_dataset = tf.data.Dataset.from_generator(lambda: raw_data_gen(train_val_or_test=1), (tf.float32, tf.uint8), ([None, 1], [None]))
...

Now, we are passing a function to from_generator that doesn't take any arguments, but that will simply act as raw_data_gen with the argument set to 1. You can use the same scheme for the validation and test sets, passing 2 and 3 respectively.

192

answered Oct 12 '22 13:10

xdurch0

Related questions
                            
                                Kivy: drag n drop, get file path
                            
                                Is it thread-safe when using tf.Session in inference service?
                            
                                Raise error for undefined attributes in Jinja templates in Flask
                            
                                How do function descriptors work?
                            
                                what is the fast way to drop columns in pandas dataframe from a list of column names [duplicate]
                            
                                PyQt: Change GUI Layout after button is clicked
                            
                                Apply function to dataframe column element based on value in other column for same row?
                            
                                How To Find Nearest Point From User Location using geodjango?
                            
                                Scale plot size of Matplotlib Plots in Jupyter Notebooks
                            
                                Error: The truth value of a Series is ambiguous - Python pandas
                            
                                How to conditionally remove duplicates from a pandas dataframe
                            
                                Smoothing out curve in Python
                            
                                Using PyKalman on Raw Acceleration Data to Calculate Position
                            
                                Making Pandas work with Pendulum
                            
                                Python 3.6 glob include hidden files and folders
                            
                                How to make a generator callable?
                            
                                Rotate interactively a 3D plot in python - matplotlib - Jupyter Notebook
                            
                                Why do I need to include sub-packages in setup.py
                            
                                Store static files on S3 but staticfiles.json manifest locally
                            
                                How to convert SVG to PNG or JPEG in Python?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do you send arguments to a generator function using tf.data.Dataset.from_generator()?

Tags:

python

python-3.x

tensorflow

tensorflow-datasets

michael_question_answerer

People also ask

1 Answers

xdurch0

Recent Activity

Donate For Us