Let's say I have defined a dataset in this way: <pre class="prettyprint"><code>filename_dataset = tf.data.Dataset.list_files("{}/*.png".format(dataset)) </code></pre> how can I get the number of elements that are inside the dataset (hence, the number of single elements that compose an epoch)? I know that <code>tf.data.Dataset</code> already knows the dimension of the dataset, because the <code>repeat()</code> method allows repeating the input pipeline for a specified number of epochs. So it must be a way to get this information.

<code>len(list(dataset))</code> works in eager mode, although that's obviously not a good general solution.

Take a look here: https://github.com/tensorflow/tensorflow/issues/26966 It doesn't work for TFRecord datasets, but it works fine for other types. TL;DR: <blockquote> num_elements = tf.data.experimental.cardinality(dataset).numpy() </blockquote>

tf.data.Dataset: how to get the dataset size (number of elements in a epoch)?

Q: What is the name of the TF data class that represents a sequence of elements in which each item consists of one or more components?

Dataset abstraction that represents a sequence of elements, in which each element consists of one or more components.

Tags:

python

python-3.x

tensorflow

tensorflow-datasets

Let's say I have defined a dataset in this way:

filename_dataset = tf.data.Dataset.list_files("{}/*.png".format(dataset))

how can I get the number of elements that are inside the dataset (hence, the number of single elements that compose an epoch)?

I know that tf.data.Dataset already knows the dimension of the dataset, because the repeat() method allows repeating the input pipeline for a specified number of epochs. So it must be a way to get this information.

602

asked Jun 07 '18 09:06

nessuno

2 Answers

len(list(dataset)) works in eager mode, although that's obviously not a good general solution.

194

answered Sep 22 '22 02:09

markemus

Take a look here: https://github.com/tensorflow/tensorflow/issues/26966

It doesn't work for TFRecord datasets, but it works fine for other types.

TL;DR:

num_elements = tf.data.experimental.cardinality(dataset).numpy()

answered Sep 18 '22 02:09

Jacob Høxbroe Jeppesen

Related questions
                            
                                In the Django admin site, how do I change the display format of time fields?
                            
                                Copy upper triangle to lower triangle in a python matrix
                            
                                Symlinks on windows?
                            
                                ImportError: No Module named simplejson
                            
                                How to build a decorator with optional parameters? [duplicate]
                            
                                sorted() using generator expressions rather than lists
                            
                                Pythonic way to determine whether not null list entries are 'continuous'
                            
                                Remove Python UserWarning
                            
                                Boxplot of Multiple Columns of a Pandas Dataframe on the Same Figure (seaborn)
                            
                                Python Imaging Library - Text rendering
                            
                                python datetime fromtimestamp yielding valueerror year out of range [duplicate]
                            
                                Overriding python threading.Thread.run()
                            
                                Mock Python's built in print function
                            
                                Convert an image RGB->Lab with python
                            
                                In Tkinter is there any way to make a widget invisible?
                            
                                Catch KeyError in Python
                            
                                How to monkeypatch python's datetime.datetime.now with py.test?
                            
                                Easy_install and pip broke: pkg_resources.DistributionNotFound: distribute==0.6.36
                            
                                module' object has no attribute 'drawMatches' opencv python
                            
                                Accessing every 1st element of Pandas DataFrame column containing lists

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With