Python Azure SDK: Using list_blobs to get more than 5.000 Results

Tags:

I'm having trouble with the Python Azure SDK and haven't found anything both on Stack Overflow and in the Msdn Forums.

I want to use Azure SDKs list_blobs() to get a list of blobs - there are more than 5.000 (which is the max_result).

If I take a look at the code in the SDK itself then I see the following:

    def list_blobs(self, container_name, prefix=None, marker=None,
                   maxresults=None, include=None, delimiter=None):

The description for 'Marker' being:

    marker:
        Optional. A string value that identifies the portion of the list
        to be returned with the next list operation. The operation returns
        a marker value within the response body if the list returned was
        not complete. The marker value may then be used in a subsequent
        call to request the next set of list items. The marker value is
        opaque to the client.

My problem is that I'm unaware on how to use the marker to get the next set of 5.000 results. If I try something like this:

    blobs = blobservice.list_blobs(target_container, prefix= prefix)            
    print(blobs.marker)

then the marker is always empty, which I assume is because list_blobs() already parses the blobs out of the response.

But if that is the case then how do I actually use the marker in a meaningful way?

I'm sorry if this is a stupid question but this actually is the first one that I didn't find an answer for, even after searching extensively.

Cheers!

288

asked Jun 19 '14 08:06

user3755680

1 Answers

SDK returns the continuation token in a variable called next_marker. You should use that to get the next set of blobs. See the code below as an example. Here I'm listing 100 blobs from a container at a time:

from azure import *
from azure.storage import *

blob_service = BlobService(account_name='<accountname>', account_key='<accountkey>')
next_marker = None
while True:
    blobs = blob_service.list_blobs('<containername>', maxresults=100, marker=next_marker)
    next_marker = blobs.next_marker
    print(next_marker)
    print(len(blobs))
    if next_marker is None:
        break
print "done"

P.S. The code above throws an exception on the last iteration. Not sure why. But it should give you an idea.

answered Sep 28 '22 21:09

Gaurav Mantri

Related questions
                            
                                How to detect curses ALT + key combinations in python
                            
                                how to get spyder's python recognize external packages on MacOS X?
                            
                                Using tweepy to stream users' timeline and filtered tweets
                            
                                How to migrate Django project to Pythonanywhere
                            
                                matplotlib's zoom functionality inside a tkinter canvas
                            
                                Why do list operations in python operate outside of the function scope? [duplicate]
                            
                                Error: No module named cv2
                            
                                S3 redirect 302 object with s3cmd
                            
                                OpenCV - QueryFrame() returns older image from the webcam
                            
                                Google OAuth2 redirect_uri_mismatch Issue
                            
                                Matplotlib - set pad between arrow and text in annotate function
                            
                                High-speed alternatives to replace byte array processing bottlenecks
                            
                                Recognising objects in images using HAAR cascade and OpenCV
                            
                                PyAPNs sending push notification to more than one device token not working
                            
                                Literal parenthesis with python regex
                            
                                Trouble with compressing big data in python
                            
                                Scrapy - How to crawl new pages based on links in scraped items
                            
                                Making the stack levels in Django HTML email reports collapsable
                            
                                Difference between generator expression and generator function
                            
                                Sublime Text: show current outer element name

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python Azure SDK: Using list_blobs to get more than 5.000 Results

Tags:

python

azure

azure-blob-storage

user3755680

People also ask

1 Answers

Gaurav Mantri

Recent Activity

Donate For Us