Is there a way to do symbolic links to the blob data when using Azure Storage to avoid duplicate blobs?

Tags:

I have a situation where a user is attaching files within an application, these files are then persisted to Azure Blob storage, there is a reasonable likelihood that there are going to be duplicates and I want to put in place a solution where duplicate blobs are avoided.

My first thought is to just name the blob as filename_hash but that only captures a subset of duplicates, then filesize_hash was then next thought.

In doing this though it seems like I am losing some of the flexibility of the blob storage to represent the position in a hierarchy of the file, see: Windows Azure: How to create sub directory in a blob container

So I was looking to see if there was a way to create a blob that referenced the blob data i.e. some for of symbolic link but couldn't find what I wanted.

Am I missing something or should I just go with filesize_hash method and store my hierarchy using an alternative method.

853

asked Oct 03 '11 22:10

JTew

1 Answers

No, there's no symbolic links (source: http://social.msdn.microsoft.com/Forums/vi-VN/windowsazuredata/thread/6e5fa93a-0d09-44a8-82cf-a3403a695922).

A good solution depends on the anticipated size of the files and the number of duplicates. If there aren't going to be many duplicates, or the files are small, then it may actually be quicker and cheaper to live with it - $0.15 per gigabyte per month is not a great deal to pay, compared to the development cost! (That's the approach we're taking.)

If it was worthwhile to remove duplicates I'd use table storage to create some kind of redirection between the file name and the actual location of the data. I'd then do a client-side redirect to redirect the client's browser to download the proper version.

If you do this you'll want to preserve the file name (as that will be what's visible to the user) but you can call the "folder" location what you want.

133

answered Nov 15 '22 10:11

Jeremy McGee

Related questions
                            
                                MSBuild Tools 2017 with Azure SDK 2.9.6
                            
                                Azure App Service: How can I determine which process is consuming high CPU?
                            
                                What is the clean up mechanism for the blobs that WebJobs SDK creates in the AzureWebJobsDashboard connection?
                            
                                Azure functions in sub folders
                            
                                Azure SQL Creating Database Scoped Credential
                            
                                Can't connect from azure resource to Azure database for postgres server
                            
                                How to deploy Angular 6 with .NET Core 2.0 Web API Application to Microsoft Azure?
                            
                                Get Webhook url of a Function App in ARM to use for Event Grid Subscription
                            
                                Logic App : Finding element in Json Object array (like XPath fr XML)
                            
                                Can you set metadata on an Azure CloudBlockBlob at the same time as uploading it?
                            
                                Downloading files to Azure function app to manipulate
                            
                                Starting Azure Service Bus Trigger Function throws InvalidOperationException for "Host not yet started"
                            
                                How to use azure-sqldb-spark connector in pyspark
                            
                                Using AddAzureKeyVault makes my application 10 seconds slower
                            
                                How to install several .NET Core SDK versions on Azure Devops
                            
                                Azure ARM template ResourceNotFound error when referencing managed identity in key vault access policy
                            
                                How to associate an Azure app service with an application insights resource (new or existing) using terraform?
                            
                                Stream Bytes chunks to csv rows in python
                            
                                Windows Azure - Transferring .NET Web Application to Azure
                            
                                Is Windows Azure Storage (Blob, Table, Queue) optimized for access from Windows Azure Roles?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there a way to do symbolic links to the blob data when using Azure Storage to avoid duplicate blobs?

Tags:

duplicates

symlink

storage

blob

azure

JTew

People also ask

1 Answers

Jeremy McGee

Recent Activity

Donate For Us