Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What happens to a file in Azure Storage V2 after 30 days in cool tier and 180 days in archive tier?

I am new to Azure Storage account. While studying I found following link:

https://docs.microsoft.com/en-us/azure/storage/blobs/storage-blob-storage-tiers#archive-access-tier-preview

It says following about available access tiers:

  • Hot - Optimized for storing data that is accessed frequently.

  • Cool - Optimized for storing data that is infrequently accessed and stored for at least 30 days.

  • Archive - Optimized for storing data that is rarely accessed and stored for at least 180 days with flexible latency requirements (on the order of hours).

Here I am little confused. Cool and archive tiers are used to store my data which is not accessed so frequently but I need after long time.

But according to what I understand from above link, file will be deleted from cool tier after 30 days and from archive tier after 180 days.

So, If it is true that If any file is not accessed for 30 days in cool tier and 180 days in archive tier then that will be deleted and never be accessible?

And if not that what do they mean by "Optimized for storing data that is infrequently accessed and stored for at least 30 days"?

like image 580
Yash Avatar asked Apr 29 '19 13:04

Yash


People also ask

What storage tier is used for a blob that is uploaded once every 30 days on average?

Cool tier - An online tier optimized for storing data that is infrequently accessed or modified. Data in the cool tier should be stored for a minimum of 30 days. The cool tier has lower storage costs and higher access costs compared to the hot tier.

Which term describes the process of moving data from an archive tier to a cool or hot tier?

To read data in archive storage, you must first change the tier of the blob to hot or cool. This process is known as rehydration and can take up to 15 hours to complete for blobs less than 50 GB. Additional time required for larger blobs varies with the blob throughput limit.

What are the 3 tiers for Azure storage?

Azure offers three storage tiers to store data in blob storage: Hot Access tier, Cool Access tier, and Archive tier.

How long should data remain archived before incurring early rehydration fees?

Blobs in the Archive tier should be stored for a minimum of 180 days. Deleting or changing the tier of an archived blob before the 180-day period elapses incurs an early deletion fee.


1 Answers

But according to what I understand from above link, file will be deleted from cool tier after 30 days and from archive tire after 180 days.

No, Microsoft will definitely not automagically delete your data, if your subscription is in good standing. They want you to store as much data for as long as possible - it's more profitable for them :) (Although note that there is an explicit Blob Lifecycle Management feature in the works)

What MS mean by this is that Cool and Archive storage is cost-optimized for a period of at least 30 and 180 days, respectively. In essence, you'll be 'penalized' for deleting data before 30 / 180 days by paying in the same amount as if you had left the file there for 30 or 180 days.

If you scan the pricing pages for block blobs

Cool and Archive early deletion In addition to the per-GB, per-month charge, any blob that is moved to Archive is subject to an Archive early deletion period of 180 days. Additionally, for General Purpose v2 accounts, any blob that is moved to Cool is subject to a Cool early deletion period of 30 days. This charge is prorated. For example, if a blob is moved to Archive and then deleted or moved to the Hot tier after 45 days, the customer is charged an early deletion fee equivalent to 135 (180 minus 45) days of storing that blob in Archive.

FWIW my 10 cents:

Hot Tier - You pay a bit more (~20%) to store the data per GB per month, but you can store it for as short a period as you like, and you aren't charged bandwidth to retrieve the data. A common use case here would be for user-uploaded data which would typically be shared or published and frequently read.

Cold Tier - slightly cheaper storage per GB per month, but retrieval isn't free. The use case here would be for data which you don't necessarily expect to read frequently, such as log data, or short term backups, but if you do need to read it, it's still quick to get to the data.

The read and write performance of hot and cold should be similar

If in doubt, I would start with hot, and then only once I understood the typical lifespan and access needs to my data would I start changing things up.

Premium Tier - Stored on SSD's - storage will cost considerably more, but performance will be better. For small files which need to be retrieved quickly, I would also consider looking at a secondary caching tech like Azure Redis Cache - that way you can persist data in cheaper hot storage, and cache in Redis.

Archive Tier - Although the storage is insanely cheap, it should be regarded as offline. You'll pay a hefty $5 fee for your first 10k read operations from Archive, and the SLA on read latency is several hours. Use cases for Archive could be for old backup archives, or bulk storage for corporate governance or statutory reasons (e.g. 5 year's worth of financial records), i.e. data which you hope to never need again, but will gladly pay for if you do.

like image 153
StuartLC Avatar answered Sep 30 '22 02:09

StuartLC