Chunk Size Issues in Google Filesystem

Tags:

filesystems

Google File System Paper -

Chunk size is one of the key design parameters. We have chosen 64 MB, which is much larger than typical file sys- tem block sizes. Each chunk replica is stored as a plain Linux file on a chunkserver and is extended only as needed. Lazy space allocation avoids wasting space due to internal fragmentation, perhaps the greatest objection against such a large chunk size.

What is lazy space allocation and how is it going to solve the internal fragmentation problem?

A small file consists of a small number of chunks, perhaps just one. The chunkservers storing those chunks may become hot spots if many clients are accessing the same file ... We fixed this problem by storing such executables with a higher replication factor and by making the batch- queue system stagger application start times.

What is staggering application start times and how does it avoid chunk-servers from becoming hot-spots?

726

asked Apr 22 '11 19:04

1 Answers

Lazy space allocation means the filesystem doesn't actually give the file space before it's written. They're commonly referred to as sparse files. For example, if only the first 2MB of the 64MB chunk file is used, only 2MB will actually be used on disk.

Staggering application start times just means that they don't start everything at once. If every application needs to read a few configuration files stored in GFS upon startup, if they all start at the same time, there will be load problems. Spreading out the startup times alleviates this.

193

answered Oct 28 '22 23:10

rmmh

Related questions
                            
                                Logging and Configuration Systems: Circular Dependency
                            
                                Optimization of consecutive map/filter/fold calls
                            
                                Find subset of points whose distance among each other is a multiple of a number
                            
                                What is a priority queue and what is it useful for
                            
                                Minimum cost path from (0,0) to (N,N) on 2D grid
                            
                                Are there viable and type safe alternatives to the 1:1 type/type-class-instance relation?
                            
                                Finding clusters of mass in a matrix/bitmap
                            
                                Open a new tab in firefox and keep ff in the background
                            
                                Flexible, Solid and Portable Service Discovery
                            
                                Searching algorithm
                            
                                Prim's MST algorithm in O(|V|^2)
                            
                                Algorithm to find fewest number of tags that encompass all items?
                            
                                I want to write a tool without usage entry barriers. Do I have to write it in C? [closed]
                            
                                Are there any benefits to following the open/closed principle when using BDD?
                            
                                How to discover if some day is a workday
                            
                                How could random functions be really random?
                            
                                Efficiently get sorted sums of a sorted list
                            
                                How are J/K/APL classified in terms of common paradigms?
                            
                                Radix Sort for Negative Integers
                            
                                Second max in BST

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Chunk Size Issues in Google Filesystem

Tags:

language-agnostic

filesystems

Vaibhav Bajpai

People also ask

1 Answers

rmmh

Recent Activity

Donate For Us