Google File System (GFS) Design and Architecture
This paper presents Google File System, a scalable distributed file system designed for large data-intensive applications on commodity hardware. It discusses assumptions, relaxed consistency models, architecture with a single master and many chunkservers, replication, metadata management, atomic record append, and fault tolerance in GFS.