Hadoop comes with a distributed filesystem called HDFS, which stands for Hadoop Distributed Filesystem inspired by the Google File System (GFS). HDFS is a highly fault tolerant, distributed, reliable, scalable file system for storing very large files running on cluster of commodity hardware. Commodity hardware is cheap and less reliable. HDFS has master/slave [...]