Аннотация:With the rising technology and age of explosion of technique, massive amount of data is generating every day within every minute. Explosive amount of data is generating with the contribution of social networking sites, mobile devices, sensors, enterprise and scientific data. Their contribution has increased a vast volume of data in recent years. Massive amount of data is called Big data, the potential of Big data can benefits many organizations, business, public sectors with the contribution of economy in the development of sphere. HDFS is a Hadoop distributed file system, it was developed to support distributed designs of file system. It is based on java and is scalable and reliable for data storage. Low cost hardware's are required to design HDFSwith high fault tolerant.Data is replicated in HDFS across clusters.In case of failure of single process, then also computation is continued without stopping process.No restrictions are there on storage of datasets in HDFS.Both structured and schema less data is supported by it.