HDFS is Hadoop Distributed file system. In this file system if you loaded any extension file (Example Images (.jpg/gif/.png), Videos like Mp4, Mp3, Text Files, Compressed files) Hadoop consider as a FILE. It never minds its extension. When we load a Data set into HDFS it is consider that as a FILE.
Here we are loading test1.mp4 and see how it is splitting and store into HDFS.
- Format Name node by command > hadoop namenode -format
- Start all services – This is Hadoop 1 . All services start by start-all.sh command
- load test1.mp4 (85MB file size ) into HDFS by -put command
- HDFS splits the 85 MB test1.mp4 as two block each (one is 64 MB another is 21 MB ) and it is available in Data Node.