{"id":960,"date":"2017-06-07T12:02:39","date_gmt":"2017-06-07T12:02:39","guid":{"rendered":"https:\/\/www.bsetec.com\/blog\/?p=960"},"modified":"2025-02-18T10:28:28","modified_gmt":"2025-02-18T10:28:28","slug":"hadoop-distributed-file-system-hdfs","status":"publish","type":"post","link":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/","title":{"rendered":"Hadoop Distributed File System (HDFS)"},"content":{"rendered":"<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><a href=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\"><img fetchpriority=\"high\" decoding=\"async\" width=\"554\" height=\"310\" src=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\" alt=\"HDFS\" class=\"wp-image-961\" srcset=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png 554w, https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS-300x168.png 300w\" sizes=\"(max-width: 554px) 100vw, 554px\" \/><\/a><\/figure><\/div>\n\n\n<ul class=\"wp-block-list\" style=\"font-size:16px\">\n<li>HDFS works on cluster, which means you don&#8217;t have to think about filling this or that server anymore.<\/li>\n\n\n\n<li>HDFS scales horizontally.<\/li>\n\n\n\n<li>HDFS works great with big big files.<\/li>\n\n\n\n<li>HDFS splits the big files in chunks, so storing a 10+TB database is easy.<\/li>\n\n\n\n<li>HDFS is object storage, so you can easily run mysqldump | xbstream -c | hdfs\u200a\u2014\u200ato store large MySQL databases.<\/li>\n\n\n\n<li>Because you&#8217;re running of a bunch of servers at the same time, you solve the I\/O problems.<\/li>\n\n\n\n<li>HDFS manages replication. No more lost backups because a single server crashes.<\/li>\n\n\n\n<li>HDFS is perfect for JBOD. No more RAID which costs money and I\/Os.<\/li>\n\n\n\n<li>You can use small machines with just a bunch of 4 to 6TB spinning disks and let the magic happen.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\" style=\"font-size:16px\"><i>Once again there are a few cons:<\/i><\/h2>\n\n\n\n<ul class=\"wp-block-list\" style=\"font-size:16px\">\n<li>HDFS is not so good at managing a gazillion small files.<\/li>\n\n\n\n<li>Unlike ZFS \/ rsnapshot, HDFS does not handle file deduplication natively (but space is cheap)<\/li>\n\n\n\n<li>Complexity: you need a full HDFS cluster with name nodes, journal nodes etc\u2026<\/li>\n\n\n\n<li>The HDFS client requires the whole Java stack which you don&#8217;t want to install everywhere.<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\"><b><i>Implementation<\/i><\/b><\/h4>\n\n\n\n<p style=\"font-size:16px\">We started to work on a quick and dirty POC to provide a HDFS backed backup system.<\/p>\n\n\n\n<ul class=\"wp-block-list\" style=\"font-size:16px\">\n<li>It uses a lightweight HDFS client written in Go.<\/li>\n\n\n\n<li>It manages backup rotation with variable retention (hourly \/ daily \/ weekly \/ monthly).<\/li>\n\n\n\n<li>It runs parallel backups.<\/li>\n<\/ul>\n\n\n\n<p style=\"font-size:16px\">We started to test it on a small HDFS cluster:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>2 small 20$\/month servers.<\/li>\n\n\n\n<li>4 * 4TB JBOD spinning disks.<\/li>\n<\/ul>\n\n\n\n<p style=\"font-size:16px\">For directories full of small files like \/etc\/, the throughput is about 30% slower than a simple rsync.<\/p>\n\n\n\n<p style=\"font-size:16px\">For large files, the throughput is 20% faster than rsync because we&#8217;re limited by the network.<\/p>\n\n\n\n<p style=\"font-size:16px\">The good point: restoring a file is not about looking for a needle in a haystack anymore. All my prerequisites are satisfied.<\/p>\n\n\n\n<p style=\"font-size:16px\">The bad point: complexity. Building even a small HDFS cluster is a bit overkill for your home backup. But for a professional use, it works like a charm.<\/p>\n\n\n\n<p style=\"font-size:16px\">HDFS elaborates to be Hadoop Distributed File System. This is a file system that\u2019s used to operate upon very large data sets which the present day\u2019s technology is producing on immense proportions. The size of file units stored in HDFS can range from Gigabytes to Terabytes, and sometimes even larger.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><b>NameNodes:<\/b><\/h4>\n\n\n\n<p style=\"font-size:16px\">NameNode is the repository of mappings to various DataNodes, meaning that it contains the information regarding the mappings between different files, their locations and their corresponding DataNodes that are branched under the NameNode.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\"><b>DataNodes:<\/b><\/h4>\n\n\n\n<p style=\"font-size:16px\">DataNodes are the actual areas where the files are stored in the file system. There will be numerous DataNodes linked to one NameNode. <\/p>\n\n\n\n<p style=\"font-size:16px\">They send reports regarding their files to the NameNode for every 10 seconds. This report is called Heartbeat. It proves that a particular DataNode that has successfully reported its Heartbeat to the NameNode, is safe and secure and is alive and active. <\/p>\n\n\n\n<p style=\"font-size:16px\">So, when a beat is skipped by a DataNode the NameNode instantly recognizes the in-activeness in that DataNode; and when this in-activeness continues for 10 minutes, the DataNode is declared dead and then on no IO will be sent to that node, and the data present in it is replicated to another DataNode and these new changes are updated into the NameNode\u2019s mappings.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><a href=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/replicated.png\"><img decoding=\"async\" width=\"512\" height=\"247\" src=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/replicated.png\" alt=\"replicated\" class=\"wp-image-963\" srcset=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/replicated.png 512w, https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/replicated-300x145.png 300w, https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/replicated-510x247.png 510w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/a><\/figure><\/div>\n\n\n<h2 class=\"wp-block-heading\"><b>Racks and Replications:<\/b><\/h2>\n\n\n\n<p style=\"font-size:16px\">Files are maintained as a series of blocks and the size of all the blocks are same except for the last block. And, many such blocks are put into <i>Racks<\/i>.<\/p>\n\n\n\n<p style=\"font-size:16px\">Replication is performed into the racks and these replication decisions are taken by the NameNode.<\/p>\n\n\n\n<p style=\"font-size:16px\">Replication of files is placed into different unique racks to ensure against the possibility of data loss, just in case a complete rack failure should occur. And to write these replications into different racks it would cost more writing; but then, in the aftermath of rack failure, wouldn\u2019t we regret not writing data to other racks than losing all of it.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter\"><a href=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/1-N0X_nXWbjVHtoCnFVhpqNw.jpeg.jpg\"><img decoding=\"async\" width=\"380\" height=\"304\" src=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/1-N0X_nXWbjVHtoCnFVhpqNw.jpeg.jpg\" alt=\"1-N0X_nXWbjVHtoCnFVhpqNw.jpeg\" class=\"wp-image-964\" srcset=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/1-N0X_nXWbjVHtoCnFVhpqNw.jpeg.jpg 380w, https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/1-N0X_nXWbjVHtoCnFVhpqNw.jpeg-300x240.jpg 300w\" sizes=\"(max-width: 380px) 100vw, 380px\" \/><\/a><\/figure><\/div>\n\n\n<h2 class=\"wp-block-heading\"><b>EditLog:<\/b><\/h2>\n\n\n\n<p style=\"font-size:16px\">There will be new files and directories created in the file system. As these changes occur in the metadata, they\u2019re all recorded in a log called, EditLog and later updated in the NameNode.<\/p>\n\n\n\n<p style=\"font-size:16px\">HDFS is the most advanced distributed file system and is undergoing a quick transformation and coming of age with the advancement in the technologies. This is going to be a huge requirement for the BigData generation that\u2019s shaping up in front of our very eyes.<\/p>\n\n\n\n<p class=\"has-background\" style=\"background:linear-gradient(135deg,rgb(255,245,203) 0%,rgb(182,227,212) 100%,rgb(51,167,181) 100%);font-size:16px\">Did you find this article useful? Let us know by leaving a comment below, or join us on <strong><a href=\"https:\/\/twitter.com\/BSEtech\" target=\"_blank\" rel=\"noreferrer noopener\">Twitter<\/a><\/strong> and <strong><a href=\"https:\/\/www.facebook.com\/bsetec\" target=\"_blank\" rel=\"noreferrer noopener\">Facebook<\/a><\/strong>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Once again there are a few cons: Implementation We started to work on a quick and dirty POC to provide a HDFS backed backup system. We started to test it on a small HDFS cluster: For directories full of small files like \/etc\/, the throughput is about 30% slower than a simple rsync. For large [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":961,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-960","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.4 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Hadoop Distributed File System (HDFS) | BSEtec<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Hadoop Distributed File System (HDFS) | BSEtec\" \/>\n<meta property=\"og:description\" content=\"Once again there are a few cons: Implementation We started to work on a quick and dirty POC to provide a HDFS backed backup system. We started to test it on a small HDFS cluster: For directories full of small files like \/etc\/, the throughput is about 30% slower than a simple rsync. For large [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\" \/>\n<meta property=\"og:site_name\" content=\"BSEtec\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/bsetec\/\" \/>\n<meta property=\"article:published_time\" content=\"2017-06-07T12:02:39+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-02-18T10:28:28+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\" \/>\n\t<meta property=\"og:image:width\" content=\"554\" \/>\n\t<meta property=\"og:image:height\" content=\"310\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"BSEtec\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@BSEtech\" \/>\n<meta name=\"twitter:site\" content=\"@BSEtech\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"BSEtec\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\"},\"author\":{\"name\":\"BSEtec\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/#\/schema\/person\/24a8ed4eefa5e9bf112e896653ca21c4\"},\"headline\":\"Hadoop Distributed File System (HDFS)\",\"datePublished\":\"2017-06-07T12:02:39+00:00\",\"dateModified\":\"2025-02-18T10:28:28+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\"},\"wordCount\":738,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\",\"url\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\",\"name\":\"Hadoop Distributed File System (HDFS) | BSEtec\",\"isPartOf\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\",\"datePublished\":\"2017-06-07T12:02:39+00:00\",\"dateModified\":\"2025-02-18T10:28:28+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage\",\"url\":\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\",\"contentUrl\":\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png\",\"width\":554,\"height\":310},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.bsetec.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Hadoop Distributed File System (HDFS)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/#website\",\"url\":\"https:\/\/www.bsetec.com\/blog\/\",\"name\":\"BSEtec\",\"description\":\"Exploring the World of Tech, One Byte at a Time\",\"publisher\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/#organization\"},\"alternateName\":\"BSEtec\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.bsetec.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/#organization\",\"name\":\"BSEtec\",\"url\":\"https:\/\/www.bsetec.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2023\/01\/fav.ico\",\"contentUrl\":\"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2023\/01\/fav.ico\",\"width\":1,\"height\":1,\"caption\":\"BSEtec\"},\"image\":{\"@id\":\"https:\/\/www.bsetec.com\/blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/bsetec\/\",\"https:\/\/x.com\/BSEtech\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/#\/schema\/person\/24a8ed4eefa5e9bf112e896653ca21c4\",\"name\":\"BSEtec\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.bsetec.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/20fcfaf426a285886f813fd3e9e0ad48f22440b11201e9a669807c088bfdac8e?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/20fcfaf426a285886f813fd3e9e0ad48f22440b11201e9a669807c088bfdac8e?s=96&d=mm&r=g\",\"caption\":\"BSEtec\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Hadoop Distributed File System (HDFS) | BSEtec","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/","og_locale":"en_US","og_type":"article","og_title":"Hadoop Distributed File System (HDFS) | BSEtec","og_description":"Once again there are a few cons: Implementation We started to work on a quick and dirty POC to provide a HDFS backed backup system. We started to test it on a small HDFS cluster: For directories full of small files like \/etc\/, the throughput is about 30% slower than a simple rsync. For large [&hellip;]","og_url":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/","og_site_name":"BSEtec","article_publisher":"https:\/\/www.facebook.com\/bsetec\/","article_published_time":"2017-06-07T12:02:39+00:00","article_modified_time":"2025-02-18T10:28:28+00:00","og_image":[{"width":554,"height":310,"url":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png","type":"image\/png"}],"author":"BSEtec","twitter_card":"summary_large_image","twitter_creator":"@BSEtech","twitter_site":"@BSEtech","twitter_misc":{"Written by":"BSEtec","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#article","isPartOf":{"@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/"},"author":{"name":"BSEtec","@id":"https:\/\/www.bsetec.com\/blog\/#\/schema\/person\/24a8ed4eefa5e9bf112e896653ca21c4"},"headline":"Hadoop Distributed File System (HDFS)","datePublished":"2017-06-07T12:02:39+00:00","dateModified":"2025-02-18T10:28:28+00:00","mainEntityOfPage":{"@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/"},"wordCount":738,"commentCount":0,"publisher":{"@id":"https:\/\/www.bsetec.com\/blog\/#organization"},"image":{"@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage"},"thumbnailUrl":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png","inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/","url":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/","name":"Hadoop Distributed File System (HDFS) | BSEtec","isPartOf":{"@id":"https:\/\/www.bsetec.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage"},"image":{"@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage"},"thumbnailUrl":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png","datePublished":"2017-06-07T12:02:39+00:00","dateModified":"2025-02-18T10:28:28+00:00","breadcrumb":{"@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#primaryimage","url":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png","contentUrl":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png","width":554,"height":310},{"@type":"BreadcrumbList","@id":"https:\/\/www.bsetec.com\/blog\/hadoop-distributed-file-system-hdfs\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.bsetec.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Hadoop Distributed File System (HDFS)"}]},{"@type":"WebSite","@id":"https:\/\/www.bsetec.com\/blog\/#website","url":"https:\/\/www.bsetec.com\/blog\/","name":"BSEtec","description":"Exploring the World of Tech, One Byte at a Time","publisher":{"@id":"https:\/\/www.bsetec.com\/blog\/#organization"},"alternateName":"BSEtec","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.bsetec.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.bsetec.com\/blog\/#organization","name":"BSEtec","url":"https:\/\/www.bsetec.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.bsetec.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2023\/01\/fav.ico","contentUrl":"https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2023\/01\/fav.ico","width":1,"height":1,"caption":"BSEtec"},"image":{"@id":"https:\/\/www.bsetec.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/bsetec\/","https:\/\/x.com\/BSEtech"]},{"@type":"Person","@id":"https:\/\/www.bsetec.com\/blog\/#\/schema\/person\/24a8ed4eefa5e9bf112e896653ca21c4","name":"BSEtec","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.bsetec.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/20fcfaf426a285886f813fd3e9e0ad48f22440b11201e9a669807c088bfdac8e?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/20fcfaf426a285886f813fd3e9e0ad48f22440b11201e9a669807c088bfdac8e?s=96&d=mm&r=g","caption":"BSEtec"}}]}},"blog_post_layout_featured_media_urls":{"thumbnail":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS-150x150.png",150,150,true],"full":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",554,310,false]},"categories_names":{"1":{"name":"Uncategorized","link":"https:\/\/www.bsetec.com\/blog\/category\/uncategorized\/"}},"tags_names":[],"comments_number":"0","wpmagazine_modules_lite_featured_media_urls":{"thumbnail":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS-150x150.png",150,150,true],"cvmm-medium":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",300,168,false],"cvmm-medium-plus":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",305,171,false],"cvmm-portrait":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",400,224,false],"cvmm-medium-square":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",554,310,false],"cvmm-large":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",554,310,false],"cvmm-small":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",130,73,false],"full":["https:\/\/www.bsetec.com\/blog\/wp-content\/uploads\/2017\/06\/HDFS.png",554,310,false]},"_links":{"self":[{"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/posts\/960","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/comments?post=960"}],"version-history":[{"count":3,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/posts\/960\/revisions"}],"predecessor-version":[{"id":9297,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/posts\/960\/revisions\/9297"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/media\/961"}],"wp:attachment":[{"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/media?parent=960"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/categories?post=960"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.bsetec.com\/blog\/wp-json\/wp\/v2\/tags?post=960"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}