What is the difference between Hadoop HDFS HBase

493    Asked by AyushiKhatri in Python , Asked on Apr 24, 2021

 I am new to Hadoop as well as NoSQL. I am very much confused about HDFS & HBase. Can anyone help me figure out the differences between the HBase and Hadoop?

Here we have hbase vs hadoop:

HBase provides low latency access to small amounts of data within large data sets while HDFS provides high latency operations.

HBase supports random read and writes while HDFS supports WORM (Write once Read Many or Multiple times).

HDFS is basically or primarily accessed through MapReduce jobs while HBase is accessed through shell commands, Java API, REST, Avro or Thrift API.

HDFS is a Java based distributed file system that allows you to store large data across multiple nodes in a Hadoop cluster. Whereas HBase is a NoSQL database (similar as NTFS and MySQL).

HDFS stores large data sets in a distributed environment and leverages batch processing on that data. While HBase stores data in a column-oriented manner where each column is stored together so that, reading becomes faster leveraging real time processing.



Your Answer

Interviews

Parent Categories