Today's Offer - Hadoop Certification Training - Enroll at Flat 10% Off.

- Hadoop Blogs -

Hive Interview Question And Answers

Hive Interview Question And Answers

Under the guidance of the expert team of Janbask, We have listed some important Hive Interview Questions and with their answers. The questions are framed in such a way that the candidates will be able to understand the concept of Hive along with the understanding of question pattern.

Hive Interview questions for Fresher

Hive Interview questions for experienced

Hive interview questions and answers (Freshers)

The Hive is an is an open-source-software tool used in ETL and Data warehousing, developed on top of Hadoop Distributed File System (HDFS). The Hive is mainly used while making data warehouse applications and while dealing with static data instead of dynamic data

Q1). Explain what is Hive?

The Hive is an is an open-source-software tool used in ETL and Data warehousing, developed on top of Hadoop Distributed File System (HDFS). Hive is a data warehouse framework, that is used for querying and analysis of data is stored in HDFS.

Q2). When is Hive used?

  • Used while making data warehouse applications.
  • While dealing with static data instead of dynamic data
  • On high latency application.
  • When maintaining a large dataset
  • Instead of scripting when queries are used

Q3). Differentiate modes of Hive?

These modes of Hive are,

  • Local mode
  • Map reduce mode
  • The Hive can operate in the above modes depending on the size of data nodes in Hadoop.

Q4). When is Map reduce mode?

The Map reduce mode is used when,

  • The large amount of data sets and query both are executed in parallel.
  • Hive is used in this mode when there are multiple data nodes in Hadoop, and when data is distributed across different nodes.
  • When large data sets with high performance are aimed at.

Q5). What are the key components of Hive Architecture?

Key components of Hive Architecture include,

  • User Interface
  • Compiler
  • Metascore
  • Driver
  • Execute Engine

Q6). Mention what are the different types of tables available in Hive?

There are mainly two types of tables available in Hive.

  • Managed table: both the data and schema are controlled by Hive
  • External table: only the schema is controlled by Hive.

Q7). What in Hive composed of?

Hive is composed of,

Read: Big Data Hadoop Developer Career Path & Future Scope
  • Clients
  • Services
  • Storage and Computing

Q8). Why is Hive not suitable for OLTP systems?

As the insert and update function at the row level is not provided, Hive becomes is not suitable for OLTP systems because

Q9). Mention what is the Object Inspector functionality in Hive?

In Hive the analysis of the internal structure of the columns, rows, and complex objects are done using Object Inspector functionality. Object Inspector functionality creates accessibility to the internal fields present inside the objects.

Q10). What is (HS2) Hive Server2?

Hive Server2 is a server interface. The following functions are performed by HiveServer2.

  • Works against Hive by allowing remote clients to execute queries.
  • The results of queries mentioned are retrieved

Advanced features:

  • Multi-client concurrency
  • Authentication

Hive interview questions and answers (Experienced)

Q11). What is the role of the Hive query processor?

In order to execute the jobs in the order of dependencies, Hive query processor with the execution time framework, convert the graph of MapReduce jobs.

Q12). List the components of a Hive query processor?

The components of a Hive query processor are,

  • Logical Plan Generation
  • Physical Plan Generation
  • Execution Engine
  • Operators
  • UDF's and UDAF's
  • Optimizer
  • Parser
  • Semantic Analyzer
  • Type Checking

Q13). Mention what are Partitions in Hive?

  • Partitions are a way of dividing tables into different parts based on partition keys.
  • Partition is used when the table has one or more Partition keys.
  • Partition act as basic key elements that determine how the data is stored in the table.

Q14). When is "Internal Table" and "External Table" in Hive chosen?

Internal table,

  • While processing the available data in the local file system
  • When Hive is expected to manage the complete lifecycle of data including the deletion

External table,

  • When the processing data is available in HDFS
  • When the files are being used outside of Hive

Q15). Is it possible to name view same as the name of a Hive table?

No, it isn't possible to name view same as the name of a Hive table. When compared to all the other tables, The name of a view has to be unique and as views present in the same database.

Read: Hadoop HDFS Commands Cheat Sheet

Q16). Mention what are views in Hive?

Views are Similar to tables In Hive, They are generated based on the requirements.

  • Any results can be saved as set data as a view in Hive
  • Similar to views used in SQL in usage.
  • All types of DML operations can be performed on a view.

Q17). In Hive, how can you enable buckets?

By using "set.hive.enforce.bucketing=true;" command buckets can be enabled.

Q18). In Hive, can you overwrite Hadoop MapReduce configuration in Hive?

Yes, overwriting Hadoop MapReduce configuration is possible in Hive.

Q19). Explain how can you change a column data type in Hive?

By using the command, "ALTER TABLE table_name CHANGE column_name column_name new_datatype;" one can change a column data type in Hive.

Q20). Mention what is the difference between order by and sort by in Hive?

SORT BY: Sorts the data within each reducer. A lot of reducers for SORT BY operation. ORDER BY: Sorts all the data together, that passes through one reducer. Thus, ORDER BY uses a single in Hive.

Q21). Explain when to use explode in Hive?

Hive uses explode when an array is taken as input and converted into a separate table row in order to convert complex data types into desired table formats.

Q22). Mention how can you stop a partition form being queried?

Stopping a partition form being queried by using the "ENABLE OFFLINE" clause with the ALTER TABLE statement.


    Janbask Training

    JanBask Training is a leading Global Online Training Provider through Live Sessions. The Live classes provide a blended approach of hands on experience along with theoretical knowledge which is driven by certified professionals.


Trending Courses

AWS

  • AWS & Fundamentals of Linux
  • Amazon Simple Storage Service
  • Elastic Compute Cloud
  • Databases Overview & Amazon Route 53

Upcoming Class

2 days 14 Dec 2019

DevOps

  • Intro to DevOps
  • GIT and Maven
  • Jenkins & Ansible
  • Docker and Cloud Computing

Upcoming Class

3 days 15 Dec 2019

Data Science

  • Data Science Introduction
  • Hadoop and Spark Overview
  • Python & Intro to R Programming
  • Machine Learning

Upcoming Class

3 days 15 Dec 2019

Hadoop

  • Architecture, HDFS & MapReduce
  • Unix Shell & Apache Pig Installation
  • HIVE Installation & User-Defined Functions
  • SQOOP & Hbase Installation

Upcoming Class

4 days 16 Dec 2019

Salesforce

  • Salesforce Configuration Introduction
  • Security & Automation Process
  • Sales & Service Cloud
  • Apex Programming, SOQL & SOSL

Upcoming Class

-1 day 11 Dec 2019

QA

  • Introduction and Software Testing
  • Software Test Life Cycle
  • Automation Testing and API Testing
  • Selenium framework development using Testing

Upcoming Class

8 days 20 Dec 2019

Business Analyst

  • BA & Stakeholders Overview
  • BPMN, Requirement Elicitation
  • BA Tools & Design Documents
  • Enterprise Analysis, Agile & Scrum

Upcoming Class

4 days 16 Dec 2019

SQL Server

  • Introduction & Database Query
  • Programming, Indexes & System Functions
  • SSIS Package Development Procedures
  • SSRS Report Design

Upcoming Class

4 days 16 Dec 2019

Comments

Search Posts

Reset

Receive Latest Materials and Offers on Hadoop Course

Interviews