Hive Interview Question And Answers
Under the guidance of the expert team of Janbask, We have listed some important Hive Interview Questions and with their answers. The questions are framed in such a way that the candidates will be able to understand the concept of Hive along with the understanding of question pattern.
Hive Interview questions for Fresher
Hive Interview questions for experienced
Hive interview questions and answers (Freshers)
The Hive is an is an open-source-software tool used in ETL and Data warehousing, developed on top of Hadoop Distributed File System (HDFS). The Hive is mainly used while making data warehouse applications and while dealing with static data instead of dynamic data
Q1). Explain what is Hive?
The Hive is an is an open-source-software tool used in ETL and Data warehousing, developed on top of Hadoop Distributed File System (HDFS). Hive is a data warehouse framework, that is used for querying and analysis of data is stored in HDFS.
Q2). When is Hive used?
- Used while making data warehouse applications.
- While dealing with static data instead of dynamic data
- On high latency application.
- When maintaining a large dataset
- Instead of scripting when queries are used
Q3). Differentiate modes of Hive?
These modes of Hive are,
- Local mode
- Map reduce mode
- The Hive can operate in the above modes depending on the size of data nodes in Hadoop.
Q4). When is Map reduce mode?
The Map reduce mode is used when,
- The large amount of data sets and query both are executed in parallel.
- Hive is used in this mode when there are multiple data nodes in Hadoop, and when data is distributed across different nodes.
- When large data sets with high performance are aimed at.
Q5). What are the key components of Hive Architecture?
Key components of Hive Architecture include,
Read: What Is Apache Oozie? Oozie Configure & Install Tutorial Guide for Beginners
- User Interface
- Compiler
- Metascore
- Driver
- Execute Engine
Q6). Mention what are the different types of tables available in Hive?
There are mainly two types of tables available in Hive.
- Managed table: both the data and schema are controlled by Hive
- External table: only the schema is controlled by Hive.
Q7). What in Hive composed of?
Hive is composed of,
- Clients
- Services
- Storage and Computing
Q8). Why is Hive not suitable for OLTP systems?
As the insert and update function at the row level is not provided, Hive becomes is not suitable for OLTP systems because
Q9). Mention what is the Object Inspector functionality in Hive?
In Hive the analysis of the internal structure of the columns, rows, and complex objects are done using Object Inspector functionality. Object Inspector functionality creates accessibility to the internal fields present inside the objects.
Q10). What is (HS2) Hive Server2?
Hive Server2 is a server interface. The following functions are performed by HiveServer2.
- Works against Hive by allowing remote clients to execute queries.
- The results of queries mentioned are retrieved
Advanced features:
Read: Hadoop Command Cheat Sheet - What Is Important?
- Multi-client concurrency
- Authentication
Hive interview questions and answers (Experienced)
Q11). What is the role of the Hive query processor?
In order to execute the jobs in the order of dependencies, Hive query processor with the execution time framework, convert the graph of MapReduce jobs.
Q12). List the components of a Hive query processor?
The components of a Hive query processor are,
- Logical Plan Generation
- Physical Plan Generation
- Execution Engine
- Operators
- UDF's and UDAF's
- Optimizer
- Parser
- Semantic Analyzer
- Type Checking
Q13). Mention what are Partitions in Hive?
- Partitions are a way of dividing tables into different parts based on partition keys.
- Partition is used when the table has one or more Partition keys.
- Partition act as basic key elements that determine how the data is stored in the table.
Q14). When is "Internal Table" and "External Table" in Hive chosen?
Internal table,
- While processing the available data in the local file system
- When Hive is expected to manage the complete lifecycle of data including the deletion
External table,
- When the processing data is available in HDFS
- When the files are being used outside of Hive
Q15). Is it possible to name view same as the name of a Hive table?
No, it isn't possible to name view same as the name of a Hive table. When compared to all the other tables, The name of a view has to be unique and as views present in the same database.
Q16). Mention what are views in Hive?
Views are Similar to tables In Hive, They are generated based on the requirements.
Read: YARN- Empowering The Hadoop Functionalities
- Any results can be saved as set data as a view in Hive
- Similar to views used in SQL in usage.
- All types of DML operations can be performed on a view.
Q17). In Hive, how can you enable buckets?
By using "set.hive.enforce.bucketing=true;" command buckets can be enabled.
Q18). In Hive, can you overwrite Hadoop MapReduce configuration in Hive?
Yes, overwriting Hadoop MapReduce configuration is possible in Hive.
Q19). Explain how can you change a column data type in Hive?
By using the command, "ALTER TABLE table_name CHANGE column_name column_name new_datatype;" one can change a column data type in Hive.
Q20). Mention what is the difference between order by and sort by in Hive?
SORT BY: Sorts the data within each reducer. A lot of reducers for SORT BY operation. ORDER BY: Sorts all the data together, that passes through one reducer. Thus, ORDER BY uses a single in Hive.
Q21). Explain when to use explode in Hive?
Hive uses explode when an array is taken as input and converted into a separate table row in order to convert complex data types into desired table formats.
Q22). Mention how can you stop a partition form being queried?
Stopping a partition form being queried by using the "ENABLE OFFLINE" clause with the ALTER TABLE statement.
Read: Frequently Used Hive Commands in HQL with Examples
FaceBook
Twitter
Google+
LinkedIn
Pinterest
Email
Hadoop Course
Upcoming Batches
Trending Courses
AWS
- AWS & Fundamentals of Linux
- Amazon Simple Storage Service
- Elastic Compute Cloud
- Databases Overview & Amazon Route 53
Upcoming Class
1 day 08 Jun 2023
DevOps
- Intro to DevOps
- GIT and Maven
- Jenkins & Ansible
- Docker and Cloud Computing
Upcoming Class
0 day 07 Jun 2023
Data Science
- Data Science Introduction
- Hadoop and Spark Overview
- Python & Intro to R Programming
- Machine Learning
Upcoming Class
2 days 09 Jun 2023
Hadoop
- Architecture, HDFS & MapReduce
- Unix Shell & Apache Pig Installation
- HIVE Installation & User-Defined Functions
- SQOOP & Hbase Installation
Upcoming Class
2 days 09 Jun 2023
Salesforce
- Salesforce Configuration Introduction
- Security & Automation Process
- Sales & Service Cloud
- Apex Programming, SOQL & SOSL
Upcoming Class
2 days 09 Jun 2023
QA
- Introduction and Software Testing
- Software Test Life Cycle
- Automation Testing and API Testing
- Selenium framework development using Testing
Upcoming Class
10 days 17 Jun 2023
Business Analyst
- BA & Stakeholders Overview
- BPMN, Requirement Elicitation
- BA Tools & Design Documents
- Enterprise Analysis, Agile & Scrum
Upcoming Class
2 days 09 Jun 2023
MS SQL Server
- Introduction & Database Query
- Programming, Indexes & System Functions
- SSIS Package Development Procedures
- SSRS Report Design
Upcoming Class
2 days 09 Jun 2023
Python
- Features of Python
- Python Editors and IDEs
- Data types and Variables
- Python File Operation
Upcoming Class
16 days 23 Jun 2023
Artificial Intelligence
- Components of AI
- Categories of Machine Learning
- Recurrent Neural Networks
- Recurrent Neural Networks
Upcoming Class
10 days 17 Jun 2023
Machine Learning
- Introduction to Machine Learning & Python
- Machine Learning: Supervised Learning
- Machine Learning: Unsupervised Learning
Upcoming Class
23 days 30 Jun 2023
Tableau
- Introduction to Tableau Desktop
- Data Transformation Methods
- Configuring tableau server
- Integration with R & Hadoop
Upcoming Class
2 days 09 Jun 2023
Hadoop Course
Upcoming Batches
Receive Latest Materials and Offers on Hadoop Course