Hive is a prominent open source data warehouse built on Hadoop’s Distributed File System (HDFS). It is used for – data storage, data summarization, data query and analysis on large data systems. Hive is easy to use for novice, as you need to simply write SQL like query. Hive converts SQL queries into Mapreduce / Tez / Spark job depending on admin configured settings.