Category Archives: Big Data

Big Data


Uses of HIVE 1. The Apache Hive distributed storage. 2. Hive provides tools to enable easy data extract/transform/load (ETL) 3. It provides the structure on a variety of data formats. 4. By using Hive, we can access files stored in Hadoop Distributed File System (HDFS is used to querying and managing large datasets residing in) or in other… Read More »

What is Hive in Hadoop?

Apache Hive is a data warehouse system for Hadoop, which enables data summarization, querying, and analysis of data by using HiveQL (a query language similar to SQL). Hive can be used to interactively explore your data or to create reusable batch processing jobs. After you define the structure, you can use Hive to query that data without knowledge… Read More »