Hive WordCount hiveQL Execution
Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. The traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over a distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like Queries (HiveQL) into the underlying Java API without the need to implement queries in the low-level Java API. Since most of the data warehousing application work with SQL based querying language, Hive supports easy portability of SQL-based application to Hadoop.
1) A machine with Ubuntu 14.04 LTS operating system
2) Apache Hadoop 2.6.4 pre installed (How to install Hadoop on Ubuntu 14.04)
3) Apache Hive 2.1.0 pre installed (How to Install Hive on Ubuntu 14.04)
Hive WordCount hiveQL Example
Step 1 - Change the directory to /usr/local/hadoop/sbin
Step 2 - Start all hadoop daemons.
Step 3 - Create employee.txt file.
Step 4 - Add these following lines to employee.txt file. Save and close.
Step 5 - Copy employee.txt from local file system into HDFS.
Step 6 - Change the directory to /usr/local/hive/bin
Step 7 - Create wordcount hive query file. The file should have .hql extension.
Step 8 - Add thses following lines to wordcount.hql Save and close.
Step 9 - Execute wordcount.hql hiveQL
Step 10 - Execute select hiveQL
Set these Hive Execution Parameters in hive-site.xml
Please share this blog post and follow me for latest updates on
Labels : Hive Installation With Derby Database Metastore Hive Installation With MySQL Database Metastore Beeline Client Usage hiveserver2 and Web UI usage Hive Metastore Configuration Hive Command Line Interface Hive Shell Commands usage Hive Distributed Cache HDFS and Linux Commands in hive shell Customizing hive logs Database Commnds Usage Table Commands Usage Hive Partitioning Configuration Hive Bucketing Configuration UDFs Java Example UDAFs Java Example UDTF Java Example Hive JDBC client Java Example Hive Web Interface (HWI) HiveQL Examples