Hive Installation on Ubuntu 14.04 With Pre Built Derby Database
Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis. Hive gives an SQL-like interface to query data stored in various databases and file systems that integrate with Hadoop. The traditional SQL queries must be implemented in the MapReduce Java API to execute SQL applications and queries over a distributed data. Hive provides the necessary SQL abstraction to integrate SQL-like Queries (HiveQL) into the underlying Java API without the need to implement queries in the low-level Java API. Since most of the data warehousing application work with SQL based querying language, Hive supports easy portability of SQL-based application to Hadoop.
1) A machine with Ubuntu 14.04 LTS operating system
2) Apache Hadoop 2.6.4 pre installed (How to install Hadoop on Ubuntu 14.04)
3) Apache Hive 2.1.0 Software (Download Here)
Hive Installation With Pre Built Derby Database
Hive versions 1.2 onward require Java 1.7 or newer. Hive versions 0.14 to 1.1 work with Java 1.6 as well.
Hadoop 2.x (preferred), 1.x (not supported by Hive 2.0.0 onward). Hive versions up to 0.13 also supported Hadoop 0.20.x, 0.23.x.
Hive Installation Steps
Step 1 - Creating hive directory. Open a new terminal(CTRL + ALT + T) and enter the following command.
Step 2 - Change the ownership and permissions of the directory /usr/local/hive. Here 'hduser' is an Ubuntu username.
Step 3 - Switch User, is used by a computer user to execute commands with the privileges of another user account.
Step 4 - Change the directory to /home/hduser/Desktop , In my case the downloaded apache-hive-2.1.0-bin.tar.gz file is in /home/hduser/Desktop folder. For you it might be in /downloads folder check it.
Step 5 - Untar the apache-hive-2.1.0-bin.tar.gz file.
Step 6 - Move the contents of apache-hive-2.1.0-bin folder to /usr/local/hive
Step 7 - Edit $HOME/.bashrc file by adding the pig path.
$HOME/.bashrc file. Add the following lines
Step 8 - Reload your changed $HOME/.bashrc settings
Step 9 - Change the directory to /usr/local/hive/conf
Step 10 - Copy the default hive-env.sh.template to hive-env.sh
Step 11 - Edit hive-env.sh file.
Step 12 - Add the below lines to hive-env.sh file. Save and Close.
Step 13 - Copy the default hive-default.xml.template to hive-site.xml
Step 14 - Edit hive-site.xml file.
Step 15 - Add or update below properties in hive-site.xml file. Save and Close.
Step 16 - Change the directory to /usr/local/hadoop/sbin
Step 17 - Start all hadoop daemons.
Step 18 - You must use below HDFS commands to create /tmp and /user/hive/warehouse (aka hive.metastore.warehouse.dir) and set them chmod g+w before you can create a table in Hive.
Step 19 - Change the directory to /usr/local/hive/bin
Step 20 - We need to run the schematool command below as an initialization step. For example, we can use "derby" as db type.
Step 21 - To use the Hive command line interface (CLI) from the shell.
Step 22 - To list all the tables those are present in derby database.
Please share this blog post and follow me for latest updates on
Previous Post Next Post
Labels : Hive Installation With MySQL Database Metastore Beeline Client Usage hiveserver2 and Web UI usage WordCount hiveQL Execution Hive Metastore Configuration Hive Command Line Interface Hive Shell Commands usage Hive Distributed Cache HDFS and Linux Commands in hive shell Customizing hive logs Database Commnds Usage Table Commands Usage Hive Partitioning Configuration Hive Bucketing Configuration UDFs Java Example UDAFs Java Example UDTF Java Example Hive JDBC client Java Example Hive Web Interface (HWI) HiveQL Examples