1.安裝環境
- CentOS Linux release 7.1.1503
- jdk1.7.0_67
- hadoop2.7.1
- hive1.2.1
- hbase1.0.1.1
- zookeeper3.4.6
2.集群各節點角色規划
- 192.168.200.118/namenode NameNode、SecondaryNameNode、ResourceManager、HBase、Hive metastore、Impala Catalog、Impala statestore、zookeeper
- 192.168.200.119/datanode1 DataNode、NodeManager、HBase、NodeManager、Impala Server、Impala shell、zookeeper
- 192.168.200.120/datanode2 DataNode、NodeManager、HBase、NodeManager、Impala Server、Impala shell、zookeeper
- 192.168.200.121/datanode3 DataNode、NodeManager、HBase、NodeManager、Impala Server、Impala shell、zookeeper
3.在namenode節點(本集群中規划的是namenode)上安裝以下impala安裝包,注意:以下安裝包必須在root用戶下執行。
rpm下載:http://archive.cloudera.com/cdh5/redhat/6/x86_64/cdh/5.5.2/RPMS/x86_64/
- rpm -ivh --nodeps impala-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh impala-state-store-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh impala-catalog-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh impala-udf-devel-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
4.在每一個datanode節點安裝一下impala安裝包,注意:以下安裝包必須在root用戶下執行。
- rpm -ivh --nodeps --force impala-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh --nodeps --force impala-shell-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh --force impala-udf-devel-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh --force impala-server-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
5.配置Impala
復制hadoop配置文件core-site.xml 、hbase-site.xml 、hdfs-site.xml 、hive-site.xml到/etc/impala/conf目錄下
1、hdfs-site.xml添加配置
<property> <name>dfs.client.read.shortcircuit</name> <value>true</value> </property> <property> <name>dfs.domain.socket.path</name> <value>/var/run/hdfs-sockets/dn</value> </property> <property> <name>dfs.client.file-block-storage-locations.timeout.millis</name> <value>10000</value> </property>
2、core-site.xml添加配置
<property> <name>dfs.client.read.shortcircuit</name> <value>true</value> </property>
3、每個datanode的hdfs-site.xml添加配置
<property> <name>dfs.client.use.legacy.blockreader.local</name> <value>true</value> </property> <property> <name>dfs.datanode.data.dir.perm</name> <value>750</value> </property> <property> <name>dfs.block.local-path-access.user</name> <value>impala</value> </property> <property> <name>dfs.client.file-block-storage-locations.timeout.millis</name> <value>10000</value> </property>
6、修改/usr/lib/impala/lib下的impala相關jar包
impala安裝后hadoop相關jar包大多找不到,修改/usr/lib/impala/lib下的相關jar包,cdh5.5.2下我替換的jar包使用軟連接仍然找不到,復制過去可以,復制過去的jar包必須改名為/usr/lib/impala/lib原來的jar包名稱
