1.安装环境
- CentOS Linux release 7.1.1503
- jdk1.7.0_67
- hadoop2.7.1
- hive1.2.1
- hbase1.0.1.1
- zookeeper3.4.6
2.集群各节点角色规划
- 192.168.200.118/namenode NameNode、SecondaryNameNode、ResourceManager、HBase、Hive metastore、Impala Catalog、Impala statestore、zookeeper
- 192.168.200.119/datanode1 DataNode、NodeManager、HBase、NodeManager、Impala Server、Impala shell、zookeeper
- 192.168.200.120/datanode2 DataNode、NodeManager、HBase、NodeManager、Impala Server、Impala shell、zookeeper
- 192.168.200.121/datanode3 DataNode、NodeManager、HBase、NodeManager、Impala Server、Impala shell、zookeeper
3.在namenode节点(本集群中规划的是namenode)上安装以下impala安装包,注意:以下安装包必须在root用户下执行。
rpm下载:http://archive.cloudera.com/cdh5/redhat/6/x86_64/cdh/5.5.2/RPMS/x86_64/
- rpm -ivh --nodeps impala-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh impala-state-store-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh impala-catalog-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh impala-udf-devel-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
4.在每一个datanode节点安装一下impala安装包,注意:以下安装包必须在root用户下执行。
- rpm -ivh --nodeps --force impala-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh --nodeps --force impala-shell-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh --force impala-udf-devel-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
- rpm -ivh --force impala-server-2.3.0+cdh5.5.2+0-1.cdh5.5.2.p0.12.el6.x86_64.rpm
5.配置Impala
复制hadoop配置文件core-site.xml 、hbase-site.xml 、hdfs-site.xml 、hive-site.xml到/etc/impala/conf目录下
1、hdfs-site.xml添加配置
<property> <name>dfs.client.read.shortcircuit</name> <value>true</value> </property> <property> <name>dfs.domain.socket.path</name> <value>/var/run/hdfs-sockets/dn</value> </property> <property> <name>dfs.client.file-block-storage-locations.timeout.millis</name> <value>10000</value> </property>
2、core-site.xml添加配置
<property> <name>dfs.client.read.shortcircuit</name> <value>true</value> </property>
3、每个datanode的hdfs-site.xml添加配置
<property> <name>dfs.client.use.legacy.blockreader.local</name> <value>true</value> </property> <property> <name>dfs.datanode.data.dir.perm</name> <value>750</value> </property> <property> <name>dfs.block.local-path-access.user</name> <value>impala</value> </property> <property> <name>dfs.client.file-block-storage-locations.timeout.millis</name> <value>10000</value> </property>
6、修改/usr/lib/impala/lib下的impala相关jar包
impala安装后hadoop相关jar包大多找不到,修改/usr/lib/impala/lib下的相关jar包,cdh5.5.2下我替换的jar包使用软连接仍然找不到,复制过去可以,复制过去的jar包必须改名为/usr/lib/impala/lib原来的jar包名称