HA集群中namenode连接不上journalnode,导致namenode启动不了


查看日志发现一下的错误:

2018-10-08 15:29:26,373 FATAL org.apache.hadoop.hdfs.server.namenode.FSEditLog: Error: recoverUnfinalizedSegments failed for required journal (JournalAndStream(mgr=QJM to [192.168.135.71:8485, 192.168.135.72:8485, 192.168.135.73:8485], stream=null)) org.apache.hadoop.hdfs.qjournal.client.QuorumException: Got too many exceptions to achieve quorum size 2/3. 3 exceptions thrown: 192.168.135.72:8485: Call From mini2/192.168.135.72 to mini2:8485 failed on connection exception: java.net.ConnectException: 拒绝连接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused 192.168.135.71:8485: Call From mini2/192.168.135.72 to mini1:8485 failed on connection exception: java.net.ConnectException: 拒绝连接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused 192.168.135.73:8485: Call From mini2/192.168.135.72 to mini3:8485 failed on connection exception: java.net.ConnectException: 拒绝连接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused

解决方法:

方法一:首先手动启动journalnode,再手动启动namenode

方法二:修改core-site.xml中的ipc参数

<property>
<name> ipc.client.connect.max.retries</name>
<value> 100</value>
<description>
Indicates the number of retries a client will make to establisha server connection.
</description>
</property>
<property>
<name>ipc.client.connect.retry.interval</name>
<value>10000</value>
<description>Indicates the number of milliseconds a client will wait for
before retrying to establish a server connection.
</description>
</property>


免责声明!

本站转载的文章为个人学习借鉴使用,本站对版权不负任何法律责任。如果侵犯了您的隐私权益,请联系本站邮箱yoyou2525@163.com删除。



 
粤ICP备18138465号  © 2018-2025 CODEPRJ.COM