HA集群中namenode連接不上journalnode,導致namenode啟動不了


查看日志發現一下的錯誤:

2018-10-08 15:29:26,373 FATAL org.apache.hadoop.hdfs.server.namenode.FSEditLog: Error: recoverUnfinalizedSegments failed for required journal (JournalAndStream(mgr=QJM to [192.168.135.71:8485, 192.168.135.72:8485, 192.168.135.73:8485], stream=null)) org.apache.hadoop.hdfs.qjournal.client.QuorumException: Got too many exceptions to achieve quorum size 2/3. 3 exceptions thrown: 192.168.135.72:8485: Call From mini2/192.168.135.72 to mini2:8485 failed on connection exception: java.net.ConnectException: 拒絕連接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused 192.168.135.71:8485: Call From mini2/192.168.135.72 to mini1:8485 failed on connection exception: java.net.ConnectException: 拒絕連接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused 192.168.135.73:8485: Call From mini2/192.168.135.72 to mini3:8485 failed on connection exception: java.net.ConnectException: 拒絕連接; For more details see: http://wi ki.apache.org/hadoop/ConnectionRefused

解決方法:

方法一:首先手動啟動journalnode,再手動啟動namenode

方法二:修改core-site.xml中的ipc參數

<property>
<name> ipc.client.connect.max.retries</name>
<value> 100</value>
<description>
Indicates the number of retries a client will make to establisha server connection.
</description>
</property>
<property>
<name>ipc.client.connect.retry.interval</name>
<value>10000</value>
<description>Indicates the number of milliseconds a client will wait for
before retrying to establish a server connection.
</description>
</property>


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM