YARN報錯
2017-08-25 03:51:58,815 WARN org.apache.hadoop.ipc.Server: Large response size 4739374 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications from 10.135.8.101:38352 Call#33361 Retry#0
2017-08-25 03:53:39,255 WARN org.apache.hadoop.ipc.Server: Large response size 4739374 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications from 10.135.8.101:38456 Call#33364 Retry#0 2017-08-25 03:55:19,700 WARN org.apache.hadoop.ipc.Server: Large response size 4739374 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications from 10.135.8.101:38556 Call#33367 Retry#0 2017-08-25 03:57:00,262 WARN org.apache.hadoop.ipc.Server: Large response size 4739374 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications from 10.135.8.101:38674 Call#33370 Retry#0 2017-08-25 03:58:40,687 WARN org.apache.hadoop.ipc.Server: Large response size 4739374 for call org.apache.hadoop.yarn.api.ApplicationClientProtocolPB.getApplications from 10.135.8.101:38804 Call#33373 Retry#0
解決辦法:
1、在hdfs-site中添加如下參數
<property> <name>ipc.server.max.response.size</name> <value>5242880</value> </property>
2、可能造成OOM問題
增大-xmx參數的大小
其他問題
正常來說這里的IPC時間返回大概是10s/1min這個級別,如果返回的太頻繁就可能會出現RM OOM的問題。
這個問題需要深入源碼去分析,待有結論再更新上來。
鏈接
2、https://issues.apache.org/jira/browse/HADOOP-14858
3、https://mapr.com/community/s/question/0D50L00006BIt35SAD/why-yarn-crashes-
4、https://issues.apache.org/jira/browse/YARN-7150
5、https://www.jianshu.com/p/ce998c10b471