Dubbo Failed to save registry store file, cause: Can not lock the registry cache file
啟動的Dubbo 服務的時候報錯,異常信息如下:
2016-08-22 16:44:40.588 | DubboSaveRegistryCache-thread-1 | WARN | com.alibaba.dubbo.common.logger.log4j.Log4jLogger:Log4jLogger.java(78) | [DUBBO] Failed to save registry store file, cause: Can not lock the registry cache file /root/.dubbo/dubbo-registry-10.141.4.168.cache, ignore and retry later, maybe multi java process use the file, please config: dubbo.registry.file=xxx.properties, dubbo version: 2.8.3, current host: 127.0.0.1 java.io.IOException: Can not lock the registry cache file /root/.dubbo/dubbo-registry-10.141.4.168.cache, ignore and retry later, maybe multi java process use the file, please config: dubbo.registry.file=xxx.properties at com.alibaba.dubbo.registry.support.AbstractRegistry.doSaveProperties(AbstractRegistry.java:193) ~[dubbo-2.8.3.jar:2.8.3] at com.alibaba.dubbo.registry.support.AbstractRegistry$SaveProperties.run(AbstractRegistry.java:150) [dubbo-2.8.3.jar:2.8.3] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_60] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_60] at java.lang.Thread.run(Thread.java:745) [na:1.7.0_60]
- 1
- 2
- 3
- 4
- 5
- 6
- 7
報錯的大概意思是 Dubbo在保存服務列表時失敗,Can not lock the registry cache file /root/.dubbo/dubbo-registry-10.141.4.168.cache,拿不到文件鎖,無法保存服務列表。
錯誤原因
出現這個的原因是服務向ZK注冊的同時,會緩存Consumer的列表,寫入user.home/.dubbo/dubbo-registry-” + url.getHost() + “.cache 這個文件,當在同一個機器上啟動多個Provider的時候,就會出現文件鎖爭用的問題,報上面這個錯誤。
解決辦法
既然是由於競爭文件鎖導致的,那么讓服務模塊各自緩存自己的cache文件就可以避免這樣的問題了。
具體做法是:在provider的xml配置文件中加入 file=”${catalina.home}/dubbo-registry/dubbo-registry.properties” ,如下:
<dubbo:registry id="zkcenter" protocol="zookeeper" address="${dubbo.zk_address}" file="${catalina.home}/dubbo-registry/dubbo-registry.properties"/>
- 1
這樣就會在catalina.home目錄下生成dubbo-registry這個目錄,cache文件就緩存在這個里邊了。
參考資料
AbstractRegistry lock problem:https://github.com/alibaba/dubbo/issues/81
另一個方法:
我們使用的Dubbo最近老是遇到WARN [DubboSaveRegistryCache-thread-1] (AbstractRegistry.java:221) method:doSaveProperties - [DUBBO] Failed to save registry store file, cause: Can not lock the registry cache file /home/newad/.dubbo/dubbo-registry-*.*.*.*.cache, ignore and retry later, maybe multi java process use the file, please config: dubbo.registry.file=xxx.properties, dubbo version: 2.5.3, current host: *.*.*.*
java.io.IOException: Can not lock the registry cache file /home/newad/.dubbo/dubbo-registry-*.*.*.*.cache, ignore and retry later, maybe multi java process use the file, please config: dubbo.registry.file=xxx.properties
從異常中很清楚的看到,Dubbo在保存服務列表時失敗,失敗的原因也很簡單,異常里面都說得很清楚了,Can not lock the registry cache file /home/newad/.dubbo/dubbo-registry-*.*.*.*.cache,拿不到文件鎖,無法保存服務列表。
Dubbo通過注冊中心發現服務,發現的服務Dubbo同時也會保存到本地緩存一份,緩存的好處有很多,比如不需要每次使用的時候都通過注冊中心獲取,注冊中心不可用了,不影響消費端的調用,因為本地緩存了一份服務提供者列表。Dubbo本地緩存默認采用的文件,會根據注冊中心自動在當前用戶目錄下生成一個緩存文件,類似/home/newad/.dubbo/dubbo-registry-*.*.*.*.cache,星號表示注冊中心的IP地址,當同一台機器上同時啟動多個進程,就會出現多個進程爭奪此文件的寫入權限,觖此問題的方法也很簡單,日志里面都說了重新配置一下這個緩存文件就。
主要在啟動腳本里面添加配置: -Ddubbo.registry.file=/home/newad/.dubbo/dubbo-registry-Order-0.cache
- #!/bin/sh
- CU=/home/www/WEB-INF/release/
- #LANG="zh_CN"
- #export LANG
- CP=$CU":./"
- LIB=$CU"lib/*.jar"
- for i in $LIB
- do
- CP="$i:$CP"
- done
- export CP
- JAVA=/home/www/jvm/jdk1.7.0_02/bin/java
- export JAVA
- cd "$CUorder"
- lock=./lock
- if [ ! -f "$lock" ]
- then
- touch "$lock"
- echo "classpath:" $CP
- $JAVA -server -Xms1024m -Xmx1024m -XX:PermSize=256m -DOrder=Order-0 -Dlog4j.configuration=file:/home/www/WEB-INF/release/order/log4j.properties -Ddubbo.registry.file=/home/newad/.dubbo/dubbo-registry-Order-0.cache -cp $CP com.product.PServer
- rm "$lock"
- else
- echo " already startup!"
- fi


