原帖:https://blog.csdn.net/Damonhaus/article/details/76572971
問題:協同過濾 ALS算法。在測試過程中遇到了內存溢出的錯誤
解決辦法1:降低迭代次數,20次 -> 10次
val model = new ALS().setRank(10).setIterations(20).setLambda(0.01).setImplicitPrefs(false) .run(alldata)
以上改成 .setIterations(10)
解決辦法2:checkpoint機制
/** * 刪除checkpoint留下的過程數據 */ val path = new Path(HDFSConnection.paramMap("hadoop_url")+"/checkpoint"); //聲明要操作(刪除)的hdfs 文件路徑 val hadoopConf = spark.sparkContext.hadoopConfiguration val hdfs = org.apache.hadoop.fs.FileSystem.get(new URI(HDFSConnection.paramMap("hadoop_url")+"/checkpoint"),hadoopConf) if(hdfs.exists(path)) { //需要遞歸刪除設置true,不需要則設置false hdfs.delete(path, true) //這里因為是過程數據,可以遞歸刪除 } /** * 設置 CheckpointDir */ spark.sparkContext.setCheckpointDir(HDFSConnection.paramMap("hadoop_url")+"/checkpoint")
/** * Set period (in iterations) between checkpoints (default = 10). Checkpointing helps with * recovery (when nodes fail) and StackOverflow exceptions caused by long lineage. It also helps * with eliminating temporary shuffle files on disk, which can be important when there are many * ALS iterations. If the checkpoint directory is not set in [[org.apache.spark.SparkContext]], * this setting is ignored. */ val model = new ALS().setCheckpointInterval(2).setRank(10).setIterations(20).setLambda(0.01).setImplicitPrefs(false) .run(alldata)