剛准備敲鍵盤,突然想起今天已經星期五了,有點小興奮,一周又這么愉快的結束,又可以休息了,等等。。我好像是來寫Java博客的,怎么變成了寫日記,好吧,言歸正傳。
不知道大家有沒有遇到過這樣的需求:統計一遍文章的瀏覽量並且分析出詳情,即QQ,微信,微博,網頁,網站APP,其他的訪問比率,我們公司前一段時間遇到了這樣的需求,那么我接下來講解一下它的具體實現。
首先需要統計一篇文章的訪問量,這里的訪問量是精確訪問量(登陸的用戶多次訪問,記多次算一人,游客根據ip統計,同一ip算一人),下面是獲取ip的代碼實現:
public static String getIpAddr(Request request) {
/* String ip = request.getHeader( " x-forwarded-for " );
if (ip == null || ip.length() == 0 || " unknown " .equalsIgnoreCase(ip)) {
ip = request.getHeader( " Proxy-Client-IP " );
}
if (ip == null || ip.length() == 0 || " unknown " .equalsIgnoreCase(ip)) {
ip = request.getHeader( " WL-Proxy-Client-IP " );
}
if (ip == null || ip.length() == 0 || " unknown " .equalsIgnoreCase(ip)) {
ip = request.getRemoteAddr();
}
if(ip.contains(",")){
ip = ip.substring(0, ip.indexOf(","));
}
return ip; */
String ipAddress = null;
//ipAddress = this.getRequest().getRemoteAddr();
ipAddress = request.getHeader("x-forwarded-for");
if (ipAddress == null || ipAddress.length() == 0 || "unknown".equalsIgnoreCase(ipAddress)) {
ipAddress = request.getHeader("Proxy-Client-IP");
}
if (ipAddress == null || ipAddress.length() == 0 || "unknown".equalsIgnoreCase(ipAddress)) {
ipAddress = request.getHeader("WL-Proxy-Client-IP");
}
if (ipAddress == null || ipAddress.length() == 0 || "unknown".equalsIgnoreCase(ipAddress)) {
ipAddress = request.remoteAddress();
if (ipAddress.equals("127.0.0.1")) {
//根據網卡取本機配置的IP
InetAddress inet = null;
try {
inet = InetAddress.getLocalHost();
} catch (Exception e) {
e.printStackTrace();
}
ipAddress = inet.getHostAddress();
}
}
//對於通過多個代理的情況,第一個IP為客戶端真實IP,多個IP按照','分割
if (ipAddress != null && ipAddress.length() > 15) { //"***.***.***.***".length() = 15
if (ipAddress.indexOf(",") > 0) {
ipAddress = ipAddress.substring(0, ipAddress.indexOf(","));
}
}
return ipAddress;
}
第二步,是在保存每條訪問記錄時進行訪問來源的插入,比如是qq還是微信等等,如下是獲取它的訪問來源的具體代碼實現:
/**
* 獲取瀏覽端的類型,0-網頁,1;//微博 2;//qq 5;//微信 6;//APP 7;//其他
*@author LongJin
* @return
*/
public static int getFromPc(String client) {
Http.Context context = ctx();
Http.Request request = context.request();
String head = request.getHeader("user-agent").toLowerCase();
play.Logger.info("head = " + request.getHeader("user-agent"));
int from = 0;//pc端
boolean isPc = HttpRequestDeviceUtils.isPcDevice(request);//判斷是否來源於pc端
if (!isPc) {
if (head.contains("weibo")) {
from = 1;//微博
} else if (getQq(head)) {//qq單獨寫了個判斷方法
from = 2;//qq
} else if (head.contains("micromessenger")) {
from = 5;//微信
} else if (StringUtils.isNotBlank(client)) {
from = 6;//APP
} else {
from = 7;//其他
}
}
return from;
}
判斷是否是來源於QQ:
/**
* 獲取QQ
*
* @param str
* @return
*/
public static boolean getQqCount(String str) {
Pattern p = Pattern.compile("\\bqq\\b", Pattern.CASE_INSENSITIVE);
Matcher m = p.matcher(str);
boolean isQq= false;
while (m.find()) {
isQq = true;
}
return isQq;
}
其中的HttpRequestDeviceUtils.isPcDevice(request)判斷是否來源pc端實現如下:
/**Wap網關Via頭信息中特有的描述信息*/
private static String mobileGateWayHeaders[]=new String[]{
"ZXWAP",//中興提供的wap網關的via信息,例如:Via=ZXWAP GateWayZTE Technologies,
"chinamobile.com",//中國移動的諾基亞wap網關,例如:Via=WTP/1.1 GDSZ-PB-GW003-WAP07.gd.chinamobile.com (Nokia WAP Gateway 4.1 CD1/ECD13_D/4.1.04)
"monternet.com",//移動夢網的網關,例如:Via=WTP/1.1 BJBJ-PS-WAP1-GW08.bj1.monternet.com. (Nokia WAP Gateway 4.1 CD1/ECD13_E/4.1.05)
"infoX",//華為提供的wap網關,例如:Via=HTTP/1.1 GDGZ-PS-GW011-WAP2 (infoX-WISG Huawei Technologies),或Via=infoX WAP Gateway V300R001 Huawei Technologies
"XMS 724Solutions HTG",//國外電信運營商的wap網關,不知道是哪一家
"wap.lizongbo.com",//自己測試時模擬的頭信息
"Bytemobile",//貌似是一個給移動互聯網提供解決方案提高網絡運行效率的,例如:Via=1.1 Bytemobile OSN WebProxy/5.1
};
/**電腦上的IE或Firefox瀏覽器等的User-Agent關鍵詞*/
private static String[] pcHeaders=new String[]{
"Windows 98",
"Windows ME",
"Windows 2000",
"Windows XP",
"Windows NT",
"Ubuntu",
"Macintosh",
"Linux"
};
/**手機瀏覽器的User-Agent里的關鍵詞*/
private static String[] mobileUserAgents=new String[]{
"Nokia",//諾基亞,有山寨機也寫這個的,總還算是手機,Mozilla/5.0 (Nokia5800 XpressMusic)UC AppleWebkit(like Gecko) Safari/530
"SAMSUNG",//三星手機 SAMSUNG-GT-B7722/1.0+SHP/VPP/R5+Dolfin/1.5+Nextreaming+SMM-MMS/1.2.0+profile/MIDP-2.1+configuration/CLDC-1.1
"MIDP-2",//j2me2.0,Mozilla/5.0 (SymbianOS/9.3; U; Series60/3.2 NokiaE75-1 /110.48.125 Profile/MIDP-2.1 Configuration/CLDC-1.1 ) AppleWebKit/413 (KHTML like Gecko) Safari/413
"CLDC1.1",//M600/MIDP2.0/CLDC1.1/Screen-240X320
"SymbianOS",//塞班系統的,
"MAUI",//MTK山寨機默認ua
"UNTRUSTED/1.0",//疑似山寨機的ua,基本可以確定還是手機
"Windows CE",//Windows CE,Mozilla/4.0 (compatible; MSIE 6.0; Windows CE; IEMobile 7.11)
"iPhone",//iPhone是否也轉wap?不管它,先區分出來再說。Mozilla/5.0 (iPhone; U; CPU iPhone OS 4_1 like Mac OS X; zh-cn) AppleWebKit/532.9 (KHTML like Gecko) Mobile/8B117
"iPad",//iPad的ua,Mozilla/5.0 (iPad; U; CPU OS 3_2 like Mac OS X; zh-cn) AppleWebKit/531.21.10 (KHTML like Gecko) Version/4.0.4 Mobile/7B367 Safari/531.21.10
"Android",//Android是否也轉wap?Mozilla/5.0 (Linux; U; Android 2.1-update1; zh-cn; XT800 Build/TITA_M2_16.22.7) AppleWebKit/530.17 (KHTML like Gecko) Version/4.0 Mobile Safari/530.17
"BlackBerry",//BlackBerry8310/2.7.0.106-4.5.0.182
"UCWEB",//ucweb是否只給wap頁面? Nokia5800 XpressMusic/UCWEB7.5.0.66/50/999
"ucweb",//小寫的ucweb貌似是uc的代理服務器Mozilla/6.0 (compatible; MSIE 6.0;) Opera ucweb-squid
"BREW",//很奇怪的ua,例如:REW-Applet/0x20068888 (BREW/3.1.5.20; DeviceId: 40105; Lang: zhcn) ucweb-squid
"J2ME",//很奇怪的ua,只有J2ME四個字母
"YULONG",//宇龍手機,YULONG-CoolpadN68/10.14 IPANEL/2.0 CTC/1.0
"YuLong",//還是宇龍
"COOLPAD",//宇龍酷派YL-COOLPADS100/08.10.S100 POLARIS/2.9 CTC/1.0
"TIANYU",//天語手機TIANYU-KTOUCH/V209/MIDP2.0/CLDC1.1/Screen-240X320
"TY-",//天語,TY-F6229/701116_6215_V0230 JUPITOR/2.2 CTC/1.0
"K-Touch",//還是天語K-Touch_N2200_CMCC/TBG110022_1223_V0801 MTK/6223 Release/30.07.2008 Browser/WAP2.0
"Haier",//海爾手機,Haier-HG-M217_CMCC/3.0 Release/12.1.2007 Browser/WAP2.0
"DOPOD",//多普達手機
"Lenovo",// 聯想手機,Lenovo-P650WG/S100 LMP/LML Release/2010.02.22 Profile/MIDP2.0 Configuration/CLDC1.1
"LENOVO",// 聯想手機,比如:LENOVO-P780/176A
"HUAQIN",//華勤手機
"AIGO-",//愛國者居然也出過手機,AIGO-800C/2.04 TMSS-BROWSER/1.0.0 CTC/1.0
"CTC/1.0",//含義不明
"CTC/2.0",//含義不明
"CMCC",//移動定制手機,K-Touch_N2200_CMCC/TBG110022_1223_V0801 MTK/6223 Release/30.07.2008 Browser/WAP2.0
"DAXIAN",//大顯手機DAXIAN X180 UP.Browser/6.2.3.2(GUI) MMP/2.0
"MOT-",//摩托羅拉,MOT-MOTOROKRE6/1.0 LinuxOS/2.4.20 Release/8.4.2006 Browser/Opera8.00 Profile/MIDP2.0 Configuration/CLDC1.1 Software/R533_G_11.10.54R
"SonyEricsson",// 索愛手機,SonyEricssonP990i/R100 Mozilla/4.0 (compatible; MSIE 6.0; Symbian OS; 405) Opera 8.65 [zh-CN]
"GIONEE",//金立手機
"HTC",//HTC手機
"ZTE",//中興手機,ZTE-A211/P109A2V1.0.0/WAP2.0 Profile
"HUAWEI",//華為手機,
"webOS",//palm手機,Mozilla/5.0 (webOS/1.4.5; U; zh-CN) AppleWebKit/532.2 (KHTML like Gecko) Version/1.0 Safari/532.2 Pre/1.0
"GoBrowser",//3g GoBrowser.User-Agent=Nokia5230/GoBrowser/2.0.290 Safari
"IEMobile",//Windows CE手機自帶瀏覽器,
"WAP2.0"//支持wap 2.0的
};
/**
* 判斷是否為pc端
* @param request
* @return
*/
public static boolean isPcDevice(Request request){
boolean b = false;
boolean pcFlag = false;
boolean mobileFlag = false;
String via = request.getHeader("Via");
String userAgent = request.getHeader("user-agent");
//String xMsisdn = request.getHeader("X-MSISDN");
/* Map<String, String[]> headers = request.headers();
play.Logger.info("xM = "+headers);*/
for (int i = 0; userAgent!=null && !userAgent.trim().equals("") && i < pcHeaders.length; i++) {
if(userAgent.contains(pcHeaders[i])){
pcFlag = true;
break;
}
}
for (int i = 0; via!=null && !via.trim().equals("") && i < mobileGateWayHeaders.length; i++) {
if(via.contains(mobileGateWayHeaders[i])){
mobileFlag = true;
break;
}
}
for (int i = 0;!mobileFlag && userAgent!=null && !userAgent.trim().equals("") && i < mobileUserAgents.length; i++) {
if(userAgent.contains(mobileUserAgents[i])){
mobileFlag = true;
break;
}
}
if(mobileFlag==false && pcFlag==true){
b=true;
}
return b;
}
至此,訪問量的分類就基本算是完成了,可以結合訪問量的統計實現原創文章排行榜功能並且統計詳情做成餅狀圖,寫着寫着突然斷電了,可能是要放假了,激動了。
ps:這里只是講解一些核心部分,至於具體詳細實現沒做講解。在我實現的整個過程中,存在一個統計量精度問題,即統一局域網下外網一樣,這樣導致了在統計訪問量時存在同一局域網下訪問只算一人,望大神指點。
