最近無意翻開4年前做過的一個功能,就是搜集全國各城市各個區(縣)的路(XX路、XX道、XX街、XX鎮、XX鄉、XX屯、XX村、XX社)。眾所周知,我們都可以在網上找到省、市、區(縣)這三級聯動的數據,可是就並沒有關於某個城市的某些區(縣)下所對應的路(以下所有的路,道,街,鎮,鄉,屯,村,社統稱為路)的數據,不過我們可以找到一些有地址的網站,例如大眾點評網,里面就有很多一些飲食店等的具體地址。可以寫個爬蟲程序,把所有的詳細地址先寫進數據庫的某個表中,然后再執行算法,把地址中有包含路、道、街、鎮、鄉、屯、村、社等屬於第四級的數據寫進對應的數據表(當然必須把第四級的數據對應到第三級區(縣))。
1、首先需要建五張表(一個用來存放地址的表Address,一個用來存放全國的所有的省Province,一個用來存放屬於省的城市City,一個用來存放屬於城市的區(縣)District,一個用來存放屬於區的路Road)。下面是數據表關系圖:
所有的表中的FID是外鍵表的ID,其中省Province的FID暫時沒用到可忽略,其中flag是一個標志,如0表示新加,1表示修改,2表示刪除等,shid和shengId是對應到其它表的,如某個城市的郵政編碼,主要是用來搜集它們的一些對應的數據而已,這里可不考慮。下面是數據庫查詢的數據展示圖:
2.其次是寫一個根據輸入的地址,返回省市區路四級聯動的數據,然后把第四級路寫進Road這個表並且把它的外鍵FID對應到第三級District區的主鍵ID
這里使用存儲過程來處理,好了,那么問題就來了,執行的地址有好幾種情況,在插入路這個數據之前需要對地址進行查找,有五種查找方式:
需要注意城市是否為直轄市(目前直轄市有四個:重慶市,北京市,天津市,上海市)
1)查找方式——省市區路,這個是最理想的,因為地址中已有前面三級的數據,首先找到省,然后找市,再找區,最后找路,如果路不存在就把路插入到Road表,返回省市區路
2)查找方式——省區路,這里只有省和區,首先找出省,然后找區,再根據區找到其所對應的城市,最后找路,如果路不存在就把路插入到Road表,返回省市區路
3)查找方式——省市路,這里只有省和市,於上面兩種方式來說是比較復雜,首先找出省,然后找市,再根據路來找出對應的區,如果路不存在就把路插入到Road表,返回省市路
如果存在,則可找出區,返回省市區路。不存在時只能手工更正。
4)查找方式——市區路,這里只有市和區,首先找出市,然后根據市再找省,最后找路,如果路不存在就把路插入到Road表,返回省市區路
5)查找方式——市路,這里只有市,於上面四種方式是最復雜的,首先找出市,如果不是直轄市就找省,然后根據路來找區,如果路不存在就把路插入到Road表,返回省市路
如果存在,則可找出區,返回省市區路。不存在時只能手工更正。
以上是返回省市區路的五種查找方式,下面是存儲過程:
GetAddress是查找方式的存儲過程,返回省市區路,代碼:

1 ALTER pROCEDURE [dbo].[GetAddress] 2 @address varchar(100), 3 @NProvinceName varchar(30) OUTPUT, 4 @NCityName varchar(30) OUTPUT, 5 @NDistrictName varchar(30) OUTPUT, 6 @NRoadName varchar(50) OUTPUT, 7 @remain varchar(50) OUTPUT, 8 @PostCode varchar(7) OUTPUT, 9 @Road varchar(100) output, --新增,要插入road的路名 10 @method int output, 11 @number int output, 12 @insert bit =1 --新增,是否插入 13 AS 14 BEGIN 15 16 set @address=replace(@address,' ',''); --空格 17 set @address=replace(@address,' ',''); --制表符 18 set @address=replace(@address,'0','0'); --數字 19 set @address=replace(@address,'1','1'); --數字 20 set @address=replace(@address,'2','2'); --數字 21 set @address=replace(@address,'3','3'); --數字 22 set @address=replace(@address,'4','4'); --數字 23 set @address=replace(@address,'5','5'); --數字 24 set @address=replace(@address,'6','6'); --數字 25 set @address=replace(@address,'7','7'); --數字 26 set @address=replace(@address,'8','8'); --數字 27 set @address=replace(@address,'9','9'); --數字 28 29 set @method=1 30 set @number=0 31 32 create table #temp(NProvinceName varchar(30),NCityName varchar(30),NDistrictName varchar(30), 33 NRoadName varchar(50),remain varchar(50),PostCode varchar(7) ,Road varchar(100),method int,number int) 34 35 while @method<=5 36 begin 37 exec ResAddress @address,@NProvinceName output,@NCityName output, 38 @NDistrictName output,@NRoadName output,@remain output,@PostCode output,@Road output,@method output,@number output 39 40 insert into #temp values (@NProvinceName,@NCityName,@NDistrictName,@NRoadName,@remain,@PostCode,@Road,@method,@number) 41 42 select @NProvinceName=null,@NCityName=null,@NDistrictName=null,@NRoadName=null,@remain=null,@PostCode=null,@number=0,@Road=null 43 set @method=@method+1 44 end 45 46 /* 47 select top 1 @NProvinceName=NProvinceName,@NCityName=NCityName,@NDistrictName=NDistrictName, 48 @NRoadName=NRoadName,@remain=remain,@PostCode=PostCode ,@method=method,@number=number,@Road=Road from #temp order by number desc 49 */ 50 51 select top 1 @NProvinceName=dbo.F_Convert(NProvinceName,0),@NCityName=dbo.F_Convert(NCityName,0),@NDistrictName=dbo.F_Convert(NDistrictName,0), 52 @NRoadName=dbo.F_Convert(NRoadName,0),@remain=dbo.F_Convert(remain,0),@PostCode=dbo.F_Convert(PostCode,0) , 53 @method=dbo.F_Convert(method,0),@number=dbo.F_Convert(number,0),@Road=dbo.F_Convert(Road,0) from #temp order by number desc 54 55 if (@Road is not null) and (@insert=1) 56 begin 57 if not exists (select * from road where fid=left(@road,36) and title=RIGHT(@road,LEN(@ROAD)-36) and text=RIGHT(@road,LEN(@ROAD)-36)) 58 insert into Road(fid,title,flag,text) values(left(@road,36),RIGHT(@road,LEN(@ROAD)-36),2,RIGHT(@road,LEN(@ROAD)-36)) 59 end 60 drop table #temp 61 END
ResAddress是查找方式中需要調用的存儲過程,其中有number這個返回一個數字,數字越大表示查找的難度就越大,代碼:

1 ALTER pROCEDURE [dbo].[ResAddress] 2 3 @address varchar(100), 4 @NProvinceName varchar(30) OUTPUT, 5 @NCityName varchar(30) OUTPUT, 6 @NDistrictName varchar(30) OUTPUT, 7 @NRoadName varchar(50) OUTPUT, 8 @remain varchar(50) OUTPUT, 9 @PostCode varchar(7) OUTPUT, 10 @Road varchar(100) output, --新增,要插入road的路名 11 @method int output, 12 @number int output 13 14 AS 15 BEGIN 16 17 DECLARE @id varchar(36) 18 declare @fid varchar(36) 19 declare @text varchar(30) 20 21 22 declare @NAddress varchar(200) 23 set @NAddress=@address 24 25 26 if @method=1 --查找方式 省市區路 27 begin 28 select top 1 @NProvinceName=title ,@id=id,@text=text from province where left(@Address,len(text)) like '%'+text+'%' order by len(text) desc 29 if (@NProvinceName is not null) 30 begin 31 set @number=@number+1 --找到省計數加1 32 select @id=id from province where title=@NProvinceName and shengid is not null 33 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 34 end 35 36 37 select top 1 @NCityName=title,@id=id,@fid=fid ,@text=text from city where left(@Address,len(text)) like '%'+text+'%' and city.fid=@id order by len(text) desc 38 if (@NCityName is not null) 39 begin 40 set @number=@number+1 41 select @id=id from city where title=@NCityName and shiid is not null 42 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 43 end 44 45 select top 1 @NDistrictName=title,@id=id,@fid=fid,@text=text from district where left(@Address,len(text)) like '%'+text+'%' and district.fid=@id order by len(text) desc 46 if (@NDistrictName is not null) 47 begin 48 set @number=@number+1 49 select @id=id from district where title=@NDistrictName and shiid is not null 50 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 51 end 52 53 select top 1 @NRoadName=title,@fid=fid,@id=id ,@PostCode=PostCode,@text=text from Road where left(@Address,len(text)) like '%'+text+'%' and road.fid=@id order by len(text) desc 54 if (@NRoadName is not null) 55 begin 56 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 57 set @number=@number+1 58 end 59 60 if @text is not null 61 set @remain=right(@NAddress,len(@NAddress)-charindex(@text,@NAddress)-len(@text)+1) 62 else 63 set @remain=@Address 64 65 if (@NRoadName is null) and (@NDistrictName is not null) 66 goto GetRoad 67 else 68 goto finished 69 end 70 71 if @method=2 --查找方式 省區路 72 begin 73 select top 1 @NProvinceName=title ,@id=id,@text=text from province where left(@Address,len(text)) like '%'+text+'%' order by len(text) desc 74 if (@NProvinceName is not null) 75 begin 76 set @number=@number+1 --找到省計數加1 77 select @id=id from province where title=@NProvinceName and shengid is not null --找省的標准ID 78 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 79 end 80 81 82 select top 1 @NDistrictName=title,@id=id,@fid=fid,@text=text from district where left(@Address,len(text)) like '%'+text+'%' 83 and district.fid in (select id from city where city.fid=@id) order by len(text) desc 84 if (@NDistrictName is not null) 85 begin 86 select @id=id from district where title=@NDistrictName and shiid is not null --找區的標准ID 87 set @number=@number+1 88 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 89 end 90 91 SELECT @NCityName=title FROM City where id=@fid --返回查找市名 92 if (@NCityName is not null) 93 set @number=@number+1 94 95 96 select top 1 @NRoadName=title,@fid=fid,@id=id ,@PostCode=PostCode,@text=text from Road where left(@Address,len(text)) like '%'+text+'%' 97 and Road.fid=@id order by len(text) desc 98 if (@NRoadName is not null) 99 begin 100 set @number=@number+1 101 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 102 end 103 104 if @text is not null 105 set @remain=right(@NAddress,len(@NAddress)-charindex(@text,@NAddress)-len(@text)+1) 106 else 107 set @remain=@Address 108 109 if (@NRoadName is null) and (@NDistrictName is not null) 110 goto GetRoad 111 else 112 goto finished 113 end 114 115 if @method=3 --查找方式 省市路 116 begin 117 select top 1 @NProvinceName=title ,@id=id,@text=text from province where left(@Address,len(text)) like '%'+text+'%' order by len(text) desc 118 if (@NProvinceName is not null) 119 begin 120 set @number=@number+1 --找到省計數加1 121 select @id=id from province where title=@NProvinceName and shengid is not null --找省的標准ID 122 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 123 end 124 125 if @NProvinceName in ('重慶市','北京市','天津市','上海市') --新增這個判斷 2009-08-25 126 begin 127 select top 1 @NCityName=title,@id=id,@fid=fid,@text=text from city where city.fid=@id 128 set @number=@number+1 129 end 130 else 131 begin 132 select top 1 @NCityName=title,@id=id,@fid=fid,@text=text from city where left(@Address,len(text)) like '%'+text+'%' 133 and city.fid=@id order by len(text) desc 134 if (@NCityName is not null) 135 begin 136 set @number=@number+1 137 select @id=id from city where title=@NCityName and shiid is not null --找市的標准ID 138 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 139 end 140 end 141 142 select top 1 @NRoadName=title,@fid=fid,@id=id ,@PostCode=PostCode,@text=text from Road where left(@Address,len(text)) like '%'+text+'%' 143 and road.fid in (select id from district where district.fid=@id) order by len(text) desc 144 if (@NRoadName is not null) 145 begin 146 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 147 set @number=@number+1 148 end 149 150 select @NDistrictName=title from district where id=@fid --返回查找區名 151 if (@NDistrictName is not null) 152 set @number=@number+1 153 154 if @text is not null 155 set @remain=right(@NAddress,len(@NAddress)-charindex(@text,@NAddress)-len(@text)+1) 156 else 157 set @remain=@Address 158 if (@NRoadName is null) and (@NDistrictName is not null) 159 goto GetRoad 160 else 161 goto finished 162 end 163 164 if @method=4 --查找方式 市區路 165 begin 166 select top 1 @NCityName=title,@id=id,@fid=fid ,@text=text from city where left(@Address,len(text)) like '%'+text+'%' order by len(text) desc 167 if (@NCityName is not null) 168 begin 169 select @id=id from city where title=@NCityName and shiid is not null --找市的標准ID 170 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 171 set @number=@number+1 172 end 173 174 select @NProvinceName=title from Province where id=@fid --返回找省名 175 if (@NProvinceName is not null) 176 begin 177 set @number=@number+1 178 end 179 180 select top 1 @NDistrictName=title,@id=id,@fid=fid, @text=text from district where left(@Address,len(text)) like '%'+text+'%' 181 and district.fid=@id order by len(text) desc 182 if (@NDistrictName is not null) 183 begin 184 set @number=@number+1 185 select @id=id from district where title=@NDistrictName and shiid is not null --找區的標准ID 186 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 187 end 188 189 190 select top 1 @NRoadName=title,@fid=fid,@id=id ,@PostCode=PostCode,@text=text from Road where left(@Address,len(text)) like '%'+text+'%' 191 and road.fid=@id order by len(text) desc 192 if (@NRoadName is not null) 193 begin 194 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 195 set @number=@number+1 196 end 197 198 if @text is not null 199 set @remain=right(@NAddress,len(@NAddress)-charindex(@text,@NAddress)-len(@text)+1) 200 else 201 set @remain=@Address 202 203 if (@NRoadName is null) and (@NDistrictName is not null) 204 goto GetRoad 205 else 206 goto finished 207 end 208 209 if @method=5 --查找方式 :市-路 210 begin 211 select top 1 @NCityName=title,@id=id,@fid=fid , @text=text from city where left(@Address,len(text)) like '%'+text+'%' order by len(text) desc 212 if (@NCityName is not null) 213 begin 214 set @number=@number+1 215 select @id=id from city where title=@NCityName and shiid is not null --找市的標准ID 216 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 217 end 218 select @NProvinceName=title from Province where id=@fid --返回找省名 219 if (@NProvinceName is not null) 220 set @number=@number+1 221 222 select top 1 @NRoadName=title,@fid=fid,@id=id ,@PostCode=PostCode ,@text=text from Road where left(@Address,len(text)) like '%'+text+'%' 223 and road.fid in (select id from district where district.fid=@id) order by len(text) desc 224 if (@NRoadName is not null) 225 begin 226 set @Address=stuff(@Address,charindex(@text,@Address),LEN(@text),'') 227 set @number=@number+1 228 end 229 230 select @NDistrictName=title from district where id=@fid --返回查找區名 231 if (@NDistrictName is not null) 232 set @number=@number+1 233 234 if @text is not null 235 set @remain=right(@NAddress,len(@NAddress)-charindex(@text,@NAddress)-len(@text)+1) 236 else 237 set @remain=@Address 238 239 if (@NRoadName is null) and (@NDistrictName is not null) 240 goto GetRoad 241 else 242 goto finished 243 end 244 245 246 GetRoad: --新增將返回@road 247 if charindex('路',@Address)>0 248 set @Road=left(@address,charindex('路',@Address)) 249 else if charindex('道',@Address)>0 250 set @Road=left(@address,charindex('道',@Address)) 251 else if charindex('街',@Address)>0 252 set @Road=left(@address,charindex('街',@Address)) 253 else if charindex('鎮',@Address)>0 254 set @Road=left(@address,charindex('鎮',@Address)) 255 else if charindex('鄉',@Address)>0 256 set @Road=left(@address,charindex('鄉',@Address)) 257 else if charindex('屯',@Address)>0 258 set @Road=left(@address,charindex('屯',@Address)) 259 else if charindex('村',@Address)>0 260 set @Road=left(@address,charindex('村',@Address)) 261 else if charindex('社',@Address)>0 262 set @Road=left(@address,charindex('社',@Address)) 263 264 set @road=@id+@road 265 finished: 266 END
execAddress是執行存儲過程,根據輸入的地址返回詳細的地址,代碼:

1 ALTER Procedure [dbo].[execAddress] 2 @inputaddress varchar(100) 3 AS 4 DECLARE @return_value int, 5 @NProvinceName varchar(30), 6 @NCityName varchar(30), 7 @NDistrictName varchar(30), 8 @NRoadName varchar(50), 9 @remain varchar(50), 10 @PostCode varchar(7), 11 @Road varchar(100), 12 @method int, 13 @number int 14 15 EXEC @return_value = [dbo].[GetAddress] 16 @address = @inputaddress, 17 @NProvinceName = @NProvinceName OUTPUT, 18 @NCityName = @NCityName OUTPUT, 19 @NDistrictName = @NDistrictName OUTPUT, 20 @NRoadName = @NRoadName OUTPUT, 21 @remain = @remain OUTPUT, 22 @PostCode = @PostCode OUTPUT, 23 @Road = @Road OUTPUT, 24 @method = @method OUTPUT, 25 @number = @number OUTPUT
以下是測試,根據輸入的地址返回省市區路的詳細地址,這里就只截圖:
1.輸入重慶新華路41-43號,點轉換,執行存儲過程execAddress,把返回的地址顯示在輸出地址里,如下圖
這里是第五種查找方式(市路)也可能是第三種查找方式(省市路)。
2.輸入廣州海珠區廣州大道客村墩和,返回廣東省廣州市海珠區廣州大道客村墩和,如下圖:
這里是第四種查找方式(市區路)。
以上就是介紹查找地址的方法,還沒用到address表,這個表是前面所說的用來搜集有詳細地址的,具體需要寫一個網頁爬蟲的程序,去爬爬有地址的網站,如大眾點評網,這里就介紹這么多了,有寫得不好的還望各位手下留情~