Python-requests请求的超时时间

本文转载自查看原文 2019-12-13 10:53 1389 python爬虫

python程序根据url从互联网上批量下载图片时，设置HTTP或Socket超时，来防止爬虫爬取某个页面时间过长，导致程序卡置不前。

一种解决方案是全局设置：

import socket
socket.setdefaulttimeout(t)
t：代表经过t秒后，如果还未下载成功，自动跳入下一次操作，此次下载失败

另外一种解决方案是:

使用timeout 参数可以设定等待连接的秒数，如果等待超时，Requests会抛出异常

>>> requests.get('http://github.com', timeout=0.001) Traceback (most recent call last): File "<stdin>", line 1, in <module> requests.exceptions.Timeout: HTTPConnectionPool(host='github.com', port=80): Request timed out. (timeout=0.001)
>>> requests.get('https://www.baidu.com',timeout=0.5)
<Response [200]>

timeout 仅对连接过程有效，与响应体的下载无关。 timeout 并不是整个下载响应的时间限制，而是如果服务器在 timeout 秒内没有应答，将会引发一个异常（更精确地说，是在 timeout 秒内没有从基础套接字上接收到任何字节的数据时)。

第三种

import time

import requests

from requests.adapters import HTTPAdapter

s = requests.Session()

s.mount( 'http://' , HTTPAdapter(max_retries = 3 ))

s.mount( 'https://' , HTTPAdapter(max_retries = 3 ))

print (time.strftime( '%Y-%m-%d %H:%M:%S' ))

try :

r = s.get( 'http://www.google.com.hk' , timeout = 5 )

return r.text

except requests.exceptions.RequestException as e:

print (e)

print (time.strftime( '%Y-%m-%d %H:%M:%S' ))

max_retries 为最大重试次数，重试3次，加上最初的一次请求，一共是4次，所以上述代码运行耗时是20秒而不是15秒

第四种：捕获请求异常：

 
           def 
           gethtml(url): 
          
           i  
           = 
           0 
          
           while 
           i <  
           3 
           : 
          
           try 
           : 
          
           html  
           = 
           requests.get(url, timeout 
           = 
           5 
           ).text 
          
           return 
           html 
          
           except 
           requests.exceptions.RequestException: 
          
           i  
           + 
           = 
           1

免责声明！

本站转载的文章为个人学习借鉴使用，本站对版权不负任何法律责任。如果侵犯了您的隐私权益，请联系本站邮箱yoyou2525@163.com删除。

猜您在找 处理 Python-requests请求的超时时间 Python-requests设置请求的超时时间 python-requests包请求响应时间 httpclient: 设置请求的超时时间，连接超时时间等 python之requests获取响应时间(elapsed)与超时时间设置（timeout） Python-requests模块 python-requests python-requests模块 python-requests向服务器发送请求（post/get） GuzzleHttp 请求设置超时时间