python网页请求urllib2模块简单封装代码


Posted in Python onFebruary 07, 2014

对python网页请求模块urllib2进行简单的封装。

例子:

#!/usr/bin/python
#coding: utf-8
import base64
import urllib
import urllib2
import time
class SendRequest:
  '''
  This class use to set and request the http, and get the info of response.
  e.g. set Authorization Type, request tyep..
  e.g. get html content, state code, cookie..
  SendRequest('http://10.75.0.103:8850/2/photos/square/type.json', 
              data='source=216274069', type='POST', auth='base',
     user='zl2010', password='111111')
  '''
  def __init__(self, url, data=None, type='GET', auth=None, user=None, password=None, cookie = None, **header):
    '''
    url:request, raise error if none
    date: data for post or get, must be dict type
    type: GET, POST
    auth: option, if has the value must be 'base' or 'cookie'
    user: user for auth
    password: password for auth
    cookie: if request with cookie 
    other header info: 
    e.g. referer='www.sina.com.cn'    
    '''
    self.url = url
    self.data = data
    self.type = type
    self.auth = auth
    self.user = user
    self.password = password 
    self.cookie = cookie 
    if 'referer' in header:
      self.referer = header[referer]
    else:
      self.referer = None
    if 'user-agent' in header:
      self.user_agent = header[user-agent]
    else:
      self.user_agent = None
    self.setup_request()
    self.send_request() 
  def setup_request(self):
    '''
    setup a request 
    '''
    if self.url == None or self.url == '':
      raise 'The url should not empty!'
    # set request type 
    #print self.url
    #print self.type
    #print self.data
    #print self.auth
    #print self.user
    #print self.password  
    if self.type == 'POST': 
      self.Req = urllib2.Request(self.url, self.data)
    elif self.type == 'GET':
      if self.data == None:
          self.Req = urllib2.Request(self.url)
      else:
        self.Req = urllib2.Request(self.url + '?' + self.data)
    else:
      print 'The http request type NOT support now!'
    ##set auth type 
    if self.auth == 'base':
      if self.user == None or self.password == None:
        raise 'The user or password was not given!'
      else:
        auth_info = base64.encodestring(self.user + ':' + self.password).replace('\n','')
        auth_info = 'Basic ' + auth_info 
        #print auth_info  
        self.Req.add_header("Authorization", auth_info)
    elif self.auth == 'cookie':
      if self.cookie == None:
        raise 'The cookie was not given!'
      else:
        self.Req.add_header("Cookie", self.cookie) 
    else:
      pass    ##add other auth type here 
    ##set other header info 
    if self.referer:
      self.Req.add_header('referer', self.referer)
    if self.user_agent:
      self.Req.add_header('user-agent', self.user_agent)
      
  def send_request(self):  
    '''
    send a request 
    '''
    # get a response object 
    try:
      self.Res = urllib2.urlopen(self.Req)
      self.source = self.Res.read()
      self.goal_url = self.Res.geturl()
      self.code = self.Res.getcode()
      self.head_dict = self.Res.info().dict
      self.Res.close()
    except urllib2.HTTPError, e:
      self.code = e.code
      print e
        
  def get_code(self):
    return self.code
  def get_url(self):
    return self.goal_url
  def get_source(self):        
    return self.source
  def get_header_info(self):
    return self.head_dict
  def get_cookie(self):
    if 'set-cookie' in self.head_dict:
      return self.head_dict['set-cookie']
    else:
      return None    
  def get_content_type(self):
    if 'content-type' in self.head_dict:
      return self.head_dict['content-type']
    else:
      return None
  def get_expires_time(self):
    if 'expires' in self.head_dict:
      return self.head_dict['expires']
    else:
      return None    
  def get_server_name(self):
    if 'server' in self.head_dict:
      return self.head_dict['server']
    else:
      return None   
  def __del__(self):
    pass   
__all__ = [SendRequest,]
if __name__ == '__main__':
  '''
  The example for using the SendRequest class 
  '''
  value = {'source':'216274069'}
  data = urllib.urlencode(value)
  url = 'http://10.75.0.103:8850/2/photos/square/type.json'
  user = 'wz_0001'
  password = '111111'
  auth = 'base'
  type = 'POST'
  t2 = time.time()
  rs = SendRequest('http://www.google.com')
  #rs = SendRequest(url, data=data, type=type, auth=auth, user=user, password=password)
  print 't2: ' + str(time.time() - t2)
  print '---------------get_code()---------------'
  print rs.get_code()
  print '---------------get_url()---------------'
  print rs.get_url()
  print '---------------get_source()---------------'
  print rs.get_source()
  print '---------------get_cookie()---------------'
  print rs.get_cookie()
  rs = None
Python 相关文章推荐
一个计算身份证号码校验位的Python小程序
Aug 15 Python
用Python抢过年的火车票附源码
Dec 07 Python
Python tkinter事件高级用法实例
Jan 31 Python
django1.11.1 models 数据库同步方法
May 30 Python
flask session组件的使用示例
Dec 25 Python
Python这样操作能存储100多万行的xlsx文件
Apr 16 Python
opencv python在视屏上截图功能的实现
Mar 05 Python
python argparse模块通过后台传递参数实例
Apr 20 Python
python中os包的用法
Jun 01 Python
Python绘制组合图的示例
Sep 18 Python
python 如何区分return和yield
Sep 22 Python
Django配置跨域并开发测试接口
Nov 04 Python
python解析xml模块封装代码
Feb 07 #Python
python 解析XML python模块xml.dom解析xml实例代码
Feb 07 #Python
python合并文本文件示例
Feb 07 #Python
python实现哈希表
Feb 07 #Python
python处理cookie详解
Feb 07 #Python
urllib2自定义opener详解
Feb 07 #Python
python解析html开发库pyquery使用方法
Feb 07 #Python
You might like
抓取YAHOO股票报价的类
2009/05/15 PHP
Windows下利用Gvim写PHP产生中文乱码问题解决方法
2011/04/20 PHP
PHP 线程安全与非线程安全版本的区别深入解析
2013/08/06 PHP
php页面防重复提交方法总结
2013/11/25 PHP
CodeIgniter实现更改view文件夹路径的方法
2014/07/04 PHP
PHP5全版本绕过open_basedir读文件脚本漏洞详细介绍
2015/01/20 PHP
php设置页面超时时间解决方法
2015/09/22 PHP
javascript 写类方式之十
2009/07/05 Javascript
Javascript操作URL函数修改版
2013/11/07 Javascript
JavaScript设计模式之单例模式实例
2014/09/24 Javascript
《JavaScript DOM 编程艺术》读书笔记之JavaScript 简史
2015/01/09 Javascript
AngularJS表达式讲解及示例代码
2016/08/16 Javascript
Vue.js仿微信聊天窗口展示组件功能
2017/08/11 Javascript
AjaxUpLoad.js实现文件上传功能
2018/03/02 Javascript
vue 解决addRoutes动态添加路由后刷新失效问题
2018/07/02 Javascript
关于微信小程序bug记录与解决方法
2018/08/15 Javascript
JavaScript对象原型链原理详解
2020/02/05 Javascript
[01:02:07]Liquid vs Newbee 2019国际邀请赛小组赛 BO2 第一场 8.15
2019/08/16 DOTA
总结Python编程中函数的使用要点
2016/03/20 Python
Python利用matplotlib生成图片背景及图例透明的效果
2017/04/27 Python
关于python pyqt5安装失败问题的解决方法
2017/08/08 Python
浅谈Django自定义模板标签template_tags的用处
2017/12/20 Python
python+numpy实现的基本矩阵操作示例
2019/07/19 Python
python中字典按键或键值排序的实现代码
2019/08/27 Python
python读取文件指定行内容实例讲解
2020/03/02 Python
Python Tornado之跨域请求与Options请求方式
2020/03/28 Python
Python实现GIF图倒放
2020/07/16 Python
matplotlib运行时配置(Runtime Configuration,rc)参数rcParams解析
2021/01/05 Python
美国网上眼镜商城:Zenni Optical
2016/11/20 全球购物
HolidayLettings英国:预订最好的度假公寓、别墅和自助式住宿
2019/08/27 全球购物
工程造价管理专业大专生求职信
2013/10/06 职场文书
计算机数据库专业职业生涯规划书
2014/02/08 职场文书
财务主管岗位职责
2014/02/28 职场文书
毕业论文评语大全
2014/04/29 职场文书
Python基础之Socket通信原理
2021/04/22 Python
Python手拉手教你爬取贝壳房源数据的实战教程
2021/05/21 Python