编程 Python

python实现下载指定网址所有图片的方法

Posted in Python onAugust 08, 2015

本文实例讲述了python实现下载指定网址所有图片的方法。分享给大家供大家参考。具体实现方法如下：

#coding=utf-8
#download pictures of the url
#useage: python downpicture.py www.baidu.com
import os
import sys
from html.parser import HTMLParser
from urllib.request import urlopen
from urllib.parse import urlparse
def getpicname(path):
  '''  retrive filename of url    '''
  if os.path.splitext(path)[1] == '':
    return None
  pr=urlparse(path)
  path='http://'+pr[1]+pr[2]
  return os.path.split(path)[1]
def saveimgto(path, urls):
  '''
  save img of url to local path
  '''
  if not os.path.isdir(path):
    print('path is invalid')
    sys.exit()
  else:
    for url in urls:
      of=open(os.path.join(path, getpicname(url)), 'w+b')
      q=urlopen(url)
      of.write(q.read())
      q.close()
      of.close()
class myhtmlparser(HTMLParser):
  '''put all src of img into urls'''
  def __init__(self):
    HTMLParser.__init__(self)
    self.urls=list()
    self.num=0
  def handle_starttag(self, tag, attr):
    if tag.lower() == 'img':
      srcs=[u[1] for u in attr if u[0].lower() == 'src']
      self.urls.extend(srcs)
      self.num = self.num+1
if __name__ == '__main__':
  url=sys.argv[1]
  if not url.startswith('http://'):
    url='http://' + sys.argv[1]
  parseresult=urlparse(url)
  domain='http://' + parseresult[1]
  q=urlopen(url)
  content=q.read().decode('utf-8', 'ignore')
  q.close()
  myparser=myhtmlparser()
  myparser.feed(content)
  for u in myparser.urls:
    if (u.startswith('//')):
      myparser.urls[myparser.urls.index(u)]= 'http:'+u
    elif u.startswith('/'):
      myparser.urls[myparser.urls.index(u)]= domain+u
  saveimgto(r'D:\python\song', myparser.urls)
  print('num of download pictures is {}'.format(myparser.num))

运行结果如下：

num of download pictures is 19

希望本文所述对大家的Python程序设计有所帮助。

python实现下载指定网址所有图片的方法

- Author -

皮蛋

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

详解Python中的__init__和__new__

Mar 12 Python

Python学习pygal绘制线图代码分享

Dec 09 Python

使用apidocJs快速生成在线文档的实例讲解

Feb 07 Python

Python基于递归算法求最小公倍数和最大公约数示例

Jul 27 Python

Python3 Post登录并且保存cookie登录其他页面的方法

Dec 28 Python

python3+selenium实现126邮箱登陆并发送邮件功能

Jan 23 Python

Django中ajax发送post请求报403错误CSRF验证失败解决方案

Aug 13 Python

如何基于python操作json文件获取内容

Dec 24 Python

Python列表切片常用操作实例解析

Mar 10 Python

对python中各个response的使用说明

Mar 28 Python

python 实现图片裁剪小工具

Feb 02 Python

Python可变集合和不可变集合的构造方法大全

Dec 06 Python

Python实现多线程抓取妹子图

Aug 08 #Python

通过Python来使用七牛云存储的方法详解

Aug 07 #Python

Python爬虫框架Scrapy实战之批量抓取招聘信息

Aug 07 #Python

深入理解Python中命名空间的查找规则LEGB

Aug 06 #Python

举例详解Python中yield生成器的用法

Aug 05 #Python

Python中return语句用法实例分析

Aug 04 #Python

python函数形参用法实例分析

Aug 04 #Python