编程 Python

Python爬虫获取图片并下载保存至本地的实例

Posted in Python onJune 01, 2018

1、抓取煎蛋网上的图片。

2、代码如下：

import urllib.request
import os
#to open the url
def url_open(url):
 req=urllib.request.Request(url)
 req.add_header('User-Agent','Mozilla/5.0 (Windows NT 6.3; WOW64; rv:51.0) Gecko/20100101 Firefox/51.0')
 response=urllib.request.urlopen(url)
 html=response.read()
 return html
#to get the num of page like 1,2,3,4...
def get_page(url):
 html=url_open(url).decode('utf-8')
 a=html.find('current-comment-page')+23 #add the 23 offset th arrive at the [2356]
 b=html.find(']',a)
 #print(html[a:b])
 return html[a:b]
#find the url of imgs and return the url of arr
def find_imgs(url):
 html=url_open(url).decode('utf-8')
 img_addrs=[]
 a=html.find('img src=')
 while a!=-1:
  b=html.find('.jpg',a,a+255) # if false : return -1
  if b!=-1:
   img_addrs.append('http:'+html[a+9:b+4])
  else:
   b=a+9
  a=html.find('img src=',b)
 #print(img_addrs)  
 return img_addrs
  #print('http:'+each)
  
#save the imgs 
def save_imgs(folder,img_addrs):
 for each in img_addrs:
  filename=each.split('/')[-1] #get the last member of arr,that is the name
  with open(filename,'wb') as f:
   img = url_open(each)
   f.write(img)
 
def download_mm(folder='mm',pages=10):
 os.mkdir(folder)
 os.chdir(folder)
 url='http://jandan.net/ooxx/'
 page_num=int(get_page(url))
 
 for i in range(pages):
  page_num -= i
  page_url = url + 'page-' + str(page_num) + '#comments'
  img_addrs=find_imgs(page_url)
  save_imgs(folder,img_addrs)
  
if __name__ == '__main__':
 download_mm()

以上这篇Python爬虫获取图片并下载保存至本地的实例就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持三水点靠木。

Python爬虫获取图片并下载保存至本地的实例

- Author -

钏的博客

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

python安装与使用redis的方法

Apr 19 Python

Python环境下安装使用异步任务队列包Celery的基础教程

May 07 Python

python统计字母、空格、数字等字符个数的实例

Jun 29 Python

Python os.rename() 重命名目录和文件的示例

Oct 25 Python

对python中大文件的导入与导出方法详解

Dec 28 Python

python的pytest框架之命令行参数详解（下）

Jun 27 Python

Django框架查询Extra功能实现解析

Sep 04 Python

python爬虫开发之selenium模块详细使用方法与实例全解

Mar 09 Python

Python使用Excel将数据写入多个sheet

May 16 Python

QML用PathView实现轮播图

Jun 03 Python

.img/.hdr格式转.nii格式的操作

Jul 01 Python

Python pandas对excel的操作实现示例

Jul 21 Python

python操作mysql代码总结

Jun 01 #Python

Python使用pylab库实现绘制直方图功能示例

Jun 01 #Python

python的格式化输出（format,%）实例详解

Jun 01 #Python

Python获取昨天、今天、明天开始、结束时间戳的方法

Jun 01 #Python

python面向对象多线程爬虫爬取搜狐页面的实例代码

May 31 #Python

Python中if elif else及缩进的使用简述

May 31 #Python

python基于物品协同过滤算法实现代码

May 31 #Python