编程 Python

在django项目中导出数据到excel文件并实现下载的功能

Posted in Python onMarch 13, 2020

依赖模块

xlwt下载：pip install xlwt

后台模块

view.py

# 导出Excel文件
def export_excel(request):
  city = request.POST.get('city')
  print(city)
  list_obj=place.objects.filter(city=city)
  # 设置HTTPResponse的类型
  response = HttpResponse(content_type='application/vnd.ms-excel')
  response['Content-Disposition'] = 'attachment;filename='+city+'.xls'
  """导出excel表"""
  if list_obj:
    # 创建工作簿
    ws = xlwt.Workbook(encoding='utf-8')
    # 添加第一页数据表
    w = ws.add_sheet('sheet1') # 新建sheet（sheet的名称为"sheet1"）
    # 写入表头
    w.write(0, 0, u'地名')
    w.write(0, 1, u'次数')
    w.write(0, 2, u'经度')
    w.write(0, 3, u'纬度')
    # 写入数据
    excel_row = 1
    for obj in list_obj:
      name = obj.place
      sum = obj.sum
      lng = obj.lng
      lat = obj.lat
      # 写入每一行对应的数据
      w.write(excel_row, 0, name)
      w.write(excel_row, 1, sum)
      w.write(excel_row, 2, lng)
      w.write(excel_row, 3, lat)
      excel_row += 1
    # 写出到IO
    output = BytesIO()
    ws.save(output)
    # 重新定位到开始
    output.seek(0)
    response.write(output.getvalue())
  return response

前端模块

<button id="export_excel" type="button" class="btn btn-primary col-sm-5" style="margin-left: 10px" >导出excel</button>

$("#export_excel").click(function () {
     var csrf=$('input[name="csrfmiddlewaretoken"]').val();
     const req = new XMLHttpRequest();
     req.open('POST', '/export_excel/', true);
     req.responseType = 'blob';
     req.setRequestHeader('Content-Type', 'application/x-www-form-urlencoded'); //设置请求头
     req.send('city='+$('#city').val()+"&&csrfmiddlewaretoken="+csrf); //输入参数
     req.onload = function() {
       const data = req.response;
       const a = document.createElement('a');
       const blob = new Blob([data]);
       const blobUrl = window.URL.createObjectURL(blob);
       download(blobUrl) ;
     };

   });

function download(blobUrl) {
 var city = $("input[name='city']").val();
 const a = document.createElement('a');
 a.style.display = 'none';
 a.download = '<文件命名>';
 a.href = blobUrl;
 a.click();
 document.body.removeChild(a);
}

补充知识：Python Django实现MySQL百万、千万级的数据量下载：解决memoryerror、nginx time out

前文

在用Django写项目的时候时常需要提供文件下载的功能，而Django也是贴心提供了几种方法：FileResponse、StreamingHttpResponse、HttpResponse，其中FileResponse和StreamingHttpResponse都是使用迭代器迭代生成数据的方法，所以适合传输文件比较大的情况；而HttpResponse则是直接取得数据返回给用户，所以容易造成memoryerror和nginx time out(一次性取得数据和返回的数据过多，导致nginx超时或者内存不足)，关于这三者，DJango的官网也是写的非常清楚，连接如下：https://docs.djangoproject.com/en/1.11/ref/request-response/

那正常我们使用的是FileResponse和StreamingHttpResponse，因为它们流式传输(迭代器)的特点，可以使得数据一条条的返回给客户端，文件随时中断和复传，并且保持文件的一致性。

FileResponse和StreamingHttpResponse

FileResponse顾名思义，就是打开文件然后进行传输，并且可以指定一次能够传输的数据chunk。所以适用场景：从服务端返回大文件。缺点是无法实时获取数据库的内容并传输给客户端。举例如下：

def download(request):
 file=open('path/demo.py','rb')
  response =FileResponse(file)
  response['Content-Type']='application/octet-stream'
  response['Content-Disposition']='attachment;filename="demo.py"'
  return response

从上可以发现，文件打开后作为参数传入FileResponse，随后指定传输头即可，但是很明显用这个来传输数据库就不太方便了，所以这边推介用StreamingHttpResponse的方式来传输。

这里就用PyMysql来取得数据，然后指定为csv的格式返回，具体代码如下：

# 通过pymysql取得数据
import pymysql
field_types = {
    1: 'tinyint',
    2: 'smallint',
    3: 'int'} #用于后面的字段名匹配，这里省略了大多数
conn = pymysql.connect(host='127.0.0.1',port=3306,database='demo',user='root',password='root')
cursor = conn.cursor(cursor=pymysql.cursors.DictCursor)
cursor.execute(sql)
#获取所有数据
data = cursor.fetchall()
cols = {}
#获取所有字段
for i,row in enumerate(self.cursor.description):
 if row[0] in cols:
   cols[str(i)+row[0]] = field_types.get(row[1], str(row[1])) #这里的field_type是类型和数字的匹配
 cols[row[0]] = field_types.get(row[1], str(row[1]))
cursor.close()
conn.close()

#通过StreamingHttpResponse指定返回格式为csv
response = StreamingHttpResponse(get_result_fromat(data, cols))
response['Content-Type'] = 'application/octet-stream'
response['Content-Disposition'] = 'attachment;filename="{0}"'.format(out_file_name)
return response

#循环所有数据，然后加到字段上返回，注意的是要用迭代器来控制
def get_result_fromat(data, cols):
 tmp_str = ""
 # 返回文件的每一列列名
  for col in cols:
    tmp_str += '"%s",' % (col)
  yield tmp_str.strip(",") + "\n"
  for row in data:
    tmp_str = ""
    for col in cols:
      tmp_str += '"%s",' % (str(row[col]))
    yield tmp_str.strip(',') + "\n"

整个代码如上，大致分为三部分：从mysql取数据，格式化成我们想要的格式：excel、csv、txt等等，这边指定的是csv，如果对其他格式也有兴趣的可以留言，最后就是用StreamingHttpResponse指定返回的格式返回。

实现百万级数据量下载

上面的代码下载可以支持几万行甚至十几万行的数据，但是如果超过20万行以上的数据，那就比较困难了，我这边的剩余内存大概是1G的样子，当超过15万行数据(大概)的时候，就报memoryerror了，问题就是因为fetchall，虽然我们StreamingHttpResponse是一条条的返回，但是我们的数据时一次性批量的取得！

如何解决？以下是我的解决方法和思路：

用fetchone来代替fetchall，迭代生成fetchone

发现还是memoryerror，因为execute是一次性执行，后来发现可以用流式游标来代替原来的普通游标，即SSDictCursor代替DictCursor

于是整个代码需要修改的地方如下：

cursor = conn.cursor(cursor=pymysql.cursors.DictCursor) ===>
cursor = conn.cursor(cursor=pymysql.cursors.SSDictCursor)

data = cursor.fetchall() ===>
row = cursor.fetchone()

def get_result_fromat(data, cols):
 tmp_str = ""
 # 返回文件的每一列列名
  for col in cols:
    tmp_str += '"%s",' % (col)
  yield tmp_str.strip(",") + "\n"
  for row in data:
    tmp_str = ""
    for col in cols:
      tmp_str += '"%s",' % (str(row[col]))
    yield tmp_str.strip(',') + "\n" 
    
    =====>
    
def get_result_fromat(data, cols):
 tmp_str = ""
  for col in cols:
    tmp_str += '"%s",' % (col)
  yield tmp_str.strip(",") + "\n"
  while True:
    tmp_str = ""
    for col in cols:
      tmp_str += '"%s",' % (str(row[col]))
    yield tmp_str.strip(',') + "\n"
    row = db.cursor.fetchone()
    if row is None:
      break

可以看到就是通过while True来实现不断地取数据下载，有效避免一次性从MySQL取出内存不足报错，又或者取得过久导致nginx超时！

总结

关于下载就分享到这了，还是比较简单的，谢谢观看~希望能给大家一个参考，也希望大家多多支持三水点靠木。

在django项目中导出数据到excel文件并实现下载的功能

- Author -

matrix_theOne

声明：登载此文出于传递更多信息之目的，并不意味着赞同其观点或证实其描述。

Python 相关文章推荐

Python常用的文件及文件路径、目录操作方法汇总介绍

May 21 Python

Python简单调用MySQL存储过程并获得返回值的方法

Jul 20 Python

Python优化技巧之利用ctypes提高执行速度

Sep 11 Python

Python+Wordpress制作小说站

Apr 14 Python

Python 高级专用类方法的实例详解

Sep 11 Python

Python实现嵌套列表去重方法示例

Dec 28 Python

解决pandas read_csv 读取中文列标题文件报错的问题

Jun 15 Python

python人民币小写转大写辅助工具

Jun 20 Python

pygame实现俄罗斯方块游戏

Jun 26 Python

Python3.7安装keras和TensorFlow的教程图解

Jun 18 Python

Pandas 缺失数据处理的实现

Nov 04 Python

python字符串下标与切片及使用方法

Feb 13 Python

Django choices下拉列表绑定实例

Mar 13 #Python

django model object序列化实例

Mar 13 #Python

浅析python标准库中的glob

Mar 13 #Python

Python3标准库glob文件名模式匹配的问题

Mar 13 #Python

python编写俄罗斯方块

Mar 13 #Python

探秘TensorFlow 和 NumPy 的 Broadcasting 机制

Mar 13 #Python

自定义Django Form中choicefield下拉菜单选取数据库内容实例

Mar 13 #Python

You might like

PHP+jQuery实现自动补全功能源码

2013/05/15 PHP

PHP源码分析之变量的存储过程分解

2014/07/03 PHP

jQuery 学习几种常用方法

2009/06/11 Javascript

JavaScript 函数惰性载入的实现及其优点介绍

2013/08/12 Javascript

用jquery等比例控制图片宽高的具体实现

2014/01/28 Javascript

jQuery 动态云标签插件

2014/11/11 Javascript

谈谈我对JavaScript原型和闭包系列理解(随手笔记9)

2015/12/24 Javascript

第五章之BootStrap 栅格系统

2016/04/25 Javascript

jQuery多文件异步上传带进度条实例代码

2016/08/16 Javascript

Vue自定义图片懒加载指令v-lazyload详解

2020/12/31 Javascript

JS一个简单的注册页面实例

2017/09/05 Javascript

jQuery实现ajax回调函数带入参数的方法示例

2018/06/26 jQuery

JS将网址url转化为JSON格式的方法

2018/07/02 Javascript

js canvas实现写字动画效果

2018/11/30 Javascript

jquery获取file表单选择文件的路径、名字、大小、类型

2019/01/18 jQuery

移动端自适应flexible.js的使用方法(不用三大框架，仅写一个单html页面使用）推荐

2019/04/02 Javascript

微信小程序间使用navigator跳转传值问题实例分析

2020/03/27 Javascript

python使用cookielib库示例分享

2014/03/03 Python

python OpenCV学习笔记实现二维直方图

2018/02/08 Python

对pycharm代码整体左移和右移缩进快捷键的介绍

2018/07/16 Python

python七夕浪漫表白源码

2019/04/05 Python

详解python实现小波变换的一个简单例子

2019/07/18 Python

如何利用Python开发一个简单的猜数字游戏

2019/09/22 Python

pytorch 中pad函数toch.nn.functional.pad()的用法

2020/01/08 Python

解决tensorflow 释放图,删除变量问题

2020/06/23 Python

解决Python安装cryptography报错问题

2020/09/03 Python

Spartoo比利时：欧洲时尚购物网站

2017/12/06 全球购物

Linux面试题LINUX系统类

2015/11/25 面试题

自荐信格式的六要素

2013/09/21 职场文书

儿子婚宴答谢词

2014/01/09 职场文书

经济管理毕业生求职信

2014/03/15 职场文书

竞聘上岗演讲

2014/05/19 职场文书

环境保护与污染治理求职信

2014/07/16 职场文书

单位接收函格式

2015/01/30 职场文书

最新动漫情报：2022年7月新番定档超过30部, OVERLORD骨王第四季也在其中噢

2022/05/04 日漫

纯CSS实现一个简单步骤条的示例代码

2022/07/15 HTML / CSS