Python 中pandas.read_excel详细介绍


Posted in Python onJune 23, 2017

Python 中pandas.read_excel详细介绍

#coding:utf-8
import pandas as pd
import numpy as np

filefullpath = r"/home/geeklee/temp/all_gov_file/pol_gov_mon/downloads/1.xls"
#filefullpath = r"/home/geeklee/temp/all_gov_file/pol_gov_mon/downloads/26368f3a-ea03-46b9-8033-73615ed07816.xls"
df = pd.read_excel(filefullpath,skiprows=[0])
#df = pd.read_excel(filefullpath, sheetname=[0,2],skiprows=[0])
#sheetname指定为读取几个sheet,sheet数目从0开始
#如果sheetname=[0,2],那代表读取第0页和第2页的sheet
#skiprows=[0]代表读取跳过的行数第0行,不写代表不跳过标题
#df = pd.read_excel(filefullpath, sheetname=None ,skiprows=[0])

print df
print type(df)
#若果有多页,type(df)就为<type 'dict'>
#如果就一页,type(df)就为<class 'pandas.core.frame.DataFrame'>
#{0:dataframe,1:dataframe,2:dataframe}
pandas.read_excel(io, sheetname=0, header=0, skiprows=None, skip_footer=0,
 index_col=None, names=None, parse_cols=None, parse_dates=False, date_parser=None,
 na_values=None, thousands=None, convert_float=True, has_index_names=None, converters=None,
 engine=None, squeeze=False, **kwds)

Read an Excel table into a pandas DataFrame

参数解析:

io : string, path object (pathlib.Path or py._path.local.LocalPath),

  file-like object, pandas ExcelFile, or xlrd workbook. The string could be a URL. Valid URL schemes include http, ftp, s3, and file. For file URLs, a host is expected. For instance, a local file could be file://localhost/path/to/workbook.xlsx

sheetname : string, int, mixed list of strings/ints, or None, default 0

  Strings are used for sheet names, Integers are used in zero-indexed sheet positions.

  Lists of strings/integers are used to request multiple sheets.

  Specify None to get all sheets.

  str|int -> DataFrame is returned. list|None -> Dict of DataFrames is returned, with keys representing sheets.

  Available Cases

    Defaults to 0 -> 1st sheet as a DataFrame
    1 -> 2nd sheet as a DataFrame
    “Sheet1” -> 1st sheet as a DataFrame
    [0,1,”Sheet5”] -> 1st, 2nd & 5th sheet as a dictionary of DataFrames
    None -> All sheets as a dictionary of DataFrames

header : int, list of ints, default 0

  Row (0-indexed) to use for the column labels of the parsed DataFrame. If a list of integers is passed those row positions will be combined into a MultiIndex

skiprows : list-like

  Rows to skip at the beginning (0-indexed)

skip_footer : int, default 0

  Rows at the end to skip (0-indexed)

index_col : int, list of ints, default None

  Column (0-indexed) to use as the row labels of the DataFrame. Pass None if there is no such column. If a list is passed, those columns will be combined into a MultiIndex

names : array-like, default None

  List of column names to use. If file contains no header row, then you should explicitly pass header=None

converters : dict, default None

  Dict of functions for converting values in certain columns. Keys can either be integers or column labels, values are functions that take one input argument, the Excel cell content, and return the transformed content.

parse_cols : int or list, default None

    If None then parse all columns,
    If int then indicates last column to be parsed
    If list of ints then indicates list of column numbers to be parsed
    If string then indicates comma separated list of column names and column ranges (e.g. “A:E” or “A,C,E:F”)

squeeze : boolean, default False

  If the parsed data only contains one column then return a Series

na_values : list-like, default None

  List of additional strings to recognize as NA/NaN

thousands : str, default None

  Thousands separator for parsing string columns to numeric. Note that this parameter is only necessary for columns stored as TEXT in Excel, any numeric columns will automatically be parsed, regardless of display format.

keep_default_na : bool, default True

  If na_values are specified and keep_default_na is False the default NaN values are overridden, otherwise they're appended to

verbose : boolean, default False

  Indicate number of NA values placed in non-numeric columns

engine: string, default None

  If io is not a buffer or path, this must be set to identify io. Acceptable values are None or xlrd

convert_float : boolean, default True

  convert integral floats to int (i.e., 1.0 ?> 1). If False, all numeric data will be read in as floats: Excel stores all numbers as floats internally

has_index_names : boolean, default None

  DEPRECATED: for version 0.17+ index names will be automatically inferred based on index_col. To read Excel output from 0.16.2 and prior that had saved index names, use True.

return返回的结果

parsed : DataFrame or Dict of DataFrames

  DataFrame from the passed in Excel file. See notes in sheetname argument for more information on when a Dict of Dataframes is returned.

感谢阅读,希望能帮助到大家,谢谢大家对本站的支持!

Python 相关文章推荐
requests和lxml实现爬虫的方法
Jun 11 Python
在Python中执行系统命令的方法示例详解
Sep 14 Python
解决Django migrate No changes detected 不能创建表的问题
May 27 Python
Python pycharm 同时加载多个项目的方法
Jan 17 Python
详解Python3之数据指纹MD5校验与对比
Jun 11 Python
如何使用python传入不确定个数参数
Feb 18 Python
Python实现动态给类和对象添加属性和方法操作示例
Feb 29 Python
python导入库的具体方法
Jun 18 Python
python 判断一组数据是否符合正态分布
Sep 23 Python
如何利用Matlab制作一款真正的拼图小游戏
May 11 Python
Python中requests做接口测试的方法
May 30 Python
Python实现Hash算法
Mar 18 Python
python3.4用函数操作mysql5.7数据库
Jun 23 #Python
Python实现树的先序、中序、后序排序算法示例
Jun 23 #Python
详解python中 os._exit() 和 sys.exit(), exit(0)和exit(1) 的用法和区别
Jun 23 #Python
Python数据操作方法封装类实例
Jun 23 #Python
Python守护线程用法实例
Jun 23 #Python
python使用fcntl模块实现程序加锁功能示例
Jun 23 #Python
Django自定义认证方式用法示例
Jun 23 #Python
You might like
星际争霸兵种名称对照表
2020/03/04 星际争霸
PHP实现的简单三角形、矩形周长面积计算器分享
2014/11/18 PHP
php实现通过soap调用.Net的WebService asmx文件
2017/02/27 PHP
Javascript的一种模块模式
2010/09/08 Javascript
jQuery EasyUI API 中文文档 - ValidateBox验证框
2011/10/06 Javascript
js 动态加载事件的几种方法总结
2013/12/25 Javascript
浅谈JSON和JSONP区别及jQuery的ajax jsonp的使用
2014/11/23 Javascript
javascript中利用柯里化函数实现bind方法
2016/04/29 Javascript
javascript实现任务栏消息提示的简单实例
2016/05/31 Javascript
点击按钮出现60秒倒计时的简单js代码(推荐)
2016/06/07 Javascript
jQuery UI结合Ajax创建可定制的Web界面
2016/06/22 Javascript
一步一步封装自己的HtmlHelper组件BootstrapHelper(二)
2016/09/14 Javascript
Jquery针对tr td的一些实用操作方法(必看篇)
2016/10/05 Javascript
Bootstrap源码解读下拉菜单(4)
2016/12/23 Javascript
浅谈DOM的操作以及性能优化问题-重绘重排
2017/01/08 Javascript
Angular使用ControlValueAccessor创建自定义表单控件
2019/03/08 Javascript
详解Vue中的scoped及穿透方法
2019/04/18 Javascript
Vue开发之封装上传文件组件与用法示例
2019/04/25 Javascript
原生js实现碰撞检测
2020/03/12 Javascript
Python中属性和描述符的正确使用
2016/08/23 Python
简单了解Python中的几种函数
2017/11/03 Python
Python Matplotlib库安装与基本作图示例
2019/01/09 Python
python自动化测试无法启动谷歌浏览器问题
2019/10/10 Python
Python搭建HTTP服务过程图解
2019/12/14 Python
简单几步用纯CSS3实现3D翻转效果
2019/01/17 HTML / CSS
北美领先的牛仔品牌:Buffalo David Bitton
2017/05/22 全球购物
基督教卡片、励志礼品、家居装饰等:DaySpring
2018/10/12 全球购物
写出程序把一个链表中的接点顺序倒排
2014/04/28 面试题
股份合作协议书
2014/09/10 职场文书
2014国庆节幼儿园亲子活动方案
2014/09/16 职场文书
党的群众路线对照检查材料(个人)
2014/09/24 职场文书
小学五年级语文上册教学计划
2015/01/22 职场文书
班主任高考寄语
2015/02/26 职场文书
二审答辩状格式
2015/05/22 职场文书
看雷锋电影观后感
2015/06/10 职场文书
公司员工培训管理制度
2015/08/04 职场文书