Python之讀寫Excel


 

現有的Excel分為兩種格式:xls(Excel 97-2003)和xlsx(Excel 2007及以上)。

Python處理Excel文件主要是第三方模塊庫xlrd、xlwt、pyexcel-xls、xluntils和pyExcel-erator等,此外Pandas中也帶有可以讀取Excel文件的模塊(read_excel)。

基於擴展知識的目的,本文使用xlrd模塊讀取Excel數據。

 

[安裝]

# 讀取
pip install xlrd
# 寫入
pip install xlwt

 

xlrd使用:

import xlrd  # 導入庫
# 打開文件
xlsx = xlrd.open_workbook('demo.xlsx')
# 查看所有sheet列表
print('All sheets: %s' % xlsx.sheet_names())

如果只有一個sheet的話, 會輸出:
All sheets: ['Sheet1']


查看sheet中數據
sheet1 = xlsx.sheets()[0]    # 獲得第1張sheet,索引從0開始
sheet1_name = sheet1.name    # 獲得名稱
sheet1_cols = sheet1.ncols   # 獲得列數
sheet1_nrows = sheet1.nrows  # 獲得行數
print('Sheet1 Name: %s\nSheet1 cols: %s\nSheet1 rows: %s' % (sheet1_name, sheet1_cols, sheet1_nrows))
輸出:
sheet1 = xlsx.sheets()[0]    # 獲得第1張sheet,索引從0開始
sheet1_name = sheet1.name    # 獲得名稱
sheet1_cols = sheet1.ncols   # 獲得列數
sheet1_nrows = sheet1.nrows  # 獲得行數
print('Sheet1 Name: %s\nSheet1 cols: %s\nSheet1 rows: %s' % (sheet1_name, sheet1_cols, sheet1_nrows))


查看sheet每行數據明細:
for i in range(sheet1_nrows):  # 逐行打印sheet1數據
    print(sheet1.row_values(i))

輸出:

['ID_number', 'Status', 'Create_Time', 'Business_City']
['431381198109106573', '有效', 42725.0, '深圳市']
['431381198809122734', '有效', 42725.0, '深圳市']
…
['431381198901176911', '有效', 42725.0, '深圳市']
['43138119870827275X', '有效', 42725.0, '深圳市']

  上述操作只是將數據從Excel中讀取出來,將讀取的數據轉換為數組便可以進行矩陣計算。由於矩陣計算大多是基於數值型數據實現的,因此上述數據將無法適用於大多數科學計算場景,這點需要注意。

 

 

xlwt使用:

import xlwt
# 創建一個workbook 設置編碼
workbook = xlwt.Workbook(encoding = 'utf-8')
# 創建一個worksheet
worksheet = workbook.add_sheet('My Worksheet')

# 寫入excel
# 參數對應 行, 列, 值
worksheet.write(1,0, label = 'this is test')

# 保存
workbook.save('Excel_test.xls')

  更多demo:

import xlwt

workbook = xlwt.Workbook(encoding = 'ascii')
worksheet = workbook.add_sheet('My Worksheet')
style = xlwt.XFStyle() # 初始化樣式
font = xlwt.Font() # 為樣式創建字體
font.name = 'Times New Roman' 
font.bold = True # 黑體
font.underline = True # 下划線
font.italic = True # 斜體字
style.font = font # 設定樣式
worksheet.write(0, 0, 'Unformatted value') # 不帶樣式的寫入

worksheet.write(1, 0, 'Formatted value', style) # 帶樣式的寫入

workbook.save('formatting.xls') # 保存文件




設置單元格寬度:

import xlwt

workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
worksheet.write(0, 0,'My Cell Contents')

# 設置單元格寬度
worksheet.col(0).width = 3333
workbook.save('cell_width.xls')



輸入日期到單元格:
import xlwt
import datetime
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
style = xlwt.XFStyle()
style.num_format_str = 'M/D/YY' # Other options: D-MMM-YY, D-MMM, MMM-YY, h:mm, h:mm:ss, h:mm, h:mm:ss, M/D/YY h:mm, mm:ss, [h]:mm:ss, mm:ss.0
worksheet.write(0, 0, datetime.datetime.now(), style)
workbook.save('Excel_Workbook.xls')


向單元格添加公式:
import xlwt
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
worksheet.write(0, 0, 5) # Outputs 5
worksheet.write(0, 1, 2) # Outputs 2
worksheet.write(1, 0, xlwt.Formula('A1*B1')) # Should output "10" (A1[5] * A2[2])
worksheet.write(1, 1, xlwt.Formula('SUM(A1,B1)')) # Should output "7" (A1[5] + A2[2])
workbook.save('Excel_Workbook.xls')



單元格添加超鏈接:
import xlwt
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
worksheet.write(0, 0, xlwt.Formula('HYPERLINK("http://www.google.com";"Google")')) # Outputs the text "Google" linking to http://www.google.com
workbook.save('Excel_Workbook.xls')

合並列和行:
import xlwt
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
worksheet.write_merge(0, 0, 0, 3, 'First Merge') # Merges row 0's columns 0 through 3.
font = xlwt.Font() # Create Font
font.bold = True # Set font to Bold
style = xlwt.XFStyle() # Create Style
style.font = font # Add Bold Font to Style
worksheet.write_merge(1, 2, 0, 3, 'Second Merge', style) # Merges row 1 through 2's columns 0 through 3.
workbook.save('Excel_Workbook.xls')


設置單元格內容的對齊方式:
import xlwt
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
alignment = xlwt.Alignment() # Create Alignment
alignment.horz = xlwt.Alignment.HORZ_CENTER # May be: HORZ_GENERAL, HORZ_LEFT, HORZ_CENTER, HORZ_RIGHT, HORZ_FILLED, HORZ_JUSTIFIED, HORZ_CENTER_ACROSS_SEL, HORZ_DISTRIBUTED
alignment.vert = xlwt.Alignment.VERT_CENTER # May be: VERT_TOP, VERT_CENTER, VERT_BOTTOM, VERT_JUSTIFIED, VERT_DISTRIBUTED
style = xlwt.XFStyle() # Create Style
style.alignment = alignment # Add Alignment to Style
worksheet.write(0, 0, 'Cell Contents', style)
workbook.save('Excel_Workbook.xls')

為單元格添加邊框:
import xlwt
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
borders = xlwt.Borders() # Create Borders
borders.left = xlwt.Borders.DASHED 
    DASHED虛線
    NO_LINE沒有
    THIN實線
    
# May be: NO_LINE, THIN, MEDIUM, DASHED, DOTTED, THICK, DOUBLE, HAIR, MEDIUM_DASHED, THIN_DASH_DOTTED, MEDIUM_DASH_DOTTED, THIN_DASH_DOT_DOTTED, MEDIUM_DASH_DOT_DOTTED, SLANTED_MEDIUM_DASH_DOTTED, or 0x00 through 0x0D.
borders.right = xlwt.Borders.DASHED
borders.top = xlwt.Borders.DASHED
borders.bottom = xlwt.Borders.DASHED
borders.left_colour = 0x40
borders.right_colour = 0x40
borders.top_colour = 0x40
borders.bottom_colour = 0x40
style = xlwt.XFStyle() # Create Style
style.borders = borders # Add Borders to Style
worksheet.write(0, 0, 'Cell Contents', style)
workbook.save('Excel_Workbook.xls')


單元格設置背景色:

import xlwt
workbook = xlwt.Workbook()
worksheet = workbook.add_sheet('My Sheet')
pattern = xlwt.Pattern() # Create the Pattern
pattern.pattern = xlwt.Pattern.SOLID_PATTERN # May be: NO_PATTERN, SOLID_PATTERN, or 0x00 through 0x12
pattern.pattern_fore_colour = 5 # May be: 8 through 63. 0 = Black, 1 = White, 2 = Red, 3 = Green, 4 = Blue, 5 = Yellow, 6 = Magenta, 7 = Cyan, 16 = Maroon, 17 = Dark Green, 18 = Dark Blue, 19 = Dark Yellow , almost brown), 20 = Dark Magenta, 21 = Teal, 22 = Light Gray, 23 = Dark Gray, the list goes on...
style = xlwt.XFStyle() # Create the Pattern
style.pattern = pattern # Add Pattern to Style
worksheet.write(0, 0, 'Cell Contents', style)
workbook.save('Excel_Workbook.xls')

  

pyexcel-xls (https://pypi.org/project/pyexcel-xls/)

 

pyexcel-xls 以 OrderedDict 結構處理數據,將整個excel文件轉化為一個OrderedDict (有序字典)結構:每個key就是一個子表(Sheet)。

每個子表(Sheet),轉化為一個列表結構:很像二維數組,第一層列表為行(Row),行的下標為列(Column),對應的值為單元格的值。

編碼為 unicode,如果有中文必須進行轉換。

[安裝]

pip install pyexcel-xls

 

[使用]

from collections import OrderedDict
from pyexcel_xls import save_data, get_data
import json


# 讀取文件
def read_xls_file():
    data = get_data(r'./clubs.xlsx')
    json_data = json.dumps(data, ensure_ascii=False)  # key為sheet名稱 value為數據
    print(type(data), json_data)
    for sheet in data.keys():
        print(sheet, ':', data[sheet])


# 寫入文件
def write_xls_file():
    data = OrderedDict()
    sheet1 = []
    row1_data = ['id', 'name', 'level']
    row2_data = [1, 'lx', 'high']
    sheet1.append(row1_data)
    sheet1.append(row2_data)
    data.update({'Sheet1': sheet1})
    save_data('./writefile.xls', data)

 


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM