用python讀寫excel(xlrd、xlwt)


最近需要從多個excel表里面用各種方式整理一些數據,雖然說原來用過java做這類事情,但是由於最近在學python,所以當然就決定用python嘗試一下了。發現python果然簡潔很多。這里簡單記錄一下。(由於是用到什么學什么,所以不算太深入,高手勿噴,歡迎指導)

一、讀excel表

讀excel要用到xlrd模塊,官網安裝(http://pypi.python.org/pypi/xlrd)。然后就可以跟着里面的例子稍微試一下就知道怎么用了。大概的流程是這樣的:

1、導入模塊

      import xlrd

2、打開Excel文件讀取數據

       data = xlrd.open_workbook('excel.xls')

3、獲取一個工作表

1  table = data.sheets()[0]          #通過索引順序獲取
2  table = data.sheet_by_index(0) #通過索引順序獲取
3  table = data.sheet_by_name(u'Sheet1')#通過名稱獲取
4、獲取整行和整列的值(返回數組)
         table.row_values(i)
         table.col_values(i)
5、獲取行數和列數 
        table.nrows
        table.ncols
6、獲取單元格
  table.cell(0,0).value
     
table.cell(2,3).value
就我自己使用的時候覺得還是獲取cell最有用,這就相當於是給了你一個二維數組,余下你就可以想怎么干就怎么干了。得益於這個十分好用的庫代碼很是簡潔。但是還是有若干坑的存在導致話了一定時間探索。現在列出來供后人參考吧:
1、首先就是我的統計是根據姓名統計各個表中的信息的,但是調試發現不同的表中各個名字貌似不能夠匹配,開始懷疑過編碼問題,不過后來發現是因為  空格。因為在excel中輸入的時候很可能會順手在一些名字后面加上幾個空格或是tab鍵,這樣看起來沒什么差別,但是程序處理的時候這就是兩個完全  不同的串了。我的解決方法是給每個獲取的字符串都加上strip()處理一下。效果良好
2、還是字符串的匹配,在判斷某個單元格中的字符串(中文)是否等於我所給出的的時候發現無法匹配,並且各種unicode也不太奏效,百度過一些解決  方案,但是都比較復雜或是沒用。最后我采用了一個比較變通的方式:直接從excel中獲取我想要的值再進行比較,效果是不錯就是通用行不太好,個  呢不能問題還沒解決。
二、寫excel表
  寫excel表要用到xlwt模塊,官網下載(http://pypi.python.org/pypi/xlwt)。大致使用流程如下:
1、導入模塊
  import xlwt
2、創建workbook(其實就是excel,后來保存一下就行)
  workbook = xlwt.Workbook(encoding = 'ascii')
3、創建表
  worksheet = workbook.add_sheet('My Worksheet')
4、往單元格內寫入內容
  worksheet.write(0, 0, label = 'Row 0, Column 0 Value')
5、保存
  workbook.save('Excel_Workbook.xls')
由於我的需求比較簡單,所以這上面沒遇到什么問題,唯一的就是建議還是用ascii編碼,不然可能會有一些詭異的現象。
當然xlwt功能遠遠不止這些,他甚至可以設置各種樣式之類的。附上一點例子
  1 Examples Generating Excel Documents Using Python’s xlwt
  2 
  3 Here are some simple examples using Python’s xlwt library to dynamically generate Excel documents.
  4 
  5 Please note a useful alternative may be ezodf, which allows you to generate ODS (Open Document Spreadsheet) files for LibreOffice / OpenOffice. You can check them out at:http://packages.python.org/ezodf/index.html
  6 
  7 The Simplest Example
  8 import xlwt
  9 workbook = xlwt.Workbook(encoding = 'ascii')
 10 worksheet = workbook.add_sheet('My Worksheet')
 11 worksheet.write(0, 0, label = 'Row 0, Column 0 Value')
 12 workbook.save('Excel_Workbook.xls')
 13 
 14 Formatting the Contents of a Cell
 15 import xlwt
 16 workbook = xlwt.Workbook(encoding = 'ascii')
 17 worksheet = workbook.add_sheet('My Worksheet')
 18 font = xlwt.Font() # Create the Font
 19 font.name = 'Times New Roman'
 20 font.bold = True
 21 font.underline = True
 22 font.italic = True
 23 style = xlwt.XFStyle() # Create the Style
 24 style.font = font # Apply the Font to the Style
 25 worksheet.write(0, 0, label = 'Unformatted value')
 26 worksheet.write(1, 0, label = 'Formatted value', style) # Apply the Style to the Cell
 27 workbook.save('Excel_Workbook.xls')
 28 
 29 Attributes of the Font Object
 30 font.bold = True # May be: True, False
 31 font.italic = True # May be: True, False
 32 font.struck_out = True # May be: True, False
 33 font.underline = xlwt.Font.UNDERLINE_SINGLE # May be: UNDERLINE_NONE, UNDERLINE_SINGLE, UNDERLINE_SINGLE_ACC, UNDERLINE_DOUBLE, UNDERLINE_DOUBLE_ACC
 34 font.escapement = xlwt.Font.ESCAPEMENT_SUPERSCRIPT # May be: ESCAPEMENT_NONE, ESCAPEMENT_SUPERSCRIPT, ESCAPEMENT_SUBSCRIPT
 35 font.family = xlwt.Font.FAMILY_ROMAN # May be: FAMILY_NONE, FAMILY_ROMAN, FAMILY_SWISS, FAMILY_MODERN, FAMILY_SCRIPT, FAMILY_DECORATIVE
 36 font.charset = xlwt.Font.CHARSET_ANSI_LATIN # May be: CHARSET_ANSI_LATIN, CHARSET_SYS_DEFAULT, CHARSET_SYMBOL, CHARSET_APPLE_ROMAN, CHARSET_ANSI_JAP_SHIFT_JIS, CHARSET_ANSI_KOR_HANGUL, CHARSET_ANSI_KOR_JOHAB, CHARSET_ANSI_CHINESE_GBK, CHARSET_ANSI_CHINESE_BIG5, CHARSET_ANSI_GREEK, CHARSET_ANSI_TURKISH, CHARSET_ANSI_VIETNAMESE, CHARSET_ANSI_HEBREW, CHARSET_ANSI_ARABIC, CHARSET_ANSI_BALTIC, CHARSET_ANSI_CYRILLIC, CHARSET_ANSI_THAI, CHARSET_ANSI_LATIN_II, CHARSET_OEM_LATIN_I
 37 font.colour_index = ?
 38 font.get_biff_record = ?
 39 font.height = 0x00C8 # C8 in Hex (in decimal) = 10 points in height.
 40 font.name = ?
 41 font.outline = ?
 42 font.shadow = ?
 43 
 44 Setting the Width of a Cell
 45 import xltw
 46 workbook = xlwt.Workbook()
 47 worksheet = workbook.add_sheet('My Sheet')
 48 worksheet.write(0, 0, 'My Cell Contents')
 49 worksheet.col(0).width = 3333 # 3333 = 1" (one inch).
 50 workbook.save('Excel_Workbook.xls')
 51 
 52 Entering a Date into a Cell
 53 import xlwt
 54 import datetime
 55 workbook = xlwt.Workbook()
 56 worksheet = workbook.add_sheet('My Sheet')
 57 style = xlwt.XFStyle()
 58 style.num_format_str = 'M/D/YY' # Other options: D-MMM-YY, D-MMM, MMM-YY, h:mm, h:mm:ss, h:mm, h:mm:ss, M/D/YY h:mm, mm:ss, [h]:mm:ss, mm:ss.0
 59 worksheet.write(0, 0, datetime.datetime.now(), style)
 60 workbook.save('Excel_Workbook.xls')
 61 
 62 Adding a Formula to a Cell
 63 import xlwt
 64 workbook = xlwt.Workbook()
 65 worksheet = workbook.add_sheet('My Sheet')
 66 worksheet.write(0, 0, 5) # Outputs 5
 67 worksheet.write(0, 1, 2) # Outputs 2
 68 worksheet.write(1, 0, xlwt.Formula('A1*B1')) # Should output "10" (A1[5] * A2[2])
 69 worksheet.write(1, 1, xlwt.Formula('SUM(A1,B1)')) # Should output "7" (A1[5] + A2[2])
 70 workbook.save('Excel_Workbook.xls')
 71 
 72 Adding a Hyperlink to a Cell
 73 import xlwt
 74 workbook = xlwt.Workbook()
 75 worksheet = workbook.add_sheet('My Sheet')
 76 worksheet.write(0, 0, xlwt.Formula('HYPERLINK("http://www.google.com";"Google")')) # Outputs the text "Google" linking to http://www.google.com
 77 workbook.save('Excel_Workbook.xls')
 78 
 79 Merging Columns and Rows
 80 import xlwt
 81 workbook = xlwt.Workbook()
 82 worksheet = workbook.add_sheet('My Sheet')
 83 worksheet.write_merge(0, 0, 0, 3, 'First Merge') # Merges row 0's columns 0 through 3.
 84 font = xlwt.Font() # Create Font
 85 font.bold = True # Set font to Bold
 86 style = xlwt.XFStyle() # Create Style
 87 style.font = font # Add Bold Font to Style
 88 worksheet.write_merge(1, 2, 0, 3, 'Second Merge', style) # Merges row 1 through 2's columns 0 through 3.
 89 workbook.save('Excel_Workbook.xls')
 90 
 91 Setting the Alignment for the Contents of a Cell
 92 import xlwt
 93 workbook = xlwt.Workbook()
 94 worksheet = workbook.add_sheet('My Sheet')
 95 alignment = xlwt.Alignment() # Create Alignment
 96 alignment.horz = xlwt.Alignment.HORZ_CENTER # May be: HORZ_GENERAL, HORZ_LEFT, HORZ_CENTER, HORZ_RIGHT, HORZ_FILLED, HORZ_JUSTIFIED, HORZ_CENTER_ACROSS_SEL, HORZ_DISTRIBUTED
 97 alignment.vert = xlwt.Alignment.VERT_CENTER # May be: VERT_TOP, VERT_CENTER, VERT_BOTTOM, VERT_JUSTIFIED, VERT_DISTRIBUTED
 98 style = xlwt.XFStyle() # Create Style
 99 style.alignment = alignment # Add Alignment to Style
100 worksheet.write(0, 0, 'Cell Contents', style)
101 workbook.save('Excel_Workbook.xls')
102 
103 Adding Borders to a Cell
104 # Please note: While I was able to find these constants within the source code, on my system (using LibreOffice,) I was only presented with a solid line, varying from thin to thick; no dotted or dashed lines.
105 import xlwt
106 workbook = xlwt.Workbook()
107 worksheet = workbook.add_sheet('My Sheet')
108 borders = xlwt.Borders() # Create Borders
109 borders.left = xlwt.Borders.DASHED # May be: NO_LINE, THIN, MEDIUM, DASHED, DOTTED, THICK, DOUBLE, HAIR, MEDIUM_DASHED, THIN_DASH_DOTTED, MEDIUM_DASH_DOTTED, THIN_DASH_DOT_DOTTED, MEDIUM_DASH_DOT_DOTTED, SLANTED_MEDIUM_DASH_DOTTED, or 0x00 through 0x0D.
110 borders.right = xlwt.Borders.DASHED
111 borders.top = xlwt.Borders.DASHED
112 borders.bottom = xlwt.Borders.DASHED
113 borders.left_colour = 0x40
114 borders.right_colour = 0x40
115 borders.top_colour = 0x40
116 borders.bottom_colour = 0x40
117 style = xlwt.XFStyle() # Create Style
118 style.borders = borders # Add Borders to Style
119 worksheet.write(0, 0, 'Cell Contents', style)
120 workbook.save('Excel_Workbook.xls')
121 
122 Setting the Background Color of a Cell
123 import xlwt
124 workbook = xlwt.Workbook()
125 worksheet = workbook.add_sheet('My Sheet')
126 pattern = xlwt.Pattern() # Create the Pattern
127 pattern.pattern = xlwt.Pattern.SOLID_PATTERN # May be: NO_PATTERN, SOLID_PATTERN, or 0x00 through 0x12
128 pattern.pattern_fore_colour = 5 # May be: 8 through 63. 0 = Black, 1 = White, 2 = Red, 3 = Green, 4 = Blue, 5 = Yellow, 6 = Magenta, 7 = Cyan, 16 = Maroon, 17 = Dark Green, 18 = Dark Blue, 19 = Dark Yellow , almost brown), 20 = Dark Magenta, 21 = Teal, 22 = Light Gray, 23 = Dark Gray, the list goes on...
129 style = xlwt.XFStyle() # Create the Pattern
130 style.pattern = pattern # Add Pattern to Style
131 worksheet.write(0, 0, 'Cell Contents', style)
132 workbook.save('Excel_Workbook.xls')
133 
134 TODO: Things Left to Document
135 - Panes -- separate views which are always in view
136 - Border Colors (documented above, but not taking effect as it should)
137 - Border Widths (document above, but not working as expected)
138 - Protection
139 - Row Styles
140 - Zoom / Manification
141 - WS Props?
142 Source Code for reference available at: https://secure.simplistix.co.uk/svn/xlwt/trunk/xlwt/


免責聲明!

本站轉載的文章為個人學習借鑒使用,本站對版權不負任何法律責任。如果侵犯了您的隱私權益,請聯系本站郵箱yoyou2525@163.com刪除。



 
粵ICP備18138465號   © 2018-2025 CODEPRJ.COM