xml文件內容如代碼所示存入的名字為login.xml:
<?xml version="1.0" encoding="utf-8"?> <info> <explain>126</explain> <url>http://www.126.com</url> <null username="" password="">請先輸入您的郵箱帳號</null> <pawd_null username="testingwtb" password=""></pawd_null> <user_null username="" password="a123456"> </user_null> <error username="xxx" password="xxx"></error> </info>
Python源代碼代碼本身是沒有錯誤的:
#coding =utf-8 import xml.dom.minidom dom=xml.dom.minidom.parse('D:\Python27\lianxidanma\login.xml') root = dom.documentElement logins=root.getElementsByTagName('null') username=logins[0].getAttribute("username") password=logins[0].getAttribute("password") prompt_info = logins[0].firstChild.data print username print prompt_info
使用xml.dom.mindom庫解析xml文件時,報如下錯誤:
Traceback (most recent call last): File "D:\Python27\lianxidanma\xml11.py", line 4, in <module> dom=xml.dom.minidom.parse('D:\Python27\lianxidanma\login.xml') File "D:\Python27\lib\xml\dom\minidom.py", line 1918, in parse return expatbuilder.parse(file) File "D:\Python27\lib\xml\dom\expatbuilder.py", line 924, in parse result = builder.parseFile(fp) File "D:\Python27\lib\xml\dom\expatbuilder.py", line 207, in parseFile parser.Parse(buffer, 0) ExpatError: not well-formed (invalid token): line 5, column 36
其實報這個錯誤主要還是“轉碼”的問題,如果xml文件中沒有中文,自然能夠輸入所需要的數據,但是現在xml文件中有中文。
一般情況我們在做自動化測試的時候,習慣用txt來編輯xml文件進行數據保存,但是在用txt編輯完xml文件后,都習慣性的直接點擊保存,默認保存的編碼方式是ANSI
問題就出在編碼方式,如果我們用UTF-8的編碼方式保存后,重新執行腳本,那么程序執行成功,正確輸出中文:
Python 2.7.9 (default, Dec 10 2014, 12:28:03) [MSC v.1500 64 bit (AMD64)] on win32 Type "copyright", "credits" or "license()" for more information. >>> ================================ RESTART ================================ >>> Traceback (most recent call last): File "D:\Python27\lianxidanma\xml11.py", line 4, in <module> dom=xml.dom.minidom.parse('D:\Python27\lianxidanma\login.xml') File "D:\Python27\lib\xml\dom\minidom.py", line 1918, in parse return expatbuilder.parse(file) File "D:\Python27\lib\xml\dom\expatbuilder.py", line 924, in parse result = builder.parseFile(fp) File "D:\Python27\lib\xml\dom\expatbuilder.py", line 207, in parseFile parser.Parse(buffer, 0) ExpatError: not well-formed (invalid token): line 5, column 36 >>> ================================ RESTART ================================ >>> 請先輸入您的郵箱帳號 >>>
