通過python將xml文件轉換成html文件

本文轉載自查看原文 2018-11-23 17:39 2318

#數據類型的轉換

def main():
    maxwidth = 100  #用於規范字段的長度
    print_start()
    count=0
    while True:
        try:
            line =input()
            if count == 0:
                color = 'lightgreen'
            elif count % 2: #取余
                color = 'white'
            else:
                color = 'lightyellow'
            print_line(line,color,maxwidth)
            count += 1
        except EOFError:
            break
    print_end()

maxwidth 用於規范字段的長度，一旦比這個長度長的字段，我們可以通過用省略號來替代后面的內容
count 用於對文件中的顏色的改變，斑馬紋的實現。
上面的代碼中的print_start(),print_line(),print_end()三個函數是我們自己的設定的函數，代碼在后面
print_start() 函數用於輸入開頭
print_end() 函數用於輸入結尾
print_line() 將該行以html的形式輸出

def print_start():
    print("<table border='1'>")
#用於文件的開始頭拼接

def print_end():
    print("</table>")
#用於文件的結尾拼接

上面兩個是用來減少函數之間的關聯性，雖然在函數中添加print(…)也可以，
但是這樣可以更加規范，以后修改也比較容易，對之后的運維提供極大的方便，
通過修改函數，可以將所有的頭尾一起修改。

def print_line(line,color,maxwidth):
    print("<tr bgcolor='{0}'>".format(color))
    fields = extrace_fields(line)
    for field in fields:
        if not field:
            print("<td></td>")
        else:
            number = field.replace(",","")
            #這里把”，“改成”“的意義是為了將數值1,000轉換為1000的格式
        try:
            x = float(number)
            print("<td align='right'>{0}</td>33".format(round(x)))
        except ValueError:
            field =field.title() 
            '''
            用於注釋
            title函數是用來將字符串的首字母大寫處理            
            str = "this is string example from runoob....wow!!!"
            請注意，非字母后的第一個字母將轉換為大寫字母：
            txt = "hello b2b2b2 and 3g3g3g"
            print(txt.title())    #Hello B2B2B2 And 3G3G3G
            '''
            field = field.replace('And','and')
            if len(field) <= maxwidth:
                field = escape_html(field)
            else:
                field = "{0}...".format(
                escape_html(field[:maxwidth]))
            print("<td>{0}</td>".format(field))
    print("</tr>")

這段程序是將通過for遍歷文件，提取出里面的值，將里面的值進行規范化然后通過需要的html格式通過format拼接，最后顯示出來。
通過try的異常捕捉，我們可以將文件中的數字與字符串分開處理，數字通過flaot進行小數格式化，字符串通過title格式化
這又體現了python語言通過try捕獲異常的靈活性
為什么不再讀取的時候直接通過replace進行分割字符串？
因為這是為了防止出現，分號中間有”，“ 使文件不正確分割，導致程序出現錯誤，所以，我們要在print_line中在進行分割，減少錯誤的產生
extrace_fields(line)是自定義函數，函數的作用是將字段串進行分割

def extrace_fields(line):
    fields =[]
    field = ''
    quote = None
    for c in line:
        if c in "\"":    
            if quote is None:  #start of quoted string
                quote = c
            elif quote == c: #end of quoted string
                quote = None
            else:
                field += c #other quote inside quoted string
            continue 
        if quote is None and c == ",": #end of a field
            fields.append(field)
            field =''
        else:
            field += c  #accumulating a field
    if field:
        fields.append(field)#adding the last field
    return fields

def escape_html(text):
    text = text.replace('&','&amp;')
    text = text.replace('<',"&lt;")
    text = text.replace(">","&gt;")
    return text

通過替換函數將'<','>'替換成html可識別的實體字符，不替換的話 html會將'<','>'大於小於符號識別為尖括號<>

免責聲明！

本站轉載的文章為個人學習借鑒使用，本站對版權不負任何法律責任。如果侵犯了您的隱私權益，請聯系本站郵箱yoyou2525@163.com刪除。

猜您在找 Excel轉換成xml文件 html文件轉換成excel CHM文件命令轉換成html python csv文件轉換成xml，構建新xml文件如何將XML轉換成XSD(XML Schema)文件 python py文件轉換成exe python 將txt文件轉換成字典 python，多圖片轉換成pdf文件 Python-目標檢測-將xml文件轉換成.txt文件 Python: 把txt文件轉換成csv