按照視頻課程,從Github上下載文件:https://github.com/alibaba/DataX
然后將下載的壓縮包解壓即可,不過需要的前提Python環境是要求python2,於是在python3的基礎上,又安裝了python2,配置了環境
打開cmd,進入到解壓的目錄bin目錄下例如我的:
發現會有以下幾個python文件
繼續在命令行輸入命令:
python2 datax.py ../job/csv_mysql.json
其中,用python2對datax.py進行編譯,后面加上需要運行的json文件,代碼如下:
{ "job": { "setting": { "speed": { "channel": 1 } }, "content": [ { "reader": { "name": "txtfilereader", "parameter": { "path": ["D:/yq.csv"], "encoding": "gbk", "column": [ { "index": 0, "type": "string" }, { "index": 1, "type": "string" }, { "index": 2, "type": "string" }, { "index": 3, "type": "string" } ], "fieldDelimiter": ",", "skipHeader": "true" } }, "writer": { "name": "mysqlwriter", "parameter": { "writeMode": "insert", "username": "root", "password": "root", "column": [ "pro", "yes", "dead", "ys" ], "session": [], "connection": [ { "jdbcUrl": "jdbc:mysql://localhost:3306/fy?useUnicode=true&characterEncoding=utf8&useSSL=false&serverTimezone=GMT%2B8", "table": ["yq2"] } ] } } } ] } }
在運行之前,需要在數據庫中提前建好數據表,然后再執行。
需要注意的是,我在運行的時候出現了亂碼現象,最終找到的解決方法是,在命令行(cmd)先輸入CHCP 65001 然后就解決了亂碼問題,運行結果如下: