測試時使用三種文件格式:
ISO-8859
Netpbm PBM image
ASCII
if [ $(file $filename|grep -c "ISO-8859") -gt 0 ]
then
echo "ISO-8859" psql -c "copy $schemaname.$tbname from '$dirname/$filename' with(format 'csv', delimiter ', encoding 'ISO-8859-1')" $dbname elif [ $(file $filename|grep -c "ASCII") -gt 0 ]
then
echo $format psql -c "copy $schemaname.$tbname from '$dirname/$filename' with(format 'csv', delimiter ', encoding 'UTF-8')" $dbname else psql -c "copy $schemaname.$tbname from '$dirname/$filename' with(format 'csv', delimiter ', encoding 'ISO-8859-1')" $dbname
這種處理方式在文件格式為ASCII時,copy中途仍然會出現編碼錯誤提示。
ERROR: invalid byte sequence for encoding "UTF8": 0xb3
最后摸索發現不管什么格式,都指定為ISO-8859-1就能處理。
全包容的編碼格式,都能處理:
psql -c "copy $schemaname.$tbname from '$dirname/$filename' with(format 'csv', delimiter ', encoding 'ISO-8859-1')" $dbname