编码 - Python 3.6中 'utf-8' codec can't decode byte invalid start byte?

Question

Python 3.6中，网页信息解析失败，试了很多种编码，查看网页的编码方式也是utf-8。错误信息：'utf-8' codec can't decode byte 0x8b in position 1: invalid start byte?还有就是第一个print终端里打印出来的unic...

阿神 · Answer

Saya melihat tapak web tersebut mengembalikan data mampat gzip, jadi ia perlu dinyahkod

# coding=utf-8
from io import BytesIO
import gzip
import urllib.request

url = ('http://wthrcdn.etouch.cn/weather_mini?city=%E4%B8%8A%E6%B5%B7')
resp = urllib.request.urlopen(url)
content = resp.read() # content是压缩过的数据

buff = BytesIO(content) # 把content转为文件对象
f = gzip.GzipFile(fileobj=buff)
res = f.read().decode('utf-8')
print(res)

伊谢尔伦 · Answer

Bukankah permintaan mudah digunakan?

伊谢尔伦 · Answer

Adalah disyorkan untuk menggunakan requestet, kodnya adalah seperti berikut:

import requests

r = requests.get('http://wthrcdn.etouch.cn/weather_mini?city=%E4%B8%8A%E6%B5%B7')
print(r.text)

阿神 · Answer

Ini bukan masalah pengekodan aksara, lihat pada pengepala Respons yang anda minta



    Status Code: 200 OK
    Access-Control-Allow-Headers: *
    Access-Control-Allow-Methods: *
    Access-Control-Allow-Origin: *
    Cache-Control: must-revalidate, max-age=300
    Connection: Keep-Alive
    Content-Encoding: gzip
    Content-Length: 443
    Date: Fri, 10 Mar 2017 03:20:46 GMT
    Fw-Cache-Status: hit
    Fw-Via: HTTP MISS from 58.59.19.99, DISK HIT from 183.131.161.27
    Server: Tengine/2.1.2

Ia adalah gzip Jika anda menggunakan perpustakaan standard, anda perlu nyahzip gzip

编码 - Python 3.6中 'utf-8' codec can't decode byte invalid start byte?

membalas semua(4)saya akan balas