web - python requests库登录网站脚本登录失败

Question

想写一个自动登录脚本，拿V2EX做实验。首先分析了下登录提交的表单： 需要分析登陆界面中的html取出next,once,next值，分别为input_next_value_pre、input_once_value、input_next_value_post, 然后用requests请...

大家讲道理 · Answer

V2EX上答案：

可能是cookie的问题，在post之前，先GET一次，把cookie存下来，再post，或者用requests的Session。

另外POST的Referer要带上。

修改后代码：

headers = {"User-Agent": user_agent,
           "Referer": "http://www.v2ex.com/signin"}
v2ex_session = requests.Session()
signin_req = v2ex_session.post(signin_url,
                               data=logininfo,
                               headers=headers,
                               )

logininfo不用改动即可。

参考： python登录V2EX失败

伊谢尔伦 · Answer

logininfo 转成json格式看一下。你抓包的时候看一下他是什么格式。至少我之前遇到过这类的问题。

黄舟 · Answer

用requests.Session()吧

迷茫 · Answer

刚实现了这个脚本，跟LZ的情况一样，本来登录不上，后来添加了headers内容，现在可以登录了。
主要是headers中Origin, Referer和Host这3个字段。实现后代码如下：

python
#coding=utf-8
import requests
from bs4 import BeautifulSoup as bs

s = requests.Session()
headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/39.0.2171.95 Safari/537.36',
    'Origin': 'http://www.v2ex.com',
    'Referer': 'http://www.v2ex.com/signin',
    'Host': 'www.v2ex.com',
}
r = s.get('http://www.v2ex.com/signin', headers=headers)
soup = bs(r.content)
once = soup.find('input', {'name': 'once'})['value']

login_data = {'u': '***', 'p': '***', 'once': once, 'next': '/'}

s.post('http://www.v2ex.com/signin', login_data, headers=headers)

f = s.get('http://www.v2ex.com/settings', headers=headers)
print f.content

黄舟 · Answer

请问楼主我在打开f12的时候并没有出现那个Form Data是怎么回事。。

web - python requests库登录网站脚本 登录失败

Antworte allen(5)Ich werde antworten

web - python requests库登录网站脚本登录失败