Home  >  Q&A  >  body text

php - python怎么捕捉日志文件里面的UA

日志格式如下
`61.182.137.6 - - [21/Apr/2017:00:00:37 +0800] 0 "HEAD / HTTP/1.1" 200 - "-" "Baidu-YunGuanCe-SLABot(ce.baidu.com)"
123.125.71.89 - - [21/Apr/2017:00:00:38 +0800] 0 "GET /article/515140 HTTP/1.1" 200 10315 "-" "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/s...)"
216.244.66.229 - - [21/Apr/2017:00:00:39 +0800] 0 "GET /article/330012 HTTP/1.1" 200 29593 "-" "Mozilla/5.0 (compatible; DotBot/1.1; http://www.opensiteexplorer.o... help@moz.com)"`

python3.5用什么方式可以捕捉到UA里面的蜘蛛类型Baiduspider和DotBot
要怎么写

淡淡烟草味淡淡烟草味2683 days ago413

reply all(1)I'll reply

  • 阿神

    阿神2017-05-16 13:12:35

    filename = 'xxx.log'
    with open(a) as f:
        for i in f:
            print(re.findall('(?<=compatible; )([^/]+)', i))

    reply
    0
  • Cancelreply