search

Home  >  Q&A  >  body text

php - How to capture UA in log files in python

The log format is as follows
`61.182.137.6 - - [21/Apr/2017:00:00:37 +0800] 0 "HEAD / HTTP/1.1" 200 - "-" "Baidu-YunGuanCe-SLABot( ce.baidu.com)"
123.125.71.89 - - [21/Apr/2017:00:00:38 +0800] 0 "GET /article/515140 HTTP/1.1" 200 10315 "-" "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/s...)"
216.244.66.229 - - [21/Apr/2017:00:00:39 +0800] 0 "GET /article/330012 HTTP/1.1" 200 29593 "-" "Mozilla/5.0 (compatible; DotBot/1.1; http://www.opensiteexplorer.o... help@moz.com)"`

How can python3.5 capture the spider types Baiduspider and DotBot in UA
How to write

淡淡烟草味淡淡烟草味2866 days ago486

reply all(1)I'll reply

  • 阿神

    阿神2017-05-16 13:12:35

    filename = 'xxx.log'
    with open(a) as f:
        for i in f:
            print(re.findall('(?<=compatible; )([^/]+)', i))

    reply
    0
  • Cancelreply