Home >Java >javaTutorial >JAVA crawler realizes automatic login to Taobao_java

JAVA crawler realizes automatic login to Taobao_java

不言
不言Original
2018-04-02 09:16:122988browse

给大家分享一个关于JAVA爬虫的相关知识点,通过代码实现自动登录淘宝网,有兴趣的朋友测试下。

目的

想通过JAVA代码实现淘宝网的自动登录,通过获取设置的登录信息自动填写并提交。目前这个代码是小编测试过的,可以通过,后期不知道淘宝会不会有相应的封堵策略。

代码分享:

package util;
import org.openqa.selenium.By;
import org.openqa.selenium.WebElement;
import org.openqa.selenium.firefox.FirefoxDriver;
import org.openqa.selenium.firefox.FirefoxOptions;
import org.openqa.selenium.firefox.FirefoxProfile;
import java.io.File;
import java.util.Random;
public class TestCase2 {
  public static void main(String[] args)
  {
    System.setProperty("webdriver.firefox.bin","C:\\Program Files\\Mozilla Firefox\\firefox.exe");
    System.setProperty("webdriver.gecko.driver","C:\\Users\\18431\\IdeaProjects\\SeleniumDemo\\bin\\geckodriver.exe");
    FirefoxOptions options = new FirefoxOptions();
    FirefoxProfile profile = new FirefoxProfile(new File("C:\\Users\\18431\\AppData\\Roaming\\Mozilla\\Firefox\\Profiles\\efzu2oem.default"));
    options.setProfile(profile);
    FirefoxDriver driver = new FirefoxDriver();
    driver.get("https://login.m.taobao.com/login.htm");
    //下面开始完全模拟正常人的操作,所以你会看到很多 sleep 操作
    WebElement usernameElement = driver.findElement(By.id("username"));
    //模拟用户点击用户名输入框
    usernameElement.click();
    String username = "18588260144";//你的手机号
    String password = "xxxxxxxxxxx";//你的密码
    Random rand = new Random();
    try {
      for (int i = 0; i <username.length() ; i++) {
        Thread.sleep(rand.nextInt(1000));//随机睡眠0-1秒
        //逐个输入单个字符
        usernameElement.sendKeys(""+username.charAt(i));
      }
      WebElement passwordElement = driver.findElement(By.id("password"));
      passwordElement.click();
      //输入完成用户名后,随机睡眠0-3秒
      Thread.sleep(rand.nextInt(3000));
      for (int i = 0; i <password.length() ; i++) {
        Thread.sleep(rand.nextInt(1000));
        passwordElement.sendKeys(""+password.charAt(i));
      }
      driver.findElement(By.id("btn-submit")).click();
    } catch (Exception e){
      e.printStackTrace();
    }
    try {
      Thread.sleep(300000);
    }catch (InterruptedException ie){
      ie.printStackTrace();
    }
    driver.quit();
  }
}

总结

可以看出来,万变不离其宗,再难的模拟登录都是可以完全模拟人类的操作习惯去实现反爬虫的,好吧,全都告诉你了,PHP 爬虫技术不打算继续写下去了,感觉还是用 PHP 适合它做的事情比较好,PHP 写的爬虫段位太低,还是python 和 java 更好些。

相关推荐:

记录一次简单的Python爬虫实例

python动态爬虫的实例分享

如何完整写一个爬虫框架


The above is the detailed content of JAVA crawler realizes automatic login to Taobao_java. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn