Home >Backend Development >PHP Tutorial >Using CURL to simulate login and obtain data instances in PHP_PHP tutorial

Using CURL to simulate login and obtain data instances in PHP_PHP tutorial

WBOY
WBOYOriginal
2016-07-13 10:26:27745browse

cURL is a powerful PHP library. Using PHP's cURL library can simply and effectively crawl web pages and collect content. Set cookies to simulate logging in to web pages. Curl provides a wealth of functions. Developers can learn from the PHP manual Get more information about cURL. This article takes simulated login to open source China (oschina) as an example to share with you the use of cURL.
PHP's curl() is relatively efficient in crawling web pages and supports multi-threading, while file_get_contents() is slightly less efficient. Of course, you need to enable the curl extension when using curl.

Code practice

Let’s look at the login code first:

Copy code The code is as follows:

//Simulate login
function login_post($url, $cookie, $post) {
$curl = curl_init();//Initialize curl module
curl_setopt($curl, CURLOPT_URL, $url);//Login submitted address
curl_setopt($curl, CURLOPT_HEADER, 0);//Whether to display header information
curl_setopt($curl, CURLOPT_RETURNTRANSFER, 0);//Whether to automatically display the returned information
curl_setopt($curl, CURLOPT_COOKIEJAR, $cookie); //Set Cookie information to be saved in the specified file
curl_setopt($curl, CURLOPT_POST, 1);//Submit via post
curl_setopt($curl, CURLOPT_POSTFIELDS, http_build_query($post));//Information to be submitted
curl_exec($curl);//Execute cURL
curl_close($curl);//Close cURL resources and release system resources
}

The function login_post() first initializes curl_init(), and then uses curl_setopt() to set relevant option information, including the URL address to be submitted, the saved cookie file, post data (user name and password, etc.), whether to return information, etc. Wait, then curl_exec executes curl, and finally curl_close() releases the resources. Note that PHP's own http_build_query() can convert arrays into connected strings.
Next, if the login is successful, we need to obtain the page information after the login is successful.

Copy code The code is as follows:

//Get data after successful login
function get_content($url, $cookie) {
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($ch, CURLOPT_COOKIEFILE, $cookie); //Read cookie
$rs = curl_exec($ch); //Execute cURL to capture page content
curl_close($ch);
Return $rs;
}

The function get_content() also initializes curl first, then sets relevant options, executes curl, and releases resources. Among them, we set CURLOPT_RETURNTRANSFER to 1 to automatically return information, and CURLOPT_COOKIEFILE can read the cookie information saved when logging in, and finally return the page content.

Our ultimate goal is to obtain the information after simulated login, which is useful information that can only be obtained after successful normal login. Next, we take logging into the mobile version of Open Source China as an example to see how to capture the information after successful login.

Copy code The code is as follows:

//Set post data
$post = array (
'email' => 'oschina account',
'pwd' => 'oschina password',
'goto_page' => '/my',
'error_page' => '/login',
'save_login' => '1',
'submit' => 'Log in now'
);

//Login address
$url = "http://m.jb51.net/action/user/login";
//Set cookie saving path
$cookie = dirname(__FILE__) . '/cookie_jb51.txt';
//The address to obtain information after logging in
$url2 = "http://m.jb51.net/my";
//Simulate login
login_post($url, $cookie, $post);
//Get login page information
$content = get_content($url2, $cookie);
//Delete cookie file
@ unlink($cookie);
//Matching page information
$preg = "/(.*)/i";
preg_match_all($preg, $content, $arr);
$str = $arr[1][0];
//Output content
echo $str;

After running the above code, we will see that the avatar image of the logged in user is finally obtained.

Usage summary:
1. Initialize curl;
2. Use curl_setopt to set the target url and other options;
3. curl_exec, execute curl;
4. After execution, close curl;
5. Output data.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/824702.htmlTechArticlecURL is a powerful PHP library. Using PHP's cURL library can easily and effectively crawl web pages and collect them. Content, set cookies to complete the simulated login web page, curl provides a wealth of functions, open...
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn