search
HomeBackend DevelopmentPHP TutorialShort URL exploration, URL exploration_PHP tutorial
Short URL exploration, URL exploration_PHP tutorialJul 13, 2016 am 10:04 AM
httpundercommonplaceintroductionExploreshortURL

Short URL exploration, URL exploration

Introduction

Short URLs are common, such as the following

 http://dwz.cn/CSW6Y => http://www.cnblogs.com/iforever/p/4313704.html,

 http://dwz.cn/CSWuP => http://www.cnblogs.com/iforever/p/4279006.html, etc.

When accessing these URLs, the front and back pairs point to the same page. I am just giving an example. These short URLs are often seen in weibo or similar social networks. Since the original link address is particularly long, this time the short URL is short. The URL will show its power and be short and easy to remember, but it also has its shortcomings. For example, short URLs may not directly see some information in the URL (some information can be seen in long URLs).

I have been exposed to this all day long, so that before I really understand its principle, it has imprinted "nothing" in my mind. This has missed some knowledge. This situation should be called in psychology. I'm thinking, does anyone know that this is the case?

Principle

First of all, I guess this is achieved through redirection. dwz.cn is a server used to receive some short URLs. These short URLs are processed by dwz.cn and have corresponding short URLs in dwz.cn. The record of the long URL, then obtain the long URL corresponding to the short URL from the database, and then redirect with 302. See if this is the case.

Visit http://dwz.cn/CSW6Y through the browser and analyze the request (use Chrome’s debugging tool here, or you can use tools such as wireshark), and you can get the following data:

Short URL exploration, URL exploration_PHP tutorialname: abit version: 1 handle: - rewrite: if (!-d && !-f && path ~ "/(.*)$") goto "do.php?url=$1&act=out"

Handle: The following are the rewrite rules. For the specific detailed rules of Sina Cloud, please see here http://sae.sina.com.cn/doc/php/runtime.html#php-app-config. It is very simple. The configurations of nginx and apache are also similar.

Another thing to note after the redirection is completed is that urlencode must be performed when passing parameters, and urldecode must be used before redirection. When url type parameters are passed without using urlencode, part of the information may be lost when obtained. Therefore, when passing parameters before generating a short link, the URL must be escaped, the special string must be encoded, and the short link must be processed when accessing. It is necessary to urldecode the encoded URL and restore it to a normal link. Otherwise, the link will not be regarded as a normal URL when the header jumps. After the jump, the URL will be appended to the host of the previous page, similar to http://abit.sinaapp.com/www.cnblogs.com, errors may occur, so please pay special attention here.

There will be a problem when sae redirects, double backslashes will be automatically filtered into one, for example from http://abit.sinaapp.com/ to http:/abit.sinaapp.com/, please note , there is a missing backslash here, you should pay special attention to this when processing, otherwise you may encounter unnecessary trouble.

Encoding

Main processing part

<?<span>php
</span><span>class</span><span> snapshotUrl{
    </span><span>//</span><span>进行编码的数据库,没6位二进制数对应一个字符,一共需要64位,因此选取
    //52+10+2个特殊字符</span>
    <span>private</span> <span>static</span> <span>$basedb</span> = <span>array</span><span>(
        </span>'(',')','a','b','c','d',
        'e','f','g','h','i','j',
        'k','l','m','n','o','p',
        'q','r','s','t','u','v',
        'w','x','y','z','A','B',
        'C','D','E','F','G','H',
        'I','J','K','L','M','N',
        'O','P','Q','R','S','T',
        'U','V','W','X','Y','Z',
        '0','1','2','3','4','5',
        '6','7','8','9',<span>
    );

    </span><span>private</span> <span>function</span> long2short(<span>$url</span><span>){
        </span><span>$hex</span> = <span>md5</span>(<span>$url</span><span>);
        </span><span>$out</span> = ''<span>;
        </span><span>$hex</span> = 0x7FFFFFFF & (1 * ('0x'.<span>substr</span>(<span>$hex</span>, 0, 8<span>)));
        </span><span>for</span>(<span>$i</span>=0; <span>$i</span><5; <span>$i</span>++<span>){
            </span><span>$index</span> = 0x3f & <span>$hex</span><span>;
            </span><span>$out</span> .= self::<span>$basedb</span>[<span>$index</span><span>];
            </span><span>$hex</span> = <span>$hex</span>>>6<span>;
        }
        </span><span>return</span> <span>$out</span><span>;
    }

    </span><span>public</span> <span>function</span> retJson(<span>$arr</span><span>){
        </span><span>return</span> json_encode(<span>$arr</span><span>);
    }

    </span><span>//</span><span>对url进行映射保存</span>
    <span>public</span> <span>function</span> dispose(<span>$url</span>, <span>$act</span><span>){
        </span><span>$mysql</span> = <span>new</span><span> SaeMysql();
        </span><span>switch</span> (<span>$act</span><span>) {
            </span><span>case</span> 'in':
                <span>$short</span> = <span>$this</span>->long2short(<span>$url</span><span>);
                </span><span>$url</span> = <span>addslashes</span>(<span>$url</span><span>);
                </span><span>$sql</span> = "insert into `tiny_url`(`short`,`long`) values ('{<span>$short</span>}','{<span>$url</span>}')"<span>;
                </span><span>$mysql</span>->runSql(<span>$sql</span><span>);
                </span><span>if</span>(<span>$mysql</span>->errno() != 0<span>){
                    </span><span>echo</span> "生成失败"<span>;
                }</span><span>else</span><span>{
                    </span><span>echo</span> "http://abit.sinaapp.com/{<span>$short</span>}"<span>;
                }
                </span><span>break</span><span>;
            </span><span>case</span> 'out':
                <span>if</span>(<span>strlen</span>(<span>$url</span>) > 5<span>)
                    </span><span>echo</span> <span>$this</span>->retJson(<span>array</span>("code"=>"-1","msg"=>"没有这条记录"<span>));
                </span><span>$sql</span> = "select * from `tiny_url` where `short`='{<span>$url</span>}' limit 1"<span>;
                </span><span>$data</span> = <span>$mysql</span>->getData(<span>$sql</span><span>);
                </span><span>if</span>(!<span>$data</span><span>) {
                    </span><span>echo</span> <span>$this</span>->retJson(<span>array</span>("code"=>"-1","msg"=>"没有这条记录"<span>));
                }</span><span>else</span><span>{
                    </span><span>$location</span> = <span>urldecode</span>(<span>$data</span>[0]['long'<span>]);
                    </span><span>header</span>("Location: {<span>$location</span>}"<span>);
                    </span><span>exit</span><span>();
                }
                </span><span>break</span><span>;
            
            </span><span>default</span>:
                <span>#</span><span> code...</span>
                <span>break</span><span>;
        }
    }    
}

</span><span>$url</span> = <span>isset</span>(<span>$_GET</span>['url']) ? <span>$_GET</span>['url'] : <span>null</span><span>;
</span><span>$act</span> = <span>isset</span>(<span>$_GET</span>['act']) ? <span>$_GET</span>['act'] : <span>null</span><span>;
</span><span>$snapshotUrl</span> = <span>new</span><span> snapshotUrl();
</span><span>if</span>(<span>$url</span> === <span>null</span> || <span>$act</span> === <span>null</span><span>)
    </span><span>echo</span> <span>$snapshotUrl</span>->retJson(<span>array</span>("code"=>"-1","msg"=>"参数错误"<span>));

</span><span>$snapshotUrl</span>->dispose(<span>$url</span>, <span>$act</span>);

Results

I made a small webpage that can be tested:

 http://abit.sinaapp.com/ If you are interested, you can try it

The copyright of this article belongs to the author iforever (luluyrt@163.com). Any form of reprinting is prohibited without the author’s consent. After reprinting the article, the author and the original text link must be given in an obvious position on the article page. Otherwise, we reserve the right to pursue legal liability.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/965023.htmlTechArticleShort URL Research, URL Research Introduction Short URLs are common, such as the following http://dwz.cn/CSW6Y =http://www.cnblogs.com/iforever/p/4313704.html, http://dwz.cn/CSWuP=http:/...
Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
如何在steam内输入网址?steam打开网站的方法如何在steam内输入网址?steam打开网站的方法Mar 14, 2024 pm 12:10 PM

  如果想要在steam里查阅一些相关的信息或者和朋友分享一些网站链接,要怎么操作呢?steam能输入网址吗?当然是可以的,下面小编就来教教大家如何在Steam内打开网址的方法。  具体方法:  1、首先,我们打开steam。  2、点击设置选项。  3、然后切换到界面标签。  4、勾选显示地址栏。  5、然后再地址栏内输入网址即可使用浏览器。

php检测网址是什么意思php检测网址是什么意思Jul 18, 2023 am 11:02 AM

php检测网址是指使用PHP编程语言来验证输入的字符串是否符合网址的格式。检测网址的方法:1、使用正则表达式进行网址校验,可以使用“preg_match”函数来进行正则匹配,如果URL符合模式;2、使用内置函数进行网址校验,使用“filter_var”函数和“FILTER_VALIDATE_URL”过滤器来验证。

Go语言简介:探究Go就是Golang吗?Go语言简介:探究Go就是Golang吗?Feb 28, 2024 am 11:09 AM

Go语言简介:探究Go就是Golang吗?Go语言(也被称为Golang)是由谷歌(Google)开发的一种开源编程语言,于2007年开始设计,2009年正式发布,旨在提高程序员的工作效率和编程快乐度。尽管很多人称其为Golang,但其官方名称仍是Go语言。那么,Go和Golang究竟是同一种语言吗?为了解答这个问题,让我们深入探究一下这门语言的背景、特点和

PHP数组的性能优化技巧探究PHP数组的性能优化技巧探究Mar 13, 2024 pm 03:03 PM

PHP数组是一种非常常见的数据结构,在开发过程中经常会用到。然而,随着数据量的增加,数组的性能可能会成为一个问题。本文将探讨一些PHP数组的性能优化技巧,并提供具体的代码示例。1.使用合适的数据结构在PHP中,除了普通数组外,还有一些其他数据结构,如SplFixedArray、SplDoublyLinkedList等,它们在特定情况下可能比普通数组性能更好

教你如何利用PHP去除网址路径的后缀教你如何利用PHP去除网址路径的后缀Mar 21, 2024 pm 03:39 PM

教你如何利用PHP去除网址路径的后缀在网站开发中,经常会遇到需要去除网址路径后缀的需求,以实现更加美观和规范的URL。今天我们将来学习如何利用PHP去除网址路径的后缀,让我们一同探讨这个问题。首先,我们需要明确一下我们想要实现的效果。通常,网址路径后缀指的是URL中的文件扩展名,比如.php、.html等。我们的目标是在用户访问带有后缀的URL时,可以自动去

PHP魔法函数探究:__clone()PHP魔法函数探究:__clone()Jun 19, 2023 pm 10:28 PM

在PHP的面向对象编程中,除了常规的用于创建对象的构造函数(__construct),还有很多针对对象操作的特殊函数,这些被称为“魔法函数”。其中,一个非常重要的魔法函数就是__clone()。在本文中,我们将对此进行探究。一.__clone()是什么__clone()是PHP中一个特殊的函数,用于在对象被复制时调用。它的作用等同于对象的克隆,也就是复制一

Golang程序能否被反编译探究与解析Golang程序能否被反编译探究与解析Mar 18, 2024 pm 09:42 PM

【反编译Golang程序:探究与解析】近年来,随着Golang(Go语言)在软件开发领域的广泛应用,人们也越来越关注Golang程序的安全性。其中一个重要的安全考量就是程序的反编译问题。在实际应用中,有些开发者会担心自己编写的Golang程序是否容易被反编译,进而泄露代码或关键信息。本文将探究Golang程序被反编译的实际情况,并通过具体的代码示例展示相关技

PHP函数探究——array_key_first()PHP函数探究——array_key_first()Jun 21, 2023 pm 12:41 PM

PHP函数探究——array_key_first()在PHP7.3中,官方新增了一个数组函数——array_key_first()。这个函数能够返回数组中第一个键名。在本文中,我们将深入探究这个函数的用法和场景。语法array_key_first(array$array):mixed说明array_key_first()函数接收一个数组参数,并返回

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),