


produced | 51CTO Technology Stack (WeChat ID: blog51cto)
Google is having a bit of a bad year.
In the past two days, search engines have provided information about the "AI Overviews" feature that often provides seriously incorrect search results information, such as absurdly suggesting that users use glue to Prevents cheese from sliding off pizza. In this regard, CEO Pichai also had to admit that this was caused by the illusion of the large language model, and there is currently no solution.
An internal document of Google search engine was recently leaked, which may show the operating mechanism of Google search engine to the public for the first time. This article was first published here Google has yet to issue an official response to the leak and has not disputed the authenticity of the documents.
The details of how Google, the most famous search engine on the Internet today, ranks websites have long been a mystery. This exposure provides a new perspective, giving us a glimpse into Google's highly confidential search algorithm system, and how its operating mechanisms complement Google's previous public statements.
1.2500 pages of leaked documents
Google’s search algorithm is perhaps the most influential system on the Internet. It determines the survival of websites and the presentation of online content. However, the specific details of how Google ranks websites have always been a "black box". Although there have been various speculations by the media, researchers, and people engaged in search engine optimization, these are just blind people trying to figure out the elephant. We never see the complete puzzle.
Now, according to foreign media The Verge, this explosive leak seems to have unveiled the mystery behind the search function for the first time, and hints that Google has not been completely honest over the years. publicly disclose how it operates. Google has so far not responded to multiple requests for comment about the authenticity of the documents.
Rand Fishkin, who has been working in SEO for more than ten years, is the protagonist of this incident. He revealed that a source shared 2,500 pages of documents with him in the hope of exposing Google’s external “lies” about how its search algorithm works.
According to Fishkin, the documents outline Google's search API and break down the information provided to employees. The details Fishkin shares are complex and technical, and may be easier for developers and SEO experts to understand than the average person.
Leaks by themselves do not necessarily prove that Google uses specific data and signals for search rankings. Instead, the leaked documents outline what data Google collects from web pages, sites and searchers, and indirectly provide SEO experts with clues about Google's focus.
2.Contradicts Google’s public statements
"It may be too serious to say 'lying,' but in this case, it is the most appropriate term," Mike King expressed it this way: "I understand that Google's public relations people are trying to What I can’t accept is that they demean those who find and question Google in the fields of marketing, technology and journalism. has not yet responded to The Verge’s request for comment involving the documents, which included a direct request to rebut the authenticity of the documents. Fishkin said in an email to The Verge that Google did not dispute the authenticity of the leak, but that an employee asked him to change some of the wording in his post about an incident.
Google’s secretive search algorithm has spawned an industry of marketers who follow Google’s public guidelines and implement SEO strategies for millions of companies around the world. However, these widely used methods have gradually made people generally feel that Google's search results are deteriorating and full of spam information.
Website operators feel compelled to produce this type of content in order to get their sites seen. But in the face of such doubts, Google's external spokesperson will always come up with a familiar set of rhetoric: Our guidelines do not indicate this.
But some details in the leaked documents cast doubt on the accuracy of Google’s public statements about how its search feature works.
#One example cited by Fishkin and Mike King is whether Google uses Chrome data in rankings. Google representatives have repeatedly stated that Chrome data is not used to rank pages, but Chrome is specifically mentioned in a section about how sites appear in searches.
Picture
In the screenshot above, according to the document, below the main vogue.com URL Some of the links that appear may have been created using Chrome data.
Another issue that has attracted attention is the role that E-A-T (expertise, authority and trustworthiness) plays in the rankings. As we all know, E-A-T has been the cornerstone of Google’s search quality assessment guidelines for many years.
Google representatives have previously stated that E-A-T is not a ranking factor. Fishkin noted that he didn't find many direct references to E-A-T in the documents.
Also, Google representatives have previously insisted that attribution is something website owners should do for readers, not Google, because it doesn't affect rankings. But that doesn't seem to be the case.
Mike King detailed how Google collects author data for pages, noting that there is a field in the file used to identify whether an entity is an author, although this field is mainly designed for news articles , but also covers other content such as scientific articles. While this doesn't confirm that attribution is an explicit ranking factor, it does suggest that Google is at least tracking this attribute closely.
3. Search algorithm innovation, the Internet ecosystem has "changed" since then
Although these documents are not conclusive evidence, they provide an in-depth and unfiltered The perspective allows us to get a glimpse of this highly confidential black box system.
In fact, in the past two years, Google search has experienced a series of major updates, some of which are even unprecedented disruptive updates. For example, mentioned at the beginning of this article, the much-criticized “AI Overview” function is one of the most representative innovations.
At the beginning of the change, Pichai, the head of Google, said that in the future, Google search will provide self-generated AI answers to many of your questions, and expressed strong support for this product function. confidence.
A Google spokesperson told the BBC that the company will only roll out search changes after rigorous testing to confirm that the changes will benefit users, and that the company provides help to website owners. , resources and the opportunity for feedback on their search rankings.
But reality always deviates from the ideal.
Whether it is the "fatal hallucination" about the AI overview function or the "inconsistent" information conveyed in this suspected leaked document, it is causing people to have doubts about Google. Search with suspicion and vigilance.
Looking back at the entire history of the development of the Internet, no company has changed the way most people on this blue star obtain information like Google, but has also reshaped the way content is created and distributed. pattern.
Using generative AI to support search as an example, Google seems to be aiming to connect users and information more efficiently through these technological innovations and improve the overall quality of the search experience.
But in fact, as critics say, this shift may exacerbate information homogeneity and reduce the depth and breadth of users exploring the web as they increasingly rely on Google to directly The short answer provided instead of visiting the source website yourself. This may not only weaken the visibility and profit model of independent websites and blogs, but may also affect the health and diversity of the online ecosystem, limiting users’ opportunities for exposure to diverse viewpoints and in-depth analysis.
For search players as powerful as Google, perhaps the only way to ensure that search algorithm optimization can not only serve the public but not destroy the ecological cornerstones that contribute high-quality content to the Internet is It is the foundation for long-term development.
Reference link:
https://www.theverge.com/2024/5/28/24166177/google-search-ranking-algorithm-leak-documents -link-seo
https://www.php.cn/link/c30ca4400db3c72274c8ad819f688c21
To learn more about AIGC, please visit:
51CTO AI.x Community
https://www.51cto.com/aigc/
The above is the detailed content of 2,500 pages of algorithm documents leaked! The most powerful black box in search history is exposed, will Google overturn and upgrade again?. For more information, please follow other related articles on the PHP Chinese website!

谷歌三件套指的是:1、google play商店,即下载各种应用程序的平台,类似于移动助手,安卓用户可以在商店下载免费或付费的游戏和软件;2、Google Play服务,用于更新Google本家的应用和Google Play提供的其他第三方应用;3、谷歌服务框架(GMS),是系统软件里面可以删除的一个APK程序,通过谷歌平台上架的应用和游戏都需要框架的支持。

中国不卖google手机的原因:谷歌已经全面退出中国市场了,所以不能在中国销售,在国内是没有合法途径销售。在中国消费市场中,消费者大都倾向于物美价廉以及功能实用的产品,所以竞争实力本就因政治因素大打折扣的谷歌手机主体市场一直不在中国大陆。

虽然谷歌早在2020年,就在自家的数据中心上部署了当时最强的AI芯片——TPU v4。但直到今年的4月4日,谷歌才首次公布了这台AI超算的技术细节。论文地址:https://arxiv.org/abs/2304.01433相比于TPU v3,TPU v4的性能要高出2.1倍,而在整合4096个芯片之后,超算的性能更是提升了10倍。另外,谷歌还声称,自家芯片要比英伟达A100更快、更节能。与A100对打,速度快1.7倍论文中,谷歌表示,对于规模相当的系统,TPU v4可以提供比英伟达A100强1.

2015 年,谷歌大脑开放了一个名为「TensorFlow」的研究项目,这款产品迅速流行起来,成为人工智能业界的主流深度学习框架,塑造了现代机器学习的生态系统。从那时起,成千上万的开源贡献者以及众多的开发人员、社区组织者、研究人员和教育工作者等都投入到这一开源软件库上。然而七年后的今天,故事的走向已经完全不同:谷歌的 TensorFlow 失去了开发者的拥护。因为 TensorFlow 用户已经开始转向 Meta 推出的另一款框架 PyTorch。众多开发者都认为 TensorFlow 已经输掉

前几天,谷歌差点遭遇一场公关危机,Bert一作、已跳槽OpenAI的前员工Jacob Devlin曝出,Bard竟是用ChatGPT的数据训练的。随后,谷歌火速否认。而这场争议,也牵出了一场大讨论:为什么越来越多Google顶尖研究员跳槽OpenAI?这场LLM战役它还能打赢吗?知友回复莱斯大学博士、知友「一堆废纸」表示,其实谷歌和OpenAI的差距,是数据的差距。「OpenAI对LLM有强大的执念,这是Google这类公司完全比不上的。当然人的差距只是一个方面,数据的差距以及对待数据的态度才

由于可以做一些没训练过的事情,大型语言模型似乎具有某种魔力,也因此成为了媒体和研究员炒作和关注的焦点。当扩展大型语言模型时,偶尔会出现一些较小模型没有的新能力,这种类似于「创造力」的属性被称作「突现」能力,代表我们向通用人工智能迈进了一大步。如今,来自谷歌、斯坦福、Deepmind和北卡罗来纳大学的研究人员,正在探索大型语言模型中的「突现」能力。解码器提示的 DALL-E神奇的「突现」能力自然语言处理(NLP)已经被基于大量文本数据训练的语言模型彻底改变。扩大语言模型的规模通常会提高一系列下游N

让一位乒乓球爱好者和机器人对打,按照机器人的发展趋势来看,谁输谁赢还真说不准。机器人拥有灵巧的可操作性、腿部运动灵活、抓握能力出色…… 已被广泛应用于各种挑战任务。但在与人类互动紧密的任务中,机器人的表现又如何呢?就拿乒乓球来说,这需要双方高度配合,并且球的运动非常快速,这对算法提出了重大挑战。在乒乓球比赛中,首要的就是速度和精度,这对学习算法提出了很高的要求。同时,这项运动具有高度结构化(具有固定的、可预测的环境)和多智能体协作(机器人可以与人类或其他机器人一起对打)两大特点,使其成为研究人

ChatGPT在手,有问必答。你可知,与它每次对话的计算成本简直让人泪目。此前,分析师称ChatGPT回复一次,需要2美分。要知道,人工智能聊天机器人所需的算力背后烧的可是GPU。这恰恰让像英伟达这样的芯片公司豪赚了一把。2月23日,英伟达股价飙升,使其市值增加了700多亿美元,总市值超5800亿美元,大约是英特尔的5倍。在英伟达之外,AMD可以称得上是图形处理器行业的第二大厂商,市场份额约为20%。而英特尔持有不到1%的市场份额。ChatGPT在跑,英伟达在赚随着ChatGPT解锁潜在的应用案


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool

DVWA
Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.
