At a developer meeting at Facebook headquarters, engineers from the social networking giant revealed that they are using a new self-developed query engine, Presto, to conduct interactive analysis on the existing massive 250PB data warehouse.
According to engineer Martin Traverso, more than 850 Facebook engineers use it to scan more than 320TB of data every day. In the past, our scientists and analysts have relied on Hive for data analysis. But Hive is designed for batch processing. But with more and more data, Hive can no longer meet our needs. While we have other tools that are faster than Hive, they are either limited in functionality or too simple to operate our massive data warehouse. And over the past few months, we've been using Presto to fill this gap.
Hive is a data warehouse tool created by Facebook specifically for Hadoop a few years ago. Because it mainly relies on MapReduce for operation, as it ages, its speed can no longer meet the growing data requirements. Browsing through a complete data set could take anywhere from minutes to hours, which is simply impractical.
Traverso also said that simple queries with Presto only take a few hundred milliseconds, and even very complex queries only take minutes to complete. It runs in memory and does not write to disk.
While it may look like Presto is Facebook's version of the Cloudera Impala SQL query engine, or similar to what Hortonworks is doing with Project Stinger, this is a version customized for faster operations at Facebook's scale. Presto won't compete with other commercial products, but it will soon shake up the big data industry. And Facebook plans to release Presto as open source this fall.
Ravi Murthy, engineering manager at Facebook, said that as the number of users continues to grow, the data warehouse is also growing rapidly. It is 4,000 times larger than four years ago. Murthy also said that in the next few years, data will reach exabytes. So in order to accommodate this kind of data scale, we had to rethink a lot of things.
Presto is one of them. In addition to improving query speed, this engine is 7 times more efficient than Hive in terms of CPU usage efficiency. Another ongoing project is shrinking the analytics space in Facebook's data centers.
What do the experts on Weibo think of Presto, the latest query engine launched by Facebook?
Big Data Pi Dong, former head of the Big Data Laboratory of EMC China Research Institute : Facebook’s latest interactive big data query system Presto, similar to Cloudera’s Impala and Hortonworks’ Stinger, solves Facebook’s rapidly expanding massive data warehouse Quickly check requirements. Facebook is developing a new generation of big data system for Exabyte scale data. Presto is one of the data warehouse interactive query systems and should also have a mass storage system. At this level, there's a lot of design to consider!
Sina CTO and Co-President Jack Xu Liangjie: Social networks and social media have given birth to a real big data (Big Data) platform. Sina Weibo is no exception...
vinW, a computer and linguistics researcher at the University of Leeds, UK, and a postdoctoral researcher on the search project: 1. Presto will be open source in the autumn; 2. Seven times faster than hive; 3. Based on memory
Launch_Bruce: FaceBook is not a search engine and has higher requirements for real-time performance. Even if Hive was initially launched, it could only be a temporary measure. This is the gene of Hadoop. Hadoop will definitely make many projects that are launched blindly without in-depth thinking difficult in the end. But obviously Hadoop's successful ecosystem will also harm many people.
TeslaElon: Come on! Big Data will generate many business opportunities. In particular, potential cooperation with Alibaba, the largest e-commerce platform, and YOKU, the largest video platform, are worth looking forward to. In addition, Sina has invested in many popular applications on Weibo and has many opportunities. We will see how Sina does well in R&D, management and sales later.
Henry, who carries big data: We were doing big data analysis about five years ago, and our MPP product already had these strategies. At that time, the biggest problem was big data in the Internet, but these star companies did not like to spend money to buy but only loved to build wheels. It's better for telecom customers, who are willing to spend money to purchase rather than reinvent the wheel.
English from: gigaom.com

Facebook是一个全球知名的社交媒体平台,它为用户提供了一个连接和交流的平台。成立于2004年,由马克·扎克伯格等人创建。它是一个在线社交网络,用户可以在上面与朋友、家人和同事分享信息、照片和视频,并与他们进行互动,它的影响力不仅限于个人用户,还扩展到了企业和新闻领域。

facebook的意思是一个美国的互联网公司,也指的是该公司开发的社交网络服务网站;facebook创立于2004年2月4日,是一款世界著名的社交通讯类应用,用户能够使用这款应用与世界各地的人们进行交流。

近期,作者发现了Facebook安卓APP应用的一个深度链接漏洞,利用该漏洞,可以把用户手机上安装的Facebook安卓APP应用转变成后门程序(Backdoor),实现后门化。另外,利用该漏洞还可以重打包FacebookAPP,并将其发送给特定目标受害者安装使用。下面就来看看作者对该漏洞的发现过程,以及如何通过Payload构造,最终将其转化为FacebookAPP实际生产环境中的安全隐患。漏洞发现通常做众测时,我会先认真了解目标系统的应用机制。在我的上一篇博客中,我已经分享了通过解析Face

正文Facebook嵌入式视频播放器API是JavaScript版FacebookSDK提供的客户端功能。可以在自己网站上播放Facebook视频。开始使用先引入FacebookSDK封装成组件FacebookPlayerimport{onMounted,onBeforeUnmount}from"vue";constprops=defineProps({id:{type:String,default:""},src:{type:String,require

随着Web开发技术的不断发展,开发人员也面临着越来越复杂的业务场景和需求。例如,高并发、大量请求处理、异步任务处理等问题都需要使用高性能的工具和技术来解决。在这种情况下,Swoole成为了一种越来越重要的解决方案。Swoole是一种基于PHP语言的高性能异步网络通信框架。它提供了一些非常有用的功能和特性,例如异步IO、协程、进程管理、定时器和异步客户端,使得

当尝试在浏览器中使用Facebook Messenger进行视频通话时,有时会抱怨硬件访问错误。弹出错误,说“您需要允许在PC上进行视频通话的麦克风和摄像头访问”。这可能是因为您可能希望在计算机上启用摄像头和麦克风访问,或者因为过时的驱动程序、冲突的设备、站点权限问题等。在本文中,我们有不同的解决方案来解决此错误。让我们开始吧!方法 1:允许桌面应用访问你的相机和麦克风步骤1:使用Windows + I键一起打开Windows设置第 2 步:点击隐私第三步: 在左侧,向下滚动并单击麦克风步骤4:

Swoole中如何高效使用协程?协程是一种轻量级的线程,可以在同一个进程内并发执行大量的任务。Swoole作为一个高性能的网络通信框架,对协程提供了支持。Swoole的协程不仅仅是简单的协程调度器,还提供了很多强大的功能,如协程池、协程原子操作,以及各种网络编程相关的协程封装等等,这些功能都可以帮助我们更高效地开发网络应用。在Swoole中使用协程有很多好处

facebook的canvas优势有高度可定制、强大的功能、良好的性能、跨平台兼容性、丰富的生态系统等等。详细介绍:1、高度可定制,Facebook Canvas允许开发人员完全自定义应用程序的外观和行为,包括界面、导航栏、状态栏等;2、强大的功能,Facebook Canvas提供了许多内置功能,如访问Facebook用户数据、使用Facebook登录、分享内容等等。


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Atom editor mac version download
The most popular open source editor

Dreamweaver Mac version
Visual web development tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 English version
Recommended: Win version, supports code prompts!
