


Amazon is making every effort to defend its leadership in cloud computing. On the one hand, they upgraded their own cloud chips and launched Amazon's version of GPT, an artificial intelligence chatbot; on the other hand, they also deepened their cooperation with NVIDIA, launched new services based on NVIDIA chips, and jointly developed them with NVIDIA supercomputer
Dave Brown, vice president of AWS, said that by focusing the design of self-developed chips on actual workloads that are important to customers, AWS can provide them with the most advanced cloud infrastructure. The Graviton 4 launched this time is the fourth generation chip product within five years. As people’s interest in generative AI rises, the second generation AI chip Trainium 2 will help customers train themselves faster at lower cost and higher energy efficiency. machine learning model.
Graviton4 computing performance is improved by up to 30% compared to the previous generation
On Tuesday, November 28th, Eastern Time, Amazon’s cloud computing business AWS announced the launch of a new generation of AWS self-developed chips. Among them, the computing performance of the general-purpose chip Graviton4 is up to 30% higher than the previous generation Graviton3, with a 50% increase in cores and a 75% increase in memory bandwidth, thus providing the highest cost performance and energy utilization on the Amazon cloud server hosting service Amazon Elastic Compute Cloud (EC2) Effect.
Graviton4 improves security with full encryption of all high-speed physical hardware interfaces. AWS said Graviton4 will be available on memory-optimized Amazon EC2 R8g instances to help customers improve the execution of high-performance database, in-memory cache, and big data analytics workloads. R8g instances offer larger instance sizes with up to three times more vCPUs and three times more memory than previous generation R7g instances
In the next few months, it is planned to launch computers equipped with Graitons4. AWS said that in the five years since the launch of the Garviton project, more than 2 million Garviton processors have been produced, and the first 100 users of AWS EC2 have chosen to use Graviton
Trainium2 is four times faster and can train models with trillions of parameters
AWS has launched a new generation of AI chips called Trainium2, which is four times faster than the previous generation Trainium1. Trainium2 can deploy up to 100,000 chips in EC2 UltraCluster, enabling users to train base models (PM) and large language models (LLM) with trillions of parameters in a short time. Compared with the previous generation, Trainium2’s energy utilization has increased by two times
Trainium2 will be used on Amazon EC2 Trn2 instances, each containing 16 Trainium chips. Trn2 instances are designed to help customers scale the number of chip applications in next-generation EC2 UltraCluster, up to 100,000 Trainium2 chips, and provide up to 65 Execute computing power through petabyte-scale network connections through AWS Elastic Fabrication Adapters (EFA)
According to AWS, Trainium2 will be used to support new services starting next year
The first major customer, DGX Cloud, uses the upgraded version of Grace Hopper GH200 NVL32, which is the fastest GPU-driven AI supercomputer
During the annual conference re:Invent, AWS and NVIDIA announced on Tuesday an expanded strategic cooperation to provide state-of-the-art infrastructure, software and services to promote customers' generative AI innovation. This cooperation not only involves self-developed chips, but also includes cooperation in other fields
AWS will become the first cloud service provider to use the new multi-node NVLink technology NVIDIA H200 Grace Hopper super chip in the cloud. In other words, AWS will become the first important customer of the upgraded version of Grace Hopper
NVIDIA’s H200 NVL32 multi-node platform uses 32 Grace Hopper chips with NVLink and NVSwitch technology in a single instance. The platform will be used on Amazon EC2 instances connected to Amazon Network EFA and is powered by advanced virtualization (AWS Nitro System) and ultra-scale clusters (Amazon EC2 UltraClusters), allowing joint Amazon and Nvidia customers to scale deployments into the thousands. Designed H200 chip
NVIDIA and AWS will collaborate to host NVIDIA’s AI training-as-a-service DGX Cloud on AWS. This will be the first DGX cloud to feature the GH200 NVL32, providing developers with a single instance with maximum shared memory. AWS’s DGX Cloud will advance cutting-edge generative AI and training of large language models with over 1 trillion parameters
Nvidia and AWS are collaborating on a project called Ceiba to design the world’s fastest GPU-powered AI supercomputer. Powered by GH200 NVL32 and Amazon EFA's interconnect technology, this computer is a massive system. It is equipped with 16,384 GH200 super chips and has 65 exaflops of AI processing power. NVIDIA plans to use it to drive the next wave of generative AI innovation
The preview version of Amazon Q, the enterprise customer robot, is now online and can help developers develop applications on AWS
In addition to providing chips and cloud services, AWS also released a preview version of an AI chatbot called Amazon Q. Amazon Q is a new type of digital assistant that uses generative AI technology to work based on the business needs of enterprise customers. It helps enterprise customers search for information, write code and review business metrics
Q has received some training on code and documentation within AWS, which can be used by developers in the AWS cloud.
Developers can use Q to create applications on AWS, research best practices, correct errors, and get help writing new features for applications. Users can interact with Q through conversational Q&A to learn new knowledge, research best practices, and understand how to build applications on AWS without leaving the AWS console
Amazon will add Q to programs for enterprise intelligence software, call center workers and logistics management. AWS says customers can customize Q based on company data or personal profiles
Conversational Q&A is currently available in preview in all enterprise regions provided by AWS
The above is the detailed content of Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidia's new generation super chip. For more information, please follow other related articles on the PHP Chinese website!

周一接受VentureBeat采访时,亚马逊AWS数据与人工智能副总裁斯瓦米·西瓦苏布拉曼尼亚表示,他负责监管AWS所有的数据库、分析、机器学习和人工智能服务,并简述了周三上午的主旨演讲和周二上午AWS首席执行官亚当·塞利普斯基的主旨演讲他说,围绕GenAI的主要主题是,企业希望有灵活性和选择来使用来自不同供应商的不同模型,而不是被锁定在单一供应商或平台上,然而,他补充说,这些车型本身可能不足以提供竞争优势,因为随着时间的推移,它们可能会变得商品化,因此,企业的关键优势将是他们自己的专有数据,以

最近,国外几家久负盛名的科技巨头展示了他们的AI雄心。例如苹果举办WWDC23,微软召开Build23,就连谷歌也在2月份举办了搜索业务大会。这些巨头们的动作,无疑彰显了生成式人工智能(AIGC)的崛起,也带动了一批此前对人工智能不感兴趣的团队、机构。现在这些大型科技公司全力押注人工智能,值得注意的几个标志是:GoogleAI、MicrosoftCopilot、Apple机器学习以及OpenAI追求通用人工智能。苹果的机器学习苹果公司似乎对人工智能这个词“不感冒”。在今年的WWDC上,只字未提“

本站10月9日消息,亚马逊云科技全球销售、市场和服务高级副总裁MattGarman对内宣布了大中华区领导人变更,储瑞松将接任张文翊担任亚马逊全球副总裁、亚马逊云科技大中华区执行董事,张文翊将会有新的职务任命。储瑞松公开资料显示,加入亚马逊云科技之前,储瑞松担任百度集团副总裁接近四年之久,是百度高层管理团队之一,负责领导百度阿波罗智能汽车业务,于今年7月份离职。在此之前,他的大部分职业生涯都在SAP度过,担任过工程、战略、业务开发等多个领导职位,并最终负责S/4HANA云产品全球研发,领导遍布德国

本站11月21日消息,顺丰推出旺季“晚到必赔”增值服务,主要面向使用亚马逊线上“购买配送”服务的卖家,11月15日-12月31日活动期间使用顺丰国际电商专递-优先CD或国际电商专递-CD产品,由中国发至美国流向即可享受9折运费优惠,并承诺分别于10个工作日和13个工作日内送达,晚到必赔,按比例赔付运费,单票最高赔偿金额为300元人民币。我们注意到,快递的揽收城市包括深圳、佛山、广州、东莞、厦门、福州、泉州、杭州、上海、苏州、合肥、莆田、新乡、南京、南昌国际电商专递-CD(E-CommerceEx

9月6日消息,亚马逊近日宣布了一项重要的决定,将在今年10月停止支持其在苹果macOS平台上的Kindle应用,并计划推出一款全新的Kindle应用程序。这一决策引起了广泛关注,因为它标志着亚马逊将迈向一个全新的阅读应用时代。亚马逊的原始Kindle应用已经在苹果macOS平台上存在了整整8年,为Mac用户提供了便捷的电子书阅读体验。然而,随着技术的不断发展,亚马逊决定升级他们的阅读应用,以适应现代需求。根据亚马逊的官方消息,新的Kindle应用将基于ReactNative开发,这将为用户带来更

北京时间8月4日早间消息,亚马逊今日发布了截至6月底的2023年第二季度财报。报告显示,亚马逊第二季度净销售额为1344亿美元(当前约9649.92亿元人民币),与上年同期的1212亿美元相比增长11%。净利润为67.50亿美元(当前约484.65亿元人民币),而上年同期净亏损20.28亿美元。相比之下,37位分析师平均预计,亚马逊第二季度营收将达到1315亿美元。财报显示,亚马逊第二季度营收为1344亿美元,高于分析师预期。36位分析师平均预计,亚马逊第二季度每股摊薄收益将达到0.35美元。财

亚马逊打不开的原因有网络连接问题、DNS解析问题、防火墙或代理问题、浏览器问题、地理位置限制、亚马逊服务器问题等。其解决办法:1、检查您的网络连接是否正常,确认您的设备已连接到互联网;2、可以尝试刷新DNS缓存或更改使用其他DNS服务器来解决此问题;3、尝试禁用防火墙或代理,或者调整其设置以允许访问亚马逊;4、尝试清除浏览器缓存、禁用插件或尝试使用其他浏览器来访问亚马逊等等。

亚马逊是一家网络电商平台,该公司位于华盛顿州的西雅图,是网络上最早开始经营电子商务的公司之一;亚马逊是贝佐斯在1995年创立的,亚马逊平台主要营收是通过收取15%的交易佣金,亚马逊在平台上制定各种规则与制度都是为了提高交易量。


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft

Dreamweaver Mac version
Visual web development tools

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool