3-minute overview of Huang Renxun's GTC speech: The strongest AI chips, NIM microservices, etc
我放心你带套猛
发表于 2024-3-19 22:40:31
239
0
0
On Tuesday morning Beijing time, the most important agenda item at NVIDIA GTC 2024 conference, the keynote speech by founder and CEO Huang Renxun, had just ended. As expected by the market, the global capital market has just seen new computing chips/servers, as well as a bunch of AI software applications.
As usual, as of the time of publication, Nvidia has released 40 press releases since Huang Renxun began speaking. This article will focus on summarizing some key developments this morning for investors to refer to.
Larger GPU - Blackwell architecture as scheduled
Although the entire market knew beforehand that a new flagship computing power GPU would be introduced today, Huang Renxun did not explain the name clearly in his speech - only stating the launch of a larger and stronger Blackwell architecture GPU, which caused media chaos at one point. But according to the data on the official website, today Lao Huang should be holding a B200 chip, and the website also lists the existence of a B100 chip in the Blackwell architecture. Nvidia has not disclosed the selling price, only stating that it will ship to its partners within the year.
Nvidia has disclosed that the new B200 chip has 208 billion transistors and is manufactured using TSMC's customized 4NP process. It is worth mentioning that this chip connects two dies into a unified GPU, and the communication speed between dies can reach 10TB/s. As expected, this chip uses 192GB of HBM3E memory.
The GB200 Grace Blackwell superchip is a combination of two B200 chips (four dies) and a Grace CPU. Compared to H100, the performance of the large language model has been improved by 30 times, while the energy consumption is only one 25th.
Lao Huang gave an example in his speech that to train a GPT model with 1.8 trillion parameters, it would require 8000 Hopper GPUs, consume 15 megawatts of electricity, and run continuously for 90 days. But if using the GB200 Blackwell GPU, only 2000 cards are needed, and running for 90 days also consumes only a quarter of the electricity. Not only training, but also the cost of generating tokens will be significantly reduced.
In conjunction with this new set of chips, Nvidia has also launched the fifth generation of new NVLink chips, as well as a series of products such as GB200 NVL72 servers, X800 series network switches, and the next-generation artificial intelligence supercomputer NVIDIA DGX SuperPOD.
New way to develop software: NIM microservices
After discussing the hardware updates, Huang Renxun also devoted the remaining time to the software ecosystem. In addition to the digital twin of Earth's climate and pharmaceutical development AI, Nvidia has also launched a series of "microservices" in AI Enterprise 5.0, including simplifying NIM for enterprises to deploy AI models into production environments.
Huang Renxun said, "In the future, companies will no longer need to write software, but will assemble AI models, present tasks to them, provide examples of work products, review plans, and intermediate results."
Nvidia stated that NIM microservices simplify the deployment process of AI models by packaging algorithms, optimizing systems and operations, and adding industry standard APIs. This allows developers to integrate NIM into existing applications and infrastructure without the need for extensive customization or expertise.
Digital twin support for Vision Pro
Nvidia also announced on Monday that Omniverse Cloud now allows developers to stream their industrial scenes from content creation applications to Nvidia's Graphics Delivery Network (GDN), allowing advanced 3D experiences to be transmitted to Apple Vision Pro.
This new workflow combines the high-resolution display of Apple Vision Pro with Nvidia's cloud rendering to provide a spatial computing experience with only devices and Internet connections.
There are also many scattered official announcements
In the semiconductor field, Nvidia announced that TSMC and Synopsys will invest Nvidia's computing lithography platform CuLitho in the production of advanced chips.
In the telecommunications field, Huang Renxun announced a research cloud called NVIDIA 6G, which is a platform driven by generative artificial intelligence and Omniverse technology, aimed at promoting the development of the next generation of communication.
In the field of transportation, BYD, the world's largest electric vehicle company, will adopt NVIDIA's centralized in vehicle computing platform DRIVE Thor to develop the next generation of electric vehicles. In addition, BYD will also use Nvidia's infrastructure for autonomous driving model training, as well as Nvidia Isaac to design/simulate intelligent factory robots.
Robots are also the final stage of the entire speech. Huang Renxun announced multiple software programs to assist in the development of robot technology. This includes the Isaac Perceptor software development toolkit, which involves multi camera visual mileage measurement, 3D reconstruction, and depth perception. There is also Isaac Manipulator - a library for robot arm perception, path planning, and kinematic control. Finally, he also announced a project called GR00T, which is a universal foundational model for humanoid robots aimed at driving the company's breakthroughs in robotics technology and embodied intelligence.
Accompanied by a pair of Disney robots Orange and Green using Nvidia Jetson chips, the entire press conference came to an end.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- US media reports: Biden administration considers further measures to regulate Chinese chips, expected to be announced as early as next week
- Microchip Technology suspends application for chip bill related subsidies
- Gurman announces Apple's' revolutionary 'breakthrough, with self-developed modem chip to be released next year
- It is reported that TSMC's 2nm chip production yield has reached over 60%, and it is expected to enter the large-scale production stage next year
- TSMC reports that N2P and N2X IP are ready for customers to design performance enhanced 2nm chips
- Google launches breakthrough quantum chip
- Chip giant, skyrocketing by 24%!
- The controversial chip license case between Arm and Qualcomm has begun
- Can Broadcom's customized AI chip challenge Nvidia with a market value exceeding trillions of dollars?
- Texas Instruments receives $1.6 billion in chip subsidies from the United States
-
隔夜株式市場 世界の主要指数は金曜日に多くが下落し、最新のインフレデータが減速の兆しを示したおかげで、米株3大指数は大幅に回復し、いずれも1%超上昇した。 金曜日に発表されたデータによると、米国の11月のPC ...
- SNT
- 前天 12:48
- 支持
- 反对
- 回复
- 收藏
-
長年にわたって、昔の消金大手の捷信消金の再編がようやく地に着いた。 天津銀行の発表によると、同行は京東傘下の2社、対外貿易信託などと捷信消金再編に参加する。再編が完了すると、京東の持ち株比率は65%に達し ...
- SNT
- 前天 12:09
- 支持
- 反对
- 回复
- 收藏
-
【GPT-5屋台で大きな問題:数億ドルを燃やした後、OpenAIは牛が吹くのが早いことを発見した】OpenAIのGPT-5プロジェクト(Orion)はすでに18カ月を超える準備をしており、関係者によると、このプロジェクトは現在進 ...
- SNT
- 半小时前
- 支持
- 反对
- 回复
- 收藏
-
【ビットコインが飛び込む!32万人超の爆倉】データによると、過去24時間で世界には32万7000人以上の爆倉があり、爆倉の総額は10億ドルを超えた。
- 断翅小蝶腥
- 3 天前
- 支持
- 反对
- 回复
- 收藏