Robin Lee's internal speech exposes that the open source model is not efficient enough to solve the problem of computing power
白云追月素
发表于 2024-9-11 17:15:52
197
0
0
"There are quite a lot of misunderstandings about the big model outside." Recently, an internal speech by Robin Lee was exposed. Robin Lee believes that the gap between big models may become larger and larger in the future. He further explained that the "ceiling" of the large model is very high, and it is still far from the ideal situation, so the model needs to be constantly iterated, updated, and upgraded quickly; We need to invest continuously for several years or even decades to meet user needs, reduce costs, and increase efficiency.
Robin Lee gave a different view on the industry's statement that "there is no barrier to the ability of big models": "Every time a new model is released, it should be compared with GPT-4o, saying that my score is almost the same as it, and even some individual scores have exceeded it, but this does not mean that there is no gap with the most advanced models."
He said that many models, in order to prove themselves, will go to the leaderboard after release, guessing test questions and answering skills. From the leaderboard, perhaps the models' abilities are already very close, "but in practical applications, there is still a significant gap in strength.
Robin Lee pointed out that the gap between models is multi-dimensional. The industry often focuses more on the gap in understanding, generation, logic, memory, and other abilities, but neglects dimensions such as cost and reasoning speed. Some models can achieve the same effect, but their cost is high and reasoning speed is slow, which is still not as good as advanced models.
Robin Lee also said that "before the era of big model, people were used to open source, which means free and low cost". He explained that, for example, open-source Linux is free to use because computers already exist. But these are not valid in the era of big models. Big model inference is expensive, and open-source models do not provide computing power. You have to buy your own equipment, which cannot achieve efficient utilization of computing power.
Open source models are not effective in terms of efficiency, "he said." Closed source models should be accurately called commercial models, which are machine resources and GPUs used by countless users to share research and development costs and inference. GPU usage efficiency is the highest, with Baidu Wenxin Big Model 3.5 and 4.0 having GPU usage rates of over 90%
Robin Lee believes that the open source model is valuable in teaching and scientific research; But in the business world, when pursuing efficiency, effectiveness, and lowest cost, open source models have no advantages.
At the application level of the big model, Robin Lee believes that Copilot is the first one to assist people; Next is the Agent intelligent agent, which has a certain degree of autonomy and can use tools, reflect, and evolve on its own; If this level of automation continues to develop, it will become an AI worker capable of independently completing various tasks.
He also stated that although "many people are optimistic about the development direction of intelligent agents, so far, intelligent agents are not a consensus, and there are not many companies like Baidu that regard intelligent agents as the most important strategy and development direction for large models.
Robin Lee believes that the threshold for agents is really low. Many people do not know how to turn a large model into an application. Agents are a very direct, efficient and simple way. It is very convenient to build agents on top of models.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Alibaba CEO Wu Yongming: AI development requires a batch of open-source models of different scales and fields
- Baidu's Q3 core net profit increased by 17%, exceeding expectations. Wenxin's large model daily usage reached 1.5 billion
- The delivery fee pricing has been lowered to 6 yuan, and McDonald's has adjusted the McDonald's delivery fee model
- Ideal Automobile implements a limited time zero interest policy for all models for the first time
- OpenAI launches full health version of the o1 big model and $200 per month ChatGPT Pro
- Open source securities: AI leads the rapid development of the education industry
- OpenAI has Rocket again! Officially launched Sora, an AI video generation model
- Google releases its most powerful model to attack OpenAI, shifting focus to AI agents
- Challenge OpenAI, Google's new move! Significantly updated generative AI, launching video model VEO 2 and the latest version Imagen3
- Is it increasingly difficult to distinguish between truth and falsehood? Google launches new generation video generation model Veo 2
-
生成式人工知能(AI)が巻き起こす技術の波の中で、電力会社は意外にも資本市場の寵児になった。 今年のスタンダード500割株の上昇幅ランキングでは、Vistraなどの従来の電力会社が注目を集め、株価が2倍になってリ ...
- xifangczy
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
隔夜株式市場 世界の主要指数は金曜日に多くが下落し、最新のインフレデータが減速の兆しを示したおかげで、米株3大指数は大幅に回復し、いずれも1%超上昇した。 金曜日に発表されたデータによると、米国の11月のPC ...
- SNT
- 前天 12:48
- 支持
- 反对
- 回复
- 收藏
-
長年にわたって、昔の消金大手の捷信消金の再編がようやく地に着いた。 天津銀行の発表によると、同行は京東傘下の2社、対外貿易信託などと捷信消金再編に参加する。再編が完了すると、京東の持ち株比率は65%に達し ...
- SNT
- 前天 12:09
- 支持
- 反对
- 回复
- 收藏
-
グーグルは現地時間12月19日、新しい「推理」モデルとしてGemini 2.0 Flash Thinkingを発売すると発表した。紹介によると、このモデルはまだ実験段階であり、訓練を経た後、モデルが反応を起こした時に経験した「思 ...
- 地下水
- 3 天前
- 支持
- 反对
- 回复
- 收藏