JD technical leader: Large models will become smaller and even finer down to the scene
四夜父脚群
发表于 2024-7-31 19:01:30
1229
0
0
General big models rely on computing power to build, while enterprise big models rely on business to run out
On July 30th, at the JD Cloud Summit held in Shanghai, Cao Peng, Chairman of the Technical Committee of JD Group and President of JD Cloud Business Unit, expressed the above views. According to his understanding, for large models, data is nourishment and scenarios are training grounds.
Over the past year, there has been a sustained craze for big models, and the industry has experienced a 'thousand model war'. According to statistics from the China Academy of Information and Communications Technology, there are currently over 1000 basic large-scale models worldwide, with China accounting for 35% of the global total.
Although the performance of basic models is constantly improving, in the personal user end, large models have not yet achieved true super applications. Instead, in many enterprise scenarios, they have gradually been deployed based on applications.
At the summit, JD Cloud showcased the latest practices of JD Yanxi's big model landing industry and released eight products including JD Cloud Enterprise Big Model Service, Yanxi Intelligent Agent Platform, Intelligent Programming Assistant JoyCoder, and Yanxi Digital Person 3.0.
According to data provided by JD.com, as of now, JD's big model has been implemented in over a hundred scenarios, covering different industries such as healthcare, e-commerce live streaming, logistics, and finance. Many of JD's own delivery personnel, merchants, doctors, procurement and sales operations, and R&D personnel have received support from the big model application.
For example, the "Jingyi Qianxun" service that serves medical scenarios, according to the head of JD Health Intelligent Algorithm Department, currently has four different sized models internally. One is a small model of about 2b, which provides a single service in a narrow domain. The team envisions that it can even be used on mobile phones in the future; The second is a medium-sized model with 14b and 22B as the core, which completes some medical consulting and service support work; Finally, there is a large model centered around 80s that specializes in serving complex medical decision-making and reasoning abilities.
The above model supports private deployment, even integrated deployment, which is related to industry characteristics. "It is difficult for the medical industry to accept a completely cloud based model, and few hospitals can accept this breakthrough," said the person in charge.
According to its introduction, in actual hospital implementation scenarios, Beijing Medical Qianxun will pay more attention to independently completing patient services in compliance, including triage, pre consultation, registration, appointment, accompanying consultations during consultations, and post consultation health management.
On the first day of GPT's release, everyone thought about the natural conversational ability and so-called anthropomorphic ability of this generation. From this perspective, whether it can better become a doctor's assistant is more valuable than becoming a diagnostic tool for doctors, "the person in charge emphasized.
In the beauty scene, unlike pure live streaming in the past, JD.com is currently attempting to combine digital person makeup testing with digital person anchors internally; In terms of footwear and clothing scenes, there will be a scene where digital people live stream in the front and hosts change their outfits in the back. The live streaming style based on specific category attributes will be transferred to digital people.
When it comes to the development trend of large models, several technical leaders from JD.com have stated that large models will become smaller and smaller. Vertical large models are a relatively certain direction, and can even be further refined to scene large models. The inherent logic is that large models need to adapt to scenarios and industries, so they cannot be too large.
He Xiaodong, Dean of JD Exploration Research Institute and Head of JD Technology's Artificial Intelligence Business, believes that due to limitations in data and computing power, simply increasing the scale of the model may quickly reach the development ceiling, resulting in the economic benefits generated by the large model being insufficient to support its own costs, making it difficult to sustain.
The large-scale models are growing at a rate of 10 times per year, with parameters ranging from billions to trillions. However, commercialization is currently lagging behind and will eventually become a problem in the medium to long term. He also pointed out that the illusion rate of many models is still high, which cannot provide solid guarantees for future industrial applications.
According to He Xiaodong, JD.com starts from the initial strategy model in terms of model self evolution. Firstly, it constructs an initial preference dataset, and then uses a pre trained reward model to score each answer. Based on the high or low score, it constructs new preference data, which will greatly promote model iteration and updates.
In terms of model inference, the cost of big language model inference is currently skyrocketing. Therefore, JD.com has improved model construction efficiency through end-to-end, low bit, high-precision quantization technology, reducing model size and enhancing inference performance without affecting model output accuracy and parameter quantity. He Xiaodong said that his current technical solution has saved 70% of the model's video memory.
When it comes to the large-scale model of enterprise implementation, Cao Peng believes that there are three key points. Firstly, simplicity is crucial. The diversity and fragmentation of scenarios cannot sustain high development costs, and it is necessary to minimize the threshold for using large models in order to cover more applications. Next is openness, based on an open Agent ecosystem, large model ecosystem, and cloud native ecosystem, giving customers the right to choose. The third is security, providing data security and privacy protection, AIGC content compliance, corpus data security management, making enterprise big model services trustworthy and reliable.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- JD Logistics fully integrates with Taobao and Tmall
- JD Logistics starts serving Taobao and Tmall merchants
- JD Finance is rumored to be involved in a "run on the bank". The company responds that inciting a "run on the bank" is suspected of being illegal and disrupting the order of the financial market
- Latest JD Financial Responds to 'Run on Bank' Rumors: Rumors Are Purely Lying on Gun and Not Having Time to Watch Talk Shows! JD.com also responded that there are no plans to collaborate with related talk show actors in the future
- JD apologizes! There are no plans to collaborate with related talk show actors in the future!
- JD Express International launches delivery service to 7 Southeast Asian countries
- On the evening of October 31st at 8pm, JD's "11 · 11" officially kicked off with doubled subsidies
- JD.com officially doubles subsidy for 11.11 opening ceremony
- JD Seven Fresh responds to price war rumors: no one targeted, just offering low prices
- JD Seven Fresh reduces prices, Meituan Xiaoxiang follows the trend of instant retail and the smoke of gunpowder rises again
-
【英偉達の需要が高すぎる?SKハイニックス:黄仁勲がHBM 4チップの6カ月前納入を要求!】SKハイニックスの崔泰源(チェ・テウォン)会長は月曜日、インビダーの黄仁勲(ファン・インフン)CEOが同社の次世代高帯域 ...
- 琳271
- 前天 17:54
- 支持
- 反对
- 回复
- 收藏
-
ファイザーが前立腺がんを治療する革新薬テゼナ& ;reg;(TALZENNA®,一般名:トルエンスルホン酸タラゾールパーリカプセル)は2024年10月29日に国家薬品監督管理局(NMPA)の承認を得て、HRR遺伝子突然変異 ...
- 什么大师特
- 昨天 17:41
- 支持
- 反对
- 回复
- 收藏
-
南方財経は11月5日、中央テレビのニュースによると、現地時間11月5日、米ボーイング社のストライキ労働者が59%の投票結果で新たな賃金協定を受け入れ、7週間にわたるストライキを終えた。ストライキ労働者は11月12 ...
- Dubssgshbsbdhd
- 昨天 16:27
- 支持
- 反对
- 回复
- 收藏
-
【マスクはテスラが携帯電話を作ることに応えた:作れるが作らないアップルとグーグルが悪さをしない限り】現地時間11月5日、有名ポッドキャストのジョローガン氏のインタビューに応じ、「携帯電話を作るのは私たち ...
- 波大老师
- 昨天 14:41
- 支持
- 反对
- 回复
- 收藏