Combining AI with iPhone? Apple's latest paper provides breakthrough solutions
米哈伊尔叔叔
发表于 2023-12-22 15:25:39
1301
0
0
Is the Apple GPT in your pocket? This may become a reality faster.
Apple artificial intelligence (AI) researchers recently published a paper on the preprint website arXiv, which mentioned an innovative "flash utilization" technology that can deploy large language models (LLMs) on iPhones and other memory limited Apple devices, which is almost a major breakthrough.
Memory constraints
LLM based chatbots (such as ChatGPT, Claude, etc.) rely heavily on data and memory, requiring a large amount of data to be processed simultaneously, often requiring a large amount of memory to run.
Therefore, running LLM is a challenge for devices such as iPhones with limited DRAM (generally referring to memory) capacity.
Usually, the standard method for computing data is to load the data from flash memory into DRAM, and then perform data inference in DRAM.
DRAM with high performance can increase data processing speed by millions of times, but the downside is its capacity. Running on DRAM severely limits the maximum model size that can be run.
To address this issue, Apple researchers have developed a new technology that uses larger capacity flash memory to store data from artificial intelligence models, which can then be transferred to DRAM memory for processing when needed.
Storing AI on flash memory
In a new research paper titled "LLM in Flash: Efficient Large Language Model Reasoning in Limited Memory", the author points out that flash memory in mobile devices is more abundant than traditional memory used to run LLM.
This method cleverly bypasses capacity limitations. The paper proposes two key technologies to minimize data transmission and maximize flash processing capabilities:
One of them is called "windowing" technology, which is equivalent to a recycling method. AI models do not need to load new data every time, but instead reuse some already processed data. This reduces the need for continuous memory acquisition, making the process faster and smoother.
The second is called "Row Column Bundling" technology. This technology is achieved by grouping data more effectively, that is, by setting the order of accessing data blocks based on the data characteristics of flash memory, which can read data from flash memory faster and accelerate the ability of artificial intelligence to understand and generate language.
According to this paper, the combination of these methods enables the running capacity of artificial intelligence models to reach twice the available memory of iPhones. This means that under this method, the inference speed in the CPU has increased by 4-5 times compared to traditional loading methods, and the inference speed in the GPU has increased by an astonishing 20-25 times.
The author of the paper wrote, "This breakthrough is particularly important for deploying advanced LLMs in resource limited environments, thereby expanding their applicability and accessibility."
Apple's AI Strategy
The breakthrough in artificial intelligence efficiency has opened up new possibilities for future iPhones, such as more advanced Siri features, real-time language translation, complex AI driven photography, and augmented reality capabilities.
The new technology in the paper also lays the foundation for iPhone to run complex artificial intelligence assistants and chatbots on devices, and it is said that Apple is already developing this technology.
Apple's work in generative artificial intelligence may eventually be integrated into its voice assistant Siri. Apple introduced its large-scale language model work to employees at the February Artificial Intelligence Summit this year. According to previous media reports, Apple's goal is to launch an intelligent version of Siri that is deeply integrated with artificial intelligence.
There are also rumors that Apple plans to add artificial intelligence to as many Apple applications as possible.
In addition, according to reports, Apple is also developing its own generative artificial intelligence model, "Ajax," which runs on 200 billion parameters to compete with OpenAI's GPT-4 model.
Internally known as "Apple GPT," Ajax aims to unify the entire Apple machine learning development, highlighting Apple's broader strategy of integrating artificial intelligence deeper into the Apple ecosystem.
According to the latest report, Ajax is considered more powerful than the early ChatGPT 3.5. However, the new model GPT-4 launched by OpenAI in September 2023 may have surpassed the capabilities of Ajax.
Fruit Chain analyst Jeff Pu has pointed out that Apple will launch some kind of generative artificial intelligence feature on iPhones and iPads around the end of 2024, which will be included in iOS 18. Pu also stated that Apple will build hundreds of artificial intelligence servers in 2023, and there will be more in 2024.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Apple and Real Madrid discuss' Immersive Vision Pro Live Streaming 'in an attempt to save the content scarce VR industry
- News reports that former Apple CFO will be appointed as new CEO of Stellantis, denied by Stellantis
- IDC: Apple misses the big rebound in the smartphone market in 2024
- Apple CEO Cook attends Chain Expo in China: Without Chinese partners, Apple cannot achieve today's success
- Apple announces expansion of Apple retail business in Saudi Arabia
- Indian regulators reject Apple's request to shelve antitrust report
- Apple is going to debut! TSMC announces that 2nm is ready for use
- The 'car race' in the XR market among Apple, Google, and Samsung
- It is reported that Apple plans to launch a foldable iPad in 2028
- Sources: Apple plans to launch thinner iPhones and foldable phones
-
生成式人工知能(AI)が巻き起こす技術の波の中で、電力会社は意外にも資本市場の寵児になった。 今年のスタンダード500割株の上昇幅ランキングでは、Vistraなどの従来の電力会社が注目を集め、株価が2倍になってリ ...
- xifangczy
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
隔夜株式市場 世界の主要指数は金曜日に多くが下落し、最新のインフレデータが減速の兆しを示したおかげで、米株3大指数は大幅に回復し、いずれも1%超上昇した。 金曜日に発表されたデータによると、米国の11月のPC ...
- SNT
- 前天 12:48
- 支持
- 反对
- 回复
- 收藏
-
長年にわたって、昔の消金大手の捷信消金の再編がようやく地に着いた。 天津銀行の発表によると、同行は京東傘下の2社、対外貿易信託などと捷信消金再編に参加する。再編が完了すると、京東の持ち株比率は65%に達し ...
- SNT
- 前天 12:09
- 支持
- 反对
- 回复
- 收藏
-
グーグルは現地時間12月19日、新しい「推理」モデルとしてGemini 2.0 Flash Thinkingを発売すると発表した。紹介によると、このモデルはまだ実験段階であり、訓練を経た後、モデルが反応を起こした時に経験した「思 ...
- 地下水
- 3 天前
- 支持
- 反对
- 回复
- 收藏