The first hundred billion parameter model from Tongyi Qianwen has arrived

Katlyn30590 发表于 2024-4-29 16:07:37

1225 0 0

According to the news on April 28 on the "Alibaba Tongyi Qianwen" WeChat official account, Tongyi Qianwen launched the first 100 billion level parameter model Qwen1.5-110B. It is reported that the basic capabilities of Qwen1.5-110B are comparable to Meta-Llama-3-70B, making it the largest model in the Qwen1.5 series and the first model in the series to have over 100 billion parameters.
According to the evaluation of the research team, the results showed that the Qwen1.5-110B model performed the best among the three benchmark tests such as MMLU, GSM8K, MATH, and HumanEval. In evaluations such as TheoremQA, ARC-C, and MBPP, the Qwen1.5-110B model performed better than Llama-3-70B.

"Alibaba Tongyi Qianwen" WeChat official account

In addition, according to the evaluation of the Chat model by the research team, the performance of the Qwen1.5-110B Chat model on MT Bench and AlpacaEval 2.0 was compared. The results show that compared with the previously released Qwem1.5-72B-Chat model, the Qwen1.5-110B-Chat model performs significantly better.

"Alibaba Tongyi Qianwen" WeChat official account

Since the beginning of this year, the team from Tongyi Qianwen has launched the latest open-source model series Qwen1.5, and subsequently launched eight large language models in less than three months. The previous model parameter sizes covered 500 million, 1.8 billion, 4 billion, 7 billion, 14 billion, 32 billion, and 72 billion, while the parameter sizes of Qwen1.5-110B reached 110 billion. It is reported that the current download volume of the Tongyi Qianwen open-source model exceeds 7 million.

The first hundred billion parameter model from Tongyi Qianwen has arrived

蔚来が第2四半期の財政報告を発表した売上高は174億5000万元、米株の当日の売上高は14%超上昇した

小鵬MONA M 03は肇慶高新区で量産され、上場48時間で大定破3万大旺智造爆金が頻出した

FRBは引きずるな！小摩経済学者も態度を転換：9月には大きな動きをしなければならない

クアルコムCEO：サムスンとグーグルと協力してハイブリッド現実眼鏡を開発中