Alibaba Releases Qwen2, Outperforms Llama 3 on Several Benchmarks

In a significant leap for open source AI, Alibaba’s Qwen team has announced the release of Qwen2, an advanced version of its mother of LLMs, Qwen1.5.

Qwen2 introduces five new models—Qwen2-0.5B, Qwen2-1.5B, Qwen2-7B, Qwen2-57B-A14B, and Qwen2-72B—each optimized for state-of-the-art performance across a variety of benchmarks.

Click here to check out the model on Hugging Face.

These models offer substantial improvements, including training on data from 27 additional languages beyond English and Chinese, including Hindi, Bengali, and Urdu. This multilingual training enhances Qwen2’s capabilities in diverse linguistic contexts, addressing common issues like code-switching with greater proficiency.

Qwen2 also excels in coding and mathematics, with significantly improved performance in these areas.

A standout feature of Qwen2 is its extended context length support, with Qwen2-7B-Instruct and Qwen2-72B-Instruct models capable of handling up to 128K tokens. This makes them particularly adept at processing and understanding long text sequences.

Qwen2’s release includes various technical enhancements such as Group Query Attention (GQA) for faster speed and reduced memory usage, and optimized embeddings for smaller models.

Performance evaluations show that Qwen2-72B, the largest model in the series, outperforms leading competitors like Llama-3-70B in natural language understanding, coding proficiency, mathematical skills, and multilingual abilities.

Despite having fewer parameters, Qwen2-72B surpasses its predecessor, Qwen1.5-110B, demonstrating the effectiveness of the new training methodologies.

Safety and responsibility remain a priority, with Qwen2-72B-Instruct performing comparably to GPT-4 in terms of safety across various categories of harmful queries. The model exhibits significantly lower proportions of harmful responses compared to other large models.

The Qwen2 models, licensed under Apache 2.0 and Qianwen License for different versions, are set to accelerate the application and commercial use of AI technologies worldwide. Future plans include training larger models and extending Qwen2 to multimodal capabilities, integrating vision and audio understanding.

The post Alibaba Releases Qwen2, Outperforms Llama 3 on Several Benchmarks appeared first on AIM.

Alibaba Releases Qwen2, Outperforms Llama 3 on Several Benchmarks

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

How to win at Markstrat (Markstrat Tips and Tricks) – Vodites

Ominde Commission Report and Recommendations – Ominde Report of 1964

Bureau of Internal Revenue: Regional Offices (Directory)

GO 53 on Enhancement of Ex-gratia upto 5 Lakhs Toddy Tappers in Telangana

Cakewalk CA-2A Leveling Amplifier v2.0.1.97 WiN, v2.0.1.96 OSX Incl Keygen

Mp3 Download: Mdu - Kunjenjenjena

How the kill the job , when DTP request running for long hours.

Microsoft Intune から展開しているアプリのアップデートについて

18-year-old girl was beaten for half an hour by two Northampton men in 'an...

Car crash in Dunton Bassett leaves driver in critical condition

Macky 2, Two Others In Road Accident

Application log 00000000000000089514: Could not convert queue DLVST90CLNT

Detroit mafia: D’Anna Brothers agree to plea deal

Delivery block field greyed out using VA02

Muloraki Au

【個人撮影】スマホのプライベート映像♪「中に出さないで///」カラオケ屋での生ハメ撮りが流出ｗ【リベンジポルノ】＠PornHub

BREAKING NEWS: Diamond Platnumz Is Reported Dead After Ghastly Car Accident

FIAT 500 B0111 B0112