Daily Mail PH

Wednesday, November 15, 2023

[New post] New Open Source AI Model from China Boasts Twice the Capacity of ChatGPT

Site logo image Crypto Breaking News posted: "An artificial intelligence (AI) model developed in China is making waves on a number of fronts, including its open-source nature and for its ability to handle up to 200,000 tokens of context—vastly exceeding other popular models like Anthropic's Claude (1" Crypto Breaking News

New Open Source AI Model from China Boasts Twice the Capacity of ChatGPT

Crypto Breaking News

Nov 15

An artificial intelligence (AI) model developed in China is making waves on a number of fronts, including its open-source nature and for its ability to handle up to 200,000 tokens of context—vastly exceeding other popular models like Anthropic's Claude (100,000 tokens) or OpenAI's GPT-4 Turbo (128,000 tokens).

Dubbed the Yi series, Beijing Lingyi Wanwu Information Technology Company created this progressive generative chatbot in its AI lab, 01.AI. The large language model (LLM) comes in two versions: the lightweight Yi-6B-200K and the more robust Yi-34B-200K, both capable of retaining immense conversational context and able to understand English and Mandarin.

Just hours after its release, the Yi model rocketed up the charts to become the second most popular open-source model on Hugging Face, a key repository for AI models.

Hugging Face Ai Models Ranking
Image: Hugging Face

Even though the Yi models handle huge context prompts, they are also very efficient and accurate, beating other LLMs in several synthetic benchmarks.

"Yi-34B outperforms much larger models like LLaMA2-70B and Falcon-180B; also Yi-34B's size can support applications cost-effectively, thereby enabling developers to build fantastic projects," explains 01.AI on its website. According to a scoreboard shared by the developers, the most powerful Yi model showed strong performance in reading comprehension, common-sense reasoning, and common AI tests like Gaokao and C-eval.

Large Language Models (LLMs) like the Yi Series operate by analyzing and generating language-based outputs. They work by processing "tokens," or units of text, which can be as small as a word or a part of a word.

To say "200K tokens of context" effectively means the model can understand and respond to significantly longer prompts, which previously would have overwhelmed even the most advanced LLMs. The Yi Series can handle extensive prompts that include more complex and detailed information without crashing.

A recent third-party analysis, however, points out a limitation in this area. When a prompt occupies more than 65% of the Yi model's capacity, it can struggle to retrieve accurate information. Despite this, if the size of the prompt is kept well below this threshold, the Yi Series Model performs admirably, even in scenarios that cause degradation in models like Claude and ChatGPT.

Pressure Testing GPT-4-128K With Long Context Recall

128K tokens of context is awesome - but what's performance like?

I wanted to find out so I did a "needle in a haystack" analysis

Some expected (and unexpected) results

Here's what I found:

Findings:
* GPT-4's recall… pic.twitter.com/nHMokmfhW5

— Greg Kamradt (@GregKamradt) November 8, 2023

A key differentiator for Yi is that it is fully open source, allowing users to run Yi locally on their own systems. This grants them greater control, the ability to modify the model architecture, and avoids reliance on external servers.

"We predict that AI 2.0 will create a platform opportunity ten times larger than the mobile internet, rewriting all software and user interfaces," 01.AI states. "This trend will give rise to the next wave of AI-first applications and AI-empowered business models, fostering AI 2.0 innovations over time."

By open-sourcing such a capable model, 01.AI empowers developers worldwide to build the next generation of AI. With immense context handling in a customizable package, we can expect a torrent of innovative applications utilizing Yi.

The potential is sky-high for open-source models like Yi-6B-200K and Yi-34B-200K. As AI permeates our lives, locally run systems promise greater transparency, security, and customizability compared to closed alternatives dependent on the cloud.

While Claude and GPT-4 Turbo grab headlines, this new open-source alternative may soon build AI's next stage right on users' devices. Just when it seemed like there were no remaining ways to upgrade our hardware, it might be time to shop for a more capable device before you find your local AI outclassed by a more "context-aware" competitor.

Stay on top of crypto news, get daily updates in your inbox.

Source: Decrypt.co


Manage your email settings or unsubscribe.

Trouble clicking? Copy and paste this URL into your browser:
https://www.cryptobreaking.com/new-open-source-ai-model-from-china-boasts-twice-the-capacity-of-chatgpt/

WordPress.com and Jetpack Logos

Get the Jetpack app to use Reader anywhere, anytime

Follow your favorite sites, save posts to read later, and get real-time notifications for likes and comments.

Download Jetpack on Google Play Download Jetpack from the App Store
WordPress.com on Twitter WordPress.com on Facebook WordPress.com on Instagram WordPress.com on YouTube
WordPress.com Logo and Wordmark title=

Automattic, Inc. - 60 29th St. #343, San Francisco, CA 94110  

at November 15, 2023
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments:

Post a Comment

Newer Post Older Post Home
Subscribe to: Post Comments (Atom)

Capping off 2025 with new Gen Z report, big team announcement – The Nerve

We have a couple of big announcements to cap the year   17 December 2025 View in Browser     Dear reader,    We have a couple of big ann...

  • [New post] Tuesday’s politics thread is trying to stay positive.
    SheleetaHam posted: " Even though I just finished the latest Opening Arguments podcast about how Roe v. Wade is toast, and ...
  • [New post] Achieve Data Sovereignty through Omnisphere
    Crypto Breaking News posted: "Web 3.0 is one of the biggest buzzwords flying around the world of social media this year. An...
  • [New post] Is XRP going to take the Crypto market by storm
    admin posted: "Is XRP going to take the Crypto market by storm While the SEC has been going after Ripple in court the XRP b...

Search This Blog

  • Home

About Me

Daily Newsletters PH
View my complete profile

Report Abuse

Labels

  • Last Minute Online News

Blog Archive

  • December 2025 (7)
  • November 2025 (4)
  • October 2025 (2)
  • September 2025 (1)
  • August 2025 (2)
  • July 2025 (5)
  • June 2025 (3)
  • May 2025 (2)
  • April 2025 (2)
  • February 2025 (2)
  • December 2024 (1)
  • October 2024 (2)
  • September 2024 (1459)
  • August 2024 (1360)
  • July 2024 (1614)
  • June 2024 (1394)
  • May 2024 (1376)
  • April 2024 (1440)
  • March 2024 (1688)
  • February 2024 (2833)
  • January 2024 (3130)
  • December 2023 (3057)
  • November 2023 (2826)
  • October 2023 (2228)
  • September 2023 (2118)
  • August 2023 (2611)
  • July 2023 (2736)
  • June 2023 (2844)
  • May 2023 (2749)
  • April 2023 (2407)
  • March 2023 (2810)
  • February 2023 (2508)
  • January 2023 (3052)
  • December 2022 (2844)
  • November 2022 (2673)
  • October 2022 (2196)
  • September 2022 (1973)
  • August 2022 (2306)
  • July 2022 (2294)
  • June 2022 (2363)
  • May 2022 (2299)
  • April 2022 (2233)
  • March 2022 (1993)
  • February 2022 (1358)
  • January 2022 (1323)
  • December 2021 (2064)
  • November 2021 (3141)
  • October 2021 (3240)
  • September 2021 (3135)
  • August 2021 (1782)
  • May 2021 (136)
  • April 2021 (294)
Simple theme. Powered by Blogger.