this post was submitted on 20 Sep 2024
63 points (92.0% liked)

Technology

35117 readers
132 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
 

Alibaba Cloud, the cloud computing arm of China’s Alibaba Group Ltd., today announced the release of more than 100 new artificial intelligence large language models open source as part of the Qwen 2.5 family of models.

Revealed at the company’s Apsara Conference, the new model series follows the release of the company’s foundation model Tongyi Qianwen, or Qwen, last year. Since then, the Qwen models have been downloaded more than 40 million times across platforms such as Hugging Face and Modelscope.

The new models range from sizes as small as a half-billion parameters to as large as 72 billion parameters. In an LLM, parameters define the behavior of an AI model and what it uses to make predictions about its skills such as mathematics, coding or expert knowledge.

Smaller, more lightweight models can be trained quickly using far less processing power on more focused training sets and excel at simpler tasks. In contrast, larger models need heavy processing power and longer training times and generally perform better on complex tasks requiring deep language understanding.

all 8 comments
sorted by: hot top controversial new old
[–] [email protected] 23 points 3 months ago (1 children)

A freely available and unencumbered binary (e.g., the model weights) isn't the same thing as open-source. The source is the data. You can't rebuild the model without the data, nor can you verify that it wasn't intentionally biased or crippled.

[–] [email protected] -3 points 3 months ago (1 children)

That's not how the license works.

[–] [email protected] 8 points 3 months ago (1 children)

"Open source" is not a license, it's a description. Things can be free with no license restrictions and still not be "open source".

[–] [email protected] -3 points 3 months ago* (last edited 3 months ago) (1 children)

The OSS movement was founded on a license. You can't separate open source from its licenses. They are intrinsically linked.

[–] [email protected] 3 points 3 months ago* (last edited 3 months ago)

A license that requires source. And since then there have been many different licenses, all with the same requirement. Giving someone a binary for free and saying they're allowed to edit the hex codes and redistribute it doesn't mean it's open source. A license to use and modify is necessary but not sufficient for something to be open source. You need to provide the source.