Artificial Intelligence thread

tphuang · Aug 6, 2024

ChatGLM's cogVideoX now opensourced and can run from your home computer if you have a 4090.

https://twitter.com/i/web/status/1820872390802354396

9dashline · Aug 6, 2024

tphuang said:
Bytedance unveils Jimeng AI app for video generation.

https://twitter.com/i/web/status/1820751135130464562

here are some videos already

Please, Log in or Register to view URLs content!

This require Chinese account# or open to all?

tphuang · Aug 7, 2024

Beijing energy group signing contract with Huawei to utilize their Ascend resources for its needs

Please, Log in or Register to view URLs content!

tokenanalyst · Aug 7, 2024

The first in China! Zhipu GLM end-side big model received the highest rating in the trusted AI end-side big model evaluation of CAICT

Zhipu GLM's end-side big model participated in the first round of evaluation of the "End-side Big Model Technology and Application Evaluation Method" standard organized by the China Academy of Information and Communications Technology, and finally obtained a level 4 rating. Zhipu became the first big model company in China to pass this evaluation and obtain the current highest rating.

GLM is a series of large models for terminal scenarios such as PCs, mobile phones, and car computers launched by Zhipu. It includes 1.5B, 6B, 9B and other specifications, and is flexibly adapted to various terminal application scenarios and computing power conditions, enabling AI applications in smart terminals and innovating user experience. Based on advanced pre-training technology and a large amount of high-quality data, GLM has functions such as text generation, dialogue interaction, web browsing, code execution, Function Call, and multi-modal interaction.

At present, the GLM end-side large model has completed the adaptation and performance optimization of many mainstream terminal chips, and supports GPU and NPU reasoning. At the same time, it has been adapted to AI PCs, AI mobile phones, smart cockpits and other devices of multiple brands, and has realized scenarios such as complex device control, device usage questions and answers, chat companionship, writing assistant, document processing, local knowledge base, life services, and multi-modal interaction. Through the end-cloud integration approach, the GLM end-side large model can also seamlessly collaborate with the cloud-side large model to perform multi-scenario, high-complexity tasks and provide a consistent experience.

Please, Log in or Register to view URLs content!

sunnymaxi · Aug 10, 2024

Alibaba launches maths-specific AI models said to outperform LLMs from OpenAI, Google

Please, Log in or Register to view URLs content!

is aiming to raise the bar in

Please, Log in or Register to view URLs content!

(AI) development by launching a group of maths-specific large language models (LLMs) called Qwen2-Math, which the

Please, Log in or Register to view URLs content!

giant claims can outperform the capabilities of

Please, Log in or Register to view URLs content!

in that field.

“Over the past year, we have dedicated significant efforts to researching and enhancing the reasoning capabilities of large language models, with a particular focus on their ability to solve arithmetic and mathematical problems,” the Qwen team, part of

Please, Log in or Register to view URLs content!

, said in a post published on developer platform GitHub on Thursday. Alibaba owns the South China Morning Post.
The latest LLMs – the technology underpinning

Please, Log in or Register to view URLs content!

services like

Please, Log in or Register to view URLs content!

– were built on the Qwen2 LLMs released by Alibaba in June and covers three models based on their scale of parameters – a machine-learning term for variables present in an AI system during training, which helps establish how data prompts yield the desired output.

The model with the largest parameter count, Qwen2-Math-72B-Instruct, outperformed proprietary US-developed LLMs in maths benchmarks, according to the Qwen team’s post. Those included GPT-4o,

Please, Log in or Register to view URLs content!

’s Claude 3.5 Sonnet,

Please, Log in or Register to view URLs content!

1.5 Pro and

Please, Log in or Register to view URLs content!

’

Please, Log in or Register to view URLs content!

-3.1-405B.

“We hope that Qwen2-Math can contribute to the community for solving complex mathematical problems,” the post said.

Please, Log in or Register to view URLs content!

GulfLander · Aug 11, 2024

https://twitter.com/i/web/status/1822080423654257006

9dashline · Aug 11, 2024

GulfLander said:
https://twitter.com/i/web/status/1822080423654257006

Yup all new OS need to be natively Multimodal AI... Microsoft is doing the same with Windows 12

siegecrossbow · Aug 13, 2024

tphuang · Aug 16, 2024

https://twitter.com/i/web/status/1824382819411562999

will be very interesting to see how WeChat and alibaba deploy AI functions in their apps.

siegecrossbow · Aug 16, 2024

Please, Log in or Register to view URLs content!

Artificial Intelligence thread

tphuang

General

9dashline

Captain

tphuang

General

tokenanalyst

Brigadier

The first in China! Zhipu GLM end-side big model received the highest rating in the trusted AI end-side big model evaluation of CAICT

sunnymaxi

Major

Alibaba launches maths-specific AI models said to outperform LLMs from OpenAI, Google

GulfLander

Colonel

9dashline

Captain

siegecrossbow

General

tphuang

General

siegecrossbow

General

Artificial Intelligence thread

General

Captain

General

Brigadier

The first in China! Zhipu GLM end-side big model received the highest rating in the trusted AI end-side big model evaluation of CAICT​

Major

Alibaba launches maths-specific AI models said to outperform LLMs from OpenAI, Google​

Colonel

Captain

General

General

General

The first in China! Zhipu GLM end-side big model received the highest rating in the trusted AI end-side big model evaluation of CAICT

Alibaba launches maths-specific AI models said to outperform LLMs from OpenAI, Google