Artificial Intelligence thread

tphuang

Lieutenant General
Staff member
Super Moderator
VIP Professional
Registered Member
Baidu's latest flagship model Wenxin 4.0 Turbo fine-tuning service is launched

View attachment 134544


On August 21, Baidu Smart Cloud announced the launch of the Wenxin flagship large model ERNIE 4.0 Turbo fine-tuning service to help companies use their own business data to train large models that are more suitable for corporate application scenarios, greatly improving the effectiveness of the model in business use. From now on, corporate users can log in to the Baidu Smart Cloud official website to apply for the experience.

It is understood that Baidu Smart Cloud Qianfan Platform has previously supported ERNIE 3.5, ERNIE Speed, ERNIE Lite, ERNIE Tiny, and ERNIE Character for model fine-tuning. As of now, a total of 6 Wenxin large models can be fine-tuned and used on the Qianfan platform. A total of 21,000 models have been fine-tuned, serving the core business scenarios of more than a thousand companies, and have many successful cases.

Although the general big model has powerful understanding, generation, logic and memory capabilities, as an "undergraduate" with strong general knowledge, it often cannot fully meet the needs of enterprises in actual applications, such as the particularity of the industry and scenarios of the enterprise, the customized needs of content generation, data privacy and security, etc.

SFT (Supervised Fine-Tuning) is the main method for fine-tuning large models. It builds input and output for specific tasks to align the model's performance on the task with the capabilities of professionals. For example, if a model already has knowledge in the financial field, SFT can teach it the basic logic and steps of research report analysis, so that the model has better capabilities in completing the specific task of "research report analysis".

In addition, opening up large model fine-tuning services is also an important manifestation of large model manufacturers in ensuring customer data security, privacy security, and model controllability.

From the perspective of the global market, OpenAI recently officially launched the fine-tuning service for its flagship large model GPT-4o. Industry commentators said that this is an important strategic move for OpenAI to actively respond to the needs of B-side users, enhance its differentiated advantages over competitors such as Google, Meta, and Anthropic, and increase its investment in the toB track.

Among the mainstream large model manufacturers in China, except Baidu Smart Cloud, other manufacturers have hardly opened fine-tuning services for any flagship models. At present, Baidu Smart Cloud has provided 6 Wenxin large models including ERNIE 4.0 Turbo and ERNIE 3.5, and has fine-tuned 21,000 models in total, serving the core business scenarios of more than 1,000 enterprises.
please post a link to your articles and do a small summary. Doesn't have to be long, just 1 sentence is good.
 

tphuang

Lieutenant General
Staff member
Super Moderator
VIP Professional
Registered Member
for the upcoming china international trade conference, they are about to display all sort of AI, robotics and computation stuff in this Beijing conference

Please, Log in or Register to view URLs content!

China has over 100 1B class parameter models.

Zhipu AI will showcase GLM-4

China Unicom's model used in 10 industries.

China mobile to showcase it's AI, big data and security products as well as various robotics.
 

phrozenflame

Junior Member
Registered Member
Not sure if this is the right place to ask: Any good offline Chinese LLM with a amazing RAG? [English supported]

Tried Llama and Mistral. They start to get wonky really fast.
 

tphuang

Lieutenant General
Staff member
Super Moderator
VIP Professional
Registered Member
Not sure if this is the right place to ask: Any good offline Chinese LLM with a amazing RAG? [English supported]

Tried Llama and Mistral. They start to get wonky really fast.
offline Chinese LLM? you can try Qwen on huggingface. I don't know what you mean by offline LLM though.

As for RAG, isn't that something you have to develop?
 
Top