{"id":590032,"date":"2025-04-07T08:00:40","date_gmt":"2025-04-07T06:00:40","guid":{"rendered":"https:\/\/mybroadband.co.za\/news\/?p=590032"},"modified":"2025-04-07T08:03:08","modified_gmt":"2025-04-07T06:03:08","slug":"chinese-chatgpt-competitor-developing-self-improving-ai-models","status":"publish","type":"post","link":"https:\/\/mybroadband.co.za\/news\/ai\/590032-chinese-chatgpt-competitor-developing-self-improving-ai-models.html","title":{"rendered":"Chinese ChatGPT competitor developing self-improving AI models"},"content":{"rendered":"\n<p>DeepSeek is working with Tsinghua University on reducing the training its AI models need in an effort to lower operational costs.<\/p>\n\n\n\n<p>The Chinese startup, which roiled markets with its low-cost reasoning model that emerged in January, collaborated with researchers from the Beijing institution on&nbsp;<a href=\"https:\/\/arxiv.org\/pdf\/2504.02495\" target=\"_blank\" rel=\"noreferrer noopener\">a paper<\/a>&nbsp;detailing a novel approach to reinforcement learning to make models more efficient.<\/p>\n\n\n\n<p>The new method aims to help artificial intelligence models better adhere to human preferences by offering rewards for more accurate and understandable responses, the researchers wrote. <\/p>\n\n\n\n<p>Reinforcement learning has proven effective in speeding up AI tasks in narrow applications and spheres. <\/p>\n\n\n\n<p>However, expanding it to more general applications has proven challenging \u2014 and that\u2019s the problem that DeepSeek\u2019s team is trying to solve with something it calls self-principled critique tuning. <\/p>\n\n\n\n<p>The strategy outperformed existing methods and models on various benchmarks and the result showed better performance with fewer computing resources, according to the paper.<\/p>\n\n\n\n<p>DeepSeek is calling these new models DeepSeek-GRM \u2014 short for \u201cgeneralist reward modeling\u201d \u2014 and will release them on an open source basis, the company said. <\/p>\n\n\n\n<p>Other AI developers, including Chinese tech giant Alibaba Group Holding and San Francisco-based OpenAI, are also pushing into a new frontier of improving reasoning and self-refining capabilities while an AI model is performing tasks in real-time.<\/p>\n\n\n\n<p>Menlo Park, California-based Meta Platforms released its latest family of AI models,\u00a0<a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2025-04-06\/meta-announces-release-of-new-llama-4-ai-models\" target=\"_blank\" rel=\"noreferrer noopener\">Llama 4<\/a>, over the weekend and marked them as its first to use the Mixture of Experts (MoE) architecture. <\/p>\n\n\n\n<p>DeepSeek\u2019s models rely significantly on MoE to make more efficient use of resources, and Meta benchmarked its new release against the Hangzhou-based startup. <\/p>\n\n\n\n<p>DeepSeek hasn\u2019t specified when it might release its next flagship model.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>DeepSeek is calling these new models DeepSeek-GRM \u2014 short for \u201cgeneralist reward modeling\u201d \u2014 and will release them on an open source basis, the company said.<\/p>\n","protected":false},"author":341034,"featured_media":590035,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[92837],"tags":[35793,98600,99459,99460],"class_list":["post-590032","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","tag-artificial-intelligence-ai","tag-deepseek","tag-generalist-reward-modeling","tag-tsinghua-university"],"_links":{"self":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts\/590032"}],"collection":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/users\/341034"}],"replies":[{"embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/comments?post=590032"}],"version-history":[{"count":3,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts\/590032\/revisions"}],"predecessor-version":[{"id":590040,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/posts\/590032\/revisions\/590040"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/media\/590035"}],"wp:attachment":[{"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/media?parent=590032"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/categories?post=590032"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mybroadband.co.za\/news\/wp-json\/wp\/v2\/tags?post=590032"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}