Close Menu
  • Home
  • AI
  • Business
  • Crypto
  • Entertainment
  • Finance
  • LIfe
  • Market
  • Sports
  • US
  • Tech

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

New details appear on the scale of Meta’s $14.3 billion contract

June 14, 2025

Founder Experience at TechCrunch All Stage: Building for those who build the following

June 14, 2025

Boston Dynamics Robots dance to “Don’t Stop Me Now” for “American Got Talent” audition

June 14, 2025
Facebook X (Twitter) Instagram
XMcnx
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • AI
  • Business
  • Crypto
  • Entertainment
  • Finance
  • LIfe
  • Market
  • Sports
  • US
  • Tech
XMcnx
Home » Deepseek may have trained the latest models using Google’s Gemini
AI

Deepseek may have trained the latest models using Google’s Gemini

By supportJune 3, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Gettyimages 2196333417 75e106.jpg
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Last week, Chinese lab Deepseek released an updated version of the R1 Reasoning AI model that works well with many mathematics and coding benchmarks. The company did not reveal the source of the data it used to train the models, but some AI researchers speculate that at least partially came from AI in Google’s Gemini family.

Sam Paech, a Melbourne-based developer who creates AI’s “emotional intelligence” assessment, has published what he claims is evidence that Deepseek’s latest model has been trained for output from Gemini. The Deepseek model, called the R1-0528, prefers words and expressions similar to Google’s Gemini 2.5 Pro favours, Paech said in the X-Post.

If you’re wondering why the new Deepseek R1 sounds a little different, I think they’ve probably switched from training with synthetic Openai to synthetic Gemini output. pic.twitter.com/oex9roapnv

– Sam Paech (@sam_paech) May 29, 2025

It’s not a smoking gun. However, he pointed out that another developer, the trace of the Deepseek model, the pseudonym creator of AI’s “free speech assessment,” called SpeechMap, the “thinking” that the model generates when it works towards conclusions, “read like traces of Gemini.”

Deepseek has previously been accused of training on data from rival AI models. In December, developers observed that Deepseek’s V3 model often identifies as ChatGpt, Openai’s AI-powered Chatbot platform, suggesting that it may be trained in the ChatGPT chat log.

Earlier this year, Openai told the Financial Times that it found evidence linking Deepseek to the use of distillation. According to Bloomberg, Microsoft, a collaborator and investor at Openai, detected a large amount of data was being excluded through its Openai developer account in late 2024. Openai believes it is affiliated with Deepseek.

Distillation is not an uncommon practice, but Openai’s terms of service prohibit customers from using company model output to build competing AI.

To be clear, many models misidentify themselves and converge to the same word and phrases of turn. This is because Open Web, a place where AI companies source most of their training data, is scattered with AI slops. Content Farms are using AI to create ClickBait, and bots are flooding Reddit and X.

This “contamination” made it extremely difficult to thoroughly filter the AI ​​output from the training dataset if so.

Still, AI experts like Nathan Lambert, a researcher at the non-profit AI Institute AI2, don’t think Deepseek trained data from Google’s Gemini out of trouble.

“If I were Deepseek, I would definitely create a ton of synthetic data from the best API models out there,” Lambert wrote in X’s post.

If I were deepseek, I would definitely create a ton of synthetic data from the best API models out there. They are short on the GPU and flush with cash. It’s literally more efficient for them more calculations. Yes, about Gemini Distill’s questions.

– Nathan Lambert (@Natolambert) June 3, 2025

In some cases, AI companies are increasing their security measures to prevent distillation.

In April, OpenAI began requesting organizations to complete the identity verification process to access certain advanced models. This process requires a government-issued ID from one of the countries supported by Openai’s API. China is not on the list.

Elsewhere, Google recently launched a “summary” of traces generated by models available through the AI ​​Studio Developer Platform. In May, humanity said it would begin summarizing traces of its own model, citing the need to protect “competitive benefits.”

I will contact Google for comment and update this article if I receive a reply.





Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMeta buys nuclear power plants (more or less)
Next Article Tiktok deploys AI-powered smart keyword filters to limit content you don’t want to see
support

Related Posts

AI

Clay will secure a new round at a $300 million valuation, sources say

By supportJune 13, 2025
AI

New York passes bill to prevent AI fuel disasters

By supportJune 13, 2025
AI

Google Tests the Audio Summary for Search Queries

By supportJune 13, 2025
AI

Meta’s Big AI Bet and Our Not So Hot Takes at Fintech IPOS

By supportJune 13, 2025
AI

Scale AI confirms “major” investments from Meta, CEO Alexandre Wang says he’s gone

By supportJune 13, 2025
AI

Scale AI confirms “major” investments from Meta, says CEO Alexanr Wang is leaving

By supportJune 13, 2025
Add A Comment
Leave A Reply Cancel Reply

Don't Miss

New details appear on the scale of Meta’s $14.3 billion contract

By supportJune 14, 2025

It is certainly unusual for meta transactions to partially acquire AI startup scales to grant…

Founder Experience at TechCrunch All Stage: Building for those who build the following

June 14, 2025

Boston Dynamics Robots dance to “Don’t Stop Me Now” for “American Got Talent” audition

June 14, 2025

Zevo’s EV-only Car Share Fleet helps Tesla owners make money

June 14, 2025
Top Posts

Cancelling the Joy Reed Show is “mistakes”

February 26, 2025

Black melodrama has a possibility

February 26, 2025

The “Facts of Life” star died in 83

February 25, 2025

Cara Sophia Gascon joins Oscar despite social media controversy

February 25, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to XMcnx – your trusted source for insightful information about the world of Crypto, Market trends, the latest developments in the US, cutting-edge AI technologies, Tech innovations, and Finance.

At XMcnx, our mission is to provide you with timely, accurate, and relevant news and analyses that empower you to stay ahead in an ever-evolving digital world. We understand the challenges of navigating through the complexities of modern markets, technology, and financial systems. That’s why we’re dedicated to delivering high-quality content that helps you make informed decisions.

Facebook X (Twitter) Pinterest YouTube WhatsApp
Our Picks

New details appear on the scale of Meta’s $14.3 billion contract

June 14, 2025

Founder Experience at TechCrunch All Stage: Building for those who build the following

June 14, 2025

Boston Dynamics Robots dance to “Don’t Stop Me Now” for “American Got Talent” audition

June 14, 2025
Most Popular

TikTok announces it will go dark on Sunday without ‘definitive’ guarantees

January 18, 2025

President Trump mints $31 billion in new official $TRUMP crypto meme coin

January 18, 2025

El Salvador’s secret weapon? Stacey Herbert talks about the company’s extensive Bitcoin education program

January 18, 2025
  • Home
  • About Us
  • Advertise with Us
  • Contact Us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2025 xmcnx. Designed by xmcnx.

Type above and press Enter to search. Press Esc to cancel.