web analytics

Learn AI With Kesse | Best Place For AI News

We make artificial intelligence easy and fun to read. Get Updated AI News.

DeepSeek’s New AI Outperforms Gemini 3 DeepThink Using Unforgiving Logic

DeepSeek’s New AI Just Surpassed Gemini 3 DeepThink With Brutal Logic

The Rise of Advanced AI Models: Deepseek and Tencent Lead the Charge

The landscape of artificial intelligence is evolving at breakneck speed, with remarkable advancements rising from unexpected places. Recently, two projects have made significant waves in the AI community: Deepseek’s Math V2 model, which operates at International Math Olympiad gold medal levels, and Tencent’s Huan OCR, a streamlined optical character recognition (OCR) model boasting a mere 1 billion parameters while outperforming larger competitors.

Deepseek Math V2: A Breakthrough in Mathematical Reasoning

Deepseek’s Math V2 model surprisingly dropped on Hugging Face without much fanfare, but it has quickly become one of the most impressive math reasoning models available to the public. Building on the success of its predecessor, a 7 billion parameter model that previously matched GPT-4 and Gemini Ultra on math tasks, Math V2 has set its sights even higher. Deepseek claims it surpasses Google’s Gemini Deepthink, a model designed explicitly for structured reasoning.

Self-Verification: The Key to Success

What sets Deepseek Math V2 apart is its focus on self-verifiable reasoning. Most existing AI math systems prioritize the final answer, often missing the crucial process behind achieving the solution. However, real mathematics demands rigor, logic, and thorough derivations. Deepseek recognized that models heavily dependent on accuracy often hit a ceiling, excelling in benchmarks but faltering when required to produce rigorous proofs.

Math V2 employs a teaching framework consisting of a student, an examiner, and a supervisor. The student generates proofs, the examiner verifies them, and the supervisor ensures that feedback makes sense. This model is unique because it not only checks for correct answers but also assesses the quality of reasoning. The examiner uses a three-point grading system, encouraging thorough proof development and offering constructive feedback like a human grader.

Revolutionary Self-Evaluation System

In a bold move, Deepseek also includes a self-evaluation component within the student model, where it grades its output and reflects on its reasoning. If the model admits a mistake, it is rewarded, fostering a culture of honesty and self-improvement. This approach enhances learning, as the student model not only receives feedback but also learns to recognize its own limitations.

This self-contained system creates a closed loop where the teacher, examiner, and student evolve together. For example, the performance of Math V2 on the IMO proof bench reached nearly 99% on basic problems and scored impressively on the 2024 Putnam test, achieving 118 out of a potential 120 points.

Tencent’s Huan OCR: Pioneering Compact Solutions

In a different domain, Tencent has unveiled Huan OCR, a cutting-edge optical character recognition model that defies expectations. With only 1 billion parameters, it surpasses several major multi-modal giants, showcasing the power of compact specialization.

Simplified Model Architecture

Huan OCR is designed differently than traditional OCR systems. Instead of using a series of complex steps—text detection, recognition, layout rebuilding, etc.—Huan OCR operates as a single end-to-end model. This simplicity reduces the risk of errors that may arise from managing multiple components. By processing images directly in their original resolution and aspect ratio, Huan OCR excels at handling diverse document formats, including long receipts, multi-column layouts, and poorly scanned materials.

Advanced Training Techniques

Tencent employed a multi-stage training approach, utilizing a combination of pure text, synthetic data, and multilingual samples. The model’s context window was gradually expanded to accommodate 32K tokens, enabling it to handle long documents seamlessly. Unlike many models that offer rewards based solely on the final output, Huan OCR uses a reinforcement learning mechanism that aligns rewards with ground truth structure.

This innovative training ensures that the model maintains high accuracy while producing structured outputs. Tests on internal benchmarks demonstrated that Huan OCR achieved an impressive overall score against 900 OCR images, outperforming well-known systems like Paddle OCR and general-purpose visual-language models.

Conclusion: The Future of AI Models

The emergence of specialized models like Deepseek Math V2 and Tencent’s Huan OCR signals a pivotal moment in AI development. These advancements illustrate that smaller, focused models can outperform larger, more generalized systems in specific tasks. As the race for AI excellence continues, the conversation surrounding the future shifts towards whether highly specialized models or giant all-in-one systems will dominate.

The real takeaway here is that the framework used by these models is as important as their functionality. By prioritizing reasoning quality and incorporating self-verification techniques, these new models are setting new standards for what AI can achieve. As we look ahead, one thing remains clear: the field of artificial intelligence is not just progressing; it is evolving in unexpected and thrilling ways.

Feel free to share your thoughts in the comments: Do you believe specialized models will triumph, or will all-in-one systems remain the standard? Your insights are valuable as we navigate this exciting landscape of AI.



#DeepSeeks #Surpassed #Gemini #DeepThink #Brutal #Logic
Thanks for reaching. Please let us know your thoughts and ideas in the comment section.

Source link

About The Author

18 thoughts on “DeepSeek’s New AI Outperforms Gemini 3 DeepThink Using Unforgiving Logic

  1. Maybe dumb question, but couldn't solutions for IMO 2025 and Putnam 2024 be already present in AI training data, I've seen once for IMO? Likely it is accounted for(otherwise every general model could get 100% by searching), but it would be exciting to check for 2026 before solutions are published

  2. Small agile models will win out over big generalized ones. Businesses have specific use cases for AI. And businesses are the ones spending on AI. Even when it comes to humanoids in people's homes, they will need updating and you'll be able to customize them like Neo in The Matrix.

  3. Very nice. Tiny models with strong functionality is great. The better this category gets, the sooner these functions will be "always on" functions for AI. Always aware of the world, its environment, always able to speak and listen, etc. This is the path to always-aware AI, and is necessary going forward for AI agents in our lives.

  4. Why do you people pump a Chinese company and promote them as if they have any aspect of truth with regards to anything they say. I am so tired of Americans or anyone promoting CRAP from a communist country run by dictators since 1949. Why do you people continue to platform any company that comes out of China. There is no democracy there. Nothing they say or do is verified or legitimate to the degree that American and other companies from democratic countries are. Nothing that China or their companies do is wholly transparent and everything that any Chinese company does is mandated by the government to contain built in censorship. Everything they produce can be hacked and or taken advantage of by outside sources such as the Chinese government or those working for them. All legitimate democratic governments have banned the usage of Chinese Ai for official networks.

    STOP PLATFORMING FOR COMMUNIST CHINA and the companies that are wholly controlled by the lying Ching Ping!!!! You tarnish your entire channel by propagating for a lying dictatorship. Stop or I and many others will soon abandon your channel and all those who promote China and the companies that front for their dictator government. Sadly, for all we know, you are an agent of and for the Chinese government.

    Here is the real Brutal Logic. Anyone that uses deepseek is an IDIOT. And anyone that promotes deepseek or any Chinese company is a traitor to democracy and probably a chinese agent.

  5. My ai app it now works with ollama cloud models and toolbridge proxy that allows non tool able agentic models to use tools via the proxy bridge that uses ollama cloud model locally (free models on ollama available for minimax-m2:cloud AI Agent CLI Builder Out aHR0cHM6Ly9naXRodWIuY29tL2phbWllZHVrL093bi1DTEktQWdlbnQ=

  6. omnibench you only used gemini 2 so interesting but not informative. Interesting idea is that these open source models are advanced but because China can't afford infrastructure they're open which will get customers but it will give insight to all of the closed models

  7. Remember back in the day when Gemini 3 literally wrecked the status quo overnight? Heralding in a seismic shift and bringing forth a new AI utopian era promising to reshape everything while leading us unto the next ginormous evolutionary leap for quite some time to come! And so, it did!!! Everyone on EVERY channel across the A.I. front united in this unanimous chorus! And they were all right!!! And we will NEVER forget that amazing 48 hours.

    Such is the times that we live. ❤

Leave a Reply

Your email address will not be published. Required fields are marked *

We use cookies to personalize content and ads and to primarily analyze our geo traffic sources. We also may share information about your use of our site with our social media, advertising, and analytics partners to improve your user experience. We respect your privacy and will never abuse your information. [ Privacy Policy ] View more
Cookies settings
Accept
Decline
Privacy & Cookie Policy
Privacy & Cookies policy
Cookie name Active

The content on this page governs our Privacy Policy. It describes how your personal information is collected, used, and shared when you visit or make a purchase from learnaiwithkesse.com (the "Site").

Kesseswebsites and Advertising owns Learn AI With Kesse and the website learnaiwithkesse.wiki. For the purpose of this Terms and Agreements [ we, us, I, our ] represents the owner of Learning AI With Kesse which is Kesseswebsites and Advertising. [ You, your, student and buyer ] represents you as the user and visitor of this site. Terms of Conditions, Terms of Service, Terms and Agreement and Terms of use shall be considered the same here. This website or site refers to https://learnaiwithkesse.com. You agree that the content of this Terms and Agreement may include Privacy Policy and Refund Policy. Products refer to physical or digital products. This includes eBooks, PDFs, and text or video courses. If there is anything on this page you do not understand you agree to reach out to us via email [ emmanuel@learnaiwithkesse.com ] for explanation before using any part of this site.

1. Personal Information We Collect

When you visit this Site, we automatically collect certain information about your device, including information about your web browser, IP address, time zone, and some of the cookies that are installed on your device. The primary purpose of this activity is to provide you a better user experience the next time you visit our again and also the data collection is for analytics study. Additionally, as you browse the Site, we collect information about the individual web pages or products that you view, what websites or search terms referred you to the Site, and information about how you interact with the Site. We refer to this automatically-collected information as "Device Information."

We collect Device Information using the following technologies:

"Cookies" are data files that are placed on your device or computer and often include an anonymous unique identifier. For more information about cookies, and how to disable cookies, visit http://www.allaboutcookies.org. To comply with European Union's GDPR (General Data Protection Regulation), we do display a disclaimer a consent text at the bottom of this website. This disclaimer alerts you the visitor or user of this website about why we use cookies, and we also give you the option to accept or decline. If you accept for us to use cookies on your site, the agreement between you and us will expire after 180 has passed.

"Log files" track actions occurring on the Site, and collect data including your IP address, browser type, Internet service provider, referring/exit pages, and date/time stamps.

"Web beacons," "tags," and "pixels" are electronic files used to record information about how you browse the Site.

Additionally, when you make a purchase or attempt to make a purchase through the Site, we collect certain information from you, including your name, billing address, shipping address, payment information (including credit card numbers), email address, and phone number. We refer to this information as "Order Information."

When we talk about "Personal Information" in this Privacy Policy, we are talking both about Device Information and Order Information.

Payment Information

Please note that we use 3rd party payment processing companies like https://stripe.com and https://paypal.com to process your payment information. PayPal and Stripe protects your data according to their terms and agreement and may store your data to help make your subsequent transactions on this website easier. We never and [ DO NOT ] store your card information or payment login information on our website or server. By making payment on our site, you agree to abide by the Terms and Agreement of the 3rd Party payment processing companies we use. You can visit their websites to read their Terms of Use and learn more about them.

2. How Do We Use Your Personal Information?

We use the Order Information that we collect generally to fulfill any orders placed through the Site (including processing your payment information, arranging for shipping, and providing you with invoices and/or order confirmations). Additionally, we use this [a] Order Information to:

[b] Communicate with you;

[c] Screen our orders for potential risk or fraud; and

When in line with the preferences you have shared with us, provide you with information or advertising relating to our products or services. We use the Device Information that we collect to help us screen for potential risk and fraud (in particular, your IP address), and more generally to improve and optimize our Site (for example, by generating analytics about how our customers browse and interact with the Site, and to assess the success of our marketing and advertising campaigns).

3. Sharing Your Personal Information

We share your Personal Information with third parties to help us use your Personal Information, as described above. For example, we use System.io to power our online store--you can read more about how Systeme.io uses your Personal Information here: https://systeme.io/privacy-policy/ . We may also use Google Analytics to help us understand how our customers use the Site--you can read more about how Google uses your Personal Information here: https://www.google.com/intl/en/policies/privacy/. You can also opt-out of Google Analytics here: https://tools.google.com/dlpage/gaoptout.

Finally, we may also share your Personal Information to comply with applicable laws and regulations, to respond to a subpoena, search warrant or other lawful request for information we receive, or to otherwise protect our rights.

4. Behavioral Advertising

As described above, we use your Personal Information to provide you with targeted advertisements or marketing communications we believe may be of interest to you. For more information about how targeted advertising works, you can visit the Network Advertising Initiative’s (“NAI”) educational page at http://www.networkadvertising.org/understanding-online-advertising/how-does-it-work.

You can opt-out of targeted advertising by:

COMMON LINKS INCLUDE:

FACEBOOK - https://www.facebook.com/settings/?tab=ads

GOOGLE - https://www.google.com/settings/ads/anonymous

BING - https://advertise.bingads.microsoft.com/en-us/resources/policies/personalized-ads]

Additionally, you can opt-out of some of these services by visiting the Digital Advertising Alliance’s opt-out portal at: http://optout.aboutads.info/.

5. Data Retention

Besides your card payment and payment login information, when you place an order through the Site, we will maintain your Order Information for our records unless and until you ask us to delete this information. Example of such information include your first name, last name, email and phone number.

6. Changes

We may update this privacy policy from time to time in order to reflect, for example, changes to our practices or for other operational, legal or regulatory reasons.

7. Contact Us

For more information about our privacy practices, if you have questions, or if you would like to make a complaint, please contact us by e-mail at emmanuel@learnaiwithkesse.com or by mail using the details provided below:

8. Your acceptance of these terms

By using this Site, you signify your acceptance of this policy. If you do not agree to this policy, please do not use our Site. Your continued use of the Site following the posting of changes to this policy will be deemed your acceptance of those changes.

Last Update | 18th August 2024

Save settings
Cookies settings