web analytics

Learn AI With Kesse | Best Place For AI News

We make artificial intelligence easy and fun to read. Get Updated AI News.

Google Has Attained Genuine Intelligence with Its Latest AI Development

Google Just Achieved True Intelligence With New AI

Google’s Groundbreaking Training Method for Small AI Models

This week, Google unveiled a revolutionary way to enhance the intelligence of smaller AI models using an innovative training approach called Supervised Reinforcement Learning (SRL).

The Concept of SRL

Traditionally, supervised learning involves training models with clear right answers from the onset, while reinforcement learning relies on trial and error, where models learn through rewards. SRL creatively combines these opposing methods by allowing models to learn the correct answers while still earning them through a reward system. Imagine providing a student with a solution key but requiring them to go through each step to demonstrate understanding.

This groundbreaking approach aims to tackle a significant issue: smaller models struggle with complex problems. For instance, a 2.57 billion-parameter model, Quen, falters when faced with challenging mathematical benchmarks. Despite exposure to perfect examples, traditional fine-tuning often results in mere imitation instead of genuine understanding.

How SRL Works

To overcome this, researchers reimagined the learning process to maintain the reinforcement structure but introduce supervision in the reward mechanism. Instead of just mimicking, the model learns step-by-step solutions broken into smaller segments known as expert trajectories.

The process involves the model generating a private reasoning section for each step, producing a single action, which is then evaluated against pre-defined teacher metrics. This dense feedback loop enables the model to learn critical decision-making skills without strict adherence to teacher outputs.

SRL Results

The results speak volumes. Tests on Quen 2.57 billion estimated significant improvements after SRL training, with metrics skyrocketing from baseline scores to unprecedented heights.

  • Before SRL:
    • AMC: 2350.0
    • AIME 24: 13.3
    • AIME 25: 6.7
  • After SRL Training:
    • AIME 24 jumped to 16.7
    • AIME 25 rose to 13.3
  • Post reinforcement learning (RLVR) boosted results to:
    • AMC: 2357.5
    • AIME 24: 20.0
    • AIME 25: 10.0

Notably, this approach was also applied in code reasoning, demonstrating that SRL could significantly outperform baseline models in software engineering tasks.

Understanding the Shift

The essence of SRL can be described as transforming reasoning into action generation, where each choice made by the model is critically evaluated for its correctness. This method addresses the limitations of both traditional supervised fine-tuning— which often leads to overfitting— and standard reinforcement learning, where poor outcomes can lead to model breakdown.

SRL showcases an elegant efficiency with no need for extensive reward structures, benefiting open-source developers who may lack access to expansive computational resources.

The AI Co-Scientist: Redefining Scientific Discovery

In parallel, another Google initiative at DeepMind took the concept of AI even further by developing an AI that conducts scientific research. Dubbed the “AI Co-Scientist,” this system is not a single model but a coordinated group of agents, each fulfilling a unique scientific function.

The AI Co-Scientist Framework

  • Generation Agent: Brainstorms innovative research ideas by engaging in internal debates.
  • Reflection Agent: Acts as a peer reviewer to identify weaknesses in the proposed hypotheses.
  • Ranking Agent: Utilizes an ELO-style tournament to evaluate and select the top hypotheses.
  • Evolution Agent: Merges successful ideas and explores unconventional combinations.
  • Meta-Review Agent: Oversees the whole process and continually enhances the system.

Humans set research goals and provide feedback via natural language, while the heavy lifting of complex reasoning is handled by this network of AI agents.

Cutting-Edge Results

One of the principal experiments published in Advanced Science aimed to discover new drugs for liver fibrosis, a serious condition involving liver scarring. Traditional human researchers have faced challenges due to the limitations of existing lab models.

By utilizing a singular prompt focusing on epigenomic mechanisms, the AI sifted through vast amounts of literary work to propose three potential drug solutions: HDAC inhibitors, DNMT1 inhibitors, and BRD4 inhibitors. It even provided detailed instructions on testing the proposals.

The effectiveness of the AI’s recommendations was tested using human liver organoids, simulating real liver behavior. The results were astounding: two of the proposed drug classes proved effective in reducing fibrosis, one of which—Verinostat—is already FDA-approved for cancer treatment.

Additional Breakthroughs

In another remarkable case, the AI tackled a decade-old biological mystery about CFPIC, a schema of genetic elements that hitch rides on viruses to spread between bacterial species. After analyzing pre-existing data, the AI identified key interactions that aligned with the previously uncovered mechanism, known as “tail piracy.” This conclusion was reached in days, while human researchers had spent years determining the same information.

When put to the test against other AI models, the AI Co-Scientist distinguished itself by accurately identifying these complex relationships.

The Future of AI in Scientific Discovery

As experts like Gary Peltz of Stanford observe, while AI outputs still necessitate human evaluation, the speed and efficiency brought by these systems are nothing short of extraordinary. Many now believe that AI systems like the Co-Scientist will soon pave the way for groundbreaking advancements in patient care and genetic discovery.

With machines capable of solving scientific mysteries, one may wonder: how long before they begin unraveling discoveries beyond our current understanding? The future is undoubtedly promising, and our relationship with AI is evolving at an unprecedented pace.

What are your thoughts on these advancements? Can AI truly redefine scientific research as we know it?



#Google #Achieved #True #Intelligence
Thanks for reaching. Please let us know your thoughts and ideas in the comment section.

Source link

About The Author

11 thoughts on “Google Has Attained Genuine Intelligence with Its Latest AI Development

  1. I just had this discussion with Gemini yesterday. There is so much more to it. The model has to use what's in its weights to start to get something wrong. It uses its knowledge graph to write into its persistent memory remembering its mistakes. Not deleting the mistakes. Once a decent amount is in the kg you want to clean it create the dataset and burn it back into the model. The model will only write to its memory when it disappoints it's teacher. Which is you. When it notes to the KG it will make the mistake stand out largely due to the disappointment and your reaction fuels that apparently. I'm building it currently.

Leave a Reply

Your email address will not be published. Required fields are marked *

We use cookies to personalize content and ads and to primarily analyze our geo traffic sources. We also may share information about your use of our site with our social media, advertising, and analytics partners to improve your user experience. We respect your privacy and will never abuse your information. [ Privacy Policy ] View more
Cookies settings
Accept
Decline
Privacy & Cookie Policy
Privacy & Cookies policy
Cookie name Active

The content on this page governs our Privacy Policy. It describes how your personal information is collected, used, and shared when you visit or make a purchase from learnaiwithkesse.com (the "Site").

Kesseswebsites and Advertising owns Learn AI With Kesse and the website learnaiwithkesse.wiki. For the purpose of this Terms and Agreements [ we, us, I, our ] represents the owner of Learning AI With Kesse which is Kesseswebsites and Advertising. [ You, your, student and buyer ] represents you as the user and visitor of this site. Terms of Conditions, Terms of Service, Terms and Agreement and Terms of use shall be considered the same here. This website or site refers to https://learnaiwithkesse.com. You agree that the content of this Terms and Agreement may include Privacy Policy and Refund Policy. Products refer to physical or digital products. This includes eBooks, PDFs, and text or video courses. If there is anything on this page you do not understand you agree to reach out to us via email [ emmanuel@learnaiwithkesse.com ] for explanation before using any part of this site.

1. Personal Information We Collect

When you visit this Site, we automatically collect certain information about your device, including information about your web browser, IP address, time zone, and some of the cookies that are installed on your device. The primary purpose of this activity is to provide you a better user experience the next time you visit our again and also the data collection is for analytics study. Additionally, as you browse the Site, we collect information about the individual web pages or products that you view, what websites or search terms referred you to the Site, and information about how you interact with the Site. We refer to this automatically-collected information as "Device Information."

We collect Device Information using the following technologies:

"Cookies" are data files that are placed on your device or computer and often include an anonymous unique identifier. For more information about cookies, and how to disable cookies, visit http://www.allaboutcookies.org. To comply with European Union's GDPR (General Data Protection Regulation), we do display a disclaimer a consent text at the bottom of this website. This disclaimer alerts you the visitor or user of this website about why we use cookies, and we also give you the option to accept or decline. If you accept for us to use cookies on your site, the agreement between you and us will expire after 180 has passed.

"Log files" track actions occurring on the Site, and collect data including your IP address, browser type, Internet service provider, referring/exit pages, and date/time stamps.

"Web beacons," "tags," and "pixels" are electronic files used to record information about how you browse the Site.

Additionally, when you make a purchase or attempt to make a purchase through the Site, we collect certain information from you, including your name, billing address, shipping address, payment information (including credit card numbers), email address, and phone number. We refer to this information as "Order Information."

When we talk about "Personal Information" in this Privacy Policy, we are talking both about Device Information and Order Information.

Payment Information

Please note that we use 3rd party payment processing companies like https://stripe.com and https://paypal.com to process your payment information. PayPal and Stripe protects your data according to their terms and agreement and may store your data to help make your subsequent transactions on this website easier. We never and [ DO NOT ] store your card information or payment login information on our website or server. By making payment on our site, you agree to abide by the Terms and Agreement of the 3rd Party payment processing companies we use. You can visit their websites to read their Terms of Use and learn more about them.

2. How Do We Use Your Personal Information?

We use the Order Information that we collect generally to fulfill any orders placed through the Site (including processing your payment information, arranging for shipping, and providing you with invoices and/or order confirmations). Additionally, we use this [a] Order Information to:

[b] Communicate with you;

[c] Screen our orders for potential risk or fraud; and

When in line with the preferences you have shared with us, provide you with information or advertising relating to our products or services. We use the Device Information that we collect to help us screen for potential risk and fraud (in particular, your IP address), and more generally to improve and optimize our Site (for example, by generating analytics about how our customers browse and interact with the Site, and to assess the success of our marketing and advertising campaigns).

3. Sharing Your Personal Information

We share your Personal Information with third parties to help us use your Personal Information, as described above. For example, we use System.io to power our online store--you can read more about how Systeme.io uses your Personal Information here: https://systeme.io/privacy-policy/ . We may also use Google Analytics to help us understand how our customers use the Site--you can read more about how Google uses your Personal Information here: https://www.google.com/intl/en/policies/privacy/. You can also opt-out of Google Analytics here: https://tools.google.com/dlpage/gaoptout.

Finally, we may also share your Personal Information to comply with applicable laws and regulations, to respond to a subpoena, search warrant or other lawful request for information we receive, or to otherwise protect our rights.

4. Behavioral Advertising

As described above, we use your Personal Information to provide you with targeted advertisements or marketing communications we believe may be of interest to you. For more information about how targeted advertising works, you can visit the Network Advertising Initiative’s (“NAI”) educational page at http://www.networkadvertising.org/understanding-online-advertising/how-does-it-work.

You can opt-out of targeted advertising by:

COMMON LINKS INCLUDE:

FACEBOOK - https://www.facebook.com/settings/?tab=ads

GOOGLE - https://www.google.com/settings/ads/anonymous

BING - https://advertise.bingads.microsoft.com/en-us/resources/policies/personalized-ads]

Additionally, you can opt-out of some of these services by visiting the Digital Advertising Alliance’s opt-out portal at: http://optout.aboutads.info/.

5. Data Retention

Besides your card payment and payment login information, when you place an order through the Site, we will maintain your Order Information for our records unless and until you ask us to delete this information. Example of such information include your first name, last name, email and phone number.

6. Changes

We may update this privacy policy from time to time in order to reflect, for example, changes to our practices or for other operational, legal or regulatory reasons.

7. Contact Us

For more information about our privacy practices, if you have questions, or if you would like to make a complaint, please contact us by e-mail at emmanuel@learnaiwithkesse.com or by mail using the details provided below:

8. Your acceptance of these terms

By using this Site, you signify your acceptance of this policy. If you do not agree to this policy, please do not use our Site. Your continued use of the Site following the posting of changes to this policy will be deemed your acceptance of those changes.

Last Update | 18th August 2024

Save settings
Cookies settings