web analytics

Learn AI With Kesse | Best Place For AI News

We make artificial intelligence easy and fun to read. Get Updated AI News.

Why Cohere’s former AI research lead opposes the scaling race.

Why Cohere's ex-AI research lead is betting against the scaling race

Image Credits:Chris Behroozian

The Race for Advanced AI: A Shift from Scaling Large Language Models

AI labs are engaged in a frenzied competition to establish data centers as expansive as Manhattan, each requiring billions of dollars and consuming energy equivalent to that of small cities. This race is motivated by the belief in “scaling”—the notion that increasing computing power applied to existing AI training methods will eventually produce superintelligent systems capable of executing a wide array of tasks.

The Limits of Large Language Models

However, an increasing number of AI researchers are cautioning that the scaling of large language models (LLMs) may be nearing its limits. They argue that new breakthroughs may be essential for enhancing AI performance significantly. Among those leading this charge is Sara Hooker, the co-founder of the new startup Adaption Labs, which she founded alongside Sudip Roy, both veterans from Cohere and Google.

Hooker has expressed a belief that relying solely on scaling LLMs has become an inefficient strategy for maximizing AI model performance. Shortly after leaving her position as the VP of AI Research at Cohere in August, she announced her new initiative aimed at addressing what she sees as one of the most pressing issues: creating AI systems that can adapt and continuously learn from real-world experiences.

A Vision for Adaptive Learning

In a recent interview with TechCrunch, Hooker elaborated on Adaption Labs’ mission to build AI systems capable of efficiently learning and adapting to their environments. While she refrained from divulging specific methodologies or whether the company will utilize LLM architectures, the idea is clear: in her view, merely scaling up existing models has not led to true intelligence that can interact meaningfully with the world.

Hooker emphasizes that adaptability lies at the “heart of learning.” She likens the process of learning to a human experience—stub your toe on a dining room table, and you learn to navigate more carefully the next time. Although AI researchers have tried to embed this adaptive learning into models through reinforcement learning (RL), Hooker points out that current RL methods fall short in real-world applications, as they do not allow AI systems to learn from mistakes while in production.

The Challenges of Fine-Tuning AI Models

Many AI labs currently provide consulting services aimed at customizing AI models for enterprise needs, but these services often come at a steep cost. For example, OpenAI reportedly requires clients to invest over $10 million to receive consulting on fine-tuning. Hooker argues that there are only a handful of dominant labs dictating a limited range of AI models that are expensive to adapt. She believes this paradigm is outdated, suggesting that AI systems can and must learn efficiently from their environments to redefine the landscape of AI control and utility.

The Growing Skepticism Around Scaling

Adaption Labs symbolizes a critical shift in the industry’s confidence regarding scaling LLMs. Recent research from MIT indicates that the largest AI models may soon experience diminishing returns, a sentiment echoed in skeptical discussions among leading AI researchers. For instance, Richard Sutton, a Turing Award-winning figure known as the “father of RL,” commented in a recent conversation that LLMs cannot truly scale since they lack the ability to learn from real-world experiences. This sentiment was further reinforced by Andrej Karpathy, an early OpenAI employee, who expressed reservations about the long-term viability of RL for improving AI models.

Concerns regarding scaling through pretraining—a strategy where AI models learn patterns from extensive datasets—have surfaced as well. Pre-training had previously been the cornerstone for the success of firms like OpenAI and Google. However, evidence suggests that the benefits from this method may also be approaching saturation.

Alternate Approaches: AI Reasoning Models

While concerns about the limitations of scaling have gained traction, the AI industry continues to identify alternative ways to enhance model performance. Breakthroughs in AI reasoning models—systems that utilize additional time and computational resources to methodically work through problems before generating responses—have extended the capabilities of AI in exciting directions.

AI labs are becoming increasingly convinced that scaling up RL and AI reasoning models represents the new frontier for AI advancements. OpenAI previously communicated to TechCrunch that they developed their initial AI reasoning model, o1, with the expectation that it would scale effectively. Additionally, research released by Meta and Periodic Labs recently explored the potential for RL to further enhance performance, highlighting the expensive nature of current approaches.

Adaption Labs: Finding Cost-Effective Solutions

In contrast to prevailing methods, Adaption Labs strives to unearth innovations that demonstrate learning from experience can be pursued in a far more cost-efficient manner. The startup was reportedly in discussions to raise a seed round between $20 million and $40 million this past fall, and while the final amount is undisclosed, it’s understood that the round has since concluded successfully. Hooker expressed their ambition to achieve significant advancements in AI and hinted at the high-caliber nature of their founding team.

Previously at Cohere Labs, Hooker specialized in training compact AI models for enterprises, and she now aims to push this trend of outperforming larger models in diverse benchmarks, including coding and reasoning tasks.

A Global Approach to AI Research

Beyond technological advancements, Hooker has made a name for herself in broadening access to AI research on a global scale. Hiring talent from underrepresented regions such as Africa has been part of her mission. While plans for an office in San Francisco are underway, Hooker is committed to a diverse approach to hiring.

The Broader Implications of Adaptive Learning

Should Hooker and Adaption Labs succeed in proving the limitations of scaling methods, the implications could be monumental. With billions already invested in the belief that larger models translate to greater general intelligence, a shift towards true adaptive learning may reveal a more powerful and efficient route to AI advancement.

In conclusion, as AI researchers and startups like Adaption Labs explore new methods of building intelligent systems, the industry might soon witness a transformation away from mere scaling, toward a more nuanced approach based on adaptability and continuous learning. This pivotal moment could reshape not only the technology but also the very relationships that users have with AI systems moving forward.

Thanks for reading. Please let us know your thoughts and ideas in the comment section down below.

Source link
#Coheres #exAI #research #lead #betting #scaling #race

About The Author

Leave a Reply

Your email address will not be published. Required fields are marked *

We use cookies to personalize content and ads and to primarily analyze our geo traffic sources. We also may share information about your use of our site with our social media, advertising, and analytics partners to improve your user experience. We respect your privacy and will never abuse your information. [ Privacy Policy ] View more
Cookies settings
Accept
Decline
Privacy & Cookie Policy
Privacy & Cookies policy
Cookie name Active

The content on this page governs our Privacy Policy. It describes how your personal information is collected, used, and shared when you visit or make a purchase from learnaiwithkesse.com (the "Site").

Kesseswebsites and Advertising owns Learn AI With Kesse and the website learnaiwithkesse.wiki. For the purpose of this Terms and Agreements [ we, us, I, our ] represents the owner of Learning AI With Kesse which is Kesseswebsites and Advertising. [ You, your, student and buyer ] represents you as the user and visitor of this site. Terms of Conditions, Terms of Service, Terms and Agreement and Terms of use shall be considered the same here. This website or site refers to https://learnaiwithkesse.com. You agree that the content of this Terms and Agreement may include Privacy Policy and Refund Policy. Products refer to physical or digital products. This includes eBooks, PDFs, and text or video courses. If there is anything on this page you do not understand you agree to reach out to us via email [ emmanuel@learnaiwithkesse.com ] for explanation before using any part of this site.

1. Personal Information We Collect

When you visit this Site, we automatically collect certain information about your device, including information about your web browser, IP address, time zone, and some of the cookies that are installed on your device. The primary purpose of this activity is to provide you a better user experience the next time you visit our again and also the data collection is for analytics study. Additionally, as you browse the Site, we collect information about the individual web pages or products that you view, what websites or search terms referred you to the Site, and information about how you interact with the Site. We refer to this automatically-collected information as "Device Information."

We collect Device Information using the following technologies:

"Cookies" are data files that are placed on your device or computer and often include an anonymous unique identifier. For more information about cookies, and how to disable cookies, visit http://www.allaboutcookies.org. To comply with European Union's GDPR (General Data Protection Regulation), we do display a disclaimer a consent text at the bottom of this website. This disclaimer alerts you the visitor or user of this website about why we use cookies, and we also give you the option to accept or decline. If you accept for us to use cookies on your site, the agreement between you and us will expire after 180 has passed.

"Log files" track actions occurring on the Site, and collect data including your IP address, browser type, Internet service provider, referring/exit pages, and date/time stamps.

"Web beacons," "tags," and "pixels" are electronic files used to record information about how you browse the Site.

Additionally, when you make a purchase or attempt to make a purchase through the Site, we collect certain information from you, including your name, billing address, shipping address, payment information (including credit card numbers), email address, and phone number. We refer to this information as "Order Information."

When we talk about "Personal Information" in this Privacy Policy, we are talking both about Device Information and Order Information.

Payment Information

Please note that we use 3rd party payment processing companies like https://stripe.com and https://paypal.com to process your payment information. PayPal and Stripe protects your data according to their terms and agreement and may store your data to help make your subsequent transactions on this website easier. We never and [ DO NOT ] store your card information or payment login information on our website or server. By making payment on our site, you agree to abide by the Terms and Agreement of the 3rd Party payment processing companies we use. You can visit their websites to read their Terms of Use and learn more about them.

2. How Do We Use Your Personal Information?

We use the Order Information that we collect generally to fulfill any orders placed through the Site (including processing your payment information, arranging for shipping, and providing you with invoices and/or order confirmations). Additionally, we use this [a] Order Information to:

[b] Communicate with you;

[c] Screen our orders for potential risk or fraud; and

When in line with the preferences you have shared with us, provide you with information or advertising relating to our products or services. We use the Device Information that we collect to help us screen for potential risk and fraud (in particular, your IP address), and more generally to improve and optimize our Site (for example, by generating analytics about how our customers browse and interact with the Site, and to assess the success of our marketing and advertising campaigns).

3. Sharing Your Personal Information

We share your Personal Information with third parties to help us use your Personal Information, as described above. For example, we use System.io to power our online store--you can read more about how Systeme.io uses your Personal Information here: https://systeme.io/privacy-policy/ . We may also use Google Analytics to help us understand how our customers use the Site--you can read more about how Google uses your Personal Information here: https://www.google.com/intl/en/policies/privacy/. You can also opt-out of Google Analytics here: https://tools.google.com/dlpage/gaoptout.

Finally, we may also share your Personal Information to comply with applicable laws and regulations, to respond to a subpoena, search warrant or other lawful request for information we receive, or to otherwise protect our rights.

4. Behavioral Advertising

As described above, we use your Personal Information to provide you with targeted advertisements or marketing communications we believe may be of interest to you. For more information about how targeted advertising works, you can visit the Network Advertising Initiative’s (“NAI”) educational page at http://www.networkadvertising.org/understanding-online-advertising/how-does-it-work.

You can opt-out of targeted advertising by:

COMMON LINKS INCLUDE:

FACEBOOK - https://www.facebook.com/settings/?tab=ads

GOOGLE - https://www.google.com/settings/ads/anonymous

BING - https://advertise.bingads.microsoft.com/en-us/resources/policies/personalized-ads]

Additionally, you can opt-out of some of these services by visiting the Digital Advertising Alliance’s opt-out portal at: http://optout.aboutads.info/.

5. Data Retention

Besides your card payment and payment login information, when you place an order through the Site, we will maintain your Order Information for our records unless and until you ask us to delete this information. Example of such information include your first name, last name, email and phone number.

6. Changes

We may update this privacy policy from time to time in order to reflect, for example, changes to our practices or for other operational, legal or regulatory reasons.

7. Contact Us

For more information about our privacy practices, if you have questions, or if you would like to make a complaint, please contact us by e-mail at emmanuel@learnaiwithkesse.com or by mail using the details provided below:

8. Your acceptance of these terms

By using this Site, you signify your acceptance of this policy. If you do not agree to this policy, please do not use our Site. Your continued use of the Site following the posting of changes to this policy will be deemed your acceptance of those changes.

Last Update | 18th August 2024

Save settings
Cookies settings