web analytics

Learn AI With Kesse | Best Place For AI News

We make artificial intelligence easy and fun to read. Get Updated AI News.

Microsoft Unveils KOSMOS: AI Achieving 80% of Human Performance Levels

Microsoft Just Dropped KOSMOS: AI With 80% Human-Level Performance

Major Breakthroughs in AI Research: A New Era of Autonomous Intelligence

In a whirlwind of advancements, major tech giants have unveiled groundbreaking AI developments that promise to reshape the landscape of research and data science. Microsoft has introduced an AI scientist capable of autonomous research, Google has rolled out an AI data scientist, and China’s Moonshot AI has released an open-source reasoning model. Let’s dive into these innovations and explore what they mean for the future of artificial intelligence.

The Rise of Cosmos: Microsoft’s AI Scientist

Cosmos, developed by Microsoft, stands out as the first genuine AI scientist that can conduct scientific research from start to finish without human intervention. When tasked with a specific scientific goal—be it analyzing brain scans, genetics data, or complex material science challenges—Cosmos dedicates 12 uninterrupted hours to its work.

During this time, Cosmos processes over 1,500 research papers, generates approximately 40,000 lines of Python code, runs analyses, tests various hypotheses, and ultimately produces a comprehensive research report complete with citations and executable code. Its early trials yielded significant discoveries across multiple fields, including biology, neuroscience, and clean energy materials.

One notable finding revealed how cooling protects the brain: as temperature drops, brain cells shift into energy-saving modes, opting to recycle existing molecules instead of producing new ones. Additionally, Cosmos identified that excessive humidity can compromise the production of perovskite solar cells—an essential factor later corroborated by human researchers.

How Cosmos Works

What truly differentiates Cosmos is its architecture, which comprises hundreds of smaller AI agents, each responsible for distinct aspects of the research process. Some agents read and summarize papers, others focus on data analysis, while others write code. They operate within a shared internal structure known as a “world model,” helping track progress, what has worked well, and what needs further investigation.

Independent reviews indicated an astounding 80% accuracy rate for Cosmos’s scientific statements. In one 12-hour session, Cosmos generated work equivalent to six months of human research, producing reports similar to early-stage academic papers complete with statistical analyses and graphs.

However, while Cosmos excels in many areas, it still requires human oversight to define research goals and validate results. The system struggles with messy or unlabeled datasets and cannot process raw images or large files over 5 GB. Its limitations stem primarily from its ability to discern which ideas have significant scientific merit rather than merely statistical validity.

Microsoft’s Vision for Humanist Super Intelligence

As Cosmos continues to forge new paths in scientific research, Microsoft is also contemplating the broader implications of AI technology. Mustafa Suleyman, a key figure at Microsoft, has introduced the concept of “humanist super intelligence.” This approach focuses on creating artificial intelligence not to surpass humans but to serve them.

Suleyman envisions a bounded, controllable AI system that embodies human values and remains subordinate to humanity. This intention represents a deliberate departure from the race for artificial general intelligence (AGI). Microsoft seeks to cultivate AI systems that are not autonomous in the unrestricted sense but rather act as companions that enhance human learning and productivity.

This vision emphasizes that, at Microsoft, human welfare takes precedence over AI capabilities. Their approach strives for an AI that is contextual, manageable, and aimed at assisting in areas such as healthcare and scientific discovery.

Moonshot AI: A New Player in Open Source Reasoning

China’s Moonshot AI has championed open-source reasoning with its latest model, K2 thinking, which strives to match or surpass existing reasoning models from OpenAI and Anthropic. What sets K2 thinking apart is its unique capability to reason across hundreds of sequential steps rather than merely generate text.

K2 thinking achieved a remarkable 40.9% on a benchmark exam for expert-level questions, doubling the human average on continuous research tasks. More impressively, it can execute up to 300 sequential tool calls independently, enabling it to perform multi-step reasoning through complex tasks.

One demonstration featured K2 thinking solving a PhD-level mathematics problem in hyperbolic geometry, going through multiple layers of reasoning and tool calls to arrive at a correct conclusion. This type of complex, long-horizon thinking is key for moving toward more advanced AI capabilities.

Moonshot AI’s commitment to open source may offer competitive advantages as U.S. labs continue to guard their reasoning models closely. They are also exploring innovative methods to enhance reasoning time and capacity, demonstrating forward-thinking strategies that keep pace with advancements in AI.

Google’s DSTAR: The AI Data Scientist

While Cosmos and K2 thinking tackle research and reasoning, Google has unveiled DSTAR, an AI tool tailored specifically for data science. Unlike traditional AI tools that operate best with well-structured SQL databases, DSTAR is designed to work seamlessly with disorganized data types, including CSVs, JSON logs, and more.

DSTAR can respond to queries phrased in plain English, such as identifying the highest-performing products based on sales data. It autonomously navigates where data resides, crafts Python code to synthesize information, tests results, corrects errors, and delivers answers—eliminating the need for human data analysts.

The system operates through a collaborative network of specialized agents: one that scans and summarizes each file, another that plans the steps needed, and one that writes the necessary code. DSTAR maintains a self-correcting and debugging loop, efficiently adapting to chaotic data landscapes.

Upgrading the capabilities of Google’s Gemini 2.5 Pro, DSTAR achieves impressive scores on several benchmarks. For instance, it recorded a significant leap in performance for complex data analysis tasks, reflecting its robustness in a world where perfect data is often a rarity.

The Future of Autonomous Intelligence

These recent advancements illustrate a transformative moment in artificial intelligence, where AI systems are not merely tools for human analysts but increasingly are becoming essential components of the research and analytical process. As Cosmos, DSTAR, and K2 thinking demonstrate, we are entering an era where AI conducts serious research and analysis, challenging our understanding of intelligence, autonomy, and purpose.

The debate over how to best harness these technologies while prioritizing human values continues to evolve. As these systems become more integrated into our daily lives, ensuring they serve humanity’s best interests will be paramount.

What are your thoughts on these cutting-edge AI developments? Let us know in the comments below!



#Microsoft #Dropped #KOSMOS #HumanLevel #Performance
Thanks for reaching. Please let us know your thoughts and ideas in the comment section.

Source link

About The Author

13 thoughts on “Microsoft Unveils KOSMOS: AI Achieving 80% of Human Performance Levels

Leave a Reply

Your email address will not be published. Required fields are marked *

We use cookies to personalize content and ads and to primarily analyze our geo traffic sources. We also may share information about your use of our site with our social media, advertising, and analytics partners to improve your user experience. We respect your privacy and will never abuse your information. [ Privacy Policy ] View more
Cookies settings
Accept
Decline
Privacy & Cookie Policy
Privacy & Cookies policy
Cookie name Active

The content on this page governs our Privacy Policy. It describes how your personal information is collected, used, and shared when you visit or make a purchase from learnaiwithkesse.com (the "Site").

Kesseswebsites and Advertising owns Learn AI With Kesse and the website learnaiwithkesse.wiki. For the purpose of this Terms and Agreements [ we, us, I, our ] represents the owner of Learning AI With Kesse which is Kesseswebsites and Advertising. [ You, your, student and buyer ] represents you as the user and visitor of this site. Terms of Conditions, Terms of Service, Terms and Agreement and Terms of use shall be considered the same here. This website or site refers to https://learnaiwithkesse.com. You agree that the content of this Terms and Agreement may include Privacy Policy and Refund Policy. Products refer to physical or digital products. This includes eBooks, PDFs, and text or video courses. If there is anything on this page you do not understand you agree to reach out to us via email [ emmanuel@learnaiwithkesse.com ] for explanation before using any part of this site.

1. Personal Information We Collect

When you visit this Site, we automatically collect certain information about your device, including information about your web browser, IP address, time zone, and some of the cookies that are installed on your device. The primary purpose of this activity is to provide you a better user experience the next time you visit our again and also the data collection is for analytics study. Additionally, as you browse the Site, we collect information about the individual web pages or products that you view, what websites or search terms referred you to the Site, and information about how you interact with the Site. We refer to this automatically-collected information as "Device Information."

We collect Device Information using the following technologies:

"Cookies" are data files that are placed on your device or computer and often include an anonymous unique identifier. For more information about cookies, and how to disable cookies, visit http://www.allaboutcookies.org. To comply with European Union's GDPR (General Data Protection Regulation), we do display a disclaimer a consent text at the bottom of this website. This disclaimer alerts you the visitor or user of this website about why we use cookies, and we also give you the option to accept or decline. If you accept for us to use cookies on your site, the agreement between you and us will expire after 180 has passed.

"Log files" track actions occurring on the Site, and collect data including your IP address, browser type, Internet service provider, referring/exit pages, and date/time stamps.

"Web beacons," "tags," and "pixels" are electronic files used to record information about how you browse the Site.

Additionally, when you make a purchase or attempt to make a purchase through the Site, we collect certain information from you, including your name, billing address, shipping address, payment information (including credit card numbers), email address, and phone number. We refer to this information as "Order Information."

When we talk about "Personal Information" in this Privacy Policy, we are talking both about Device Information and Order Information.

Payment Information

Please note that we use 3rd party payment processing companies like https://stripe.com and https://paypal.com to process your payment information. PayPal and Stripe protects your data according to their terms and agreement and may store your data to help make your subsequent transactions on this website easier. We never and [ DO NOT ] store your card information or payment login information on our website or server. By making payment on our site, you agree to abide by the Terms and Agreement of the 3rd Party payment processing companies we use. You can visit their websites to read their Terms of Use and learn more about them.

2. How Do We Use Your Personal Information?

We use the Order Information that we collect generally to fulfill any orders placed through the Site (including processing your payment information, arranging for shipping, and providing you with invoices and/or order confirmations). Additionally, we use this [a] Order Information to:

[b] Communicate with you;

[c] Screen our orders for potential risk or fraud; and

When in line with the preferences you have shared with us, provide you with information or advertising relating to our products or services. We use the Device Information that we collect to help us screen for potential risk and fraud (in particular, your IP address), and more generally to improve and optimize our Site (for example, by generating analytics about how our customers browse and interact with the Site, and to assess the success of our marketing and advertising campaigns).

3. Sharing Your Personal Information

We share your Personal Information with third parties to help us use your Personal Information, as described above. For example, we use System.io to power our online store--you can read more about how Systeme.io uses your Personal Information here: https://systeme.io/privacy-policy/ . We may also use Google Analytics to help us understand how our customers use the Site--you can read more about how Google uses your Personal Information here: https://www.google.com/intl/en/policies/privacy/. You can also opt-out of Google Analytics here: https://tools.google.com/dlpage/gaoptout.

Finally, we may also share your Personal Information to comply with applicable laws and regulations, to respond to a subpoena, search warrant or other lawful request for information we receive, or to otherwise protect our rights.

4. Behavioral Advertising

As described above, we use your Personal Information to provide you with targeted advertisements or marketing communications we believe may be of interest to you. For more information about how targeted advertising works, you can visit the Network Advertising Initiative’s (“NAI”) educational page at http://www.networkadvertising.org/understanding-online-advertising/how-does-it-work.

You can opt-out of targeted advertising by:

COMMON LINKS INCLUDE:

FACEBOOK - https://www.facebook.com/settings/?tab=ads

GOOGLE - https://www.google.com/settings/ads/anonymous

BING - https://advertise.bingads.microsoft.com/en-us/resources/policies/personalized-ads]

Additionally, you can opt-out of some of these services by visiting the Digital Advertising Alliance’s opt-out portal at: http://optout.aboutads.info/.

5. Data Retention

Besides your card payment and payment login information, when you place an order through the Site, we will maintain your Order Information for our records unless and until you ask us to delete this information. Example of such information include your first name, last name, email and phone number.

6. Changes

We may update this privacy policy from time to time in order to reflect, for example, changes to our practices or for other operational, legal or regulatory reasons.

7. Contact Us

For more information about our privacy practices, if you have questions, or if you would like to make a complaint, please contact us by e-mail at emmanuel@learnaiwithkesse.com or by mail using the details provided below:

8. Your acceptance of these terms

By using this Site, you signify your acceptance of this policy. If you do not agree to this policy, please do not use our Site. Your continued use of the Site following the posting of changes to this policy will be deemed your acceptance of those changes.

Last Update | 18th August 2024

Save settings
Cookies settings