• Latest
  • Trending
OpenAI's Strategy to Empower Humans in AI Training

OpenAI’s Strategy to Empower Humans in AI Training

June 29, 2024
$TRUMP Crypto coin's meteoric rise: 300% gain in hours electrifies investors

How to learn various types of cryptocurrencies

October 4, 2025
MTN South Sudan boosts connectivity with Starlink

MTN South Sudan boosts connectivity with Starlink

October 4, 2025
How cryptocurrency price increases: 7 factors

How cryptocurrency price increases: 7 factors

October 4, 2025
I&M Bank Tanzania Unveils Mastercard World Elite Cards for Digital Payments

I&M Bank Tanzania Unveils Mastercard World Elite Cards for Digital Payments

October 4, 2025
Kenya moves closer to formal crypto law with passage of draft bill

Kenya moves closer to formal crypto law with passage of draft bill

October 3, 2025
GritinAI Connect 1.0: Benin City takes centre stage, shaping AI conversations

GritinAI Connect 1.0: Benin City takes centre stage, shaping AI conversations

October 3, 2025
Cellulant joins forces with Pesalink to simplify digital payments in Kenya

Cellulant joins forces with Pesalink to simplify digital payments in Kenya

October 3, 2025
Sterling Bank abolishes maintenance fees six months after scrapping transfer charges

Sterling Bank abolishes maintenance fees six months after scrapping transfer charges

October 3, 2025
Netflix loses $15 billion as Elon Musk intensifies action to boycott over transgender shows

Netflix loses $15 billion as Elon Musk intensifies action to boycott over transgender shows

October 3, 2025
Threads challenges X by introducing communities

Threads challenges X by introducing communities

October 3, 2025
Your one-stop tech hub! Get the latest updates on AI, cybersecurity, fintech, and emerging technologies.
  • Tech News
    • Africa Tech
    • Global Tech
    • Tech with Pelumy
    • Tech Careers
    • Tech TV
    • General News
    • How To
    • Reviews
  • Cryptocurrency
  • Fintech
  • Startups
  • Ai
No Result
View All Result
  • Tech News
    • Africa Tech
    • Global Tech
    • Tech with Pelumy
    • Tech Careers
    • Tech TV
    • General News
    • How To
    • Reviews
  • Cryptocurrency
  • Fintech
  • Startups
  • Ai
No Result
View All Result
Techpression
No Result
View All Result
Home Technology AI

OpenAI’s Strategy to Empower Humans in AI Training

Modupeoluwa Olalere by Modupeoluwa Olalere
June 29, 2024
151 2
0
OpenAI's Strategy to Empower Humans in AI Training
474
SHARES
Share on FacebookShare on TwitterWhatsAppTelegram

ChatGPT was successful because human instructors told the AI model that controlled the bot what was good and wrong. Additional AI may make AI aids more innovative and more reliable for human teachers, according to OpenAI.

With ChatGPT, OpenAI pioneered reinforcement learning with human feedback (RLHF). This method refines an AI model with human testers to make its output more coherent, less unpleasant, and more accurate. An algorithm controls the model based on trainer ratings. Chatbots are more trustworthy and valuable and behave better thanks to technology.

Read also: How Ziki Uses AI to Tailor Education for every Student

OpenAI researcher Nat McAleese said, “RLHF does work very well, but it has some key limitations.” Unreliable human feedback is one example. Furthermore, even expert humans may struggle to rate complicated outputs like software code. It can also optimise a model to generate convincing but inaccurate results.

RelatedPosts

Morocco and OpenAI explore Artificial Intelligence partnership to advance digital Morocco 2030

OpenAI challenges Google and Amazon with new AI shopping tool

OpenAI expands $500 Stargate AI initiative in partnership with Oracle and SoftBank

How does OpenAI’s new model assist human trainers in assessing code?

OpenAI refined its most powerful model, GPT-4, to help human trainers evaluate code. The company found that CriticGPT could find bugs humans overlooked and that human judges liked its code critiques 63% of the time. OpenAI will consider applying the method beyond code.

MacAleese says, “We’re starting work to integrate this technique into our RLHF chat stack. He admits that CriticGPT can hallucinate but believes it could improve OpenAI’s models and ChatGPT by minimising human training errors. The ability of humans to instruct an AI beyond their capacities may potentially help AI models get more brilliant, he says. McAleese says that people will require more excellent aid as models improve.

The new technique is now being developed to improve large language models and squeeze more abilities. It is also part of an effort to ensure that AI behaves in acceptable ways even as it becomes more capable.

Anthropic, a rival to OpenAI formed by ex-OpenAI workers, released Claude, its chatbot, last month with improved training and data. Anthropic and OpenAI have also announced new tools to analyse AI models to understand how they produce output to prevent deceit.

Read also: Sonia’s AI chatbot redefines therapy

OpenAI’s Breakthrough in AI Alignment

OpenAI may be able to train more innovative, trustworthy AI models that align with human values if they can employ the new method in more than just code. OpenAI is training the next large AI model. The brand wants to show it’s serious about model behaviour. This happened after a well-known AI risk group was disbanded. 

He managed the team with co-founder and former board member Ilya Sutskever, who briefly ousted CEO Sam Altman before helping him return. Some on that team have since complained that the corporation is taking too many risks to swiftly produce and sell sophisticated AI systems.

Dylan Hadfield-Menell, a professor at MIT who researches ways to align AI, says the idea of having AI models help train more powerful ones has been kicking around for a while. “This is a pretty natural development,” he says.

RLHF researchers considered related ideas several years ago, according to Hadfield-Menell. How broadly applicable and powerful it is is unknown. He explains, “It might lead to big jumps in individual capabilities and a stepping stone towards more effective feedback in the long run.

Tags: ChatGPTOpenAIRLHF
Modupeoluwa Olalere

Modupeoluwa Olalere

Modupe is a tech content writer with 3+ years of experience turning complex ideas into clear, engaging stories. She covers innovation, digital trends, and emerging technologies. When she’s not writing, she’s exploring new tools or tracking trends shaping Africa’s tech ecosystem.

Quick Links

  • Tech News
  • Cryptocurrency
  • Fintech
  • Startups
  • Business

Follow Us:

  • facebook
  • instagram
  • Twitter(X)
  • Linkedin
  • YouTube
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2025 Techpression.com -Techpression Media Limited

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

No Result
View All Result
  • Home
  • Tech News
    • Africa Tech
    • Global Tech
    • Tech with Pelumy
    • Tech Careers
    • Reviews
    • How To
    • General News
  • Cryptocurrency
  • Business
  • Fintech
  • Startups
  • Featured
  • Ai
  • Tech TV

© 2025 Techpression.com -Techpression Media Limited

ADVERTISEMENT
techpression.com
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

3rd Party Cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.