• Latest
  • Trending
OpenAI's Strategy to Empower Humans in AI Training

OpenAI’s Strategy to Empower Humans in AI Training

June 29, 2024
Microsoft and OpenAI strengthen AI partnership with new agreement

Microsoft and OpenAI strengthen AI partnership with new agreement

September 15, 2025
70% of stablecoin users in Africa focus on personal needs – Yellow Card

70% of stablecoin users in Africa focus on personal needs, says Yellow Card

September 15, 2025
Dashen Bank boosts clean energy with solar panel installation across branches

Dashen Bank boosts clean energy with solar panel installation across branches

September 15, 2025
Infinix GT 30 Pro price and availability in Kenya

Infinix GT 30 Pro price and availability in Kenya

September 15, 2025
What you need to know about Nigeria’s first stablecoin, cNGN

What you need to know about Nigeria’s first stablecoin, cNGN

September 15, 2025
Nucleon Security gets €3M to expand its AI cybersecurity across Africa

Nucleon Security gets €3M to expand its AI cybersecurity across Africa

September 15, 2025
Yango Motors brings new mobility to Côte d’Ivoire

Yango Motors brings new mobility to Côte d’Ivoire

September 15, 2025
216 Capital, Plug and Play launch new accelerator for Tunisian startups

216 Capital, Plug and Play launch new accelerator for Tunisian startups

September 15, 2025
itel Super 26 Ultra Now Available in Nigeria: Key Features and Pricing

itel Super 26 Ultra Now Available in Nigeria: Key Features and Pricing

September 15, 2025
New security feature in iPhone 17 protects memory

New security feature in iPhone 17 protects memory

September 15, 2025
Your one-stop tech hub! Get the latest updates on AI, cybersecurity, fintech, and emerging technologies.
  • Tech News
    • Africa Tech
    • Global Tech
    • Tech with Pelumy
    • Tech Careers
    • Tech TV
    • General News
    • How To
    • Reviews
  • Cryptocurrency
  • Fintech
  • Startups
  • Ai
No Result
View All Result
  • Tech News
    • Africa Tech
    • Global Tech
    • Tech with Pelumy
    • Tech Careers
    • Tech TV
    • General News
    • How To
    • Reviews
  • Cryptocurrency
  • Fintech
  • Startups
  • Ai
No Result
View All Result
Techpression
No Result
View All Result
Home Technology AI

OpenAI’s Strategy to Empower Humans in AI Training

Modupeoluwa Olalere by Modupeoluwa Olalere
June 29, 2024
151 2
0
OpenAI's Strategy to Empower Humans in AI Training
474
SHARES
Share on FacebookShare on TwitterWhatsAppTelegram

ChatGPT was successful because human instructors told the AI model that controlled the bot what was good and wrong. Additional AI may make AI aids more innovative and more reliable for human teachers, according to OpenAI.

With ChatGPT, OpenAI pioneered reinforcement learning with human feedback (RLHF). This method refines an AI model with human testers to make its output more coherent, less unpleasant, and more accurate. An algorithm controls the model based on trainer ratings. Chatbots are more trustworthy and valuable and behave better thanks to technology.

Read also: How Ziki Uses AI to Tailor Education for every Student

OpenAI researcher Nat McAleese said, “RLHF does work very well, but it has some key limitations.” Unreliable human feedback is one example. Furthermore, even expert humans may struggle to rate complicated outputs like software code. It can also optimise a model to generate convincing but inaccurate results.

RelatedPosts

Microsoft and OpenAI strengthen AI partnership with new agreement

Apple’s Siri to Get AI Search to Take on OpenAI, Perplexity

How to use ChatGPT’s new branching feature

How does OpenAI’s new model assist human trainers in assessing code?

OpenAI refined its most powerful model, GPT-4, to help human trainers evaluate code. The company found that CriticGPT could find bugs humans overlooked and that human judges liked its code critiques 63% of the time. OpenAI will consider applying the method beyond code.

MacAleese says, “We’re starting work to integrate this technique into our RLHF chat stack. He admits that CriticGPT can hallucinate but believes it could improve OpenAI’s models and ChatGPT by minimising human training errors. The ability of humans to instruct an AI beyond their capacities may potentially help AI models get more brilliant, he says. McAleese says that people will require more excellent aid as models improve.

The new technique is now being developed to improve large language models and squeeze more abilities. It is also part of an effort to ensure that AI behaves in acceptable ways even as it becomes more capable.

Anthropic, a rival to OpenAI formed by ex-OpenAI workers, released Claude, its chatbot, last month with improved training and data. Anthropic and OpenAI have also announced new tools to analyse AI models to understand how they produce output to prevent deceit.

Read also: Sonia’s AI chatbot redefines therapy

OpenAI’s Breakthrough in AI Alignment

OpenAI may be able to train more innovative, trustworthy AI models that align with human values if they can employ the new method in more than just code. OpenAI is training the next large AI model. The brand wants to show it’s serious about model behaviour. This happened after a well-known AI risk group was disbanded. 

He managed the team with co-founder and former board member Ilya Sutskever, who briefly ousted CEO Sam Altman before helping him return. Some on that team have since complained that the corporation is taking too many risks to swiftly produce and sell sophisticated AI systems.

Dylan Hadfield-Menell, a professor at MIT who researches ways to align AI, says the idea of having AI models help train more powerful ones has been kicking around for a while. “This is a pretty natural development,” he says.

RLHF researchers considered related ideas several years ago, according to Hadfield-Menell. How broadly applicable and powerful it is is unknown. He explains, “It might lead to big jumps in individual capabilities and a stepping stone towards more effective feedback in the long run.

Tags: ChatGPTOpenAIRLHF
Modupeoluwa Olalere

Modupeoluwa Olalere

Modupe is a tech content writer with 3+ years of experience turning complex ideas into clear, engaging stories. She covers innovation, digital trends, and emerging technologies. When she’s not writing, she’s exploring new tools or tracking trends shaping Africa’s tech ecosystem.

No Result
View All Result

Quick Links

  • Tech News
  • Cryptocurrency
  • Fintech
  • Startups
  • Business

Follow Us:

  • facebook
  • instagram
  • Twitter(X)
  • Linkedin
  • YouTube
  • About
  • Advertise
  • Privacy & Policy
  • Contact

© 2025 Techpression.com -Techpression Media Limited

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

No Result
View All Result
  • Home
  • Tech News
    • Africa Tech
    • Global Tech
    • Tech with Pelumy
    • Tech Careers
    • Reviews
    • How To
    • General News
  • Cryptocurrency
  • Business
  • Fintech
  • Startups
  • Featured
  • Ai
  • Tech TV

© 2025 Techpression.com -Techpression Media Limited

techpression.com
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.

3rd Party Cookies

This website uses Google Analytics to collect anonymous information such as the number of visitors to the site, and the most popular pages.

Keeping this cookie enabled helps us to improve our website.