OpenAI, the company behind ChatGPT, has recently taken to social media to address concerns about the "lazy" performance of GPT-4, receiving feedback on platforms like social media and Google Reviews.
This move comes after growing user feedback, including a one-star review on the company’s Google Reviews.
OpenAI Provides Insight Into Training Chat Models, Performance Evaluations, and A/B Testing
OpenAI, through its @ChatGPTapp Twitter account, explained the complexities involved in training chat models. The organization highlighted that the process is not a “clean industrial process” and that variations in training runs can lead to noticeable differences in the AI’s personality, creative style, and political bias.
Thorough AI model testing includes offline evaluation metrics and online A/B tests. The final decision to release a new model is based on a data-driven approach to improve the "real" user experience.
OpenAI’s Google Review Score Affected By GPT-4 Performance Issues
This explanation comes after weeks of user feedback about GPT-4 becoming worse on social media networks like X.
"Idk if anyone else has noticed this, but GPT-4 Turbo performance is significantly worse than GPT-4 standard. I know it’s in preview right now but it’s significantly worse." — Max Weinbach (@MaxWinebach), November 8, 2023
"There has been discussion if GPT-4 has become ‘lazy’ recently. My anecdotal testing suggests it may be true. I repeated a sequence of old analyses I did with Code Interpreter. GPT-4 still knows what to do, but keeps telling me to do the work. One step is now many & some are odd." — Ethan Mollick (@emollick), November 28, 2023
Complaints also appeared in OpenAI’s community forums. One user even left a one-star rating for OpenAI via Google Reviews. Other complaints regarded accounts, billing, and the artificial nature of AI.
A recent user on Product Hunt also gave OpenAI a rating seemingly related to GPT-4 worsening.
"OpenAI is now only 3.8 stars on Google Maps and a dismal 1 star on Yelp! GPT-4’s degradation has really hurt their rating. Hope the business survives." — Nate Chan (@nathanwchan), December 9, 2023
Feedback About ChatGPT 3.5 Performance
GPT-4 isn’t the only issue reviewers complain about. On Yelp, OpenAI has a one-star rating related to ChatGPT 3.5 performance.
The most liked review aligns with recent rumors about a volatile workplace, alleging that OpenAI is a “Cutthroat environment. Not friendly. Toxic workers.”
The top reviews on Glassdoor suggest that employee frustration and product development issues stem from the company’s shift in focus towards profits.
Positive Aspects Highlighted By Google SGE
In addition to occasional complaints, Google reviewers acknowledged the revolutionary impact of OpenAI’s technology on various fields. The most positive reviews about the company appear in Google SGE (Search Generative Experience).
Conclusion
OpenAI’s recent insights into training chat models and its response to public feedback about GPT-4 performance illustrate the dynamic and evolving nature of AI technology and its impact on users. This is particularly important for those just invited to join ChatGPT Plus after being waitlisted and those developing GPTs for the upcoming GPT Store launch.
As AI advances, professionals must remain agile, informed, and responsive to technological developments and the public’s reception of these advancements.
Featured image: Tada Images/Shutterstock