Thanks @Rajiv this is fantastic

Rajiv Shah

Feb 9, 2023

My concern with the current generation of models is whether they have been overfit on these benchmark datasets. Effectively learning to respond to the patterns in the benchmark, but not generalising.

Whenever I craft novel reasoning tasks for ChatGPT it fails miserably. I should try them on these other models.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by John Hawkins

184 Followers

257 Following

Chief Scientist at Gum Gum (Formerly PlaygroundXYZ.com) | Computer Scientist | Open-Source Developer | Author of Getting-Data-Science-Done.com

Responses (1)

Write a response

What are your thoughts?

Also publish to my profile

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams