John Hawkins
Feb 9, 2023

--

Thanks @Rajiv this is fantastic

My concern with the current generation of models is whether they have been overfit on these benchmark datasets. Effectively learning to respond to the patterns in the benchmark, but not generalising.

Whenever I craft novel reasoning tasks for ChatGPT it fails miserably. I should try them on these other models.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

John Hawkins
John Hawkins

Written by John Hawkins

Chief Scientist at Gum Gum (Formerly PlaygroundXYZ.com) | Computer Scientist | Open-Source Developer | Author of Getting-Data-Science-Done.com

Responses (1)

Write a response