6

preprints

Favorite:

16

The GPT Revolution: Benchmarking, Boundaries, and Breakthroughs

An set of preprints benchmarks GPT‑4 against GPT‑3.5 across professional exams, critiques GPT‑4’s logical reasoning limits via diverse puzzles, reviews ChatGPT’s capabilities and constraints, assesses GPT‑3’s feasibility as a public health collaborator, and introduces ROSGPT, a ROS2 package enabling natural language–driven human‑robot interaction.

Konstantine Arkoudas

Anis Koubaa,

Wadii Boulila,

Lahouari Ghouti,

Ayyub Alzahem,

Shahid Latif

Peer-reviewed Version

Jeremy Howard,

Austin Huang,

Zhiyuan Li,

Zeynep Tufekci,

Vladimir Zdimal,

Helene-Mari van der Westhuizen,

Arne von Delft,

Amy Price,

Lex Fridman,

Lei-Han Tang

+9 authors

of 1

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2025 MDPI (Basel, Switzerland) unless otherwise stated