The aggressive fringe of the o1 fashions is their capacity to excel at reasoning. That is as a result of coaching strategy that taught them to assume earlier than responding and, because of this, allowed them to attempt completely different methods earlier than touchdown on a solution.
Due to this fact, the o1 mannequin could be very sturdy in math, science, and coding. For instance, it scored 83% on the Worldwide Arithmetic Olympiad (IMO) qualifying examination. For comparability, GPT-4o accurately solved solely 13% of issues. In consequence, when fixing a tough or advanced STEM downside, this mannequin is your best choice.
Additionally: 10 features Apple Intelligence needs to actually compete with OpenAI and Google
Although this efficiency would possibly lead you to imagine it’s a higher different in all case situations, that’s not the case. As a result of o1 is an early mannequin, it lacks key ChatGPT options, resembling internet browsing and accepting media uploads, and because of this, for now, GPT-4o stays the higher choice for many requests.
ChatGPT Plus and Groups customers can toggle between the completely different fashions relying on the immediate, so they do not should decide to utilizing one or the opposite. To make it simpler to determine which to make use of, OpenAI is engaged on a characteristic that permits ChatGPT to decide on the very best mannequin mechanically primarily based on consumer prompts.