{"id":127,"date":"2025-08-22T09:41:00","date_gmt":"2025-08-22T09:41:00","guid":{"rendered":"https:\/\/www.darkbluemonkey.com\/?p=127"},"modified":"2026-04-23T09:01:18","modified_gmt":"2026-04-23T09:01:18","slug":"around-around-we-go","status":"publish","type":"post","link":"https:\/\/www.darkbluemonkey.com\/?p=127","title":{"rendered":"Around around we go&#8230;"},"content":{"rendered":"\n<p>For my job, I&#8217;ve been asked to look at all the different AI vendors. I&#8217;ve grabbed myself logins to most of the &#8216;big ones&#8217;, and a few of the newer upstarts. I&#8217;ve been running queries through them to see how they do.  I&#8217;m positive that the &#8216;plan&#8217; is to incorporate more AI into our daily workstreams&#8230;<\/p>\n\n\n\n<p>Well, all I can say is &#8220;meh&#8221;. The sheer levels of hallucination are crazy to me. They all just seem to make stuff up as they go. Copilot is the absolute worst. Perhaps it&#8217;s the version we&#8217;re using at work, but my god does it hallucinate. GPT comes second.   The very idea that a business could run effectively with tools that are so prone to mistakes is hilarious.   I know they&#8217;ll get better over time, but anyone trying to be an &#8216;early adopter&#8217; needs their head seeing to in my honest opinion.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"396\" src=\"https:\/\/www.darkbluemonkey.com\/wp-content\/uploads\/2026\/04\/metacopying-1024x396.png\" alt=\"\" class=\"wp-image-128\" srcset=\"https:\/\/www.darkbluemonkey.com\/wp-content\/uploads\/2026\/04\/metacopying-1024x396.png 1024w, https:\/\/www.darkbluemonkey.com\/wp-content\/uploads\/2026\/04\/metacopying-300x116.png 300w, https:\/\/www.darkbluemonkey.com\/wp-content\/uploads\/2026\/04\/metacopying-768x297.png 768w, https:\/\/www.darkbluemonkey.com\/wp-content\/uploads\/2026\/04\/metacopying.png 1311w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">At the very bottom of the chain, creatives are generating the actual content.  Meanwhile the plagiarism machines are just copying off each other ad infinitum&#8230;<\/figcaption><\/figure>\n\n\n\n<p>For starters, it feels like they&#8217;re all the bloody same at the moment. I&#8217;m hoping that they start to diverge into different directions. Copilot seems slightly better at writing &#8216;business bullshit&#8217;. GPT seems better at understanding human-level stuff, while Claude seems slighly better at writing code. Gemini seems pretty good at most things, but seems to understand the world better (positional relationships between objects etc). I can see them going in those directions.<\/p>\n\n\n\n<p>The US techbros are aiming for &#8220;AGI&#8221; Artificial General Intelligence&#8230;. i.e. a machine that&#8217;s generally just smart, and can be independent in its training and reasoning.   It feels like the LLM engines are a good effort to generate a reasoning system, but there&#8217;s still something missing&#8230; It just feels like, the larger they make the engine, the dumber it gets.  It&#8217;s kind of like when you mix all the different colours of the rainbow together, you end up with muddy brown.<\/p>\n\n\n\n<p>   &#8220;Agentic&#8221; (I still don&#8217;t like that word) solutions feel like they&#8217;ll be the best way out.  That keeps all the different colours separate, and prevents them running together into a muddy brown.  Let the agents be the best at one particular subject, and then chat together to be &#8216;smart&#8217; as a whole.   <\/p>\n\n\n\n<p>Until all the different LLM vendors really decide which direction they want to go, and plough their own furrow, I can&#8217;t really decide which to use, they&#8217;re all much of a muchness, so we&#8217;ll stick with GPT for our experimentation.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>For my job, I&#8217;ve been asked to look at all the different AI vendors. I&#8217;ve grabbed myself logins to most of the &#8216;big ones&#8217;, and a few of the newer upstarts. I&#8217;ve been running queries through them to see how they do. I&#8217;m positive that the &#8216;plan&#8217; is to incorporate more AI into our daily [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":128,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0,"footnotes":""},"categories":[9,5],"tags":[],"class_list":["post-127","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-random","category-tech"],"blocksy_meta":[],"_links":{"self":[{"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/posts\/127","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=127"}],"version-history":[{"count":3,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/posts\/127\/revisions"}],"predecessor-version":[{"id":139,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/posts\/127\/revisions\/139"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=\/wp\/v2\/media\/128"}],"wp:attachment":[{"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=127"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=127"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.darkbluemonkey.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=127"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}