Categories
Artificial intelligence

I found a simple task where Generative AI fails!

I made ChatGPT-4o go into a loop while trying to come up with an answer to my question yesterday. Gemini Advanced failed too. It was surprising to me, to think that these very capable Generative AI applications can fail in a task that is relatively simple to us humans.

But today, it looks like ChatGPT learned from its mistakes ๐Ÿ‘

If you also want to try this, here are the details ๐Ÿ‘‡

Background (Baby names! ๐Ÿ‘ถ)

A friend wanted help with finding a good baby name. My search started simply in Google, as usual. Then, I came across an article about numerology and how each character in the alphabet is assigned a specific number, allowing us to calculate a total for a baby name that is compatible (the details are irrelevant for this blog post).

These were the numbers for characters:

Values for each character of the alphabet in Chaldean numerology

So, for example, if the name is “John”, it would be:

J = 1, O = 7, H = 5, N = 5.

So, 1 + 7 + 5 + 5 = 18

My idea ๐Ÿ’ก

I was curious to use AI, to come up with baby names whose total (according to the above chart) is a specific number, just to see how easy it is. After all, these Generative AI applications can code fully functional games before you can even finish your coffee.

If a human was given this task, we are going to:

  • Collect a few names we like
  • Calculate each name to see if they match the required total
  • If matches, add it to a ‘selected names’ list
  • If not, we would try to modify the name a bit to confirm to the total
  • If that fails, we will move to other names
  • Repeat until we have the required number of names

It’s as simple as that.

First step โœ…

As a first step, I provided both ChatGPT and Gemini the characters & values in the chart above and explained the situation.

Then, to make sure they got it, I gave a name and asked for the total. Here’s the Gemini screenshot:

Gemini Advanced AI chat

As expected, that was easy-peasy. ChatGPT responded similarly too.

Step Two ๐Ÿ‘Ž

Then I pulled the UNO Reverse card on them and asked to come up with names to satisfy a particular total (Asked for 5 names from ChatGPT and 1 name from Gemini). They both started lying!

Here’s the Gemini screenshot:

Generative AI chat

When asked, they both responded with a “sorry”:

Gemini Advanced AI chat

I tried to make it “really” easy, expecting the word “Dog”, but it guessed “Kitty” (I love both, so that’s forgivable) but how can you forgive that addition?:

Gemini Advanced AI chat

They kept saying sorry…

Gemini Advanced AI chat

The scary ChatGPT loop โžฟ

Since I asked for only 1 name from Gemini, it just failed, said sorry and stopped – no hard feelings.

With ChatGPT, since I asked for 5 names, it was listing each name along with its characters and values. So I asked it to stop doing that and only list just the name and its total. Also, I said “strongly” that it should respond only when it has names with the correct totals, because it was getting annoying.

I guess I hurt ChatGPT’s feelings! It started going crazy ๐Ÿ˜•

ChatGPT started to list a name, its characters, values and the total – then it would realize that the total is not what I wanted, say sorry and then try again… and again… and again…

Here’s how the loop went:

ChatGPT-4o AI chat
ChatGPT-4o AI chat

When it was the 14th attempt or something, I got scared and stopped the generation!

Then, I tried one more time to clearly state what I want. The result? More lies:

ChatGPT-4o AI chat

I gave up and went to bed happily. Because, they are not going to take our developer jobs this way ๐Ÿ˜œ

The comeback ๐Ÿ‘

I wanted to recreate the situation in a new chat today, to capture a video of ChatGPT failing and going into a loop, for this blog post. But to my surprise, it looks like it has learned from its mistakes. This was the response today:

ChatGPT-4o AI chat

It says “out of this list” and “from the list” in this answer (I have not mentioned anything about a list). I guess it selected names into a list and did the calculation in-memory before responding today (like how we would do), as opposed to responding immediately before calculating the total properly yesterday.

Gemini still doesn’t care.

Have you experienced anything interesting like this with your Generative AI tools?

If you have any interesting experience with AI like this (or even a different result for the above experiment), feel free to share in the comments below – I’d love to hear about it.

Cheers ๐Ÿป

Categories
Featured Articles Git

Setting up Git and TortoiseGit with Bitbucket, step by step (2024)

[Edit: I finally updated the post with the latest changes. Thank you for everyone who kept returning to this post (and left comments too) from 2012!]

I wanted a set of instructions for my colleagues and friends when they want to setup Git and TortoiseGit on Windows (connected to Bitbucket). There are some steps that I have to Google myself too every time, because I donโ€™t install Git & TortoiseGit everyday.

Okay, so this post is going to be a bare-bones, step by step guide for me or anyone (No backgrounds, theories and other noise)

Basics:

  • Git – is a free and open source distributed version control system.
  • TortoiseGit โ€“ is a Git client (GUI interface).
  • Bitbucket – is a web-based project hosting service (supports Git and Mercurial)

Assumptions:

  • You already have a Bitbucket account. If not, you can create one here.
  • Iโ€™m installing this on Windows 11 โ€“ 64bit. I assume you can adapt to minor differences in your flavor of Windows.
Categories
Programming

StackOverflow logo using C# and GDI+

StackOverflow logoDiscovered the โ€œCode Golfโ€ in StackExchange from the people I follow in twitter (for the #codechallenge hashtag).

The question to generate the StackOverflow logo in code was particularly interesting to me and naturally I wanted to try generating the logo using C# (GDI+ code). Because nobody had a C# solution listed there, I also put the code there as an answer (Look for the answer titled “C#/GDI+”)

I initially recreated the logo in GDI+ without a for loop (every rectangle had a hardcoded value for its locations and size). Using that approach, its possible to recreate the logo almost identical to the original. But I was not happy with doing it that way.

Categories
Nature Personal Science and Technology

Wish you all a Happy New Year – 2012 Doomsday style!

The year 2012 is here. The year that some groups claim will be the last year for the planet and its life as we know it. There were many such occasions in the past too (for example, the year 2000). But, here we are… still living and breathing.

Categories
Photography

Couple of days with a Canon EOS 500D

Canon EOS 500DDespite the craze over great photos and photography in general, I managed to live with only a 3.2 megapixel mobile phone camera (I still do).

A DSLR camera has always been in my โ€œwish listโ€, though nobody cared gifting me one Winking smile – just kidding.

I was helping a friend of a friend with a .NET software, that required photos to be directly transferred to a PC as soon as they are captured. As far as I know, digital cameras go into a โ€œUSB modeโ€ that makes them mass storage devices, just like ordinary USB thumb drives.

Categories
Personal Story

The little spike of motivation from Reddit

Anybody who visits my blog can scan through the publish dates of the posts and know that I am not a frequent poster.

According to SEO experts, updating blog content regularly will effect the web traffic positively. So for a โ€œrareโ€ poster like me, seeing โ€œViews per day: 10โ€ in the WordPress analytics everyday is not new, and I am not serious about web traffic volume as I am (still) only a hobbyist blogger.

Categories
Personal Story

My IE6 dream

IE6 DreamGot startled and woke up to the wild vibration and tone of my mobile phone’s alarm at 5am in the morning, stumbling in the darkness and quickly stopping the alarm, I notice 3 new text messages. Relaxing again, resting my head on the pillow, I assume they would be just some facebook alerts.

The good news comes!

Then I muster up some energy and sit up. Deciding to go about my usual daily duties, I open the 1st message to read, which is a CNN news alert: “Google hinted today that they are making moves to get rid of Internet Explorer 6 for good, CNN”! What? wow, I thought. And I open the 2nd message…

Categories
Featured Articles Tips Tools

Gmail with Push e-mail in a Sony Ericsson K800i – using IMAP4

Ok, so you don’t have to own a BlackBerry to enjoy “Push” email: Email that is actively transferred (pushed) to the device as it arrives. It doesn’t even have to be a Smartphone.

Categories
Featured Articles Nature

Tree leaves becoming “pinhole cameras” during the Annular Solar Eclipse.

annular solar eclipse The longest, annular (like a ring) Solar Eclipse of the millennium happened today (15th January 2010).

When the annular eclipse was at its peak in the region from where I was watching it (Nugegoda, Sri Lanka), I saw some peculiar shadows on the ground. It occurred to me only after several seconds that they are caused by the Solar Eclipse!

Categories
Personal

Merry Christmas!

It’s that time of the year again… when the whole world (or at least the countries that use the standard Gregorian Calendar) commemorates the birth ofย  Jesus Christ.