blog of Guganeshan.T

I found a simple task where Generative AI fails!

Post author By Guganeshan.T
Post date May 31, 2024
No Comments on I found a simple task where Generative AI fails!

I made ChatGPT-4o go into a loop while trying to come up with an answer to my question yesterday. Gemini Advanced failed too. It was surprising to me, to think that these very capable Generative AI applications can fail in a task that is relatively simple to us humans.

But today, it looks like ChatGPT learned from its mistakes 👏

If you also want to try this, here are the details 👇

Background (Baby names! 👶)

A friend wanted help with finding a good baby name. My search started simply in Google, as usual. Then, I came across an article about numerology and how each character in the alphabet is assigned a specific number, allowing us to calculate a total for a baby name that is compatible (the details are irrelevant for this blog post).

These were the numbers for characters:

Values for each character of the alphabet in Chaldean numerology

So, for example, if the name is “John”, it would be:

J = 1, O = 7, H = 5, N = 5.

So, 1 + 7 + 5 + 5 = 18

My idea 💡

I was curious to use AI, to come up with baby names whose total (according to the above chart) is a specific number, just to see how easy it is. After all, these Generative AI applications can code fully functional games before you can even finish your coffee.

If a human was given this task, we are going to:

Collect a few names we like
Calculate each name to see if they match the required total
If matches, add it to a ‘selected names’ list
If not, we would try to modify the name a bit to confirm to the total
If that fails, we will move to other names
Repeat until we have the required number of names

It’s as simple as that.

First step ✅

As a first step, I provided both ChatGPT and Gemini the characters & values in the chart above and explained the situation.

Then, to make sure they got it, I gave a name and asked for the total. Here’s the Gemini screenshot:

As expected, that was easy-peasy. ChatGPT responded similarly too.

Step Two 👎

Then I pulled the UNO Reverse card on them and asked to come up with names to satisfy a particular total (Asked for 5 names from ChatGPT and 1 name from Gemini). They both started lying!

Here’s the Gemini screenshot:

When asked, they both responded with a “sorry”:

I tried to make it “really” easy, expecting the word “Dog”, but it guessed “Kitty” (I love both, so that’s forgivable) but how can you forgive that addition?:

They kept saying sorry…

The scary ChatGPT loop ➿

Since I asked for only 1 name from Gemini, it just failed, said sorry and stopped – no hard feelings.

With ChatGPT, since I asked for 5 names, it was listing each name along with its characters and values. So I asked it to stop doing that and only list just the name and its total. Also, I said “strongly” that it should respond only when it has names with the correct totals, because it was getting annoying.

I guess I hurt ChatGPT’s feelings! It started going crazy 😕

ChatGPT started to list a name, its characters, values and the total – then it would realize that the total is not what I wanted, say sorry and then try again… and again… and again…

Here’s how the loop went:

When it was the 14th attempt or something, I got scared and stopped the generation!

Then, I tried one more time to clearly state what I want. The result? More lies:

I gave up and went to bed happily. Because, they are not going to take our developer jobs this way 😜

The comeback 👏

I wanted to recreate the situation in a new chat today, to capture a video of ChatGPT failing and going into a loop, for this blog post. But to my surprise, it looks like it has learned from its mistakes. This was the response today:

It says “out of this list” and “from the list” in this answer (I have not mentioned anything about a list). I guess it selected names into a list and did the calculation in-memory before responding today (like how we would do), as opposed to responding immediately before calculating the total properly yesterday.

Gemini still doesn’t care.

Have you experienced anything interesting like this with your Generative AI tools?

If you have any interesting experience with AI like this (or even a different result for the above experiment), feel free to share in the comments below – I’d love to hear about it.

Cheers 🍻

Tags AI, ChatGPT, ChatGPT-4o, Gemini, GeminiAdvanced, GenerativeAI

Setting up Git and TortoiseGit with Bitbucket, step by step (2024)

Post author By Guganeshan.T
Post date September 29, 2013
108 Comments on Setting up Git and TortoiseGit with Bitbucket, step by step (2024)

[Edit: I finally updated the post with the latest changes. Thank you for everyone who kept returning to this post (and left comments too) from 2012!]

I wanted a set of instructions for my colleagues and friends when they want to setup Git and TortoiseGit on Windows (connected to Bitbucket). There are some steps that I have to Google myself too every time, because I don’t install Git & TortoiseGit everyday.

Okay, so this post is going to be a bare-bones, step by step guide for me or anyone (No backgrounds, theories and other noise)

Basics:

Git – is a free and open source distributed version control system.
TortoiseGit – is a Git client (GUI interface).
Bitbucket – is a web-based project hosting service (supports Git and Mercurial)

Assumptions:

You already have a Bitbucket account. If not, you can create one here.
I’m installing this on Windows 11 – 64bit. I assume you can adapt to minor differences in your flavor of Windows.

Tags Bitbucket, Git, Git Bash, SSH, TortoiseGit, Tutorial

Programming

StackOverflow logo using C# and GDI+

Post author By Guganeshan.T
Post date January 30, 2012
3 Comments on StackOverflow logo using C# and GDI+

Discovered the “Code Golf” in StackExchange from the people I follow in twitter (for the #codechallenge hashtag).

The question to generate the StackOverflow logo in code was particularly interesting to me and naturally I wanted to try generating the logo using C# (GDI+ code). Because nobody had a C# solution listed there, I also put the code there as an answer (Look for the answer titled “C#/GDI+”)

I initially recreated the logo in GDI+ without a for loop (every rectangle had a hardcoded value for its locations and size). Using that approach, its possible to recreate the logo almost identical to the original. But I was not happy with doing it that way.

Tags C#, GDI+, Programming, Source Code, StackExchange, StackOverflow

Nature Personal Science and Technology

Wish you all a Happy New Year – 2012 Doomsday style!

Post author By Guganeshan.T
Post date January 1, 2012
No Comments on Wish you all a Happy New Year – 2012 Doomsday style!

The year 2012 is here. The year that some groups claim will be the last year for the planet and its life as we know it. There were many such occasions in the past too (for example, the year 2000). But, here we are… still living and breathing.

Tags 2012, Adobe After Effects, Cosmophobia, New Year, Wish

Photography

Couple of days with a Canon EOS 500D

Post author By Guganeshan.T
Post date December 4, 2011
6 Comments on Couple of days with a Canon EOS 500D

Despite the craze over great photos and photography in general, I managed to live with only a 3.2 megapixel mobile phone camera (I still do).

A DSLR camera has always been in my “wish list”, though nobody cared gifting me one Winking smile – just kidding.

I was helping a friend of a friend with a .NET software, that required photos to be directly transferred to a PC as soon as they are captured. As far as I know, digital cameras go into a “USB mode” that makes them mass storage devices, just like ordinary USB thumb drives.

Tags Camera, Canon EOS 500D, DSLR, Photography

Personal Story

The little spike of motivation from Reddit

Post author By Guganeshan.T
Post date August 31, 2011
No Comments on The little spike of motivation from Reddit

Anybody who visits my blog can scan through the publish dates of the posts and know that I am not a frequent poster.

According to SEO experts, updating blog content regularly will effect the web traffic positively. So for a “rare” poster like me, seeing “Views per day: 10” in the WordPress analytics everyday is not new, and I am not serious about web traffic volume as I am (still) only a hobbyist blogger.

Personal Story

My IE6 dream

IE6 Dream Got startled and woke up to the wild vibration and tone of my mobile phone’s alarm at 5am in the morning, stumbling in the darkness and quickly stopping the alarm, I notice 3 new text messages. Relaxing again, resting my head on the pillow, I assume they would be just some facebook alerts.

The good news comes!

Then I muster up some energy and sit up. Deciding to go about my usual daily duties, I open the 1st message to read, which is a CNN news alert: “Google hinted today that they are making moves to get rid of Internet Explorer 6 for good, CNN”! What? wow, I thought. And I open the 2nd message…

Tags Dream, Fiction, IE6, Story

Gmail with Push e-mail in a Sony Ericsson K800i – using IMAP4

Post author By Guganeshan.T
Post date February 26, 2010
32 Comments on Gmail with Push e-mail in a Sony Ericsson K800i – using IMAP4

Ok, so you don’t have to own a BlackBerry to enjoy “Push” email: Email that is actively transferred (pushed) to the device as it arrives. It doesn’t even have to be a Smartphone.

Tags Email, Gmail, IMAP, Mobile, Sony Ericsson

Tree leaves becoming “pinhole cameras” during the Annular Solar Eclipse.

Post author By Guganeshan.T
Post date January 15, 2010
7 Comments on Tree leaves becoming “pinhole cameras” during the Annular Solar Eclipse.

annular solar eclipse The longest, annular (like a ring) Solar Eclipse of the millennium happened today (15th January 2010).

When the annular eclipse was at its peak in the region from where I was watching it (Nugegoda, Sri Lanka), I saw some peculiar shadows on the ground. It occurred to me only after several seconds that they are caused by the Solar Eclipse!

Tags Astronomy, Interesting, Miscellaneous, Nature

Personal

Merry Christmas!

It’s that time of the year again… when the whole world (or at least the countries that use the standard Gregorian Calendar) commemorates the birth of Jesus Christ.