LLMs fail without context

Context is key to human language

Feb 22, 2025

yellow road signage at daytime — Shouldn’t AI be able to navigate? First, it should establish where it is and what else is there - i.e. context. Photo by Robert Ruggiero on Unsplash

When we are children, we quickly master directions. Directions are flexible in most languages — and important. That flexibility allows for descriptions of spatial relationships in a myriad of ways - near, left, above, North of, and so on.

To read more on spatial languages, a great resource for such cognitive semantics was written by Leonard Talmy 25 years ago in his book, Toward a Cognitive Semantics. These relationships seem to have fixed meanings once learned, but rely closely on the context to remove ambiguity, if possible and recognize it otherwise.

C.S. Peirce had a great model more than 100 years ago

What’s interesting about navigation is how it relates to the C.S. Peirce signs. In his model, signs can be icons (things that resemble other things, like how maps resemble the world in spatial terms), indices (connection to their object) and symbols (connection to objects by convention).

For navigation, they are icons whose semantics relate objects in context with other objects in context or that need to be. This is least probable to work with today’s transformer architecture because each transformer token is mapped to a word vector - a little like catering to symbols, a little for indices to relate proforms, but not icons.

Let’s test Microsoft Copilot

Saying, “I’m next to my friend” suggests they are either to the left or right of you. If you’re facing North, they are either East or West of you. There are many permutations, and ‘next to’ is simpler than many.

Let’s test one of those candidates for AGI or artificial general intelligence that should be at human-level emulation now according to many tech CEOs.

Testing simple relations with Microsoft Copilot. The LLM system doesn’t seem to recognize the other possibility - a curse of statistical systems that focus on the most probable choice (or with temperature settings, lower probabilities).

Notice that copilot fails to detect the other possible direction. It selects one but doesn’t recognize the other valid possibility. Humans find this grating like running fingernails over a chalkboard.

Let’s give it another chance with a follow up.

Giving Microsoft Copilot a second chance to answer about directions. Again, it doesn’t understand the question, and therefore make an error in its response. It looks like it fails to understand the sentence fragment I supplied.

Human children easily deal with sentence fragments. In the case above, my fragment ‘what if she is on my west side’ should modify the original sentence: “if i'm facing north and my friend is next to me facing south, what direction is she from me” as:

if i'm facing north and my friend is on my west side facing south, what direction is she from me

The correct answer: “From me, she is on my West side.” I’m betting you got it.

Also how could it interpret this as my friend ‘facing directly towards me!’

Up next

In my upcoming video post, I’ll look at the biggest technical limitations of LLMs that will stop them ever becoming AGI without fundamental redesigns that deal with meaning, knowledge and the lack of scale ever possible in the foreseeable future. The human-like use of context, tracking current conversational elements while retaining reference to long-term memory elements is crucial for simple conversational tasks.

Discussion about this post

Ready for more?