LLMs are reframing our comprehension

Guys, the cognitive numbness caused by ChatGPT’s style of documentation and meeting room discussion is here, real and bad ….

I am getting more and more design documents/disucussiosn that now have wording like “here is what it means,” “here is why it matters,” “Instead of this it is that” “five reasons why this is important” “..it is inherently bad at” ” the X has sfhited to Y” …and repeated usage of Goal/Insights/Strategic/Impact and so on….

The experience felt exactly like watching Instagram reels, every other reel uses the same song or music in the story, the plot lines are similar, the visual aspects are in the same vicinity and so on ..
The effect Instagram had on me is that it numbed my mind with no focus on what was shown and no significant recall of it. Which is exactly opposite of what our roles at the workplace expect us to do ….

Just to be fair, such words showing up in docs and meetings aren’t a new phenomenon. When Stephen Covey released his books, we suddenly had synergy and win-win showing up in every conversation. The same thing happened to books and talks by Steve Jobs, Jeff Bezos, till your latest Darios and Altmans. But this jarganification of normal conversation happened over a longer timeline. So our brains had time to adapt and adjust.

This ” feel thoughtful without actually thinking about it ” style of narration was useful for social media engagement and payouts. And now, LLMs have pushed it on everyone’s keyboard. Sadly, people are dumping the conceptual rearrangement and problem reframing and discourse dissemination without realizing what it is and what effect it will have.

So the problems of the humanities domain i.e logic, fallacies, rhetoric, synthesis, nuance and so on, are becoming problems of software engineering …

as an engineer with a master’s in humanities am more worried …

LLMs are web3 !

Its little surprising that analysts at large aren’t pronouncing it yet .

So Web1 was the advent of the internet(wwb) as we know it . It heralded us into client-server at a worldwide scale . Everything is web-something website/webservice and so on. Browsers were the prime representative. Request-response or search/type was the paradigm.Active/Dynamic content was the currency.

Web 2 was the advent of content networks on the web. It heralded us into a network of content providers . Everything is a social-network something. Contact/Connections and the resultant network onto something were the prime representative. Feed/subscribe, scroll was the paradigm. Post and React was the currency.

Web3 is now the advent of Large Language models. It heralded us into an AI model as a window into the information. Everything is “ask me” something. Chat Windows are the prime representative. Generative Assistant is the paradigm. Questions/Answers, Detailing/Summaries, Ideate/review i.e humanization of the interaction is the currency (Anthromorpization of Internet).

ps: It also plays out accordingly if you study the programming frameworks of the era. The langchain tribe of frameworks has exactly recreated the primitives of the web/J2EE era. ChatGPT has replaced dot com as the representative term 🙂 .

Just to be fair to blockchain guys, it didn’t redefine the basic nature of human interaction with information systems. Analysts hurried to coin the buzzword for what was a different technology implementation of the web 2 style networked interaction model.

Autoresearch and old memory of a C assignment

Andrej Karpathy published his work on auto research where he used ai to improve itself .

It brought back memory of old project assignment. This was a time when we were hired as interns at Infosys and we had to give this final coding assignment that would make or break or employment .

The project

In this we were given a working C program that would produce certain result already.

Our job was to read this code file via another C program. This new C program that we were to write had to modify the original C code following bunch of rules.

First set of rules wear on formatting; for example if there are multiple variables declared on a same line they should be rearranged so that there is one variable declared each line. Then some variables were declared and initialized in same statement ,some were not.Our job was to rearrange them such that the declarations would occur before the assignment.Again one statement at a time.In case there were multiple variables declared or initialized they had to be separated as one statement per line.

Another rule was that their naming convention.Here we had to find their type and use that as prefixes to the original variable names. Another rule was that these variables would be rearrange in the sequence of their appearance in the code. Similarly we had to rearrange the functions in the sequence of their appearance.

We had to retain any comments or imports at their new relevant positions.I might have forgotten few rules here but you get the idea.

The last step was the eval. Our new program had to compile the modified program and execute the a dot out file and compare the result to original files output.

We felt the presence of auto research

As such the idea of program writing or modifying itself is classic item in computer science. When i/we were doing assignment i got fascinated by that idea.However we didnt have power of LLMs and their long context window with us at that time.So the idea of writing a program to create a sophisticated self improving,self modifying program sounded too acadmic and too cumbersome so i stopped my mental pursuit there*.But now LLLs are here and stimulating self awarenesses and self reference for a program is trivial.So its a fun times for programmers i say..

*yacc/lex and sed/awk was where i took refuge.Thanks to “Archie” mentors.

Where will LLM coding take us?

Been writing small posts on how LLM based coding is evolving and how it will impact the software industry.The posts are aging so well that by Feb 2026 the markets and public are already convinced that LLM coding will make hand crafted code thing of the post.Thanks to anthropics claude releases.Hand crafted code might become an artistic pursuit even or just a teaching tool.

But where is taking us?

The industry : Would benefit a lot , now that we can ship out software faster than before . This also takes the pivot of success back where it always belonged; that is having superior ideas and executing it well till production.

The code : Perfecting creation of structural code is something that LLM based coding approaches have already demonstrated that they can master well.By now people have also demonstrated that even the intentional part code can also be crafted with LLM based coding approaches.
That leaves us with only the bare bones part the software,which is the code of business rules. The hard part of business which only we know .The best example of such code i could think of was how fortan programmes were written and used .Just the exact business rules sans the I/O or hardware or framework considerations.

(sample fortran code , generated via , well 😉 AI )

Individuals : The spot where our jobs were standing has shifted now.Instead of being the person who wrote code by understanding business from someone.We now have become the person who understand business so well that we can now use LLM coding approaches to fulfill the effect business needs.This was always why IT department existed in the first place (hence the fortran analogy).Is this making all of us business analysts ,not really but it does make us software product owners.A title that needs to be coined now.

Job losses : come on, ask any ceo, there is so so much business always wanted to do and achieve but could not due to to lack of software delivary speed.That constraints is removed now.

However: Transition periods are confusing,chaotic and not time bound.That we need to bear.But a total and complete job losses for us,No way.

What i wrote in Jan 2026 new year:

by the time 2025 is closing and the question on will Coding LLMs/Agents/IDEs replace human programmer? gets aksed ,i am at ” It might actually” moment.

Few quarters before i was at ” may be lets see”.

Thats the amount of progress state of the art has made .
Based on the report you read and the date of it ,some 40 percent of code and 256 billion lines of code has been outputted by models. Forget about the impact but such numbers means it is a large scale training and human verification that has already gone back into the models/to the labs.

With more focused optimizations from the AI LABs in the model architecture and deployments 2026 especially by plugging in some knowledge representation and knowledge priority/ranking might close the year 2026 on “Ah it happened”.

ps: there have been many shots taken at eliminating the programmer over last 4 decedes (really,its that old of a dream ).
These attempts tried visuals, formatted config,drag and drop and even document based approaches. Additionally the design based approaches like annotation and injection also kept on happening.

But the “so called” ability to reason offered by LLMs is breaking past the totality barrier of code generation from specs . My best bet is models becoming runtime aware is what will be the last finishing touch to the masterpiece.

feb 2026 :

Most of the buzz on llm coding is about how fast and easy it was for a non tech person to write an app or software was.At times we hear personal experiences of using llm coding by old techies like me .

But we hardly hear anything from people who sell software itself as the commodity!
So, the likes of android or spring or angular or chrome have a huge builtup of code complexity in them.They also have competing and conflicting features and a huge backlog of features they would have loved to implement yesterday.

The real and more believable metric of impact of llm coding will come from these places.If they are able to ship bigger feature sets faster,it would be the gold standard for productivity measurement around llm coding.

This also raises another question. When llm coding is so powerful why would we need their framework at all? would it not be easy to get your own curated framework based on your needs and use it. The picture is much darker for maintainers of smaller specialized libraries.

vibecoding or llm coding as i call it, is going to have its first evolutionary prey-victim soon. Cambrian explosion are beautiful and brutal at the same time.

Will Code generating LLMs replace programmers soon?

By the time 2025 is closing and the question is whether Coding LLMs/Agents/IDEs replace human programmers? gets asked,i am at ” It might actually” moment.

Few quarters before i was at “maybe let’s see”.

Thats the amount of progress state of the art has made.
Based on the report you read and the date of it ,some 40 percent of code and 256 billion lines of code have been outputted by models. Forget about the impact but such numbers mean it is a large scale training and human verification that has already gone back into the models/to the labs.

With more focused optimizations from the AI LABs in the model architecture and deployments 2026, especially by plugging in some knowledge representation and knowledge priority/rankin,g might close the year 2026 on “Ah it happened”.

ps: There have been many shots taken at eliminating the programmer over the last 4 decades (really, it’s that old of a dream ).
These attempts tried visuals, formatted config, drag and drop and even document based approaches. Additionally, the design based approaches like annotation and injection, also kept on happening.

But the “so called” ability to reason offered by LLMs is breaking past the totality barrier of code generation from specs. My best bet is that models becoming runtime aware is what will be the last finishing touch to the masterpiece.

I created my LLM from ground up, you should too

Edit ,based on the feedback: Building a car and building a Ferrari that can be sold in showroom are 2 diff things . Here are links to learning resources Stanford class , Karpathy and Rashcka

LLMs have moved on since ChatGPT

As an AI Architect with production level experience with AI/Neural Networks and regular reading of various research papers in the field, the arrival of ChatGPT didn’t stun my circle of technologists as much as it did to rest of the sector.

The initial use case in generative AI was focused on using it to literally “generate” insights via prompts. It quickly morphed into a general purpose toolkit for most routine (or shallow) inferences.In a way it was sort of a Java or Windows moment for the technology field. This was the era that felt very comfortable to someone with prior hands on tensors.

The Cambrian explosion of LLMs

By the time 2024 ended, the default choice for GenAI projects wasn’t always OpenAI. There were many LLM companies that shipped many versions of LLMs with different architectural designs and sometimes shipped an internal mix of experts. In the larger field of LLMs, the scientist would still discard such differences as more differences of the same thing. But if one is crafting solutions around LLMs one has to account for differences in modality, reasoning , pure chatbots and so on. Not to forget their parameters and tasks they were specialized in ( and beat some benchmark for ) and evaluating them/output.There is also enough business traction in moving to domain-specific models . All of these demanded a personal experiential look at LLMs.

Hands-on learning of LLMs

As such, there are multiple universities and experts who have amazing courses and materials on LLMs.My search zeroed in on the course by Stanford and the material by Karpathy and Rashka.In fact, one can ask ChatGPT to generate code to create one’s own LLM and it would give you 20-something lines of code to do it . But what insight can it provide?

Learning path and building knowledge pyramid

This boils down to first mapping the knowledge pyramid for LLMs as a field. I got this image created with the help of ChatGPT as a reference, but one should define their own based on where he is starting and how far one has to reach (there has to be an upper end).

Another aspect of my learning approach is to read 2-3 books on the same topic. And then move to read/watch/code more along the knowledge pyramid/path. That ends up being 10-12 books or equivalent of watching/coding for each new technology wave and a few months.

As such, each book has its own objective, so when books are titled as “from scratch”, “hands on”, “head first”,” in production, or “deep dive” etc, it makes sense to read the table of contents or preface. That can give you a good idea of the journey the author will take you (and what remains for you to do on your own).It is also useful to read the book cover to cover, including the references.

For my learning style, I wanted to be guided by someone who could take the learning from a blank slate and build on it, like how the LLM state-of-the-art was built. My approach is also to type along, run and experiment. This is where Sebastian Raschka’s material on LLMs worked best for me.

Building the Actual, not a Large Language Model

Building from scratch approach meant that I had to literally start with understanding transformers, choose text and convert it into embeddings, code attention and then move on to multi-head attention while experiencing the why of it, normalize the layers and code my own GPT model. This version just allowed me to chat around the text I trained it for. But the real intention was to see how everything I did leads up to and affects the output rather than saying look, here is my LLMs”.

I then learnt to add depth by evaluating the generative output and instruction, fine-tuning it (and LoRA). This was done by downloading GPT2 and loading/using it for evaluation.I also took a detour with Ollma since I wanted to use a few mode models for evaluation and see the minute differences.

There were 2-3 variations of my model that I fine-tuned for classification and also generating my riddle version of output (remember 4th standard kids inventing their own riddle/cryptic languages for speaking, that thing).

My most fun moments were experiencing the epochs and watching the output of the print command on the model object(this is real fun, do it). The code-along approach also gave me first-hand intuition around tuning and parameters.

The most frustrating aspect, which is also a reality check, is that debugging the mistakes is very tough. Not the syntax one, but the ones related to PyTorch and the attention overlap area.

Rashka has been very generous in mentioning additional material for the curious minds. For someone like me who is looking for a wide and deep understanding of the craft, taking detours to DPO and Bahdanau attention added to the joy.

What next

Personally, I went on to spend some money on Google Colab and try it all out at a little large scale/TPUs. But that’s as far as faithful learning goes.

My friend Kamlesh has given me a target to fine-tune my post-trained LLM and beat the incumbents on one of the benchmarks :).Depends on my weekend time and budget, tough. Moreover I and Kamlesh, have a history of big ambitious aims. Last year, we wanted to build a vision model to read documents in MoDi script ( later, IITKgp took up such a project with the institution we were to approach ).

As such, there are not many corporations that will be building LLMs of their own and beat the AI labs.

The amount of data needed to give an LLM commercial meaning is bigger of a problem than server and talent cost. There is, however, a case for creating domain LL models as the optimization cycles in the field accelerate, and things become cheaper.

While I wait for the river to flow in its natural course. The most certain thing in 2026 are :

1. We shall be building a lot of agentic stuff for production.

2. …Build more solutions around LLMs

3. New LLM architectures, their specializations and runtime costs in 2026 will look far different than what we saw in the last 5 years(better, I mean).

4. You and I will be crafting commercial solutions around LLMs and AI. Something that AI labs don’t have an interest in, domain expertise and commercial standing for!

So the learning will continue ..

Chatbots before LLMs

Designing Bot frameworks

In the era before LLMs, building a conversational agent meant ….one had to literally come up with sample phrases so that the Engine could do the Named Entity Recognition. Doing this by ourselves was limiting by definition as each person would have a limited range of expression. In that era, there were also tools that generated sentences if you gave them the base activity as input.(Now that we have experienced LLMs, this sounds funny on multiple count, but it used to work). (The era before production-grade NLP was even weird, i have documented that in an earlier post . )

This is also the era when a formal framework for chatbot interaction did not exist. So we ended up building our own framework. We had to debate the fitness of different NLP libraries (Stanford vs opennlp etc) and debate about their accuracy and code framework capability around interactions/invocation. In one case i was so frustrated with NLP that i designed a framework that would allow users to issue commands instead of chat by typing and we also had autocomplete/type ahead added to it . Like how you type elaborate commands on Unix, but imagine the typing experience of Google search for this. We were able to do this with some good keyword-parser-functional paradigm. And not to forget a brilliant developer with me,Nehal . But the momentum for a formal chat-style bot was huge and frameworks arrived soon.

Designing with chatbot Frameworks

Again,a debate will unfold about the chat agent framework selection. Most of the frameworks had similar capability around the core NLP and invoking services part but they had marked differences in the “flow” aspects, voice vs text capabilities and so on . This madea huge difference when the conversions we were supporting had multiple end states or conditionalities. The implication of this statement is that when we first did our chatbot in 2014/15 , the Alexa one the field of conversation design was not acknowledged (Alexa wasn’t available in India then, we got it from us). It was a few years later, especially when the commercial use cases came along, that the User Experience of the part of the chat interaction became mainstream/part of project work (Interaction Design) . Bot discovery and interaction design are useful and important even in the agentic era.

Getting the chatbots working

Once the engine of the framework we selected and trained would determine the action, I would write and wire handlers to do the processing.It quickly evolved into a chain of command pattern cum workflow or some sort. Giving rise to all sorts of integration issues.Message transformation, Errors, Retries, Auth and so on. Some of the frameworks had built-in capability to chain conversational flows (and pass variables/values around ). Most of them had some take on retries and how long the conversation can be, but it had to be discovered than being documented.

And then there was a user.At time, he would be technically disconnect from the chatbot, so we had to maintain the whole conversation state in the database. Sometimes the follow-up step in the processing needed more inputs from the user to I had to create a local and global state for the whole interaction to be recorded.

In some use cases we had to ask the user to upload receipts, which were processed by a vision model.And guess what, the image upload and processing could take more and varied time for my chatbot to remain active.So we keep some keepalive and sweet nothing “status… updating…” going to the user,to fool the whole system. Moreover, the framework chosen didn’t have native support for this sort of outside call, so everything had to be bundled together.

In another use case, we had served him content based on a help document.This had to be done with a combination of a search index in case the document repo was too large.In case of a structured FAQ one of the engine ,NIA had built in the capability of mapping queries to document keywords (density).

As a side note , many engine had some capability to detect obscenity that sufficed.PII interestingly panned out in black and white in many cases (due to the domain and use case mix at that time).

Voice Video and Human agent handovers

The voice based chatbots had a different set of additional issues.The engines from AWS and Google had built in ability to prompt for the question again if the pronunciation wasn’t clear. At times, this ended up in multiple retries/pass at the same handler so it had to be taken care of (since not all services were idempotent ).At time, the user would totally rephrase the ask, which would throw our design off guard .

Another fun aspect is that as Alexa evolved into a voice device with screen, we suddenly had to take care of the visual aspect of the interaction. Multilingual support was out of the box, so life was cool .

Video bots were a different thing to handle. Out stuff didn’t make it to production but the idea was to emulate a human face with expressions (confidentiality etc etc).It was pretty impressive for that time .

In one of the case, we had to handover the interaction to a human agent based on the predefined scenario.It was a straightforward integration to another system with some adjustments to timeout. But when the requirement evolved into passing the whole conversation to a human agent ,we realized that we didn’t have a handle to chat interaction that is provided by the framework! So I logged them as passed it on. That eventually led us to design another product around chat interaction analysis/insights and designs and it went on to (then ) compete with chatbase product.

Some notes

When we select a new technology or framework, it’s best to adapt to its way of doing things. However, when the field is new and evolving, the capability mismatch can be huge. In many case,s when it came to call/orchestrate service calls, my experience with traditional banking development helped me handle the issues with ambiguity, state and performance better than the chatbot native generation of freshers who looked towards the state of the art for solution.It also mattered because most of the recommended remedies around these problems were to use some sort of ESB or wire RPA somehow.I had found them out of sync with spirt of chat interaction (Now that we have LLMs to reason, plan and orchestrate them, i feel validated with some sort of emotional closure) .

Later, when ChatGPT happened and we moved on to RAG and Agentic tools,it felt like I was remaking the Spiderman movie Franchise for the third time. The story from there on in next post.

Embracing the Weird Stuff :becoming AI Architect

Weird Stuff that added flavor

As any good technologist has new “weird stuff” knocking on the door while he is focusing on some other technology. This is also the theme of how my resume got built.

Python was the first programming language I started my career with. At that time I was working on code generators,compiler switches and other “weird stuff” while my friends were working on struts and j2ee. So with much effort I moved to those technologies.Working in banking domain in mid 2000s we would create jobs to facilitate what was then called as business intelligence.These were early ancestors of data cleaning, aggregation and summarization that was done via code and service via ui. It didn’t feel amazing but it was work so we did it.

Enter ChatBots

Cut to 2009s I was obsessed with JavaScript ,Spring and whole SMAC buzz and then another “weird stuff” came my way . There we were building a chatting bot for relationship managers to support the internet banking users .I used the IBM SameTime stack while my colleagues used MS stack. Out bot could do basic chat and then allow screen sharing and video calls.

Post 2010 while I was chasing Angular ,microservices and my Ethical hacker certification, another “weird stuff” called Hadoop came along. So there I was working on MapReduce, Storm and Spark. We had do some Data Science work using Apache’s MLLib. My background with BI allowed me to work on it but converting beautiful data structures to some numbers and flags didn’t seem like calling to the programmer in me 🙂 ,so I was very happy when Data Scientist joined our team. By this time it was clear that DS was the hottest job of the decade but I had moved on to Node and Dockers of the world and then another “weird stuff” called R came along. And again i was supposed to build a chat application .I was so angry to have my “When Harry meets Sally” moment that a wrote a blog post against chatbots 🙂 (here).

ChatBots strike again

It was by this time that the role of AI-Architect was coined ,brining me peace .So I moved on to a projects working full time on AI and Automation .Our projects had working going on neural network of all kinds and all of were hands on. But industry had some other plan .A “weird stuff” called a chatbot came knocking on the door .So there I was building chatbots on multiple frameworks.We had Alexa,Google’s Dialogflow,Amazon and IBMs stacks and open source framework called RASA. Not to forget mentioning our Inhouse AI Studio/Chatbot solution called NIA that my friend GuruPrasad was developing .

This was probably the Jurassic age of bots .

Have LLM use cases platued

This is the spread of use cases of ChatGPT usage. What is not evident from the chart is that people didn’t really ask/need an LLM to do it.
Most of the users would be totally cool to have one app for images, another for summaries, another for coding assistance and just another for music.

( The chart was created by asking ChatGPT )

So long the features and cost are reasonably good, most of users won’t even care what’s powering them.

The everything-everywhere-all-at-once nature of LLMs signifies that their builders have been successful in discovering use cases around the powers that GPT gave them.
This is also the game where feature parity tends to plateau very soon. Ask anyone who has developed a successful consumer-focused mobile utility app or look at your own mobile app installation history over the last decade.
While there is always some buzz around the benchmark, the latest model from some company has beaten but 5 years into LLMs, the real question we need to ask is does the end consumer notice or even care ? Especially the average retail internet consumer? (where the market capitalization lies 🙂 )

Which is to also imply that for end internet users, the craze or utility of GenAI might have peaked as novelty and is taken for granted at par with you mobile phones.
When was the last time did you really felt the difference between version 10 and 11 of your mobile phone?

Ah LLMs ….

PS: 60% of paying chatgpt users are enterprise

Podcast : AI threats and Future outlook

In this podcast in Marathi language hosted by Omkar Dabhadkar i spoke about hashtag#AI . As such Marathi language is spoken by 80 to 100 million people but when it comes to Artificial Intelligence we did not have not-dumbed down content .
In this video i spoke of impact of AI on society in general and IT jobs in particular .The discussion also touched up Indian efforts in fields of AI , nature of Indian IT industry, Indian startups , the need of different skilling that AI will enforce upon freshers and the remedies colleges can take . We ended up also covering Gen-Z and my optimism about them .
https://www.youtube.com/watch?v=A-3CWRoQQnY