The AI Revolution
Large language models (LLMs) are designed to generate human-like text based on a given prompt. They are revolutionizing the way we interact with machines and automating many tasks that previously required human intervention. The program that interfaces the LLM to the human or machine is called an agent.
An agent uses an LLM as a brain to make decisions about how to use a tool. In the case of a chat bot, the tool is the text input and display devices (and potentially audio microphone and speakers) that the bot (the agent) uses to interact with a user while the "brain" to which it interfaces to get the content is the LLM. In the case of a coding support agent the tool is the IDE (integrated development environment) or editor the human user is using to edit software and with which the agent (the plugin such as copilot) is assisting. In the case of a robot the voice encoder/decoder and actuators, vison system and touch sensors that comprise the robot are the tools. In some cases and application can be both a tool and an agent depending on the use case.
Over the last five years, LLMs (and commensurately agents) have evolved significantly. One of the most significant advancements in this field is the development of large-scale pre-trained language models. These models are trained on vast amounts of text data and can generate coherent and fluent text that is often indistinguishable from human writing. LLMs can be used for a wide range of applications, including chatbots, search engines, summarization tools, and even code generation.
One of the most popular proprietary LLMs (at the time of (re)writing) is Chat GPT-4 developed by OpenAI. GPT3 had 175 billion parameters while GPT4 has approximately 1.75 trillion parameters and can generate high-quality text for a wide range of applications. Parameters are the numeric weights and biases that comprise the variables at the neuronal level that the LLM adjusts in order to learn. Like Gemini (Google), Claude (Anthropic), LLaMA (Meta), Mistral (Mistral) and Grok (xAI), GPT is a general purpose LLM trained on a large corpus of narrative text and the fine tuned with conversation patterns. LaMDA also developed by Google, (Language model for dialogue applications) is an LLM trained on general dialogue from the outset and designed to understand natural language conversations and provide relevant responses. Which approach is better for conversational AI is an open question today as LaMDA released in 2021 and is thus "old" by LLM standards and the other models mentioned have been released after that date and substantially enhanced. Their larger training sets and fine tuning may have given them conversational capabilities indistinguishable from those of the earlier LaMDA.
So far we have concentrated on the large (generally proprietary) LLMs, but many now have mini versions released which are small enough to use on a mobile device, and some like LLaMA (Meta) and Grok (xAI) are released as open source models (usually through huggingface.co. In the case of LLaMA the models are released as fine tuned & optimised end products ready for deployment onto desktop systems with advance graphic cards to aid in inference processing while in the case of Grok the raw model has been released which requires fine tuning training to be effectively used. There are many other models available on huggingface enabling developers and experimenters to download the model and directly apply it to their own use case on local hardware. This is a legitimate use of the models for both commercial and non commercial purposes although some skill in coding and AI technologies in Python or C++ will be required. While the large models are available, there are also many mini-versions also which in some use cases actually outperform their larger cousins in specific problem domains and in many cases are otherwise sufficient for general use. See Hugging Face – The AI community building the future.. for access to a very large library of LLMs and associated agents as well as detailed instructions on how to use them. Both LM Studio (LM Studio - Discover, download, and run local LLMs) and Leo in the Brave Browser (Brave Browser Download | Brave) support connecting and loading a variety of open source LLMs. The first advantage of a local LLM is that all the data shared with it in the prompt stays local while the second advantage is that it is a lot cheaper than paying hosted usage fees.
The LLM is essentially just a model of how semantic elements (called tokens or words) relate to each other, but equipped with agents (tools that wrap the LLM feeding it data in the form of prompts and receiving, responding to, and acting on responses provided) are what give the LLM its application and usefulness. Chat bots are a kind of agent with which most users are now familiar. In some cases the agents are designed to deal with multiple LLM's allowing the user to target the response to different purposes.
Some LLMs are naturally better at some tasks than others because of their training sets, architecture or fine tuning. For example Claude is said to be particularly strong in program code generation, while LaMDA is stronger in conversational dialogues and OpenAI o1 is specifically trained to perform complex reasoning tasks by generating a long internal chain of thought before responding and thus also excels in mathematics, science and coding problem solving. Among the classic uses cases are:
If you want to learn more about LLMs and their applications, we recommend checking out this article on Beebom that provides a list of many of the best large language models in 2023 (as of the time of original writing) with detailed descriptions and use cases. At the time of the update to this page (2025) this article is probably a little dated.
LLMs and their agents represent an exciting area of research in AI technology. With the development of large-scale pre-trained language models and new solutions appearing on the internet that use prompts to create an output, we are seeing new possibilities for automating tasks and creating engaging experiences for users. The applications of LLMs are diverse and include chatbots, search engines, summarization tools, code generation, content creation, translation, and more. However, we must also be mindful of the challenges associated with these technologies.