What is OpenAI?
- An AI research organization founded in 2015.
- Mission: Ensure AGI (Artificial General Intelligence) benefits all of humanity.
- Key products: GPT models, ChatGPT, Codex, DALL·E.
- What are GPT, o3, o4-mini?
- GPT stands for Generative Pre-trained Transformer.
- o3 is OpenAI’s most powerful system as of 2025.
- o4-mini is a lighter, faster model.
- Why is this globally relevant?
- AI systems like GPT are widely used in:
- Customer service
- Healthcare (AI diagnosis)
- Legal tech (contract review, legal research)
- Education (AI tutors)
What is an LLM (Large Language Model)?
- A Large Language Model (LLM) is an AI system trained to understand and generate human language using statistical patterns learned from large text datasets.
- It's powerful, but not perfect—prone to hallucinations and lacks real-world understanding.
Key Features of LLMs:
Feature
|
Description
|
Scale
|
Trained on billions to trillions of words from books, websites, forums, etc.
|
Learning Method
|
Uses unsupervised learning to detect patterns, grammar, and context in text.
|
Function
|
Can generate, translate, summarise, or complete text, and even answer questions.
|
Architecture
|
Most LLMs today are based on Transformer architecture (introduced by Google in 2017).
|
Examples of LLMs:
Model
|
Company
|
Notes
|
GPT-4 / o3 / o4-mini
|
OpenAI
|
Powers ChatGPT
|
Gemini (formerly Bard)
|
Google DeepMind
|
Multi-modal model
|
Claude 3
|
Anthropic
|
Focuses on safety and alignment
|
LLaMA
|
Meta
|
Open-source foundation model
|
Mistral
|
Mistral AI
|
Lightweight, open-source LLM
|
How Do LLMs Work?
- LLMs are trained to predict the next word in a sentence based on the context of previous words.
- They use probabilistic methods to choose the most likely word.
- They don’t understand meaning like humans do; they recognize statistical patterns.
|