AI related definitions and terminology - annotated

AI Definitions and Terminology

{wikipedia}

Artificial intelligence (AI) is the capability of computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of research in computer science that develops and studies methods and software that enable machines to perceive their environment and use learning and intelligence to take actions that maximize their chances of achieving defined goals.

The Glossary listed below was generated by ChatGPT

Glossary of AI Terms and Definitions

Latest Technology Awareness Date: June 2024
(Incorporates concepts from GPT-4, GPT-4-turbo, GPT-4o, LLaMA 3, Gemini, Claude, Mistral, and more as of mid-2024.)


A. Core AI Concepts

Artificial Intelligence (AI)

The field of computer science focused on creating systems capable of tasks that typically require human intelligence (e.g. perception, reasoning, learning, planning, natural language understanding).

Machine Learning (ML)

A subset of AI where computers learn from data without explicit programming; algorithms improve performance on a task over time as they are exposed to more data.

Deep Learning

A subfield of ML that uses neural networks with many layers (“deep” architectures) to learn hierarchical representations of data, powering applications like image recognition and language modeling.

Neural Network

A computational model inspired by biological neurons; consists of interconnected nodes (neurons) that transform input data through weighted connections to produce output.

Supervised Learning

An ML approach where the model learns from labeled data (input-output pairs) to predict outputs for new, unseen inputs.

Unsupervised Learning

Learning patterns or structures from data without explicit labels, such as clustering or dimensionality reduction.

Reinforcement Learning (RL)

An ML paradigm where an agent learns to take actions in an environment to maximize cumulative reward through trial and error.

Transfer Learning

Applying knowledge gained from one task or dataset to improve learning on a related but different task.

Generative AI

Systems that can produce new content—text, images, audio, or code—by learning the underlying distribution of the data.

Foundation Model

A very large, general-purpose model trained on broad data (e.g. GPT, PaLM, LLaMA) that can be adapted to many downstream tasks.


B. NLP & Language Models

Natural Language Processing (NLP)

The branch of AI focused on enabling computers to understand, interpret, and generate human language.

Large Language Model (LLM)

A neural network trained on vast text data to predict and generate human-like text, often with billions or trillions of parameters.

Transformer

A neural network architecture introduced in 2017 (Vaswani et al.) that uses self-attention mechanisms to model dependencies in sequential data, enabling scalable training of LLMs.

Prompt

An input given to an AI model (especially an LLM) to steer its response toward a desired output.

Prompt Engineering

Crafting prompts strategically to guide AI model outputs toward specific tasks or styles.

Chain-of-Thought (CoT)

A technique where the model generates intermediate reasoning steps to improve complex problem-solving.

Instruction Tuning

Fine-tuning a model on datasets where instructions and expected outputs are paired, improving its ability to follow human directions.

Few-Shot Learning

The ability of a model to perform new tasks with very few examples provided at inference time.

Zero-Shot Learning

The ability of a model to perform tasks it was not explicitly trained for, relying on general knowledge encoded during training.

Context Window

The amount of input text (tokens) a model can “see” at once; recent models have context windows of tens or hundreds of thousands of tokens.


C. Technical Terms

Parameter

A learned weight in a neural network; LLMs often have billions or trillions of parameters.

Embedding

A dense vector representation of data (words, sentences, images) capturing semantic or contextual meaning.

Attention Mechanism

A method in neural networks that dynamically focuses on different parts of the input when producing output, core to the transformer architecture.

Loss Function

A metric optimized during training to measure the difference between predicted and actual outputs.

Backpropagation

The algorithm for computing gradients of the loss with respect to model parameters, enabling learning through gradient descent.

Gradient Descent

An optimization technique that iteratively adjusts parameters to minimize the loss function.

Overfitting

When a model learns the training data too well, including noise, and fails to generalize to new data.

Regularization

Techniques to prevent overfitting by penalizing model complexity.

Fine-Tuning

Continuing the training of a pretrained model on a new, often smaller, dataset to specialize it for a particular task.


D. Deployment & Safety

Inference

The process of using a trained model to make predictions on new data.

Serving

Deploying a model so it can respond to user requests in real-time.

Latency

The delay between a user request and the model’s response.

Alignment

Efforts to ensure AI systems behave in ways consistent with human values and intentions.

Hallucination

When an AI model generates text that is syntactically correct but factually false or misleading.

Red Teaming

Systematic testing of AI systems to discover vulnerabilities or harmful outputs.

Guardrails

Techniques and policies to constrain AI behavior to safe and appropriate outputs.

RLHF (Reinforcement Learning from Human Feedback)

Training technique where human preferences are used to shape a model’s outputs via reward signals.


E. Emerging & Industry Terms

MoE (Mixture of Experts)

A neural network design where only a subset of the model’s parameters (experts) are active per input, increasing efficiency and scalability.

Multimodal Model

An AI model that can process and generate multiple types of data, such as text, images, audio, and video.

Vision-Language Model (VLM)

A model that jointly understands images and text, such as CLIP or GPT-4 with vision.

Retrieval-Augmented Generation (RAG)

Combining retrieval systems with generative models to improve factual accuracy by grounding responses in external documents.

Synthetic Data

Artificially generated data used to augment training datasets.

Open-Weight Model

A model whose weights are publicly released (e.g. Meta’s LLaMA 2/3).

Closed-Weight Model

A model whose weights remain proprietary (e.g. OpenAI’s GPT-4).


📅 Technology Awareness Date

Definitions current as of June 2024.
Includes trends and terminology reflecting the state of the field up through mid-2024.

My initial goal in creating a Glossary page is just to actually learn all of the basic terminology.  I feel comfortable asking ChatGPT for its own assessment of AI terminology.

I wanted to also see what the Google Search Internet thinks are the salient glossaries out there.  Here are a few from SERP 1:

“CNET” grabbed my eye as I remember CNET from a long time ago, therefore my brain gives it greater weight than other possible results.  I’m surprised at how incomplete such a list is; the page is well monitized.

Click here https://www.cnet.com/tech/services-and-software/chatgpt-glossary-52-ai-terms-everyone-should-know/ to see a list of CNET Tech Writer Imad Khan’s 52 AI Terms Everyone Should Know.  I read the list and it has some interesting inclusions and exclusions, such as a big entry for Perplexity.ai chatbot but no mention, at all, of Anthropic / Claude or MicroSoft Copilot, which is advertised in the header of the page.

I personally like coursera and I have audited a few courses they provide.   Here’s their list: https://www.coursera.org/resources/ai-terms

And here’s a link from a source I know nothing about – Moveworks.  They put together a nice Glossary:

https://www.moveworks.com/us/en/resources/ai-terms-glossary

The Glossary below was created by the Divi AI page creation tool.  I really like the way it looks and has nice graphics to go with the terms.  I might find it difficult to augment the list while maintaining the style, until, of course, I ask ChatGPT to change it for me.

AI Policy Glossary

Artificial Intelligence (AI)

Machine Learning

Data Privacy

Algorithm

Ethical AI

Bias Mitigation

Compliance

Transparency

Automation

Neural Networks

Deep Learning

AI Governance

Risk Management

AI Ethics

Data Security

Predictive Analytics

Natural Language Processing (NLP)

Regulatory Frameworks