Abstract Algorithms

ai

44 articles across 26 sub-topics

Types of LLM Quantization: By Timing, Scope, and Mapping

TLDR: There is no single "best" LLM quantization. You classify and choose quantization along three axes: when you quantize (timing), what you quantize (scope), and how values are encoded (mapping). In practice, most teams start with weight quantizati...

Mar 14, 2026•14 min read

LoRA Explained: How to Fine-Tune LLMs on a Budget

TLDR: Fine-tuning a 7B-parameter LLM updates billions of weights and requires expensive GPUs. LoRA (Low-Rank Adaptation) freezes the original weights and trains only tiny adapter matrices that are added on top. 90%+ memory reduction; zero inference l...

Mar 9, 2026•13 min read

LLM Model Quantization: Why, When, and How to Deploy Smaller, Faster Models

TLDR: Quantization converts high-precision model weights and activations (FP16/FP32) into lower-precision formats (INT8 or INT4) so LLMs run with less memory, lower latency, and lower cost. The key is choosing the right quantization method for your a...

Mar 8, 2026•13 min read

Variational Autoencoders (VAE): The Art of Compression and Creation

TLDR: A VAE learns to compress data into a smooth probabilistic latent space, then generate new samples by decoding random points from that space. The reparameterization trick is what makes it trainable end-to-end. Reconstruction + KL divergence loss...

Feb 11, 2026•13 min read

Deep Learning Architectures: CNNs, RNNs, and Transformers

TLDR: CNNs, RNNs, and Transformers solve different kinds of pattern problems. CNNs are great for spatial data like images, RNNs handle ordered sequences, and Transformers shine when long-range context matters. Choosing the right architecture often ma...

Feb 8, 2026•12 min read

Neural Networks Explained: From Neurons to Deep Learning

TLDR: A neural network is a stack of simple "neurons" that turn raw inputs into predictions by learning the right weights and biases. Training means repeatedly nudging those numbers via back-propagation until the error shrinks. Master the basics and ...

Feb 8, 2026•12 min read

Ai Agents(4)

Skills vs LangChain, LangGraph, MCP, and Tools: A Practical Architecture Guide

TLDR: These are not competing ideas. They are layers. Tools do one action. MCP standardizes access to actions and resources. LangChain and LangGraph orchestrate calls. Skills package business outcomes with contracts, guardrails, and evaluation. Most ...

Mar 12, 2026•13 min read

Multistep AI Agents: The Power of Planning

TLDR: A simple ReAct agent reacts one tool call at a time. A multistep agent plans a complete task decomposition upfront, then executes each step sequentially — handling complex goals that require 5-10 interdependent actions without re-prompting the ...

Mar 9, 2026•14 min read

'The Developer''s Guide: When to Use Code, ML, LLMs, or Agents'

TLDR: AI is a tool, not a religion. Use Code for deterministic logic (banking, math). Use Traditional ML for structured predictions (fraud, recommendations). Use LLMs for unstructured text (summarization, chat). Use Agents only when a task genuinely ...

Mar 9, 2026•14 min read

AI Agents Explained: When LLMs Start Using Tools

TLDR: A standard LLM is a brain in a jar — it can reason but cannot act. An AI Agent connects that brain to tools (web search, code execution, APIs). Instead of just answering a question, an agent executes a loop of Thought → Action → Observation unt...

Mar 9, 2026•14 min read

Generative Ai(4)

Mastering Prompt Templates: System, User, and Assistant Roles with LangChain

TLDR: A production prompt is not a string — it is a structured message list with system, user, and optional assistant roles. LangChain's ChatPromptTemplate turns this structure into a reusable, testable, injection-safe blueprint. TLDR: LangChain p...

Mar 9, 2026•12 min read

How to Develop Apps Using LangChain and LLMs

TLDR: LangChain is a framework that simplifies building LLM applications. It provides abstractions for Chains (linking steps), Memory (remembering chat history), and Agents (using tools). It turns raw API calls into composable building blocks. TLD...

Mar 9, 2026•14 min read

LLM Hyperparameters Guide: Temperature, Top-P, and Top-K Explained

TLDR: Temperature, Top-p, and Top-k are three sampling controls that determine how "creative" or "deterministic" an LLM's output is. Temperature rescales the probability distribution; Top-k limits the candidate pool by count; Top-p limits it by cumul...

Feb 15, 2026•14 min read

RAG Explained: How to Give Your LLM a Brain Upgrade

TLDR: LLMs have a training cut-off and no access to private data. RAG (Retrieval-Augmented Generation) solves both problems by retrieving relevant documents from an external store and injecting them into the prompt before generation. No retraining re...

Feb 11, 2026•12 min read

Machine Learning(4)

Mathematics for Machine Learning: The Engine Under the Hood

TLDR: 🚀 Three branches of math power every ML model: linear algebra shapes and transforms your data, calculus tells the model which direction to improve, and probability gives it a way to express confidence. You don't need to memorize formulas — you...

Feb 8, 2026•13 min read

Unsupervised Learning: Clustering and Dimensionality Reduction Explained

TLDR: Unsupervised learning helps you find patterns when you do not have labels. Clustering groups similar data points into segments, and dimensionality reduction compresses large feature spaces into smaller, useful representations for visualization,...

Feb 8, 2026•11 min read

Supervised Learning Algorithms: A Deep Dive into Regression and Classification

TLDR: Supervised learning maps labeled inputs to outputs. In production, success depends less on algorithm choice and more on objective alignment, calibration, threshold tuning, and drift monitoring. This post walks through the full pipeline from dat...

Feb 8, 2026•13 min read

Machine Learning Fundamentals: A Beginner-Friendly Guide to AI Concepts

TLDR: 🤖 AI is the big umbrella, ML is the practical engine inside it, and Deep Learning is the turbo-charged rocket inside that. This guide explains -- in plain English -- how machines learn from data, the difference between supervised and unsupervi...

Feb 7, 2026•14 min read

Llm(3)

Tokenization Explained: How LLMs Understand Text

TLDR: LLMs don't read words — they read tokens. A token is roughly 4 characters. Byte Pair Encoding (BPE) builds an efficient subword vocabulary by iteratively merging frequent character pairs. Tokenization choices directly affect cost, context limit...

Feb 11, 2026•12 min read

LLM Terms You Should Know: A Helpful Glossary

TLDR: The world of LLMs has its own dense vocabulary. This post is your decoder ring — covering foundation terms (tokens, context window), generation settings (temperature, top-p), safety concepts (hallucination, grounding), and architecture terms (a...

Feb 11, 2026•13 min read

Large Language Models (LLMs): The Generative AI Revolution

TLDR: Large Language Models predict the next token, one at a time, using a Transformer architecture trained on billions of words. At scale, this simple objective produces emergent reasoning, coding, and world-model capabilities. Understanding the tra...

Feb 8, 2026•14 min read

Agent-architecture(2)

LLM Skills vs Tools: The Missing Layer in Agent Design

TLDR: A tool is a single callable capability (search, SQL, calculator). A skill is a reusable mini-workflow that coordinates multiple tool calls with policy, guardrails, retries, and output structure. If you model everything as "just tools," your age...

Mar 12, 2026•14 min read

LLM Skill Registries, Routing Policies, and Evaluation for Production Agents

TLDR: If tools are primitives and skills are reusable routines, then the skill registry + router + evaluator is your production control plane. This layer decides which skill runs, under what constraints, and how you detect regressions before users do...

Mar 12, 2026•13 min read

Fine Tuning(2)

SFT for LLMs: A Practical Guide to Supervised Fine-Tuning

TLDR: Supervised fine-tuning (SFT) is the stage where a pretrained model learns task-specific response behavior from curated input-output examples. It is usually the first alignment step after pretraining and often the foundation for later RLHF. Good...

Mar 9, 2026•11 min read

PEFT, LoRA, and QLoRA: A Practical Guide to Efficient LLM Fine-Tuning

TLDR: Full fine-tuning updates every model weight, which is expensive in memory, compute, and storage. PEFT methods update only a small trainable slice. LoRA learns low-rank adapters on top of frozen base weights. QLoRA pushes efficiency further by q...

Mar 9, 2026•12 min read

#Bitsandbytes(1)

Practical LLM Quantization in Colab: A Hugging Face Walkthrough

TLDR: This is a practical, notebook-style quantization guide for Google Colab and Hugging Face. You will quantize real models, run inference, compare memory/latency, and learn when to use 4-bit NF4 vs safer INT8 paths. 📖 What You Will Build in Thi...

Mar 12, 2026•13 min read

Awq(1)

GPTQ vs AWQ vs NF4: Choosing the Right LLM Quantization Pipeline

TLDR: GPTQ, AWQ, and NF4 all shrink LLMs, but they optimize different constraints. GPTQ focuses on post-training reconstruction error, AWQ protects salient weights for better quality at low bits, and NF4 offers practical 4-bit compression through bit...

Mar 12, 2026•13 min read

Ai Safety(1)

RLHF in Practice: From Human Preferences to Better LLM Policies

TLDR: Reinforcement Learning from Human Feedback (RLHF) helps align language models with human preferences after pretraining and SFT. The typical pipeline is: collect preference comparisons, train a reward model, then optimize a policy (often with KL...

Mar 9, 2026•10 min read

Ai Infrastructure(1)

LLM Model Naming Conventions: How to Read Names and Why They Matter

TLDR: LLM names encode practical decisions: model family, size, training stage, context window, format, and quantization level. If you can decode naming conventions, you can avoid costly deployment mistakes and choose the right checkpoint faster. �...

Mar 9, 2026•11 min read

Data Science(1)

Why Embeddings Matter: Solving Key Issues in Data Representation

TLDR: Embeddings convert words (and images, users, products) into dense numerical vectors in a geometric space where semantic similarity = geometric proximity. "King - Man + Woman ≈ Queen" is not magic — it is the arithmetic property of well-trained ...

Mar 9, 2026•14 min read

Basics(1)

What are Logits in Machine Learning and Why They Matter

TLDR: Logits are the raw, unnormalized scores produced by the final layer of a neural network — before any probability transformation. Softmax converts them to probabilities. Temperature scales them before Softmax to control output randomness. 📖 T...

Mar 9, 2026•11 min read

Artificial Intelligence(1)

Unlocking the Power of ML, DL, and LLM Through Real-World Use Cases

TLDR: ML, Deep Learning, and LLMs are not competing technologies — they are a nested hierarchy. LLMs are a type of Deep Learning. Deep Learning is a subset of ML. Choosing the right layer depends on your data type, problem complexity, and available t...

Mar 9, 2026•14 min read

Beam-search(1)

Text Decoding Strategies: Greedy, Beam Search, and Sampling

TLDR: An LLM doesn't "write" text — it generates a probability distribution over all possible next tokens and then uses a decoding strategy to pick one. Greedy, Beam Search, and Sampling are different rules for that choice. Temperature controls the c...

Mar 9, 2026•14 min read

Alignment(1)

RLHF Explained: How We Teach AI to Be Nice

TLDR: A raw LLM is a super-smart parrot that read the entire internet — including its worst parts. RLHF (Reinforcement Learning from Human Feedback) is the training pipeline that transforms it from a pattern-matching engine into an assistant that is ...

Mar 9, 2026•13 min read

Chatgpt(1)

Prompt Engineering Guide: From Zero-Shot to Chain-of-Thought

TLDR: Prompt Engineering is the art of writing instructions that guide an LLM toward the answer you want. Zero-Shot, Few-Shot, and Chain-of-Thought are systematic techniques — not guesswork — that can dramatically improve accuracy without changing a ...

Mar 9, 2026•12 min read

Chromadb(1)

Guide to Using RAG with LangChain and ChromaDB/FAISS

TLDR: RAG (Retrieval-Augmented Generation) gives an LLM access to your private documents at query time. You chunk and embed documents into a vector store (ChromaDB or FAISS), retrieve the relevant chunks at query time, and inject them into the LLM's ...

Mar 9, 2026•12 min read

Computer Vision(1)

Diffusion Models: How AI Creates Art from Noise

TLDR: Diffusion models work by first learning to add noise to an image, then learning to undo that noise. At inference time you start from pure static and iteratively denoise into a meaningful image. They power DALL-E, Midjourney, and Stable Diffusio...

Mar 9, 2026•12 min read

Architecture(1)

A Guide to Pre-training Large Language Models

TLDR: Pre-training is the phase where an LLM learns "Language" and "World Knowledge" by reading petabytes of text. It uses Self-Supervised Learning to predict the next word in a sentence. This creates the "Base Model" which is later fine-tuned. 📖 ...

Mar 9, 2026•14 min read

#Embeddings(1)

A Beginner's Guide to Vector Database Principles

TLDR: A vector database stores meaning as numbers so you can search by intent, not exact keywords. That is why "reset my password" can find "account recovery steps" even if the words are different. 📖 Searching by Meaning, Not by Words A standard d...

Mar 9, 2026•14 min read

Networking(1)

API Gateway vs. Load Balancer vs. Reverse Proxy: What's the Difference?

TLDR: A Reverse Proxy hides your servers and handles caching/SSL. A Load Balancer spreads traffic across server instances. An API Gateway manages API concerns — auth, rate limiting, routing, and protocol translation. Modern tools (Nginx, AWS ALB, Kon...

Feb 16, 2026•14 min read

Langchain(1)

Mastering Prompt Templates: System, User, and Assistant Roles with LangChain

TLDR: Prompt templates are the contract between your application and the LLM. Role-based messages (System / User / Assistant) provide structure. LangChain's ChatPromptTemplate and MessagesPlaceholder turn ad-hoc strings into versioned, testable pipel...

Feb 15, 2026•14 min read

Advanced Ai(1)

Advanced AI: Agents, RAG, and the Future of Intelligence

TLDR: Large Language Models are brilliant "brains in a jar." Retrieval-Augmented Generation (RAG) hands them a constantly refreshed memory, while AI Agents give them tools to act in the world. Combined, they turn static knowledge into dynamic, goal-d...

Feb 8, 2026•14 min read

Ethics(1)

Ethics in AI: Bias, Safety, and the Future of Work

TLDR: 🤖 AI inherits the biases of its creators and data, can act unsafely if misaligned with human values, and is already reshaping the labor market. Understanding these issues — and the tools to address them — is essential for anyone building or us...

Feb 8, 2026•13 min read

Nlp(1)

Natural Language Processing (NLP): Teaching Computers to Read

TLDR: 🌟 NLP turns raw text into numbers so machines can read, understand, and generate language. The field evolved from counting words (Bag-of-Words) to contextual Transformers — each leap brings richer meaning, new capabilities, and different engin...

Feb 8, 2026•13 min read