Python November 19 ,2025

1. Python in AI

Python in AI

This chapter explains how Python is used across the entire AI workflow: data preparation, classical machine learning, deep learning, natural language processing, computer vision, model evaluation, deployment, scaling, MLOps, reproducibility, and responsible AI. For every subtopic you get a conceptual explanation first (theory) followed by a concrete practical example you can run and extend.

Introduction: Why Python for AI

Python is the dominant language in AI for reasons that are practical and historical:

Rich ecosystem: mature libraries for numerical computing (NumPy), data manipulation (Pandas), ML (scikit-learn), deep learning (TensorFlow, PyTorch), NLP (Transformers), CV (OpenCV, torchvision).
Expressiveness: concise syntax, rapid prototyping, readability.
Interoperability: easy bindings to C/C++ libraries, GPU-enabled runtimes, and tooling for production.
Community & tooling: notebooks, visualization, experiment tracking, and cloud integration.
Research-to-production path: many research prototypes in Python have direct upgrade paths to production with frameworks and deployment tools.

Python is used through the entire lifecycle: exploration in notebooks, training on workstations/GPUs, distributed training in clusters, packaging models for inference, and orchestrating ML pipelines.

Data Handling & Preprocessing

Data quality drives model quality. Preprocessing tasks include: loading heterogeneous data, cleaning (missing values, outliers), normalization/standardization, encoding categorical variables, feature engineering, time-series transformations, tokenization for text, image augmentation for CV, and splitting data correctly (train/val/test, stratification). Important principles: avoid data leakage, preserve reproducibility, and log transformations so inference receives identical preprocessing.

Key libraries: pandas, NumPy, scikit-learn preprocessing, albumentations/torchvision for augmentation, Hugging Face tokenizers for text.

Practical — Loading, cleaning, encoding, and scaling

# requires: pip install pandas scikit-learn
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.impute import SimpleImputer
from sklearn.preprocessing import StandardScaler, OneHotEncoder
from sklearn.compose import ColumnTransformer
from sklearn.pipeline import Pipeline

# sample dataframe
df = pd.DataFrame({
    "age": [25, 30, None, 40, 22],
    "income": [50000, 60000, 55000, None, 48000],
    "city": ["NY","SF","NY","LA","SF"],
    "target": [0,1,0,1,0]
})

X = df.drop(columns="target")
y = df["target"]

numeric_features = ["age","income"]
categorical_features = ["city"]

numeric_transformer = Pipeline(steps=[
    ("impute", SimpleImputer(strategy="median")),
    ("scale", StandardScaler())
])

cat_transformer = Pipeline(steps=[
    ("impute", SimpleImputer(strategy="most_frequent")),
    ("onehot", OneHotEncoder(handle_unknown="ignore"))
])

preprocessor = ColumnTransformer(transformers=[
    ("num", numeric_transformer, numeric_features),
    ("cat", cat_transformer, categorical_features),
])

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=42, stratify=y)
X_train_proc = preprocessor.fit_transform(X_train)
X_test_proc = preprocessor.transform(X_test)

print("Processed train shape:", X_train_proc.shape)

This pipeline ensures the same fitted transformers are applied to train and test sets.

Classical Machine Learning with scikit-learn

scikit-learn provides algorithms and helper tools for supervised and unsupervised learning, pipelines, model selection, cross-validation, and evaluation metrics. It encourages small, composable estimators and consistent APIs: fit(), predict(), transform(), score(). Typical flow: preprocess → pipeline → cross-validate → tune hyperparameters → evaluate.

Important concepts: bias–variance tradeoff, cross-validation, overfitting/underfitting, regularization, class imbalance handling, feature importance.

Practical — Simple classification pipeline + cross-validation

# requires: pip install scikit-learn
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import cross_val_score
from sklearn.pipeline import Pipeline
import numpy as np

pipe = Pipeline(steps=[
    ("pre", preprocessor),
    ("clf", RandomForestClassifier(n_estimators=100, random_state=42))
])

# cross-validate on the small sample
scores = cross_val_score(pipe, X, y, cv=3, scoring="accuracy")
print("CV accuracy:", np.round(scores, 3), "mean:", scores.mean())

For realistic datasets, add GridSearchCV or RandomizedSearchCV to tune hyperparameters.

Deep Learning Fundamentals

Deep learning focuses on neural networks: layers of parameterized functions trained via gradient-based optimization (backpropagation). Frameworks (PyTorch, TensorFlow) provide tensor abstractions, automatic differentiation (autograd), optimization algorithms (SGD, Adam), data loaders, and model building blocks.

Key considerations:

Architecture selection (MLP, CNN, RNN/Transformer)
Loss functions & metrics
Batch size, learning rate scheduling
Regularization (dropout, weight decay)
Data augmentation
GPU acceleration
Mixed precision training & gradient accumulation for large batches

Practical — Small feedforward network in PyTorch

# requires: pip install torch (or torchvision) depending on system
import torch
import torch.nn as nn
from torch.utils.data import TensorDataset, DataLoader

# toy dataset
X = torch.randn(500, 10)
y = (X[:,0] + X[:,1]*0.5 > 0).long()

dataset = TensorDataset(X, y)
loader = DataLoader(dataset, batch_size=32, shuffle=True)

class Net(nn.Module):
    def __init__(self):
        super().__init__()
        self.net = nn.Sequential(
            nn.Linear(10, 64),
            nn.ReLU(),
            nn.Linear(64, 2)
        )
    def forward(self, x):
        return self.net(x)

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = Net().to(device)
opt = torch.optim.Adam(model.parameters(), lr=1e-3)
loss_fn = nn.CrossEntropyLoss()

for epoch in range(5):
    total_loss = 0.0
    for xb, yb in DataLoader(dataset, batch_size=64, shuffle=True):
        xb, yb = xb.to(device), yb.to(device)
        opt.zero_grad()
        logits = model(xb)
        loss = loss_fn(logits, yb)
        loss.backward()
        opt.step()
        total_loss += loss.item()
    print(f"Epoch {epoch+1} loss {total_loss/len(loader):.4f}")

This pattern (dataset → dataloader → training loop) is core to deep learning in Python.

Natural Language Processing (NLP)

NLP tasks include classification, sequence labeling, language generation, translation. Modern state-of-the-art uses Transformer architectures (BERT, GPT). Python libraries (NLTK, spaCy) cover preprocessing; Hugging Face Transformers enables easy use of pretrained models and fine-tuning. Important themes: tokenization, subword vocabularies, contextual embeddings, transfer learning, fine-tuning vs feature extraction.

Practical — Text classification with Hugging Face Transformers

# requires: pip install transformers datasets torch
from transformers import AutoTokenizer, AutoModelForSequenceClassification, Trainer, TrainingArguments
from datasets import load_dataset

model_name = "distilbert-base-uncased"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForSequenceClassification.from_pretrained(model_name, num_labels=2)

# small dataset example (use glue or imdb in real tasks)
ds = load_dataset("imdb", split="train[:1%]")
ds = ds.train_test_split(test_size=0.1)

def tokenize(batch):
    return tokenizer(batch["text"], truncation=True, padding="max_length", max_length=128)

ds = ds.map(tokenize, batched=True)
ds = ds.rename_column("label", "labels")
ds.set_format(type="torch", columns=["input_ids","attention_mask","labels"])

training_args = TrainingArguments(output_dir="./hf_tmp", per_device_train_batch_size=8, num_train_epochs=1, logging_steps=10)
trainer = Trainer(model=model, args=training_args, train_dataset=ds["train"], eval_dataset=ds["test"])
trainer.train()

Hugging Face greatly reduces boilerplate for fine-tuning transformer models.

Computer Vision (CV)

CV models range from classical image processing (filters, edge detection) to deep models (CNNs, ResNets, Vision Transformers). Data pipelines include resizing, normalization, augmentation (random flips, crops, color jitter), and handling large image datasets via efficient transforms and caching. Pretrained models and transfer learning are common.

Practical — Transfer learning with torchvision

# requires: pip install torch torchvision
import torch
from torchvision import models, transforms, datasets
from torch.utils.data import DataLoader

# use a small dataset like CIFAR10 for example
transform = transforms.Compose([
    transforms.Resize(224),
    transforms.ToTensor(),
    transforms.Normalize(mean=(0.5,0.5,0.5), std=(0.5,0.5,0.5))
])

train_ds = datasets.CIFAR10(root="./data", train=True, download=True, transform=transform)
train_loader = DataLoader(train_ds, batch_size=64, shuffle=True)

model = models.resnet18(pretrained=True)
# replace final layer
model.fc = torch.nn.Linear(model.fc.in_features, 10)
device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = model.to(device)

opt = torch.optim.SGD(model.parameters(), lr=0.01, momentum=0.9)
loss_fn = torch.nn.CrossEntropyLoss()

# one epoch
model.train()
for xb, yb in train_loader:
    xb, yb = xb.to(device), yb.to(device)
    opt.zero_grad()
    out = model(xb)
    loss = loss_fn(out, yb)
    loss.backward()
    opt.step()

This shows how to adapt a pretrained network for a new classification task.

Evaluation, Metrics & Model Selection

Choose metrics that align with business goals: accuracy, precision/recall/F1, ROC-AUC, mean squared error, mean absolute error, BLEU/ROUGE for text, IoU for segmentation. Use confusion matrices for classification and proper cross-validation schemes (time-series cross-validation for temporal data). Evaluate calibration and class imbalance. Use holdout test set and careful experiment logging.

Practical — Classification evaluation

from sklearn.metrics import classification_report, confusion_matrix, roc_auc_score
import numpy as np

# fake predictions
y_true = np.array([0,1,1,0,1,0])
y_pred = np.array([0,1,0,0,1,1])
y_score = np.array([0.1,0.9,0.3,0.2,0.8,0.6])

print(classification_report(y_true, y_pred))
print("Confusion:\n", confusion_matrix(y_true, y_pred))
print("ROC-AUC:", roc_auc_score(y_true, y_score))

Always inspect per-class performance and error patterns.

GPUs, Mixed Precision & Distributed Training

GPUs accelerate tensor computations massively. PyTorch/TensorFlow offer CUDA support. Mixed precision (float16) reduces memory and increases throughput. For very large models/data, distribute training across multiple GPUs or nodes (DataParallel, DistributedDataParallel, Horovod, DeepSpeed). Key concerns: reproducibility with random seeds, deterministic ops, handling gradient synchronization, and checkpointing.

Practical — Using GPU and mixed precision in PyTorch (basic)

# Example showing device and amp
import torch
from torch.cuda.amp import GradScaler, autocast

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = Net().to(device)
opt = torch.optim.Adam(model.parameters(), lr=1e-3)
scaler = GradScaler()

for xb, yb in DataLoader(dataset, batch_size=64):
    xb, yb = xb.to(device), yb.to(device)
    opt.zero_grad()
    with autocast():              # automatic mixed precision context
        logits = model(xb)
        loss = torch.nn.functional.cross_entropy(logits, yb)
    scaler.scale(loss).backward()
    scaler.step(opt)
    scaler.update()

For multi-GPU, use DistributedDataParallel and launch with torch.distributed.launch or torchrun.

Model Persistence & Inference

Persist model weights and preprocessing artifacts. Best practice: save model state, tokenizer/encoder, and full inference pipeline. For PyTorch use state_dict, for TF use SavedModel or checkpoints. For reproducible inference, store model hyperparameters and library versions.

Deployment patterns:

Batch inference scripts
REST/gRPC microservice (FastAPI + Uvicorn)
Serverless functions for small models
On-device (mobile) using ONNX, TorchScript, TensorFlow Lite
Model serving platforms: TorchServe, TensorFlow Serving, Triton Inference Server

Practical — Save/load PyTorch model and simple FastAPI inference

# Save
torch.save(model.state_dict(), "model.pth")

# Load
model = Net()
model.load_state_dict(torch.load("model.pth", map_location="cpu"))
model.eval()

# Simple FastAPI inference skeleton:
# pip install fastapi uvicorn
from fastapi import FastAPI
import uvicorn
import torch

app = FastAPI()
model = model  # assume loaded

@app.post("/predict")
def predict(payload: dict):
    # payload: features list
    x = torch.tensor([payload["features"]], dtype=torch.float32)
    with torch.no_grad():
        logits = model(x)
        probs = torch.nn.functional.softmax(logits, dim=-1).tolist()
    return {"probs": probs}

# run: uvicorn myapp:app --host 0.0.0.0 --port 8000

Wrap preprocessing in the service so inputs match training transforms.

Explainability & Fairness

Explainability (SHAP, LIME, feature importance) helps understand model decisions. Fairness auditing includes evaluating disparate impact across groups, bias metrics, and mitigation strategies (reweighting, adversarial debiasing, fairness-aware training). For regulated domains, explainability and audit trails are essential.

Practical — Feature importance with a tree model

# requires: pip install scikit-learn
from sklearn.ensemble import RandomForestClassifier
import numpy as np

rf = RandomForestClassifier(random_state=42).fit(X_train_proc, y_train)
importances = rf.feature_importances_
print("Top features:", np.argsort(importances)[::-1][:5])

For deeper explanations, use SHAP:

# pip install shap
import shap
explainer = shap.TreeExplainer(rf)
shap_values = explainer.shap_values(X_test_proc)
shap.summary_plot(shap_values, feature_names=["age","income","city_SF","city_NY",...])

Hyperparameter Tuning & Experimentation

Use systematic search (grid/random) or adaptive search (Bayesian optimization — Optuna, Hyperopt) to find hyperparameters. Track experiments (parameters, metrics, artifacts) using MLflow, Weights & Biases, or TensorBoard. Use sensible defaults and budgets, tune on validation set, avoid peeking at test set.

Practical — Simple Optuna tuning for learning rate

# pip install optuna scikit-learn
import optuna
from sklearn.model_selection import cross_val_score

def objective(trial):
    lr = trial.suggest_loguniform("lr", 1e-5, 1e-1)
    clf = RandomForestClassifier(n_estimators=100)  # example; for DL adapt
    score = cross_val_score(clf, X_train_proc, y_train, cv=3, scoring="accuracy").mean()
    return score

study = optuna.create_study(direction="maximize")
study.optimize(objective, n_trials=20)
print("Best params:", study.best_params)

For deep learning, integrate Optuna with training loop (report intermediate metrics and prune).

Data & Model Versioning, Reproducibility

Version data and code. Tools: DVC for data versioning, Git for code, MLflow/W&B for experiment runs. Reproducibility requires: deterministic seeds, documenting library versions (pip freeze or lockfile), containerization (Docker), and storing preprocessing pipelines.

Practical — simple reproducibility checklist and random seeds

import random, numpy as np, torch, os

def set_seed(seed=42):
    random.seed(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)
    if torch.cuda.is_available():
        torch.cuda.manual_seed_all(seed)
    os.environ["PYTHONHASHSEED"] = str(seed)

set_seed(42)

Use Dockerfiles + requirements.txt to freeze the environment.

MLOps & Production

MLOps combines DevOps and ML engineering. Topics: CI/CD for models, automated testing (unit + integration + data tests), model validation gates, drift detection, retraining pipelines, monitoring (latency, throughput, accuracy), data lineage, and governance. Orchestration: Airflow, Kubeflow, Prefect. Serving & scaling: containers, Kubernetes, model servers, autoscaling policies.

Practical — Minimal ML pipeline outline with Prefect/Airflow (pseudo)

A Prefect flow could orchestrate: extract → transform → train → validate → push model artifact → deploy. Full examples are extensive; start with small automated pipeline that runs tests and triggers training on new data.

Responsible AI & Ethics

AI systems affect people. Consider privacy, bias, explainability, consent, and safety. Techniques: differential privacy, federated learning, model auditing, anonymization, and minimizing sensitive data collection. Document intended model use, limitations, and failure modes.

Practical — short checklist to include with any model

Who is the intended user and impact?
What data was used? Was it consented?
Which groups may be disadvantaged?
How to monitor drift or degraded fairness?
How to roll back a model in production?

Include these answers in model cards or documentation.

Learn core libraries: NumPy, pandas, scikit-learn, PyTorch/TensorFlow.
Hands-on: train small projects (tabular ML, simple CNN, text classifier).
Experiment tracking: set up MLflow or W&B for a project.
Deployment: containerize a model behind a FastAPI endpoint and test locally.
MLOps basics: automate a pipeline with GitHub Actions + a scheduled retrain.
Follow ethics & governance best practices and keep reproducibility front and center.

Practical short plan:

Start a project: choose a dataset (Kaggle, UCI), build baseline with scikit-learn.
Improve with deep learning (PyTorch) if task benefits.
Track experiments & hyperparams.
Save model + pipeline, create API with FastAPI.
Add CI to run tests and training smoke checks.
Monitor model in a small deployment.

Sanjiv

You must logged in to post comments.

Python

Python

Table of Contents

Python in AI

Introduction: Why Python for AI

Data Handling & Preprocessing

Practical — Loading, cleaning, encoding, and scaling

Classical Machine Learning with scikit-learn

Practical — Simple classification pipeline + cross-validation

Deep Learning Fundamentals

Practical — Small feedforward network in PyTorch

Natural Language Processing (NLP)

Practical — Text classification with Hugging Face Transformers

Computer Vision (CV)

Practical — Transfer learning with torchvision

Evaluation, Metrics & Model Selection

Practical — Classification evaluation

GPUs, Mixed Precision & Distributed Training

Practical — Using GPU and mixed precision in PyTorch (basic)

Model Persistence & Inference

Practical — Save/load PyTorch model and simple FastAPI inference

Explainability & Fairness

Practical — Feature importance with a tree model

Hyperparameter Tuning & Experimentation

Practical — Simple Optuna tuning for learning rate

Data & Model Versioning, Reproducibility

Practical — simple reproducibility checklist and random seeds

MLOps & Production

Practical — Minimal ML pipeline outline with Prefect/Airflow (pseudo)

Responsible AI & Ethics

Practical — short checklist to include with any model

Related Blogs

Strings in Python

Functions in Python

Modules and Packages...

File Handling in Pyt...

CSV and JSON Files

Exception Handling i...

Object-Oriented Prog...

Iterators and Genera...

Practice Problems wi...

Functions in Python

Object-Oriented Prog...

DECORATORS IN PYTHON

Python Libraries Ove...

FILE SYSTEM AND OS O...

Databases in Python

PYTHON FOR WEB AND A...

UNIT TESTING IN PYTH...

Python Project Work

Python Interview Pre...

Advanced Topics

Inheritance

SQL Databases

NoSQL Databases

Get In Touch

Categories