O'Reilly Media

Learning AutoML

Automating ML Pipelines with AutoGluon, Leading Frameworks, and Real-World Integration

Complete code companion with production-ready Jupyter notebooks covering feature engineering, neural architecture search, tabular data, NLP, time series, computer vision, and MLOps deployment.

10 Chapters
3 Case Studies
100+ Code Examples
Learning AutoML - O'Reilly Book Cover
AutoGluon PyTorch TensorFlow MLflow Optuna Ray Tune FastAPI Airflow

Chapter Notebooks

Production-ready code implementations for every major topic

04

Automated Feature Engineering

Master data preprocessing with intelligent profiling, outlier detection, encoding strategies, and multimodal feature fusion using a simulated e-commerce dataset.

  • Data Quality Assessment
  • Advanced Categorical Encoding
  • Feature Selection Methods
  • Multimodal Fusion
View Notebook
06

Neural Architecture Search

Implement NAS from scratch with search space design, evolutionary algorithms, DARTS, performance estimation strategies, and one-shot supernet training.

  • Search Space Design
  • Evolutionary & DARTS
  • Successive Halving
  • AutoKeras & Optuna
View Notebook
07

Working with Tabular Data

Complete guide to AutoGluon's TabularPredictor for classification and regression, from basics to advanced customization with SHAP interpretability.

  • TabularPredictor Basics
  • Automatic Data Processing
  • SHAP Analysis
  • Production Monitoring
View Notebook
08

Processing Text & NLP

Deep dive into text classification, NER, semantic matching, and multimodal text+tabular processing with AutoGluon's MultiModalPredictor.

  • Text Classification
  • Named Entity Recognition
  • Semantic Matching
  • Custom Preprocessing
View Notebook
09

Time Series Forecasting

Build forecasting systems with AutoGluon including Chronos-Bolt foundation models, probabilistic calibration, and business impact optimization.

  • Multi-Series Forecasting
  • Zero-Shot with Chronos
  • Probabilistic Calibration
  • Inventory Optimization
View Notebook
11

MLOps Integration

Integrate AutoML into production with MLflow experiment tracking, model registry, Kubeflow pipelines, drift detection, and automated retraining.

  • MLflow Tracking
  • Model Registry
  • Drift Detection
  • Auto Retraining
View Notebook
12

Data Pipelines with Airflow

Build robust data pipelines with validation gates, point-in-time feature engineering, incremental processing, and data contracts.

  • Data Validation
  • Feature Engineering
  • Incremental Processing
  • Drift Detection
View Notebook
13

Deployment & CI/CD

Production deployment patterns with FastAPI, shadow deployments, Prometheus metrics, Evidently drift detection, and continuous learning.

  • FastAPI Serving
  • Shadow Deployment
  • Prometheus Metrics
  • Continuous Learning
View Notebook

Industry Case Studies

End-to-end production implementations with real business impact

🏦
Financial Services

Fraud Detection System

Production-ready implementation processing 50M daily transactions with 62% reduction in fraud losses and <100ms latency.

$223.5M Annual Value
4,470x ROI
94ms p99 Latency
Explore Implementation →
🛒
Retail

Demand Forecasting

Omnichannel forecasting across 450 stores × 50K SKUs with stockout detection, weather effects, and hierarchical predictions.

11.8% MAPE
$43M Capital Freed
6.8% Stockout Rate
Explore Implementation →
🏥
Healthcare

Readmission Prediction

Fairness-aware AutoML for 30-day readmission risk with HIPAA compliance, multimodal EHR learning, and clinical interpretability.

0.73 AUC
<3pp Fairness Gap
$79.1M Annual Value
Explore Implementation →

Getting Started

Set up your environment and start learning in minutes

1

Clone the Repository

git clone https://github.com/tomakk/OReillyAutoMLBook.git
2

Create Environment

conda create -n automl python=3.10 && conda activate automl
3

Install Dependencies

pip install autogluon torch tensorflow mlflow optuna
4

Launch Jupyter

jupyter lab

📋 Requirements

  • Python 3.10+ (3.11 recommended)
  • AutoGluon 1.5.0+ for latest features
  • CUDA-capable GPU recommended for deep learning chapters
  • 16GB+ RAM for large-scale examples