A comprehensive guide to image augmentation from the creator of Albumentations, covering two distinct regimes: in-distribution augmentation (simulating realistic variations) and out-of-distribution augmentation (unrealistic perturbations for regularization). Key topics include the label preservation rule, building starter pipelines with Albumentations, preventing silent label corruption through target synchronization, task-specific strategies for detection/segmentation/keypoints/medical imaging, matching augmentation strength to model capacity, and a repeatable evaluation protocol. Also covers advanced theory (invariance vs equivariance, manifold perspective) and augmentation in contrastive/self-supervised learning.
Table of contents
ContentsThe Intuition: Transforms That Preserve MeaningWhy Augmentation Helps: Two LevelsThe One Rule: Label PreservationBuild Your First Policy: A Starter PipelinePrevent Silent Label Corruption: Target SynchronizationExpand the Policy Deliberately: Transform FamiliesKnow the Failure Modes Before They Hit ProductionTask-Specific and Targeted AugmentationEvaluate With a Repeatable ProtocolAdvanced: Why These Heuristics WorkBeyond Standard Training: Augmentation in Other ContextsProduction Reality: Operational ConcernsConclusionWhere to Go NextSort: