image_to_pixle_params_yoloSAM/ultralytics-main/docs/en/macros/augmentation-args.md

6.2 KiB

Argument Type Default Range Description
hsv_h float {{ hsv_h }} 0.0 - 1.0 Adjusts the hue of the image by a fraction of the color wheel, introducing color variability. Helps the model generalize across different lighting conditions.
hsv_s float {{ hsv_s }} 0.0 - 1.0 Alters the saturation of the image by a fraction, affecting the intensity of colors. Useful for simulating different environmental conditions.
hsv_v float {{ hsv_v }} 0.0 - 1.0 Modifies the value (brightness) of the image by a fraction, helping the model to perform well under various lighting conditions.
degrees float {{ degrees }} 0.0 - 180 Rotates the image randomly within the specified degree range, improving the model's ability to recognize objects at various orientations.
translate float {{ translate }} 0.0 - 1.0 Translates the image horizontally and vertically by a fraction of the image size, aiding in learning to detect partially visible objects.
scale float {{ scale }} >=0.0 Scales the image by a gain factor, simulating objects at different distances from the camera.
shear float {{ shear }} -180 - +180 Shears the image by a specified degree, mimicking the effect of objects being viewed from different angles.
perspective float {{ perspective }} 0.0 - 0.001 Applies a random perspective transformation to the image, enhancing the model's ability to understand objects in 3D space.
flipud float {{ flipud }} 0.0 - 1.0 Flips the image upside down with the specified probability, increasing the data variability without affecting the object's characteristics.
fliplr float {{ fliplr }} 0.0 - 1.0 Flips the image left to right with the specified probability, useful for learning symmetrical objects and increasing dataset diversity.
bgr float {{ bgr }} 0.0 - 1.0 Flips the image channels from RGB to BGR with the specified probability, useful for increasing robustness to incorrect channel ordering.
mosaic float {{ mosaic }} 0.0 - 1.0 Combines four training images into one, simulating different scene compositions and object interactions. Highly effective for complex scene understanding.
mixup float {{ mixup }} 0.0 - 1.0 Blends two images and their labels, creating a composite image. Enhances the model's ability to generalize by introducing label noise and visual variability.
cutmix float {{ cutmix }} 0.0 - 1.0 Combines portions of two images, creating a partial blend while maintaining distinct regions. Enhances model robustness by creating occlusion scenarios.
copy_paste float {{ copy_paste }} 0.0 - 1.0 Segmentation only. Copies and pastes objects across images to increase object instances.
copy_paste_mode str {{ copy_paste_mode }} - Segmentation only. Specifies the copy-paste strategy to use. Options include 'flip' and 'mixup'.
auto_augment str {{ auto_augment }} - Classification only. Applies a predefined augmentation policy ('randaugment', 'autoaugment', or 'augmix') to enhance model performance through visual diversity.
erasing float {{ erasing }} 0.0 - 0.9 Classification only. Randomly erases regions of the image during training to encourage the model to focus on less obvious features.