6.2 KiB
6.2 KiB
Argument | Type | Default | Range | Description |
---|---|---|---|---|
hsv_h |
float |
{{ hsv_h }} |
0.0 - 1.0 |
Adjusts the hue of the image by a fraction of the color wheel, introducing color variability. Helps the model generalize across different lighting conditions. |
hsv_s |
float |
{{ hsv_s }} |
0.0 - 1.0 |
Alters the saturation of the image by a fraction, affecting the intensity of colors. Useful for simulating different environmental conditions. |
hsv_v |
float |
{{ hsv_v }} |
0.0 - 1.0 |
Modifies the value (brightness) of the image by a fraction, helping the model to perform well under various lighting conditions. |
degrees |
float |
{{ degrees }} |
0.0 - 180 |
Rotates the image randomly within the specified degree range, improving the model's ability to recognize objects at various orientations. |
translate |
float |
{{ translate }} |
0.0 - 1.0 |
Translates the image horizontally and vertically by a fraction of the image size, aiding in learning to detect partially visible objects. |
scale |
float |
{{ scale }} |
>=0.0 |
Scales the image by a gain factor, simulating objects at different distances from the camera. |
shear |
float |
{{ shear }} |
-180 - +180 |
Shears the image by a specified degree, mimicking the effect of objects being viewed from different angles. |
perspective |
float |
{{ perspective }} |
0.0 - 0.001 |
Applies a random perspective transformation to the image, enhancing the model's ability to understand objects in 3D space. |
flipud |
float |
{{ flipud }} |
0.0 - 1.0 |
Flips the image upside down with the specified probability, increasing the data variability without affecting the object's characteristics. |
fliplr |
float |
{{ fliplr }} |
0.0 - 1.0 |
Flips the image left to right with the specified probability, useful for learning symmetrical objects and increasing dataset diversity. |
bgr |
float |
{{ bgr }} |
0.0 - 1.0 |
Flips the image channels from RGB to BGR with the specified probability, useful for increasing robustness to incorrect channel ordering. |
mosaic |
float |
{{ mosaic }} |
0.0 - 1.0 |
Combines four training images into one, simulating different scene compositions and object interactions. Highly effective for complex scene understanding. |
mixup |
float |
{{ mixup }} |
0.0 - 1.0 |
Blends two images and their labels, creating a composite image. Enhances the model's ability to generalize by introducing label noise and visual variability. |
cutmix |
float |
{{ cutmix }} |
0.0 - 1.0 |
Combines portions of two images, creating a partial blend while maintaining distinct regions. Enhances model robustness by creating occlusion scenarios. |
copy_paste |
float |
{{ copy_paste }} |
0.0 - 1.0 |
Segmentation only. Copies and pastes objects across images to increase object instances. |
copy_paste_mode |
str |
{{ copy_paste_mode }} |
- | Segmentation only. Specifies the copy-paste strategy to use. Options include 'flip' and 'mixup' . |
auto_augment |
str |
{{ auto_augment }} |
- | Classification only. Applies a predefined augmentation policy ('randaugment' , 'autoaugment' , or 'augmix' ) to enhance model performance through visual diversity. |
erasing |
float |
{{ erasing }} |
0.0 - 0.9 |
Classification only. Randomly erases regions of the image during training to encourage the model to focus on less obvious features. |