Deduplicating Images

The Image Deduplication functionality identifies and manages duplicate images to ensure that only one version of a particular image is maintained in the system. This provides a single source of truth which ensures consistent and accurate image data, regardless of the number of objects using the image.

Image Deduplication compares and evaluates all images within a classification, regardless of encoding differences (such as file type or color model) or being referenced to products. Essentially, if images look the same, they are considered duplicates. Images that use CMYK and RGB color models and have the extensions in the table below are considered by the process.

Image Deduplication File Types

  • .BMP
  • .GIF
  • .JPEG
  • .JPG
  • .MSL
  • .MVG
  • .P7
  • .PBM
  • .PNG
  • .PNM
  • .PPM
  • .PSD
  • .TIF
  • .TIFF
  • .XWD

For example, selecting a single parent classification node would recursively compare all images of the identified types within the node to determine potential duplicates, but would not consider images in other nodes.

Running the image deduplication process includes:

To access the Image Deduplication functionality, the 'asset-deduplication' component must be activated on your system. Contact your Stibo Systems representative for details.

Limitations

The following limitations should be considered when evaluating the Image Deduplication functionality:

Additional Information

Image Deduplication can be configured and run as defined in the following topics:

The following topics provide an explanation of how image deduplication works and an example of image deduplication: