Data deduplication

Posted over 2 years ago by Mark Callahan

Deduplication allows you to remove any repeated data to make managing it easier and avoid having to repeat work. Smartbox automatically identifies duplicate files (even if they do not share the same name), labels them, and generates a Duplicates Report.

Duplicates can be deleted manually or deleted in bulk for all duplicates within a Box or for all instances of a particular file. 

If you wish to delete them manually, navigate your box and delete any duplicate files that are marked with a purple "D" icon.


1. After data upload, open you box and click on "Cull" tab at the top of your page and click on "Duplicates" tab.

2. Select the checkbox next to Select all and click Remove Duplicates to initiate the bulk deduplication process 

3. The most recent version of each duplicate file is kept in Smartbox and the remainder are deleted

  • This process usually takes only a few seconds but is dependent on the size of the dataset and number of duplicates.

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select atleast one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article