What is de-threading?

Modified on Tue, 20 Dec 2022 at 10:46 AM

What is de-threading?


Simply put; de-threading is the process of removing thread data from emails such that you don't need to review the same emails over, and over, and over.


Why is that not deduplication? 


Deduplication is the process of removing exact, or near-duplicate documents. De-threading is a somewhat more complex concept, where emails are removed if they also exist as an appendix on the bottom of another email. 


How it works


Smartbox performs advanced content-analysis on emails to determine whether the appended thread information at the bottom also appears in other emails. Based on this, extraneous thread emails are removed from the system


Many systems use embedded thread ID headers to perform such operations, however by performing content analysis, Smartbox is able to improve on this approach in several ways:  

  • Smartbox will identify any tampering with the thread below and include all different versions for review
  • No reliance on different header types by different clients
  • Supports scanned emails, such as PDFs, which would not include a thread ID header
  • Supports complex thread structures by keeping every 'leaf node' as opposed to having to assume that the latest email is the only relevant one.




Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select atleast one of the reasons

Feedback sent

We appreciate your effort and will try to fix the article