What is de-threading?
Simply put; de-threading is the process of removing thread data from emails such that you don't need to review the same emails over, and over, and over.
Why is that not deduplication?
Deduplication is the process of removing exact, or near-duplicate documents. De-threading is a somewhat more complex concept, where emails are removed if they also exist as an appendix on the bottom of another email.
How it works
Smartbox performs advanced content-analysis on emails to determine whether the appended thread information at the bottom also appears in other emails. Based on this, extraneous thread emails are removed from the system
Many systems use embedded thread ID headers to perform such operations, however by performing content analysis, Smartbox is able to improve on this approach in several ways:
- Smartbox will identify any tampering with the thread below and include all different versions for review
- No reliance on different header types by different clients
- Supports scanned emails, such as PDFs, which would not include a thread ID header
- Supports complex thread structures by keeping every 'leaf node' as opposed to having to assume that the latest email is the only relevant one.
Was this article helpful?
That’s Great!
Thank you for your feedback
Sorry! We couldn't be helpful
Thank you for your feedback
Feedback sent
We appreciate your effort and will try to fix the article