Text Duplicate Pivot Documents to Reduce Review Volume

IN THIS ARTICLE:

This workflow provides details on how to use MA's text duplicate identification output to focus review efforts on pivot documents within near-duplicate document groups to reduce review volumes to only one document per text duplicate group.

WHO CAN PERFORM:

You must have the following MA permissions:

Create and manage sets—can create and edit analytics sets. When disabled, it hides the Overlay and Edit buttons.

Overlay results—for overlaying results into Relativity. When disabled, it disables the auto-overlay toggle and the manual overlay buttons.

In addition to the above permissions, Relativity Object and other Relativity permissions must be enabled for full functionality.

What is Text Duplicate identification?

" Text duplicates" are documents that have been identified and grouped based on the matching of text information, even if the native documents themselves would not hash identically due to native file formatting differences or white space.  This makes text duplicates more reliable for use with coding propagation and isolating pivots to review the full amount of unique information a text duplicate group has to offer. Each document belongs to only one text duplicate set and each text duplicate set contains at least one document. Unlike the settings for near-duplicate identification, Text Duplicate settings do not require any decisions regarding similarity or file comparison.

There are two primary features in text duplicate identification that are important when reducing review volumes to focus only on text duplicate pivot documents:

  • Text Duplicate Pivot: the pivot document is based on the document within a text duplicate group that represents the first document encountered within a text duplicate group. Once assigned, pivots do not adjust, for example, when there is an incremental run.
  • Text Duplicate Group: a Text Duplicate group contains the full set of documents that have been identified containing matching text.

Saved Search rules for returning pivot Text Duplicate documents

These are our recommended settings for a source saved search for a text duplicate document set intended to only return a single pivot document per each text duplicate document group:

Saved Search Criteria:

  • TDGroup is set
  • ::Is Pivot = Y

Relativity Search Criteria for Text Duplicate Pivot Documents

All other settings can be configured according to the needs of the project.

Back to top