View grammars found

From the Data Analysis tab (data analysis tab icon) of the content detail panel, you can view grammar classes and rules found in data items that have been analyzed. If grammar value extraction has been performed you can also view the identified grammar values. If necessary, you can also mark an identified grammar value as a false positive or a table column as not sensitive. You can also remap a grammar value, or an entire grammar rule, to a new grammar rule.

For unstructured data items, you can view the grammar classes, rules, and values identified by the rules. For structured data items, you can view the grammar rules and values identified by the rules and the name of the columns in which a grammar values have been identified.

The number of values next to the grammar rule name is the total number of values identified in the document and any attachments. The percentage next to the grammar value represents the calculation of the confidence level that the identified value is sensitive content.

To view the identified grammar rules and values, select a document or table from the content list in either the List view (list view icon) or Detailed view (detailed view icon). In the content detail panel, click the Data Analysis tab (data analysis tab icon).

NOTE: If content has not been stored, you can see the grammar rules and values but you cannot select a rule or value to highlight it in the content panel.

  • For unstructured content, expand grammar classes and rules to view the grammar values found in the document.

    For structured content, expand the column names in the Columns with Sensitive Data section and then expand the grammar rules to view the grammar values found in the table. Columns in which no grammars were found are listed in the Other Columns section.

  • Click one or more rows for grammar rules or values. The background color for the clicked row changes to blue to indicate that the rule or value is "selected". If the content has been stored, the associated grammar values are highlighted in the content view panel.

    • If you click a grammar rule, all values for the grammar are selected in the grammar rules list and are highlighted in the content view panel.

      TIP: If your content list view is a result of a search that included a keyword search, the highlight color for grammar rule identification is a different color than the highlight color used to identify found keywords in collected documents.

    • If you click a grammar value, the value is highlighted in both in the grammar rules list and is highlighted in the content view panel. If more than one instance of the value is present in the content, go to the next value using the navigation arrows at the bottom of the panel (icon to navigate to the previous value 1 of 2 icon to navigate to the next value).

    • A red box around a highlighted value (word or phrase) in the content view indicates the grammar value in focus.

  • If you have multiple grammar values or rules and want to change the focus of a selected value, hover over a selected grammar value and then click the view icon (view icon). The focus in the content view panel changes to the selected value. If more than one instance of the value is present in the content, go to the next value using the navigation arrows at the bottom of the panel (icon to navigate to the previous value 1 of 2 icon to navigate to the next value).

  • To deselect a grammar rule or value, click the desired rule or value. The blue background is removed to indicate that the rule or value is no longer selected.

    TIP: If a value remains highlighted in the content view panel after you deselected it in the grammar rules list, the value is still selected for another grammar rule.

Mark identified grammar values as false positives

When reviewing data items, you may find that grammar values identified for a particular rule may not be accurate in context. For example, the color "white" may be identified as a value matching the grammar rule "Racial Ethnic Origin" within the "Other Personal Data" grammar class. If this occurs, you can mark the grammar value as a false positive. You can mark an entire grammar rule as a false positive if you find that all identified grammar values for the rule are not correct in context. You can also clear a false positive if needed.

NOTE: Content must be stored before you can mark a grammar value as a false positive.

You can also clear a false positive that has been set at the individual grammar value level or at the grammar rule level.

IMPORTANT: If you clear false positives at the grammar rule level, this also clears any remapped values under that rule. This will also remove any remapped values if they were mapped from another rule. For more information about remapping grammar rules, see Remap identified grammar values

Mark columns with identified grammar values as not sensitive

Similar to identifying and marking identified grammar values as false positives, you can identify columns in structured data tables that should not be considered sensitive. For example, you have a database table that includes customer account numbers. The grammar analysis of this table identifies all values in the "Customer #" column as bank account numbers because your internal customer reference numbers happen to have the same pattern as Australian bank account numbers. When reviewing this table, you can set this entire column as "not sensitive".

NOTE: When you set a column as not sensitive, all identified grammar rules and values in that column are marked as false positive.

Setting a table column as not sensitive or resetting a table column to sensitive will clear any previously remapped grammar values.

Remap identified grammar values

When reviewing data items, you may find that grammar values identified for a particular grammar rule may, in context, apply to a different grammar rule. For example, the last name "White" may be identified as a value matching the grammar rule "Racial Ethnic Origin" within the "Other Personal Data" grammar class. If this occurs, you can remap the value "White" to the correct grammar rule (in context) "Names" within the "Identification Data" grammar class. You can also remap all values within a grammar rule or table column to a different grammar rule.

NOTE: When you remap a table column, all identified grammar rules and values in that column are marked as false positive.

When you remap a grammar rule, all identified grammar values for that rule are remapped to the selected grammar rule.

Once grammar values are remapped, the original values are considered false positives. You can clear those false positives as you would clear any other false positive. Clearing false positives at the grammar rule level also clears any remapped values under that grammar rule. It will also 'remove' any remapped values if they were mapped there from another rule.