Introduction
The IDOL Eduction Grammars Package contains tools that allow you to find different kinds of information in text. The grammars package is part of OpenText IDOL Eduction.
Eduction identifies and extracts entities from text. An entity is a word, phrase, or block of information, such as a person's name, an address, a date, or a telephone number.
The grammars are files that contain the definitions for a set of entities. In some cases, the grammar might be a list of fixed values, such as names. In other cases, the grammar might be pattern matching tools that find data of a particular type, such as a set of digits that make up a phone number.
The IDOL Eduction Grammars Package includes the following sets of grammars:
-
Personal Identifiable Information (PII) grammars allow you to find various kinds of personal data, to help you comply with regulations such as the General Data Protection Regulation (GDPR).
-
Protected Healthcare Information (PHI) grammars allow you to find healthcare information, to ensure compliance with regulations such as the Standards of Privacy of Individually Identifiable Health Information implemented as part of the Health Insurance Portability and Accountability Act (HIPAA).
-
Payment Card Industry (PCI) grammars allow you to locate financial information, to ensure compliance with financial regulations.
-
Government grammars allow you to find governmental document markings and other information in your data, to help you comply with data management restrictions.
IDOL Eduction also includes a set of legacy grammars, which provide a set of standard entities. These grammars are not maintained like the IDOL Eduction Grammars Package grammars, but they are user-extensible. For more information about the available legacy grammars, refer to the IDOL Eduction User and Programming Guide.