For the most part organisations now have good information governance and visibility of their sensitive data. We certainly played a part in helping this happen by providing sound advice around information governance in relation to GDPR, but also in providing a solution for automatically identifying sensitive content in a document. This solution consisted of a search tool from Netwrix called Data Classifier (formally ConceptSearching) which uses NLP and various other technologies to identify sensitive content. A tag is automatically assigned a document which can then be used in data loss prevention configuration to prevent inadvertent sharing etc.
The capability provided by Netwrix Data Classifier is broader than just identifying sensitive content and can be used to apply any business classification to a corpus of documents.
SharePoint EDRMS, case management and auto-classification of sensitive content