Automatic detection of protected health information from clinic narratives