We are seeking a Data Scientist proficient in Python and Jupyter Notebook to support a Natural Language Processing (NLP) project to accurately and automatically tokenize language data with spoken or written origins. You will develop automated solutions for the annotation of language data with parts of speech information and improve existing models by scoring performance against human-generated annotations for speech and text.
The Level 3 Data Scientist shall possess the following capabilities:
Qualifications:
TS/SCI with polygraph is required.