Data Scientist 3

Gormat

Annapolis Junction, MD

Apply

JOB DETAILS

SKILLS

Algorithms, Analysis Skills, Artificial Intelligence (AI), Astronomy, Biology, Chemistry, Communication Skills, Computer Science, Customer/Client Research, Data Analysis, Data Cleaning, Data Management, Data Mining, Data Modeling, Data Processing, Data Quality, Data Recovery, Data Science, Data Sets, Data Structures, Data Visualization, Government, Higher Education, Linear Algebra, Linear Models, Machine Learning, Mathematics, Model Validation, Natural Language Processing (NLP), Ontology, Operations Research, Performance Modeling, Physical Science, Physics, Python Programming/Scripting Language, Scientific Method, Software Engineering, Statistical Analysis System (SAS), Statistical Modeling, Statistical Programming Languages, Statistics, Systems Engineering, Technical Presentation, Training Data Sets, Training/Teaching, Unstructured Data

LOCATION

Annapolis Junction, MD

POSTED

29 days ago

We are seeking a Data Scientist proficient in Python and Jupyter Notebook to support a Natural Language Processing (NLP) project to accurately and automatically tokenize language data with spoken or written origins. You will develop automated solutions for the annotation of language data with parts of speech information and improve existing models by scoring performance against human-generated annotations for speech and text.

The Level 3 Data Scientist shall possess the following capabilities:

Foundations: (Mathematical, Computational, Statistical).
Data Processing: (Data management and curation, data description and visualization, workflow and reproducibility).
Modeling, Inference, and Prediction: (Data modeling and assessment, domain-specific considerations).
Ability to make and communicate principal conclusions from data using elements of mathematics, statistics, computer science, and applications-specific knowledge.
Ability to use analytic modeling, statistical analysis, programming, and/or another appropriate scientific method, develop and implement qualitative and quantitative methods for characterizing, exploring, and assessing large datasets in various states of organization, cleanliness, and structure that account for the unique feature and limitations inherent in Government data holdings.
Translate practical mission needs and analytic questions related to large datasets into technical requirements and, conversely, assist others with drawing appropriate conclusions from the analysis of such data.
Effectively communicate complex technical information to non-technical audiences.
Ability to train and develop NLP/NER for LLM solutions within an agentic AI framework (LangGraph). Must be able to perform supervised and unsupervised model training and validation for automated knowledge extraction from unstructured natural language data in multiple languages without a predefined ontology. Familiarity with customer data sources and data retrieval techniques is necessary for producing preprocessed training data, which will require an understanding of techniques to ensure data quality and readiness for integration into the system. Understanding of enterprise data compliance and policy concerns are necessary to ensure solutions are built for end user access.

Qualifications:

Bachelor's Degree with 10 years of relevant experience, associate's degree with 12 years of experience may be considered for individuals with in-depth experience that is clearly related to the position.
Bachelor's Degree must be in Mathematics, Applied Mathematics Statistics, Applied Statistics, Machine learning, Data Science, Operations Research, or Computer Science or a degree in a related field (Computer Information Systems, Engineering), a degree in the physical/hard sciences (e.g. physics, chemistry, biology, astronomy), or other science disciplines with a substantial computational component (i.e. behavioral, social, or life) may be considered if it included a concentration of coursework (5 or more courses) in advanced Mathematics (typically 300 level or higher, such as linear algebra, probability and statistics, machine learning) and/or computer science (e.g. algorithms, programming, , data structures, data mining, artificial intelligence). College-level requirement, or upper-level math courses designated as elementary or basic do not count.
Broader range of degrees will be considered if accompanied by a Certificate in Data Science from an accredited college/university.
Relevant experience must be in designing/implementing machine learning, data science, advanced analytical algorithms, programming (skill in at least on high level language (e.g. Python), statistical analysis (e.g. variability, sampling error, inference, hypothesis testing, EDA, application of linear models), data management (e.g. data cleaning and transformation), data mining, data modeling and assessment, artificial intelligence, and/or software engineering.
Proficiency withJupyterNotebooks usingPython is required.
NLP experience is required.

TS/SCI with polygraph is required.

Job Posted by ApplicantPro

About the Company

Gormat

Resume Resources

Free Resume Templates Free Resume Builder