Lead Software Engineer - Machine Learning

SAMPRASOFT

Durham, NC

JOB DETAILS
SKILLS
Agile Programming Methodologies, Algorithms, Amazon Web Services (AWS), Analysis Skills, Apache Kafka, Apache Spark, Application Programming Interface (API), Automation, Big Data, Cloud Computing, Communication Skills, Computer Science, Continuous Deployment/Delivery, Continuous Integration, Data Analysis, Data Management, Data Processing, Data Science, Database Extract Transform and Load (ETL), Distributed Computing, Ecosystems, Git, Identify Issues, Java, Jenkins, Kanban, Machine Learning, Memory Hardware, Messaging Technology, NoSQL, Object Oriented Programming (OOP), Open Source, Predictive Modeling, Python Programming/Scripting Language, Quality Monitoring, Relational Databases (RDBMS), Risk Management, SQL (Structured Query Language), Scala Programming Language, Scalable System Development, Scripting (Scripting Languages), Scrum Project Management and Software Development, Simple Queue Service (SQS), Software Engineering, Source Code/Configuration Management (SCM), Streaming Technology, Systems Administration/Management, Team Player, Technical/Engineering Design, Unix Shell Programming
LOCATION
Durham, NC
POSTED
6 days ago
Lead Software Engineer - Machine Learning

Location: Hybrid Model (5 days in month), Boston MA, Durham NC, New Jersey

Duration: Long term

The Expertise You Have

  • Has Bachelor's or master's degree in a technology related field (e.g., Engineering, Computer Science, etc.).
  • 8+ years of proven experience in implementing Big data solutions in data analytics space.
  • 2+ years of experience in developing ML infrastructure and MLOps in the Cloud using AWS Sagemaker.
  • Extensive experience working with machine learning models with respect to deployment, inference, tuning, and measurement required.
  • Experience in Object Oriented Programming (Java, Scala, Python), SQL, Unix scripting or related programming languages and exposure to some of Python's ML ecosystem (numpy, panda, sklearn, Tensorflow, etc.).
  • Experience with building data pipelines in getting the data required to build and evaluate ML models, using tools like Apache Spark or other distributed data processing frameworks.
  • Data movement technologies (ETL/ELT), Messaging/Streaming Technologies (AWS SQS, Kinesis/Kafka), Relational and NoSQL databases (DynamoDB, EKS, Graph database), API and in-memory technologies.
  • Strong knowledge of developing highly scalable distributed systems using Open-source technologies.
  • Experience with CI/CD tools (e.g., Jenkins or equivalent), version control (Git), orchestration/DAGs tools (AWS Step Functions, Airflow, Luigi, Kubeflow, or equivalent).
  • Solid experience in Agile methodologies (Kanban and SCRUM).

The Skills You Bring

  • You have strong technical design and analysis skills.
  • You the ability to deal with ambiguity and work in fast paced environment.
  • Your experience supporting critical applications.
  • You are familiar with applied data science methods, feature engineering and machine learning algorithms.
  • Your Data wrangling experience with structured, semi-structure and unstructured data.
  • Your experience building ML infrastructure, with an eye towards software engineering.
  • You have excellent communication skills, both through written and verbal channels.
  • You have excellent collaboration skills to work with multiple teams in the organization.
  • Your ability to understand and adapt to changing business priorities and technology advancements in Big data and Data Science ecosystem.

The Value You Deliver

  • Designing & developing a feature generation & store framework that promotes sharing of data/features among different ML models.
  • Partner with Data Scientists and to help use the foundational platform upon which models can be built and trained.
  • Operationalize ML Models at scale (e.g., Serve predictions on tens of millions of customers).
  • Build tools to help detect shifts in data/features used by ML models to help identify issues in advance of deteriorating prediction quality, monitoring the uncertainty of model outputs, automating prediction explanation for model diagnostics.
  • Exploring new technology trends and using them to simplify our data and ML ecosystem.
  • Driving Innovation and implementing solutions with future thinking.
  • Guiding teams to improve development agility and productivity.
  • Resolving technical roadblocks and mitigating potential risks.
  • Delivering system automation by setting up continuous integration/continuous delivery pipelines.

About the Company

S

SAMPRASOFT