Senior Data Scientist

Neptune Technology Group Inc

Duluth, GA

JOB DETAILS
SKILLS
AWS Lambda, Agile Programming Methodologies, Amazon Simple Storage Service (S3), Amazon Web Services (AWS), Analysis Skills, Application Programming Interface (API), Architectural Analysis, Artificial Intelligence (AI), Artificial Intelligence (AI) Programming Languages, Best Practices, Big Data, Cloud Computing, Code Reviews, Communication Skills, Computer Science, Conservation, Continuous Deployment/Delivery, Continuous Integration, Cross-Functional, Customer Support/Service, Data Analysis, Data Formats, Data Modeling, Data Processing, Data Science, Data Visualization, Data Visualization Tools, Distributed Computing, Electronic Medical Records, Experiment Design, Forecasting, Git, Internet of Things, JSON, Machine Learning, Mathematics, Mentoring, Model Review, Modeling Languages, MySQL, Performance Analysis, Performance Modeling, Performance Tuning/Optimization, PostgreSQL, Predictive Modeling, Product Engineering, Product Management, Production Control, Production Machining, Production Systems, Python Programming/Scripting Language, REST (Representational State Transfer), Reporting Dashboards, Requirements Management, SQL (Structured Query Language), Sales Prospecting, Science Library, Scientific Publications, Software Engineering, Source Code/Configuration Management (SCM), Sprint Planning, Statistics, Technical Leadership, Time Series Analysis, Trend Analysis, Water Resource Management, Water Utility
LOCATION
Duluth, GA
POSTED
30+ days ago

Position Summary

As a Senior Data Scientist, you will be responsible for designing and implementing machine learning models and data-driven solutions that enhance our water utility intelligence platform and create value for our customers. This position involves working with large-scale IoT data from millions of water meters, developing predictive analytics capabilities, and deploying AI solutions into production environments. You will collaborate with Product Management and Engineering teams to translate business requirements into data science solutions, mentor junior data scientists, and drive Neptunes AI transformation initiatives. This role provides direct impact on utility operations, water conservation efforts, and customer service improvements.

Responsibilities

• Effectively communicate and articulate decisions, designs, and outcomes to stakeholders at all levels of the organization. • Work with cross-functional teams to deliver high-quality machine learning models and data science solutions. • Understand and enhance requirements defined by Product Management for AI-powered features. • Design and implement machine learning models for water consumption forecasting, anomaly detection, leak detection, and predictive maintenance. • Develop and deploy production-ready machine learning pipelines on cloud infrastructure (AWS). • Analyze large-scale time-series data from IoT devices and water utility operations. • Build and optimize data processing workflows using PySpark and distributed computing frameworks. • Create data visualizations and analytics dashboards to communicate insights to stakeholders. • Conduct exploratory data analysis to identify patterns, trends, and opportunities in metering data. • Perform feature engineering and model selection to optimize predictive performance. • Evaluate model performance and implement monitoring solutions for production ML systems. • Collaborate with software engineers to integrate ML models into the Neptune 360 platform. • Provide technical guidance to Product Management on data science capabilities and feasibility. • Document data science methodologies, model architectures, and analytical findings. • Stay current with latest developments in machine learning, AI, and data science best practices. • Mentor junior data scientists and disseminate technical knowledge within the organization. • Review code and model implementations of other team members. • Participate in sprint planning and demonstrate completed work at the end of every iteration. • Work with Python, SQL, PySpark, AWS services (SageMaker, Bedrock, Lambda, Redshift), and ML frameworks. • Contribute to Neptunes AI strategy and identify new opportunities for data-driven innovation.

Experience

• 5+ years of experience in data science, machine learning, or related analytical roles. • 5+ years of experience with Python and data science libraries (pandas, NumPy, scikit-learn, TensorFlow/PyTorch). • Strong experience with SQL and working with large-scale databases (Redshift, PostgreSQL, MySQL). • Experience with PySpark and distributed computing frameworks for large-scale data processing, including working with common data formats such as JSON and Parquet. • Proven track record of deploying machine learning models to production environments. • Experience with cloud platforms, preferably AWS (SageMaker, Bedrock, Lambda, S3, Redshift). • Experience with time-series analysis and forecasting methods. • Understanding of MLOps practices and model lifecycle management. • Experience building RESTful APIs for model serving. • Strong statistical analysis and experimental design skills. • Experience with data visualization tools and techniques. • Experience working in Agile/iterative development environments. • Ability to communicate complex technical concepts to non-technical stakeholders. • Experience with version control systems (Git) and CI/CD pipelines. • Continued professional self-improvement through courses, certifications, or research. • Preferred: Experience with AWS big data services (Glue, EMR, Athena). • Preferred: Experience with IoT data, utility operations, or water management systems. • Preferred: Experience with generative AI and large language models.

Education

Masters or Ph.D. degree in Data Science, Computer Science, Statistics, Mathematics, or related quantitative field, or combination of Bachelors degree with equivalent experience.

Location: Duluth, GA

About the Company

N

Neptune Technology Group Inc