Data Engineer

Seven Seven Softwares

Newark, NJ

JOB DETAILS
SKILLS
Agile Programming Methodologies, Analysis Skills, Apache, Apache Cassandra, Apache HBase, Apache Hadoop, Apache Hive, Apache Spark, Big Data, Communication Skills, Computer Science, Data Lake, Data Modeling, Data Visualization, Database Extract Transform and Load (ETL), Git, Informatica, Java, JavaScript, Jenkins, MapReduce, Maven, MongoDB, NoSQL, Object Oriented Analysis (OOA), Object Oriented Design (OOD), Organizational Skills, PostgreSQL, Presentation/Verbal Skills, Programming Languages, Python Programming/Scripting Language, Ruby, Scala Programming Language, Software Development, Subversion, Tableau, Test Automation, Writing Skills
LOCATION
Newark, NJ
POSTED
23 days ago
Data Engineer for enhancements to PII Data Lake environment for Grow
Flair for data, schema, data model, how to bring efficiency in big data related life cycle.  Understanding of automated QA needs related to Big Data and visualization platforms.

Requirements:
 
  • Java is must and also should have UI experience – Profile
  • BS in Computer Science or related area
  • 5-8 years software development experience
  • Minimum 2 Year Experience on Big Data Platform
  • Proficiency with Java, Python, Scala, HBase, Hive, MapReduce, ETL, Kafka, Mongo, Postgres, Redshift.  Visualization technologies etc.
  • Flair for data, schema, data model, how to bring efficiency in big data related life cycle
  • Understanding of automated QA needs related to Big data
  • Understanding of various Visualization platform (Tableau, D3JS, others)
  • Proficiency with agile or lean development practices
  • Strong object-oriented design and analysis skills
  • Excellent technical and organizational skills
  • Excellent written and verbal communication skills. Skill sets / technologies
  • Programming language -- Java (must), Python, Scala, Ruby
  • Batch processing -- Hadoop MapReduce, Cascading/Scalding, Apache Spark
  • Stream processing -- Apache Storm, AKKA, Samza, Spark streaming
  • NoSQL -- HBase, MongoDB, Cassandra, Riak,
  • ETL Tools Data Stage, Informatica
  • Code/Build/Deployment -git, hg, svn, maven, sbt, jenkins, bamboo


About the Company

S

Seven Seven Softwares