Sr. Data Engineer
Role: Sr. Data EngineerLocation: Marina Del Rey, CADuration: Long term contractNo of Hours: 160 Hours/ Month?ROLE SUMMARY:We?re seeking a Senior Software Engineer who specializes in data pipelines, large-scale data processing and distributed systems. This DATA ENGINEER will be a member of the Data Science & Engineering Team, which is a Center of Excellence for Data Science, Data Transport and Cluster Computing at DSC. They will operate in a cross-functional squad with business stakeholders, product managers and engineers from other teams to create data-enabled features in both internal and customer-facing applications. Key to success in this role will be strong curiosity and drive to learn technologies and architectures that create scalable, reliable and secure data services.RESPONSIBILITIES:Architect and maintain our data Infrastructure on AWS, including:Data Lake (Redshift)Cluster computing resources (Apache Spark)Distributed messaging and streaming platforms (Apache Kafka)Distributed storage (S3)Design and implement data applications on cluster computing and streaming enginesDesign and implement data processing microservices and pipelinesIntegrate with our vendors to collect and ingest second-party data into our data lakeWork with data scientists to enhance our software with the latest in machine learning algorithms and predictive modelsCollaborate to ensure the integrity and cleanliness of data sources, and to support the requirements of downstream data consumersMentor and guide team membersEstablish and communicate team standards and best practicesCollaborate in a cross-functional squad with business stakeholders, product managers and engineers from other teams to create data-enabled features in both internal and customer-facing applicationsQUALIFICATIONS:Enthusiasm to work in a fast-paced engineering teamBS or MS in Computer Science or a related technical field; or equivalent work experience4+ years of prior experience with general software development3+ years experience working with data infrastructure in several of the categories listed aboveFluency in SQL or equivalent query languageExpertise in Python, Java, Scala or similar programming languagesExperience working in the following domains is preferred: data mining, machine learning, statistical modeling, distributed systems,?stream processing, data warehousingExpertise with the following?is a big plus: Spark, MapReduce, Kafka, Kinesis, Storm, Redshift, PostgreSQL, Apache Airflow, Python, NumPy, SciPyThanks and Regards,Bharath kumar - Technical Recruiter - Pantar Solutions IncContact: 704-233-7955 -?Email:?bharath@pantarsolutions.com? - provided by Dice
Data Lake (Redshift),Spark,Kafka,data infrastructure
Data Lake (Redshift),Spark,Kafka,data infrastructure
|