live chat
Data Engineer Job in Columbus, Ohio US

Data Engineer

American Chemical Society - Columbus, OH

Posted: 10/19/2020 - Expires: 1/17/2021

Job ID: 221255703



Job Description

Data Engineer                           

Date: Oct 19, 2020

Location:Columbus, OH, US, 43202

                Company:                 American Chemical Society
                                   
CAS uses unparalleled scientific content, specialized technology and unmatched human expertise to help R&D organizations across Commercial, Government and Academic sectors create groundbreaking innovations that benefit the world. As the Scientific Information Solutions Division of the American Chemical Society, CAS manages the largest curated reservoir of scientific knowledge, and for 112 years, has helped innovators mine, assess and apply that information to keep businesses thriving. The CAS team is global, diverse, endlessly curious and strives to make actionable scientific insights accessible to innovators worldwide.

CAS is currently seeking a Data Engineer. This position will be located in our headquarters in Columbus, Ohio.

Data Engineers are responsible for supporting ingestion and transformation pipelines that handle data for analytical or operational uses across broad business areas and enterprise data domains. The data engineer often works as a dedicated member of support teams, focused on providing production stability for data processing workflows that will be used by analytics groups and data scientists who are interrogating information for predictive analytics, machine learning and data mining purposes.

Duties

Delivers data engineering expertise for ingestion and transformation pipelines that handle data for analytical or operational uses across wide variety of business needs and enterprise data domains.
Ensures production stability for data processing workflows used by analytics groups and data scientists who are interrogating information for predictive analytics, machine learning and data mining purposes.
Defines structure, integrates, governs, stores, describes, models, and maintains data in the enterprise for accuracy and usage while maintaining current state
Safeguards best practices of data architecture including accountability, governance, and requirements.
Perform other duties as assigned

Qualifications

Bachelor’s degree in Computer Science or similar discipline.  Experience across a broad range of modern data science and analytics tools (e.g., SQL, Hadoop, Spark, Python, R)
3+ years’ experience in Large Big Data Development and Deployment Automation in Private/Public cloud preferably on AWS
Hands on experience in big data environments such as (Cloudera or Hortonworks)
Experience with DevOps, Continuous Integration and Continuous Delivery (Maven, Jenkins, Stash, Ansible, Docker)
Experience with programming in Scala, Spark, Python, JavaScript and Java, as well as Unix shell skills
Defines structure, integrates, governs, stores, describes, models, and maintains data in the enterprise for accuracy and usage while maintaining current state
Support policies and procedures enforced by the data governance committee to ensure best practices of data architecture including accountability, governance, and requirements
Experience working with XML
Experience building Data Ingestion on the cloud (using tools like Glue)
Understanding of principles, best practices and trade-offs of schema design for both Relational and NoSQL database systems
Solid understanding of Big Data NoSQL databases/technologies (MarkLogic, Hbase, Hive, Spark, MongoDB)
Strong written and verbal communication skills.
Ability to travel as required

Desired, but not required:

Knowledge and experience in chemistry, drug discovery/development, or medical related industry

CAS offers a competitive salary and comprehensive benefits package, including a generous vacation plan, medical, dental, vision insurance plans, and employee savings and retirement plans.  Candidates for this position must be authorized to work in the United States and not require work authorization sponsorship by our company for this position now or in the future.
Division


Position Summary


       
               
Nearest Major Market: Columbus                                   
Job Segment:                     Database, Developer, Computer Science, SQL, XML, Technology                                       
                           
                                                       
                       
                   

EEO/Minority/Female/Disabled/Veteran

Job Summary


Employment Type:
Full Time Employee
Job type:
Federal Contractor
Skill Based Partner:
No
Education Level:
Bachelor's degree
Work Days:
Mon, Tue, Wed, Thu, Fri
Job Reference Code
43686680
Salary
N/A
Licenses / Certifications:
N/A
Display Recommended WorkKeys®Recommended WorkKeys®:
N/A