Data Engineer Job at Thermo Fisher Scientific, Pittsburgh, PA

TnpGRkhNK3BadTJEMFFKbEJsdVBGYS82U3c9PQ==
  • Thermo Fisher Scientific
  • Pittsburgh, PA

Job Description

Work Schedule

Standard (Mon-Fri)

Environmental Conditions

Office

When you are part of the team at Thermo Fisher Scientific, you’ll do important work, like helping customers in finding cures for cancer, protecting the environment or making sure our food is safe. Your work will have real-world impact, and you’ll be supported in achieving your career goals.

How will you make an impact?

Thermo Fishers Scientific is seeking a Data Engineer located at Pittsburgh, PA to work with Data Science Center of Excellence and Data Architecture team to build Databricks-based Data Pipeline and bring data onto our enterprise level data platform for Data Science, Analytics and Digital Marketing needs. The data platform is primarily based on Oracle Exadata database, AWS Redshift and Databricks-based Delta and Unity Catalog technologies toward  Lakehouse transition to enable Data Science, Data Analytics, Customer Analytics and Data Services for critical Application and Business enablement

What will you do?

  • Design, develop, test, deploy, support, enhance data integration solutions seamlessly to connect and integrate Thermo Fisher enterprise systems in our Data Science and Enterprise Data Platform.
  • Innovate for data integration in Apache Spark-based Platform to ensure the technology solutions leverage cutting edge integration capabilities.
  • Facilitate requirements gathering and process mapping workshops, review business/functional requirement documents, author technical design documents, testing plans and scripts.
  • Assist with implementing standard operating procedures, facilitate review sessions with functional owners and end-user representatives, and leverage technical knowledge and expertise to drive improvements.
  • Defining, designing and documenting reference architecture and leading the implementation of BI and analytical solutions.
  • Follow agile development methodologies to deliver solutions and product features by following DevOps practices.

How will you get here?

  • 4-year degree with major in computer science engineering (or equivalent).

Experience , Knowledge, Skills, Abilities

  • 6+ Years Experience in Databricks, Data/Delta lake/Unity Catalog, Oracle, SQL Server or AWS Redshift type relational databases.
  • Experience in ETL (Data extraction, data transformation and data load processes)
  • 6+ years working experience in data integration and pipeline development.
  • Excellent experience in Databricks and Apache  Spark .
  • Data lake  and Delta lake and Unity Catalog experience with  AWS Glue and Athena .
  • 6+ years of Experience with AWS Cloud on data integration with Apache Spark, Glue, Kafka, Elastic Search, Lambda, S3, Redshift, RDS, MongoDB/DynamoDB ecosystems.
  • Strong real-life experience in python development especially in  pySpark in AWS Cloud environment
  • Design, develop test, deploy, maintain and improve data integration  pipeline .
  • Experience in Python and common python libraries.
  • Knowledge of Generative AI models (e.g., GPT, DALL-E) and their potential applications in data engineering and analytics
  • Strong analytical experience with database in writing complex queries, query optimization, debugging, user defined functions, views, indexes etc.
  • Strong experience with source control systems such as  Git and Jenkins build and continuous integration tools.
  • Highly self-driven, execution-focused, with a willingness to do "what it takes” to deliver results as you will be expected to rapidly cover a considerable amount of demands on data integration
  • Understanding of development methodology and actual experience writing functional and technical design specifications.
  • Must be willing to learn Generative AI.
  • 20% of supporting Activities along with Dev work.
  • Excellent verbal and written communication skills, in person, by telephone, and with large teams.
  • Strong prior technical, development background in either data Services or Engineering
  • Demonstrated experience resolving complex data integration problems;
  • Must be able to work cross-functionally. Above all else, must be equal parts data-driven and results-driven

Job Tags

Remote job, Full time, Work experience placement, Work at office

Similar Jobs

BJC Healthcare

ICU Clinical Nurse PRN Job at BJC Healthcare

Additional Information About the Role Unit - Memorial Shiloh - ICU PRN Nights Three shifts per 6 week schedule Competitive Pay (See Career Ladder Information Below) BSN Differential Shift Differential Eligible for up to 40 hours of paid time off each...

Chick-fil-A

Miembro del equipo de cocina Job at Chick-fil-A

 ...ofrecidos para este puesto Este puesto requiere un 100% de trabajo in situ en 3181 Harbor Blvd, Costa Mesa, CA 92626 Debe...  ...califiquen. ~ Los servicios de asesoramiento gratuitos estn disponibles para todos los empleados. ~ Se ofrecen planes de salud de telemedicina... 

Carle Health

RN - Surgery (Proctor) Job at Carle Health

 ...school, and Methodist College. Carle BroMenn Medical Center, Carle Foundation Hospital, Carle Health Methodist Hospital, Carle Health Proctor Hospital, Carle Health Pekin Hospital, and Carle Hoopeston Regional Health Center hold Magnet designations, the nations highest... 

New York State Civil Service

Office Assistant 1 (NY HELPS) - 91141 Job at New York State Civil Service

 ...Health, Department of - Veterans Home at Montrose Title Office Assistant 1 (NY HELPS) - 91141 Occupational Category Clerical,...  ...Competitive HELPS qualifications: There are no minimum education or experience requirements for this title.Preferred Qualifications: The... 

Saint Louis University

Flight Instructor Job at Saint Louis University

 ...service. Job Summary Under general direction, conducts quality flight instruction required in the professional pilot programs at...  ..., Abilities, and Personal Characteristics Knowledge and instructor certification appropriate to the position sought Knowledge of...