1+ months

Data Engineer

Cambridge, MA 02139
Site Name: USA - Massachusetts - Cambridge
Posted Date: Jan 19 2021

We are looking for a Software Engineer/Data Engineer who will join our team to develop solutions for collecting, storing, processing, and analyzing huge sets of small molecule-protein interaction data. The primary focus of this job will be on choosing methods to use for these purposes, then implementing, maintaining, and monitoring these solutions over their full life cycle. You will also be responsible for integrating these methods with the information architecture used across the company.

The role provides an exciting opportunity to transform our data collection and storage infrastructure. You will be embedded in the Encoded Library Technology (ELT)* Scientific Computing team, working with biologists, chemists and data scientists who are passionate about advancing the ELT technology. You will also have opportunity to interface and collaborate with analysts and engineers in the GSK AI/ML groups, R&D infrastructure, and computational chemistry/biology teams.

Key Responsibilities

  • Selecting and integrating in-house software packages with Big Data tools and frameworks.

  • Creating and maintaining custom software solutions for the analysis of affinity selection results.

  • Implementing Extract Transform Load (ETL) process for Encoded Library Technology assay data.

  • Monitoring performance and advising any necessary infrastructure changes.

Why you?

Basic Qualifications:

  • BS/MS in Computer Science, Analytics or similar discipline.

  • 8+ years professional software engineering experience.

  • Experience with Linux and Python programming.

  • Experience analyzing biological, chemical or omics (i.e. RNA-Seq, DNA-Seq, etc.) data.

Preferred Qualifications:

  • Experience implementing and maintaining data or analytic pipelines.

  • Experience delivering software and data solutions using Agile and DevOps methodologies and concepts. (i.e. Scrum, Kanban, etc.)

  • Experience with enterprise-level data solutions including big-data technologies. (i.e. Hadoop, MapReduce, Impala, Hive and Spark)

  • Experience with analyzing biological, chemical or omics (i.e. RNA-Seq, DNA-Seq, etc.) data.

  • Demonstrated ability to translate data requirements between bench scientists and data/IT professionals.

  • Strong interpersonal skills and ability to communicate complex concepts to stake holders with wide range of expertise.

Why GSK?

Our values and expectationsare at the heart of everything we do and form an important part of our culture.

These include Patient focus, Transparency, Respect, Integrity along with Courage, Accountability, Development, and Teamwork. As GSK focuses on our values and expectations and a culture of innovation, performance, and trust, the successful candidate will demonstrate the following capabilities:

  • Operating at pace and agile decision-making using evidence and applying judgement to balance pace, rigour and risk.
  • Committed to delivering high quality results, overcoming challenges, focusing on what matters, execution.
  • Continuously looking for opportunities to learn, build skills and share learning.
  • Sustaining energy and well-being.
  • Building strong relationships and collaboration, honest and open conversations.
  • Budgeting and cost-consciousness.

*For a recent review of ELT at GSK see: Arico-Muendel, C. C. From haystack to needle: finding value with DNA encoded library technology at GSK. MedChemComm 2016, DOI: 10. 1039/c6md00341a


If you require an accommodation or other assistance to apply for a job at GSK, please contact the GSK Service Centre at 1-877-694-7547 (US Toll Free) or +1 801 567 5155 (outside US).

GSK is an Equal Opportunity Employer and, in the US, we adhere to Affirmative Action principles. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, covered/protected veteran status or any other federal, state or local protected class.

Important notice to Employment businesses/ Agencies

GSK does not accept referrals from employment businesses and/or employment agencies in respect of the vacancies posted on this site. All employment businesses/agencies are required to contact GSK's commercial and general procurement/human resources department to obtain prior written authorization before referring any candidates to GSK. The obtaining of prior written authorization is a condition precedent to any agreement (verbal or written) between the employment business/ agency and GSK. In the absence of such written authorization being obtained any actions undertaken by the employment business/agency shall be deemed to have been performed without the consent or contractual agreement of GSK. GSK shall therefore not be liable for any fees arising from such actions or any fees arising from any referrals by employment businesses/agencies in respect of the vacancies posted on this site.

Please note that if you are a US Licensed Healthcare Professional or Healthcare Professional as defined by the laws of the state issuing your license, GSK may be required to capture and report expenses GSK incurs, on your behalf, in the event you are afforded an interview for employment. This capture of applicable transfers of value is necessary to ensure GSKs compliance to all federal and state US Transparency requirements. For more information, please visit GSKs Transparency Reporting For the Record site.

","street_address":"200 Cambridge Park Drive",


Posted: 2020-11-27 Expires: 2021-02-19

Before you go...

Our free job seeker tools include alerts for new jobs, saving your favorites, optimized job matching, and more! Just enter your email below.

Share this job:

Data Engineer

Cambridge, MA 02139

Join us to start saving your Favorite Jobs!

Sign In Create Account
Powered ByCareerCast