Are you passionate about contributing meaningfully to battling cancer? Then join us here at MSK, where we can provide you with the opportunity to make a difference with your career. We believe this is a very exciting opportunity for someone who has the right skillset and drive to make an impact to support our mission.
The Computational Oncology Program in the Department of Epidemiology and Biostatistics is seeking a talented, highly skilled Data Engineer to join their team. We are motivated by contributing meaningfully to contemporary progress in cancer research driven by advances in computing and data. The right person will work in close collaboration with researchers and software engineers, and be responsible for managing data from leading edge, large scale research efforts in computational biology including genomics, imaging and clinical data analysis and interpretation. The Data Engineer will have experience managing data utilizing robust, enterprise level contemporary software systems.
You Will :
Manage data from high-throughput next-generation sequencing and imaging
Contribute to the design of databases as part of bioinformatics data processing and analysis systems
Maintain and monitor streaming and batch ETLs operating on structured and unstructured sources
Maintain a data lake with hundreds of terabytes of data
Develop workflows and integrate systems with REST APIs
Compile datasets and verify data consistency
Communicate with stakeholders of the data and upon request, conduct data query tracking and resolution
Identify inefficiencies and work with software engineers to simplify processes, debug systems and automate routine tasks
Bachelors Degree in Computer Science, Information Systems, or Database Management (or equivalent experience), Masters degree is preferred
3+ years of experience, preferably with bioinformatics lab information management systems
Experience designing databases and defining system requirements for data collection
Strong software engineering skills in Python, and working with SQL and NoSQL data
Solid experience in Linux systems, and shell scripting
Internal Number: 2019-37064
About Memorial Sloan-Kettering Cancer Center
As one of the world's premier cancer centers, Memorial Sloan-Kettering Cancer Center is committed to exceptional patient care, leading-edge research, and superb educational programs. The close collaboration between our physicians and scientists is one of our unique strengths, enabling us to provide patients with the best care available today as we work to discover more effective strategies to prevent, control, and ultimately cure cancer in the future. Our education programs train future physicians and scientists, and the knowledge and experience they gain at Memorial Sloan-Kettering has an impact on cancer treatment and the biomedical research agenda around the world.