The Data Scientist will function in a multidisciplinary team environment, providing comprehensive machine learning data analysis with a focus on deep learning methods for multiple clinical and epidemiological studies of cardiovascular outcomes research conducted by investigators at the Center for Outcomes Research and Evaluation (CORE). This role involves planning, development, and implementation of methodology for specific research projects.
Under the direction of the Principal Investigator and Program Co-Director, the ideal candidate will perform a variety of duties involving the application of machine learning skills to the analysis of research studies, and will work as a member of a research team to provide input in the design of study, perform data analysis, and lead or assist drafting analytical sections for peer-review publication for various projects. The candidate is expected to lead several efforts for which recent deep learning methods are appropriate, and for which large amounts of complex data are available. All research and programming code must be regularly documented and archived per best practices. Responsibilities will also include participating in the design, implementation, and maintenance of high performance computing (HPC) environments with graphical processing units (GPU) for accelerated deep learning computation. General responsibilities of the Data Scientist will be to coordinate and lead regular meetings that include agenda and materials preparation as well as meeting minute documentation, and to develop and manage timelines and research activities for research projects to ensure project goals and deliverables are met. The Data Scientist will communicate information and data in written and verbal form to colleagues as well as project and senior management teams. The individual will also be responsible for contributing to the development of training and presentation materials and assisting with the planning and coordination of training and seminars.Develop and execute new and/or highly complex algorithms and statistical predictive models and determine analytical approaches and modeling techniques to evaluate potential future outcomes. Establish analytical rigor and statistical methods to analyze large amount of data, using advanced statistical techniques and mathematical analyses. Manage analytical projects from data exploration, model building, performance evaluation, through implementation. Develop work plans and monitor progress and project timelines. Document coding and changes to work plans using established work group methods in GitHub. Interact with a multidisciplinary team of internal and external peers to regularly, effectively, and openly communicate progress and outcomes of planned work. Attend weekly team meetings to discuss team and project-related activities, issues, change, communications, and updates.
Preferred Education: Experience in leading conference publications, particularly in the deep learning field. Strong background in data analysis with a diverse set of platforms (e.g. R, Python). Knowledge of advanced analytic approaches, including signal processing, image analysis, and supervised and unsupervised machine learning. Experience with version control, unit testing, and continuous integration environments.
Preferred Education, Experience and Skills: Experience in leading conference publications, particularly in the deep learning field. Strong background in data analysis with a diverse set of platforms (e.g. R, Python). Knowledge of advanced analytic approaches, including signal processing, image analysis, and supervised and unsupervised machine learning. Experience with version control, unit testing, and continuous integration environments.
Required Skill/ability 5: Strong organizational, time management, and leadership skills. Ability and willingness to work in a highly collaborative team environment and matrixed organization.
Posting Position Title: Data Scientist 1
Required Skill/ability 3: Sound background in theoretical and applied machine learning.
Work Week: Standard (M-F equal number of hours per day)
University Job Title: Data Scientist
Required Skill/ability 1: Expertise in computer vision and/or NLP.
Required Skill/ability 4: Demonstrated strong ability to communicate technical ideas and results to non-technical customers in written and verbal formats.
Required Skill/ability 2: Ability to work with massive datasets and GPUs.
Master's Degree in computer science, applied/computational mathematics, engineering, biostatistics, statistics, or a quantitative field such as astronomy or geology, and 2 years of hands-on experience in deep learning or an equivalent combination of education and experience.
Internal Number: 52656BR
About Yale University
Yale University is an American private Ivy League research university located in New Haven, Connecticut. Founded in 1701 in the Colony of Connecticut, the university is the third-oldest institution of higher education in the United States.