Senior Data Science Engineer, nanoHUB

Details: Published on Tuesday, 09 January 2024 14:14

Senior Data Science Engineer

Apply at: https://careers.purdue.edu/job/West-Lafayette-Senior-Data-Science-Engineer-IN-47906/1102334400/

Job Summary

The Network for Computational Nanotechnology (NCN) operates the nanoHUB platform, which is broadly used across the global science and engineering community for dissemination, education, and research. NCN and nanoHUB provide cyberinfrastructure support to several efforts, including the ME Commons Silicon Crossroads, Scalable Asymmetric Lifecycle Engagement (SCALE) and Micro-Electronics Security Training (MEST) Centers. The NCN team provides content deployment through nanoHUB, including simulation tools and data, as well as infrastructure development and analytics specifically to support the needs of these projects.

The Senior Data Science Engineer will work on the nanoHUB platform to develop infrastructure and data analytics on user behavior, content, and scientific data in nanoHUB. The position is expected to have a 3-year duration and be renewable.

Required:

Bachelor’s degree in engineering, computer science, physical science, or related field
Four (4) years of experience in programming, database, and software design, including customer-driven software design and development experience and including at least three (3) years of experience in data modeling, data transformation, or business intelligence/data warehousing technologies
Equivalent combinations of education and experience may be considered
Experience with data science, machine learning, and scientific computing
Demonstrated experience with one or more relational database management system such as Oracle, MySQL, or MS SQL Server
Programming experience using C/C++/Java/other languages and/or using scripting languages such as Python or Perl, with databases or web applications
Domain knowledge in data science and machine learning, and cloud computing
Experience with data curation, data preprocessing and refinement, data analysis, data warehousing, and database design
Ability to identify and facilitate production implementation for recurring data sets and analytical patterns and to create tools for developing insights from moderately complex data sets
Ability to generate visualizations to communicate important features of data sets and identify new data sources and data flows

Preferred:

Knowledge of FAIR (Findable, Accessible, Interoperable, and Reusable) data principles and application of LLMs (large language models)
Advanced degree in engineering or physical sciences discipline
Experience developing data analysis and/or scientific applications, graphical user interface design, or developing software on Linux or Windows platforms
Web development experience including JavaScript, PHP, CSS, HTML5, and XML
Experience working with large volumes of data
Domain knowledge in nanotechnology or materials science
Familiarity with Hubzero and/or nanoHUB infrastructure and software development practices
Specialized skills such as: big data technologies, dynamic web programming, or speculative/exploratory data-driven analysis
Extensive knowledge of Python, C++, or Java

Additional Information

This position has a limited duration expected to be three years. Continuation is dependent upon additional grant funding
Purdue’s benefits summary
Purdue will not sponsor employment authorization for this position
A background check will be required for employment in this position
FLSA: Exempt (Not Eligible For Overtime)
Retirement Eligibility: Defined Contribution Waiting Period
Purdue University is an EOE/AA employer. All individuals, including minorities, women, individuals with disabilities, and veterans are encouraged to apply