Skip to main content

Senior Data Science Engineer, nanoHUB

Senior Data Science Engineer

Apply at: https://careers.purdue.edu/job/West-Lafayette-Senior-Data-Science-Engineer-IN-47906/1102334400/

Job Summary

The Network for Computational Nanotechnology (NCN) operates the nanoHUB platform, which is broadly used across the global science and engineering community for dissemination, education, and research. NCN and nanoHUB provide cyberinfrastructure support to several efforts, including the ME Commons Silicon Crossroads, Scalable Asymmetric Lifecycle Engagement (SCALE) and Micro-Electronics Security Training (MEST) Centers. The NCN team provides content deployment through nanoHUB, including simulation tools and data, as well as infrastructure development and analytics specifically to support the needs of these projects.  

The Senior Data Science Engineer will work on the nanoHUB platform to develop infrastructure and data analytics on user behavior, content, and scientific data in nanoHUB. The position is expected to have a 3-year duration and be renewable.  

 

Required:

  • Bachelor’s degree in engineering, computer science, physical science, or related field 
  • Four (4) years of experience in programming, database, and software design, including customer-driven software design and development experience and including at least three (3) years of experience in data modeling, data transformation, or business intelligence/data warehousing technologies 
  • Equivalent combinations of education and experience may be considered
  • Experience with data science, machine learning, and scientific computing
  • Demonstrated experience with one or more relational database management system such as Oracle, MySQL, or MS SQL Server
  • Programming experience using C/C++/Java/other languages and/or using scripting languages such as Python or Perl, with databases or web applications
  • Domain knowledge in data science and machine learning, and cloud computing
  • Experience with data curation, data preprocessing and refinement, data analysis, data warehousing, and database design
  • Ability to identify and facilitate production implementation for recurring data sets and analytical patterns and to create tools for developing insights from moderately complex data sets
  • Ability to generate visualizations to communicate important features of data sets and identify new data sources and data flows

 

Preferred:

  • Knowledge of FAIR (Findable, Accessible, Interoperable, and Reusable) data principles and application of LLMs (large language models)
  • Advanced degree in engineering or physical sciences discipline
  • Experience developing data analysis and/or scientific applications, graphical user interface design, or developing software on Linux or Windows platforms
  • Web development experience including JavaScript, PHP, CSS, HTML5, and XML
  • Experience working with large volumes of data
  • Domain knowledge in nanotechnology or materials science 
  • Familiarity with Hubzero and/or nanoHUB infrastructure and software development practices
  • Specialized skills such as: big data technologies, dynamic web programming, or speculative/exploratory data-driven analysis
  • Extensive knowledge of Python, C++, or Java
     

 

Additional Information

  • This position has a limited duration expected to be three years. Continuation is dependent upon additional grant funding
  • Purdue’s benefits summary 
  • Purdue will not sponsor employment authorization for this position  
  • A background check will be required for employment in this position
  • FLSA: Exempt (Not Eligible For Overtime)
  • Retirement Eligibility:  Defined Contribution Waiting Period
  • Purdue University is an EOE/AA employer. All individuals, including minorities, women, individuals with disabilities, and veterans are encouraged to apply