different time-series interpolation techniques on real process datasets

Where: Wetsus – Leeuwarden, Netherlands
Starting date: Flexible
Duration: 6 months
Allowance: 200- 400 euros/month – not applicable for Erasmus student


In the last 10 years, new molecular biology tools like Next Generation Sequencing (NGS) became available to unravel the microbial composition and activity of microbial communities. This offers the potential of providing information on both the presence and the response of the microorganisms towards their environment, allowing a comprehensive evaluation of the water quality in wastewater treatment plants.

The meta’omics datasets which are generated by NGS technology are big data in nature and provide an opportunity for tapping into new knowledge potential by using data-driven models. However, fitting reliable models using meta’omics dataset can be a challenging task due to high dimensionality, limited amount of samples, high amount of unknowns in the dataset, and great disparity in the sampling frequencies.

To circumvent problems caused by missing data points, different time-series interpolation techniques exist. For this internship project, a comprehensive performance evaluation of different time-series interpolation techniques will be conducted on a real-life data set to model water microbiome diversity as a measure of different process parameters in a saline wastewater treatment plant in the Netherlands.

Research Objective:

  • Appling different time-series interpolation techniques on real process datasets
  • Performance assessment of the varying time-series interpolation techniques
  • Comparative analysis of the results
  • GitHub repository maintenance for project management
  • Scientific reporting of the outcome



• Experience working in an international environment
• Experience working on a multidisciplinary project
• Contribution to the advancement of water technology


  • EU citizen or international student registered at a Dutch university
  • Specialized in Data Science, Machine Learning, Engineering, Artificial Intelligence, Bio-informatics or bio-statistics
  • Experience with R or Python is a requirement
  • Experience with GitHub is a requirement
  • Fluency in English
  • Interest in water systems and in water microbiomes is an advantage

How to apply:
Please attach your CV (max 2 pages) and motivation letter (max 1 page) in the application form below. You can contact the phd researcher Asala Mahajna through the message option below.

Application form: Applying different time-series interpolation techniques on real process datasets

  • Max. bestandsgrootte: 1 GB.
  • Max. bestandsgrootte: 1 GB.