Validation of Synthetic Data generated by Artificial Intelligence

PhD Offer: Validation of Synthetic Data generated by Artificial Intelligence

The GRBIO research group on biostatistics and bioinformatics is currently inviting applications for a PhD position. The main focus of this research project will be the validation of synthetic data using several statistical tools. The position is funded for three years and will be based at the Department of Statistics and Operations Research of the Universitat Politècnica de Catalunya.

PhD Project:

The PhD project will focus on the validation of synthetic data generated using algorithms based on Artificial Intelligence. Synthetic data generation has become a popular method in data science to overcome data privacy concerns and enable large-scale data sharing. However, synthetic data must be validated to ensure that they preserve the key characteristics of the original data and at the same time, we want it to be different enough to preserve the privacy of the individuals involved. The project will aim to develop new statistical tests and procedures to validate synthetic data generated from different sources.

Tasks:

The successful candidate will be expected to carry out the following tasks:

  • Conduct an extensive literature review on synthetic data generation and validation methods
  • Develop statistical methods to validate synthetic data generated from different sources
  • Implement and apply the proposed methods on simulated and real-world data sets
  • Evaluate the performance of the proposed methods and compare them with existing ones
  • Interpret the results and communicate findings in academic papers and presentations

Requirements:

The ideal candidate should have a master's degree in biostatistics, statistics, mathematics, computer science or a related field. Strong programming skills in R or Python are required, and experience in data analysis, statistical modeling, and machine learning would be advantageous. The candidate should have excellent written and verbal communication skills in English and be able to work independently and as part of a team.

Funding:

This PhD position will be funded by the project ENIA (Estrategia Nacional de Inteligencia Artificial), which is part of the NextGenerationEU initiative. NextGenerationEU is a European Union program that aims to support the economic recovery of the European Union in the aftermath of the COVID-19 pandemic. The ENIA project focuses on developing new methods and technologies in artificial intelligence that can be applied to different sectors of the economy, including healthcare. The PhD candidate will have the opportunity to work closely with other researchers and stakeholders involved in the project, and contribute to the development of new knowledge in this exciting and rapidly growing field.

Application:

Interested candidates should send their CV, academic transcripts, a letter of motivation, and contact details of two referees to the project supervisors, Dr. Daniel Fernández () and Dr. Jordi Cortés () by the 30th of June 2023. Shortlisted candidates will be invited for an interview, either in person or via video conference, and the successful candidate will be notified by the end of July 2023.

We welcome applicants from all backgrounds, genders, and nationalities. The university is committed to promoting a diverse and inclusive community, and we encourage candidates to apply who will contribute to this goal.