Data and AI cluster

Research group All Automated Machine Learning Data Mining Database Generative AI Uncertainty in AI Show items up to All 5 years 10 years 15 years

Master projects

Here you can find all our available master projects.

Open Projects (24)

Reinforcement Learning for Efficient Causal Discovery

Understanding causal relationships within data is essential across fields such as healthcare, economics, and social sciences, where knowing "what causes what" guides decision-making and policy. Causal discovery, the process of identifying these relationships and structuring them in causal graphs, remains challenging, especially in complex, …

Devendra Dhami

Maryam Tavakol
More info
Causal Discovery for Offline Model-Based Reinforcement Learning

Reinforcement Learning (RL) has proven effective in a variety of complex decision-making tasks. However, traditional RL requires extensive online interactions, making it costly and, in some domains, impractical due to constraints on safety, time, or resource availability. Offline RL, which relies solely on pre-collected …

Maryam Tavakol

Devendra Dhami
More info
Context-Aware Model-Based Offline Reinforcement Learning

Offline Reinforcement Learning (RL) deals with the problems where simulation or online interaction is impractical, costly, and/or dangerous, allowing to automate a wide range of applications from healthcare and education to finance and robotics. However, learning new policies from offline data suffers from distributional …

Maryam Tavakol
More info
Pattern discovery to improve overlay control loop by using Bayesian inference tooling

This assignment aims to detect and quantify persistent overlay improvements by investigating a larger data set systematically.It will provide you with insights into the overlay performance of ASML lithography machines. You will also learn how ASML maintains machine performance via drift control strategy. As …

Erik Quaeghebeur

SR
Sejong Park, Hamideh Rostami
More info
Examples of missing not at random (MNAR) data that are difficult for classifiers

It often occurs in datasets that there is missing data. A good introduction can be found here: https://stefvanbuuren.name/fimd/.This missingness might be "completely at random" (MCAR). This occurs when the probability of being missing is the same for all cases. An example of MCAR data …

Arthur van Camp
More info
Implementation of inference algorithms for the imprecise Plackett–Luce model

The Plackett–Luce model is a popular parametric probabilistic model to define distributions between rankings of objects, modelling for instance observed preferences of users or ranked performances of algorithms. Since such observations may be scarce (users may provide partial preferences, or not all algorithms are …

Arthur van Camp
More info
Breeding Program Optimization via Offline Reinforcement Learning

Crop breeding programs aim to develop new cultivars with desirable traits through controlled mating within a population, enhancing agricultural productivity while reducing land use, greenhouse gas emissions, and water consumption. However, these programs face challenges like long turnover times, complex decision-making, long-term goals, and climate …

Maryam Tavakol

IA
Ioannis Athanasiadis
More info
Uncertainty Estimation in Model-Based Offline Reinforcement Learning using Random Networks

Offline Reinforcement Learning (RL) deals with the problems where simulation or online interaction is impractical, costly, and/or dangerous, allowing to automate a wide range of applications from healthcare and education to finance and robotics. However, learning new policies from offline data suffers from distributional …

Maryam Tavakol
More info
Long-term Fairness with Offline Reinforcement Learning

One of the main concerns in the recent AI research is that most data-driven approaches preserve the bias or unfairness available in the collected (offline) data in the resulting models, which could lead to harmful social and ethical effects in the society. Fairness-aware machine learning has …

Maryam Tavakol
More info
Implementing inference algorithms for choice functions

In recent years, imprecise-probabilistic choice functions have gained growing interest, primarily from a theoretical point of view. These versatile and expressive uncertainty models have demonstrated their capacity to represent decision-making scenarios that extend beyond simple pairwise comparisons of options, accommodating situations of indecision as …

Arthur van Camp

Cassio de Campos
More info
Generative Random Forests: The Next Level

The work on generative random forests has started, but there is a long way to make them practical. This project aims at studying the drawbacks of such models and improving them with better ensemble ideas, gradient boosting, and/or other techniques already employed with decision …

Cassio de Campos
More info
Probabilistic Circuits versus Bayesian networks

This project aims to compare two different types of generative models: tractable probabilistic circuits and Bayesian networks of bounded tree-width, and potentially have tools to translate between them (when possible). Probabilistic circuits have been recently applied to a number of tasks, but there is …

Cassio de Campos
More info
Hybrid Bayesian networks

This internal project aims at developing and testing (for example in classification tasks) a generative model based on probabilistic graphical models for domains with continuous and categorical variables. We want to learn both the graph structure and parameters of such models while constraining their …

Cassio de Campos
More info
Synthetic data generation for causal learning

An arguably major difficulty for improving causal inferences is the lack of availability of data. While observational data are abundant, interventional data are not. This internal project aims at creating software tools to generate data that can be useful for testing causal learning approaches. …

Cassio de Campos
More info
Scalable Implementation of Probabilistic Circuits

This internal project aims at designing and development a usable software package for learning and reasoning with probabilistic circuits. Probabilistic circuits are models which can represent complicated mixture models and their computation circuit can be wide and deep. Because they have a structure which …

Cassio de Campos
More info
Algorithms for forward irrelevance with choice functions

In recent years, imprecise-probabilistic choice functions have gained growing interest, primarily from a theoretical point of view. These versatile and expressive uncertainty models have demonstrated their capacity to represent decision-making scenarios that extend beyond simple pairwise comparisons of options, accommodating situations of indecision as …

Arthur van Camp

Cassio de Campos
More info
Concepts of independence for choice functions

In recent years, imprecise-probabilistic choice functions have gained growing interest, primarily from a theoretical point of view. These versatile and expressive uncertainty models have demonstrated their capacity to represent decision-making scenarios that extend beyond simple pairwise comparisons of options, accommodating situations of indecision as …

Arthur van Camp

Cassio de Campos
More info
Local inference algorithms for choice functions

In recent years, imprecise-probabilistic choice functions have gained growing interest, primarily from a theoretical point of view. These versatile and expressive uncertainty models have demonstrated their capacity to represent decision-making scenarios that extend beyond simple pairwise comparisons of options, accommodating situations of indecision as …

Arthur van Camp

Cassio de Campos
More info
Interventional Whittle Sum-Product Networks

Whittle sum-product networks [1] model the joint distribution of multivariate time series by leveraging the Whittle approximation, casting the likelihood in the frequency domain, and place a complex-valued sum-product network over the frequencies. The conditional independence relations among the time series can then be …

Devendra Dhami
More info
Dynamic Knowledge Graph Embeddings

Knowledge graph embeddings are an important area of research inside machine learning and has become a necessity due to the importance of reasoning about objects, their attributes and relations in large graphs. There have been several approaches that have been explored and can be …

Devendra Dhami
More info
Efficient Unbiased Training of Large-scale Distributed RL

It is widely known that training deep neural networks on huge datasets improves learning. However, huge datasets and deep neural networks can no longer be trained on a single machine. One common solution is to train using distributed systems. In addition to traditional data-centers, …

Maryam Tavakol

AR
Ali Ramezani-Kebrya
More info
Computational Complexity of Probabilistic Circuits

This internal project aims at studying and devising new bounds for the computational complexity of inferences in probabilistic circuits and their robust/credal counterpart, including approximation results and fixed-parameter tractability. It requires mathematical interest and good knowledge of theory of computation. This is a theoretical …

Cassio de Campos
More info
Learning Bayesian Networks in a Single Step

This internal project aims at implementing a new approach to learning the structure and parameters of Bayesian networks. It is mostly an implementation project, as the novel ideas are already established (but never published, so the approach is novel). It requires high expertise in …

Cassio de Campos
More info
Battle of the credal networks: strong independence or forward irrelevance?

Bayesian networks are a popular model in AI. Credal networks are a robust version of Bayesian networks created by replacing the conditional probability mass functions describing the nodes by conditional credal sets (sets of probability mass functions). Next to their nodes, Bayesian networks are …

Erik Quaeghebeur
More info

Assigned Projects

No currently assigned Projects.

Finished Projects (21)

Model Transfer for Offline Reinforcement LearningNov 2024

Offline Reinforcement Learning (RL) deals with the problems where simulation or online interaction is impractical, costly, and/or dangerous, allowing to automate a wide range of applications from healthcare and education to finance and robotics. However, learning new policies from offline data suffers from distributional shifts …

Maryam Tavakol
More info
On Neural Cellular AutomataJul 2024

The design of collective intelligence, i.e. the ability of a group of simple agents to collectively cooperate towards a unifying goal, is a growing area of machine learning research aimed at solving complex tasks through emergent computation [1, 2]. The interest in these techniques …

Erik Quaeghebeur

Gennaro Gala
More info
Efficient surrogates for the Ainslie wind turbine wake modelAug 2023

In wind farms, one source of reduction in power generation by the turbines is the reduction of wind speed in the wake downstream of each turbine's rotor. Namely, a turbine downstream in the wind direction of another will effectively experience wind with a reduced …

Erik Quaeghebeur

LB
Laurens Bliek (Information Systems, IE&IS)
More info
Identifying robust instances in classificationJul 2023

In a classification task, some instances are classified more robustly than others. Namely, even with a large modification of the training set, these instances (in the test set) will be assigned to the same class. Other instances are non-robust in the sense that a …

Erik Quaeghebeur
More info

Filter Research group All Automated Machine Learning Data Mining Database Generative AI Uncertainty in AI Show items up to All 5 years 10 years 15 years

Master projects

Open Projects (24)

Reinforcement Learning for Efficient Causal Discovery

Causal Discovery for Offline Model-Based Reinforcement Learning

Context-Aware Model-Based Offline Reinforcement Learning

Pattern discovery to improve overlay control loop by using Bayesian inference tooling

Examples of missing not at random (MNAR) data that are difficult for classifiers

Implementation of inference algorithms for the imprecise Plackett–Luce model

Breeding Program Optimization via Offline Reinforcement Learning

Uncertainty Estimation in Model-Based Offline Reinforcement Learning using Random Networks

Long-term Fairness with Offline Reinforcement Learning

Implementing inference algorithms for choice functions

Generative Random Forests: The Next Level

Probabilistic Circuits versus Bayesian networks

Hybrid Bayesian networks

Synthetic data generation for causal learning

Scalable Implementation of Probabilistic Circuits

Algorithms for forward irrelevance with choice functions

Concepts of independence for choice functions

Local inference algorithms for choice functions

Interventional Whittle Sum-Product Networks

Dynamic Knowledge Graph Embeddings

Efficient Unbiased Training of Large-scale Distributed RL

Computational Complexity of Probabilistic Circuits

Learning Bayesian Networks in a Single Step

Battle of the credal networks: strong independence or forward irrelevance?

Assigned Projects

Finished Projects (21)

Model Transfer for Offline Reinforcement LearningNov 2024

On Neural Cellular AutomataJul 2024

Efficient surrogates for the Ainslie wind turbine wake modelAug 2023

Identifying robust instances in classificationJul 2023

Research group All Automated Machine Learning Data Mining Database Generative AI Uncertainty in AI Show items up to All 5 years 10 years 15 years