Data and AI cluster

Research group All Automated Machine Learning Data Mining Database Generative AI Uncertainty in AI Show items up to All 5 years 10 years 15 years

Master projects

Here you can find all our available master projects.

Open Projects (75)

Enhancing Real-World Imitation Learning with Reinforcement Learning

This TU/e master project is setup in collaboration with a robotics start-up in Eindhoven.Company OverviewTeleOperation Services is an innovative company based in Woensel-Noord, Eindhoven. Our cutting-edge AI-driven system empowers robotic arms to imitate tasks and perform them independently with human-like finesse and speed. Through …

Bram Grooten

Thiago Simão
More info
Federated Learning for detecting Lung Disease in Developing Countries

BackgroundDelft Imaging develops mobile X-Ray machines that allow screening in low resource settings, such as developing countries or remote locations. Due to the lack of qualified professionals in these settings, they use computer vision models to automate screening of patients for diseases such as …

Tim d'Hondt

Mykola Pechenizkiy
More info
Safe Contrastive Imitation Learning

MotivationIn safety-critical domains such as autonomous driving, healthcare robotics, and industrial automation, it is imperative for autonomous agents to not only perform tasks efficiently but also safely. Traditional imitation learning enables agents to learn behaviors by mimicking expert demonstrations. However, these methods often overlook …

Tristan Tomilin

Thiago Simão
More info
Part replacement identification using Knowledge Graphs

(This project is also available as an internship)Company: Marel Location: Boxmeer BackgroundIt is important for industrial equipment developers to provide accurate part replacements to their customers. Parts can wear over time or break and having suitable replacements is a dynamic process based on availability, …

Mykola Pechenizkiy

Zeno van Cauter
More info
(PwC) Enhancing Anomaly Detection: Integrating User Explainability

PwC developed an unsupervised Transformer-based anomaly detection tool to enhance insights into machine functionality in factories by analyzing machinery timeseries sensor data. However, the current solution lacks explainability for why certain time windows are flagged as anomalous. Root cause algorithms, such as Bayesian inference, …

Bart Engelen
More info
(PwC) Your thesis at PwC Advisory Data Analytics

Do you want to write your master's thesis about a Data & AI related topic on real-world client cases? PwC offers you the opportunity to write your thesis within PwC's Data Analytics Advisory team. This is a multidisciplinary team that uses its analytical skills …

Bart Engelen
More info
Procedural 3D Environment Generation for Image-Based Reinforcement Learning

As autonomous systems evolve, static simulation environments for training reinforcement learning agents increasingly fail to prepare algorithms for real-world variability. Procedural content generation (PCG) [5] in 3D environments offers a low-cost solution to automatically creating a near-infinite variety of dynamic training scenarios. This has the …

Tristan Tomilin

Meng Fang
More info
A Matrix Factorization approach to Exceptional Model Mining

Exceptional Model Mining aims to identify subgroups in the dataset that behave somehow exceptionally. It differs from a clustering approach since subgroups may overlap; not all data points are assigned to a cluster. However, consequently, the list of subgroups often contains many similar, redundant …

Rianne Schouten

Sibylle Hess
More info
Exceptional Model Mining with Missing Data

In this project, we develop an instance of Exceptional Model Mining using the HBSC dataset (together with UU and Trimbos Institute). The HBSC study is repeated every four years among Dutch adolescents and among others, collects information about their drug and alcohol use. We …

Rianne Schouten

Wouter Duivesteijn
More info
Generating synthetic data with generative modeling

In this project, we aim to generate a synthetic dataset that has similar properties as an existing, longitudinal, medical dataset. In particular, we work together with the Dutch south west Psoriatic Arthritis Registry (DEPAR) study (https://ciceroreumatologie.nl/depar), situated at Erasmus MC. Generating a synthetic version …

Rianne Schouten

Jakub Tomczak
More info
Analyzing mathematical learning abilities in children using re-description mining for hierarchical data

In this project, we analyze learning behavior in young children. We work with data collected by the Turku Research Institute for Learning Analytics, where children perform a variety of computer assisted tasks such as comparing numbers and simple calculation tasks. Re-description mining is a …

Rianne Schouten

Mykola Pechenizkiy
More info
Alternative latent space models for vessel re-identification

Coastal surveillance cameras are often used to detect (distinguish from the background) and recognize (as belonging to a class) non-cooperative vessels, i.e. vessels not reporting their position and identity using an AIS [1] transponder through a TDMA network such that nearby AIS base stations …

Mykola Pechenizkiy

Stiven Schwanz Dias
More info
Vision-centric image tokenization in the generative transformer era

Generative autoregressive next token prediction has shown impressive success in LLMs. Several works have attempted to extend the success of LLMs to vision-language tasks with VLMs. While a VLM can be designed specifically for image-to-text tasks like visual question answering, many works also attempt …

Bahram Zonooz

Elahe Arani
More info
Maritime traffic anomaly detection

Coastal surveillance systems rely on multiple sensors to perform object assessment [1], i.e., to detect and track the sequence of vessels' states including their position and velocity (where are the vessels at a given timestamp?). In general, surface radars are employed as a primary …

Mykola Pechenizkiy

Stiven Schwanz Dias
More info
Leveraging Language Semantics for Enhanced Understanding and Generalization

The field of artificial intelligence has seen unprecedented growth in recent years, particularly with the advent of foundation models and large language models (LLMs). These models have showcased remarkable capabilities across a broad spectrum of applications, including natural language processing and multimodal tasks. Traditionally, …

Bahram Zonooz

Elahe Arani
More info
Architectural Analysis of Vision Transformers in Continual Learning

Deep neural networks (DNN) deployed in the real world are frequently exposed to non-stationary data distributions and required to sequentially learn multiple tasks. This requires that DNNs acquire new knowledge while retaining previously obtained knowledge and this is imperative in applications like autonomous driving …

Bahram Zonooz

Elahe Arani
More info
True continual learners in the wild: CL beyond artificial constraints / datasets

Continual Learning (CL) is a learning paradigm in which computational systems progressively acquire multiple tasks as new data becomes available over time. An effective CL system must find a balance between being adaptable to integrate new information and maintaining stability to prevent disruption of …

Bahram Zonooz

Elahe Arani
More info
Exploring sparsity in lifelong learning

In the dynamic world, deep neural networks (DNNs) must continually adapt to new data and environments. Unlike humans, who can learn continually without forgetting past knowledge, DNNs often suffer from catastrophic forgetting when exposed to new data, causing them to lose previously acquired information. …

Bahram Zonooz

Elahe Arani
More info
Image representation learning in autoregressive transformers

With the recent success of LLMs, and the strong potential of multi-modal learning from both text and vision, several works have framed images as sequences to conform with generative sequence-to-sequence encoder-decoder or decoder based transformers [1]. Such formulations present advantages such as unified architectures …

Bahram Zonooz

Elahe Arani
More info
Understanding deep learning – exploring the development of complexity in neural networks over depth and time

Introduction: When we train deep, nonlinear neural networks, we often assume that the applied transformations at every layer are effectively nonlinear. Earlier work (Kalimeris et al., 2019)has shown that in the beginning of training, the complete function that deep, nonlinear networks implement is close …

Hannah Pinson

Aurélien Boland
More info
AI for 3D Concrete Printing

Designing 3D printable materials has been, so far, a trial-and-error process dependent on human knowledge and effort; hence time-consuming and wasteful. To predict certain properties of 3DCP, material scientists have used modelling and simulations for decades. While helpful in many ways, models mostly require …

Mykola Pechenizkiy

SB
Sandra Lucas and Önder Babur
More info
Continual Reinforcement Learning with Language Instructions

Continual reinforcement learning (CRL) stands as a pivotal paradigm in the AI landscape, fostering the development of adaptive and lifelong learning agents. This project delves into the intersection of CRL and natural language processing within the immersive realm of 3D simulation environments. The integration …

Tristan Tomilin

Meng Fang
More info
Large language Model Based Chatbots and their Applications

In recent years, large language models have revolutionized how machines understand and generate human-like text, offering profound implications for chatbot technology. This thesis proposes a deep exploration into the capabilities of these models within chatbot applications, aiming to enhance how they mimic human conversational …

Meng Fang

Jiaxu Zhao
More info
Transfer Learning for Robot-to-Robot Adaptation

A popular paradigm in robotic learning is to train a policy from scratch for every new robot. This is not only inefficient but also often impractical for complex robots. The project revolves around the exploration and advancement of techniques for transferring policies between different …

Meng Fang

Tristan Tomilin
More info
Playing Text-based Games with Large Language Models

The project aims to explore the utilization of sophisticated language models in the domain of text-based games. This endeavor seeks to harness the capabilities of large language models, such as GPT (Generative Pre-trained Transformer), in the context of interactive narratives, text adventures, and other …

Meng Fang

Yudi Zhang
More info
Language Agents for Playing Card Games

The project is a pioneering initiative that combines Natural Language Processing (NLP) and Reinforcement Learning (RL) methodologies to create intelligent agents capable of understanding natural language instructions and participating in playing card games. This project aims to develop AI-driven agents that not only comprehend …

Meng Fang

Yudi Zhang
More info
AI for histopathology of melanoma

BackgroundMelanoma is a form of skin cancer that originates in melanin-producing cells known as melanocytes. While other skin cancer types occur more frequently, melanoma is most dangerous due to the high likelihood of metastasis if not treated early. The incidence rate of melanoma has …

Sibylle Hess

MV
Mitko Veta
More info
Counterfactual explanations

I plan to offer a few assignments on counterfactual explanationsCounterfactual explanations on evolving dataFeasibility, actionability and personalization of counterfactual explanationsCounterfactual explanations for spotting unwanted biased in predictive model behaviourValue alignment for counterfactual explanations (in collaboration with Emily Sullivan)Counterfactual explanations for behaviour change

Mykola Pechenizkiy
More info
Enhancing Reconstructive Surgery Decision-Making

The goal of this project would be to come up with a transformer or any other smart solution to (in a one sentence oversimplified description) find mappings between an image of the current patient condition, possible surgery actions and preferred outcome image. A more detailed …

Mykola Pechenizkiy
More info
Safe reinforcement learning with decision transformers

Safety is a core challenge for the deployment of reinforcement learning (RL) in real-world applications [1]. In applications such as recommender systems, this means the agent should respect budget constraints [2]. In this case, the RL agent must compute a policy condition of the …

Thiago Simão
More info
Learning Decision Trees to Reduce the Sample Complexity of Offline Reinforcement Learning

Reinforcement Learning (RL) deals with problems that can be modeled as a Markov decision process (MDP) where the transition function is unknown. When an arbitrary policy was already in execution, and the experiences with the environment were recorded in a dataset, an offline RL …

Thiago Simão
More info
Reinforcement Learning for Configurable Systems

Nowadays, most software systems are configurable, meaning that we can tailor the settings to the specific needs of each user. Furthermore, we may already have some data available indicating each user's preferences and the software's performance under each configuration. This way, we can compute …

Thiago Simão
More info
Deriving Valuation Bases to Expand GP-Growth to More EMM Model Classes

See PDF

Wouter Duivesteijn
More info
Deriving Upper Confidence Bounds to Expand Monte-Carlo Tree Search to More EMM Model Classes

See PDF

Wouter Duivesteijn
More info
Surveying to Bring Order in the Jungle of Supervised Local Pattern Mining Implementations

See PDF

Wouter Duivesteijn
More info
Exceptional Gestalt Mining (EGM)

See PDF

Wouter Duivesteijn

TD
Thomas van Dijk
More info
Expanding Exceptional Model Mining on Unstructured Data

See PDF. As attachment, see also https://wwwis.win.tue.nl/~wouter/MSc/Niels.pdf

Wouter Duivesteijn
More info
Finding the Curse of Dimensionality Sweet Spot Between Traditional Clustering and Deep Clustering

See PDF

Wouter Duivesteijn

Sibylle Hess
More info
Equivariant Neural Simulators for Probabilistic Fluid Dynamics Simulation

TL;DR: In this project, you will focus on developing a model architecture that can efficiently simulate fluid dynamics, while taking into account the vast amount of domain knowledge in the field in the form of symmetries, as well as the modeling of stochastic effects …

Vlado Menkovski

Koen Minartz
More info
Understanding Out-of-distribution Detection

There are numerous methods for out-of-distribution (OOD) detection and related problems in deep learning, see e.g. [1] for an overview. Many of these however only work well in highly fine-tuned settings and are not well understood in broader context. In this project, you would …

Sibylle Hess

Jan Moraal
More info
Explaining learned features by generative models

In order to get some insight into the inner workings of deep neural network classifiers, a method that enables the interpretation of learned features would be very helpful. This master project is loosely based on the approach presented in [1], where a GAN is …

Sibylle Hess

WM
Wil Michiels
More info
Topics in Deep Clustering

Deep clustering is a well-researched field with promising approaches. Traditional nonconvex clustering methods require the definition of a kernel matrix, whose parameters vastly influence the result, and are hence difficult to specify. In turn, the promise of deep clustering is that a feature transformation …

Sibylle Hess
More info
Physics-Informed Neural Simulation of Cellular Dynamics

TL;DR: In this project, you will develop a framework for integrating domain knowledge into generative models for cellular dynamics simulations, and apply the method to (synthetic) data of e.g. cancer cell migration. Project description: Studying the variety of mechanisms through which cells migrate and interact …

Vlado Menkovski

Koen Minartz
More info
ML Simulation of Nuclear Fusion reactors

Thermonuclear fusion holds the promise of generating clean energy on a large scale. One promising approach for controlled fusion power generation is the tokamak, a torus-shaped device that magnetically confines the fusion plasma in its vessel. Currently, not all physical processes in these plasmas …

Vlado Menkovski

Yoeri Poels
More info
Conditional Generation of Materials for Carbon Capture

n recent years, the urgency of addressing the climate crisis, resulting from escalating greenhouse gas emissions, has increased. A potential solution for the increasing amount of CO2 in the air is carbon capture. Zeolites are potential candidate materials for carbon capture, as they are …

Vlado Menkovski

MP
Marko Petkovic
More info
Evaluating Explanations of Model Predictions

The black-box nature of neural networks prohibits their application in impactful areas, such as health care or generally anything that would have consequences in the real world. In response to this, the field of Explainable AI (XAI) emerged. State-of-the-art methods in XAI define a …

Sibylle Hess

WM
Wil Michiels
More info
Physics Informed AI for improved cancer prognosis

In order to metastasize, cancer cells need to move. Estimating the ability for cells to move, i.e. their dynamics, or so-called migration potential, is a promising new indicator for cancer patient prognosis (overall survival) and response to therapy. However, predicting the migration potential from …

Sibylle Hess

SV
Secondary supervisors could be Liesbeth Janssen or Mitko Vetka.
More info
Comparison of E(n)-equivariant GNNs for metamaterial simulation

Soft, porous metamaterials are materials that consist of a flexible base material (e.g., rubber-like material) with pores of a carefully designed shape in it. Under external loading (a pressure applied on the outside surface, mechanical constraints, or other interactions), they deform which in turn …

Vlado Menkovski

OH
Ondrej Rokos, Fleur Hendriks
More info
Peer-to-peer Federated Learning

--update--: This project is now taken by Davis EisaksThe goal of this project is to study how to train a machine learning model in a gossip-based approach, where if two devices (e.g smartwatches) pass each other in the physical space, they could exchange part of …

Mykola Pechenizkiy

TD
Tim d'Hondt
More info
Bayesian Federated Learning using node-based BNNs.

Node-based BNNs assign latent noise variables to hidden nodes of a neural network. By restricting inference to the node-based latent variables, node stochasticity greatly reduces the dimension of the posterior. This allows for inference of BNNs that are cheap to compute and to communicate, …

Mykola Pechenizkiy

TD
Tim d'Hondt
More info
ML projects at ASML

ASML has recently re-confirmed there two projects; a couple more will likely be confirmed in the coming weeksXAI in Exceptional Model Mining (--- update --- this project is taken by Yasemin Yasarol)In the semiconductor industry there are different, diverse and unique failure modes that impact …

Mykola Pechenizkiy

TT
tbc
More info
ML projects at Floryn

--- update --- These projects are no longer available. Theonymfi Anogeianaki will work on FairML.1. Bayesian inferenceWe have been doing ‘traditional’ machine learning for years now at Floryn but never investigated Bayesian modeling. We currently make use of probability measures that come from our (frequentist) machine learning …

Mykola Pechenizkiy
More info
Continual Structure from Motion

Autonomous vehicles and robots need 3D information such as depth and pose to traverse paths safely and correctly. Classical methods utilize hand-crafted features that can potentially fail in challenging scenarios, such as those with low texture [1]. Although neural networks can be trained on …

Bahram Zonooz

Elahe Arani
More info
Feature selection, sparse neural networks, truly sparse implementations, and societal challenges

Context of the work: Deep Learning (DL) is a very important machine learning area nowadays and it has proven to be a successful tool for all machine learning paradigms, i.e., supervised learning, unsupervised learning, and reinforcement learning. Still, the scalability of DL models is …

Mykola Pechenizkiy

Ghada Sokar
More info
Diversifying attention through randomization in sparse neural networks training

Context of the work: Deep Learning (DL) is a very important machine learning area nowadays and it has proven to be a successful tool for all machine learning paradigms, i.e., supervised learning, unsupervised learning, and reinforcement learning. Still, the scalability of DL models is …

Mykola Pechenizkiy

Ghada Sokar
More info
Topics in Continual Lifelong Learning

Nowadays, data changes very rapidly. Every day new trends appear on social media with millions of images. New topics rapidly emerge from the huge number of videos uploaded on Youtube. Attention to continual lifelong learning has recently increased to cope with this rapid data …

Mykola Pechenizkiy

Ghada Sokar
More info
Multi-modal Representation Learning and Applications

With the rapid development of multi-media social network platforms, e.g., Instagram, Tiktok, etc., more and more content is generated in the multi-modal format rather than pure text. This brings new challenges for researchers to analyze the user generated content and solve some concrete problems …

Yulong Pei

Tianjin Huang
More info
Architectural Analysis of Vision Transformers in Continual Learning

Deep neural networks (DNN) deployed in the real world are frequently exposed to non-stationary data distributions and required to sequentially learn multiple tasks. This requires that DNNs acquire new knowledge while retaining previously obtained knowledge. However, continual learning in DNNs, in which networks are …

Elahe Arani

Bahram Zonooz
More info
Human Visual System Inspired Mechanisms for Data Curation

Every second, around 107 to 108 bits of information reach the human visual system (HVS) [IK01]. Because biological hardware has limited computational capacity, complete processing of massive sensory information would be impossible. The HVS has therefore developed two mechanisms, foveation and fixation, that preserve perceptual performance …

Bahram Zonooz

Elahe Arani
More info
Human Visual System Inspired Mechanisms for Interpretability

Every second, around 107 to 108 bits of information reach the human visual system (HVS) [IK01]. Because biological hardware has limited computational capacity, complete processing of massive sensory information would be impossible. The HVS has therefore developed two mechanisms, foveation and fixation, that preserve perceptual performance …

Elahe Arani

Bahram Zonooz
More info
Human Visual System Inspired Mechanisms for Video Action Recognition/Prediction

Every second, around 107 to 108 bits of information reach the human visual system (HVS) [IK01]. Because biological hardware has limited computational capacity, complete processing of massive sensory information would be impossible. The HVS has therefore developed two mechanisms, foveation and fixation, that preserve perceptual …

Bahram Zonooz

Elahe Arani
More info
Online knowledge distillation for self-supervised learning

Self-supervised learning [1, 2] solves pretext prediction tasks that do not require annotations in order to learn feature representations. Recent empirical research has demonstrated that deeper and wider models benefit more from task-agnostic use of unlabeled data than their smaller counterparts; i.e., smaller models …

Bahram Zonooz

Elahe Arani
More info
Deep Clustering: Simultaneous Optimization of Representations and Clustering

Deep clustering is a well-researched field with promising approaches. Traditional nonconvex clustering methods require the definition of a kernel matrix, whose parameters vastly influence the result, and are hence difficult to specify. In turn, the promise of deep clustering is that a feature transformation …

Sibylle Hess
More info
Overcoming data scarcity in visual object detection and recognition tasks with frugal learning

IntroductionThe Observe, Orient, Decide and Act (OODA) loop [1] shapes most modern military warfare doctrines. Typically, after gathering sensor and intelligence data in the Observe step, a common tactical operating picture of the monitored aerial, maritime and/or ground scenario is built and shared among …

Mykola Pechenizkiy

Stiven Schwanz Dias
More info
Fairness-aware Influence Minimization for Combating Fake News

Influence blocking and fake news mitigation have been the main research direction for the network science and data mining research communities in the past few years. Several methods have been proposed in this direction [1]. However, none of the proposed solutions has proposed feature-blind …

Mykola Pechenizkiy

Akrati Saxena
More info
Fairness in Network Anonymization

In the past 10-15 years, a massive amount of social networking data has been released publicly and analyzed to better understand complex networks and their different applications. However, ensuring the privacy of the released data has been a primary concern. Most of the graph …

Mykola Pechenizkiy

Akrati Saxena
More info
Towards Cognitive-inspired Adversarial Training Approach

Deep neural networks (DNN) are achieving superior performance in perception tasks; however, they are still riddled with fundamental shortcomings. There are still core questions about what the network is truly learning. DNNs have been shown to rely on local texture information to make decisions, …

Elahe Arani

Bahram Zonooz
More info
Generalizable, fair and explainable default predictors

Context:Financial sector is a tightly regulated environment. All models used in the financial sector, are studied under the microscope of developers, validators, regulators, and eventually the end users – the clients, before these models can be deployed and used.To assess whether a customer should be …

Mykola Pechenizkiy

DD
DLL
More info
Data-Efficient Reinforcement Learning under Constraints

Reinforcement learning (RL) is a general learning, predicting, and decision-making paradigm and applies broadly in many disciplines, including science, engineering, and humanities. Conventionally, classical RL approaches have seen prominent successes in many closed world problems, such as Atari games, AlphaGo, and robotics. However, dealing …

Mykola Pechenizkiy

Danil Provodin
More info
Input Adaptive Inference for Semantic Segmentation

Neural networks typically consist of a sequence of well-defined computational blocks that are executed one after the other to obtain an inference for an input image. After the neural network has been trained, a static inference graph comprising these computational blocks is executed for …

Bahram Zonooz

Elahe Arani
More info
Robust symbol detection and recognition in piping and instrumentation diagrams

Project description This project is concerned with the recognition of symbols of piping and process equipment together with the instrumentation and control devices that appear on piping and instrumentation diagrams (P&ID). Each item on the P&ID is associated with a pipeline. Piping engineers often receive drawings …

Mykola Pechenizkiy

Stiven Schwanz Dias
More info
Fairness Analysis in Anomaly Detection

In anomaly detection, we aim to identify unusual instances in different applications, including malicious users detection in OSNs, fraud detection, and suspicious bank transaction detection. Most of the proposed anomaly detection methods are dependent on network structure as some specific structural pattern can convey …

Mykola Pechenizkiy

Akrati Saxena
More info
Curiosity driven fairness in Reinforcement learning

Reinforcement learning (RL) is a computational approach to automating goal-directed decision making. In this project, we will use the framework of Markov decision processes. Fairness in reinforcement learning [1] deals with removing bias from the decisions made by the algorithms. Bias or discrimination in …

Mykola Pechenizkiy

Pratik Gajane
More info
Generate fair (pseudo) samples for reinforcement learning

Reinforcement learning (RL) is a computational approach to automating goal-directed decision making. Reinforcement learning problems use either the framework of multi-armed bandits or Markov decision processes (or their variants). In some cases, RL solutions are sample inefficient and costly. To address this issue, some …

Mykola Pechenizkiy

Pratik Gajane
More info
Causal perspective of fairness in reinforcement learning

Reinforcement learning (RL) is a computational approach to automating goal-directed decision making using the feedback observed by the learning agent. In this project, we will be using the framework of multi-armed bandits and Markov decision processes. Observational data collected from real-world systems can mostly …

Mykola Pechenizkiy

Pratik Gajane
More info

Assigned Projects (17)

[Internship at TNO] Dynamic road space allocation with shared mobility hubsNov 2024

The XCARCITY project investigates how to facilitate and support implementation of car-free areas in Amsterdam, Almere Pampus and Metropoolregio Rotterdam Den Haag.Car-free and car-low areas offer many benefits by freeing up road space, reducing congestion and parking requirements, and generally contributing to increased livability …

DV
Dido Verstegen

Thiago Simão

CP
Canmanie T. Ponnambalam
More info
Reinforcement Learning for Contact-rich and Impact-aware Robotic TasksNov 2024

Reinforcement Learning (RL) [6] has achieved successful outcomes in multiple applications, including robotics [1]. A key challenge to deploying RL in such a scenario is to ensure the agent is robust so it does not lose performance even if the environment's geometry and dynamics …

Bram Grooten

Thiago Simão
More info
Dual arm manipulation of heavy objects with humanoid robots via reinforcement learningNov 2024

Motivation. Reinforcement Learning(RL; Sutton and Barto 2018) has achieved successful outcomes in multiple applications, including robotics(Kober, Bagnell, and Peters 2013). A key challenge to deploying RL in such a scenario is to ensure the agent is robust so it does not lose performance even …

Bram Grooten

Thiago Simão
More info
Multi-agent reinforcement learning for sustainable touristic recommender systemNov 2024

A touristic recommender system (TRS; Dalla Vecchia et al., 2024; Gaonkar et al., 2018; de Nijs et al., 2018) often provides to its users a sequence of recommendations instead of a single suggestion to optimize the user experience in the available time interval. Due …

PD
Paul Dewez

Thiago Simão

EQ
Elisa Quintarelli
More info
High-performance Safe RL BenchmarkingNov 2024

As AI systems become increasingly integral to critical sectors, ensuring their safety and reliability is essential. Reinforcement Learning (RL) is a prominent method that learns optimal behaviors through trial-and-error interactions with a dynamic environment. Yet, the stakes are high: in physical settings, a wrong …

MB
Mourad Boustani

Tristan Tomilin

Thiago Simão
More info
Adversarial attacks for safe transfer in reinforcement learningNov 2024

Safety is a paramount challenge for the deployment of autonomous agents. In particular, ensuring safety while an agent is still learning may require considerable prior knowledge (Carr et al., 2023; Simão et al., 2021). A workaround is to pre-train the agent in a similar …

CM
Cheuk Lam Mo

Thiago Simão
More info
Understanding deep learning: efficient retraining of networksJul 2024

Recent work has shown that neural networks, such as fully connected networks and CNNs, learn to distinguish between classes from broader to finer distinctions between those classes [1,2] (see Fig. 1). Figure 1: Illustration of the evolution of learning from broader to finer distinctions between …

Hannah Pinson
More info
Understanding deep learning: the initial learning rateJul 2024

This project is finished/closed. While deep learning has become extremely important in industry and society, neural networks are often considered ‘black boxes’, i.e., it is often believed that it is impossible to understand how neural networks really work. However, there are a lot of …

Hannah Pinson
More info
Multi-Agent Reinforcement Learning for Cooperative TasksFeb 2024

Multi-Agent Reinforcement Learning (MARL) is a field in artificial intelligence where multiple agents learn to make decisions in an environment through reinforcement learning. In the context of cooperative tasks, it involves agents working together to achieve common goals, sharing information and coordinating their actions …

LB
Luka van den Boogaard

Meng Fang

Tristan Tomilin
More info
Pimp my BUS: Improving a Minicluster-based Deterministic Pattern Sampling Algorithm for Exceptional Model MiningNov 2023

See PDF. As attachment, see also https://wwwis.win.tue.nl/~wouter/MSc/Bart.pdf

LK
Lars Kuijten

Wouter Duivesteijn
More info
Preventing Beam Pollution: Defining an Empirical Protocol to Improve Beam Search Lattice TraversalOct 2023

See PDF

BS
Bart Slenders

Wouter Duivesteijn
More info
Diversity of recommendations in collaboration with Bol.comFeb 2023

Recommender Systems (RSs) have emerged as a way to help users find relevant information as online item catalogs increased in size. There is an increasing interest in systems that produce recommendations that are not only relevant, but also diverse [1]. In addition to users, increased …

Mykola Pechenizkiy

Hilde Weerts
More info
Simulation of Nanopore sequencing (with generative models)Jan 2023

---UPDATE---: This project is now taken by Jonas NiederleNanopore sequencing is a third-generation sequencing method that directly measures long DNA or RNA (Figure 1). The method works by translocating a single DNA strand through a Nanopore in which an electric current signal is measured. The …

Vlado Menkovski
More info
Bubble simulation with latent variable modelsNov 2022

--update--: This project is now taken byTijs TeulingsThe topic of the project is simulation of bubbles with deep generative models. Bubbles are a fascinating phenomenon in multiphase flow, and they play an important role in chemical, industrial processes. Bubbles can be simulated well with …

Vlado Menkovski
More info
Aspect-based Few-shot classification (Meta-learning)Nov 2022

--- UPDATE ---: This project is now taken by Tim van EngelandMeta-learning (also referred to as learning to learn) is a set of Machine Learning techniques that aim to learn quickly from a few given examples in changing environments [1]. One instantiation of the meta-learning …

Vlado Menkovski
More info
Exceptional Gestalt Mining (EGM)Nov 2022

See PDF

PR
Pim Rietjens

Wouter Duivesteijn

TB
Thomas C. van Dijk, Ruhr-Universität Bochum
More info
Generating Missing At Random (MAR) data in Images; a Convolutional ApproachNov 2022

See PDF

SH
Sam Al Habash

Wouter Duivesteijn

Vlado Menkovski
More info

Finished Projects (21)