Data and AI cluster

Research group All Advanced Models through Open Research and Engineering Data Mining Database Machine Learning for Physical Sciences Scalable Online Data Management Uncertainty in AI Show items up to All 5 years 10 years 15 years

Master projects

Here you can find all our available master projects.

Open Projects (8)

Synopses for continual learning

In this project you will consider the use of synopses (course 2AMD15) for continual learning. You will (a) explore how existing synopses can be used to support continual learning tasks, e.g., to mitigate forgetting (b) develop novel sketches, if needed, (c) prove their properties …

Odysseas Papapetrou

Mykola Pechenizkiy
More info
Is SQL sufficient to support visual analytics

Databases often act as the backend for visualization -- to safely store the data, and to aggregate/serve it to the visualization layer efficiently, such that it is shown to the user in a way that helps decision making. This connection between the two layers …

Odysseas Papapetrou
More info
Applications of minwise sampling

Minwise sampling (or MinHash) is a collection of methods that estimate similarity between sets. Most methods assume static data. A new method, designed last year in our group, also works with non-static (i.e., streaming) data, and it can support deletion. This thesis will focus …

Odysseas Papapetrou

WP
Wieger R. Punter (PhD student)
More info
Extending Omnisketch

In [1] we proposed OmniSketch, the first sketch that supports OLAP-like analytics. In this thesis you will consider either of the two options: (a) distributing OmniSketch such that it works efficiently over large clusters, (b) making it able to handle sliding windows queries, by using …

Odysseas Papapetrou

Wieger Punter
More info
Spatial sketches -- topic 1

The recent work "Synopses for summarizing spatial data streams" describes a framework that allows any existing synopsis to summarize spatial data. This thesis focuses on further extending this work by replacing the simple regular grid structure that is used now with other, more space …

Odysseas Papapetrou

Wieger Punter
More info
Spatial sketches -- topic 2

The recent work "Synopses for summarizing spatial data streams" describes a framework that allows any existing synopsis to summarize spatial data. This thesis focuses on further extending this work by rethinking the allocation of space in the spatial sketch. For example, areas in the …

Odysseas Papapetrou

Wieger Punter
More info
Synopses meet machine learning

Training ML models over big data is a time-consuming and energy-hungry process. Furthermore it requires full access over the data, which is challenging in many use cases, due to the size of the data. The problem is particularly challenging when the data is read …

Odysseas Papapetrou

Mykola Pechenizkiy
More info
Time-series search: to index or not to index

Time series data is widely generated and used across various fields, including healthcare, finance, and surveillance. For example, in the stock market, the changes in stock prices throughout the day form a time series. In such contexts, it is often important to perform searches—either …

Odysseas Papapetrou
More info

Assigned Projects

No currently assigned Projects.

Research group All Advanced Models through Open Research and Engineering Data Mining Database Machine Learning for Physical Sciences Scalable Online Data Management Uncertainty in AI Show items up to All 5 years 10 years 15 years

Master projects

Open Projects (8)

Synopses for continual learning

Is SQL sufficient to support visual analytics

Applications of minwise sampling

Extending Omnisketch

Spatial sketches -- topic 1

Spatial sketches -- topic 2

Synopses meet machine learning

Time-series search: to index or not to index

Assigned Projects

Finished Projects (43)

Survey and evaluation of IoT hubs for data ingestionFeb 2026

Detection of high-order correlations in health dataNov 2025

Data access for ambulance care personnelOct 2025

Improved monitoring for home-care patientsOct 2025

Lagged multivariate correlationsMay 2025

Detection of similarities and correlations in multidimensional time seriesMay 2025

Discovery and maintenance of heavy hitters over sliding windows, in a distributed environment.May 2025

Multivariate correlations for data cleaningMay 2025

Correlation Detective on streaming dataJul 2023

Filter Research group All Advanced Models through Open Research and Engineering Data Mining Database Machine Learning for Physical Sciences Scalable Online Data Management Uncertainty in AI Show items up to All 5 years 10 years 15 years

Master projects

Open Projects (8)

Synopses for continual learning

Is SQL sufficient to support visual analytics

Applications of minwise sampling

Extending Omnisketch

Spatial sketches -- topic 1

Spatial sketches -- topic 2

Synopses meet machine learning

Time-series search: to index or not to index

Assigned Projects

Finished Projects (43)

Survey and evaluation of IoT hubs for data ingestionFeb 2026

Detection of high-order correlations in health dataNov 2025

Data access for ambulance care personnelOct 2025

Improved monitoring for home-care patientsOct 2025

Lagged multivariate correlationsMay 2025

Detection of similarities and correlations in multidimensional time seriesMay 2025

Discovery and maintenance of heavy hitters over sliding windows, in a distributed environment.May 2025

Multivariate correlations for data cleaningMay 2025

Correlation Detective on streaming dataJul 2023

Research group All Advanced Models through Open Research and Engineering Data Mining Database Machine Learning for Physical Sciences Scalable Online Data Management Uncertainty in AI Show items up to All 5 years 10 years 15 years