Researcher profile

George Papadakis

George Papadakis contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

Computational study of airfoil stall flutter Limit Cycle Oscillations

This paper presents a comprehensive numerical investigation of a NACA0012 undergoing Stall Flutter Limit Cycle Oscillations (LCO) across distinct fluid dynamics regimes. It accurately models Small Amplitude Oscillations (SAO) in the transitional Reynolds regime and Large Amplitude Oscillations (LAO) in the moderate regime, observed in different experimental campaigns. The SAO analysis serves as a verification of the computational framework against established numerical benchmarks. Crucially, the LAO simulations represent the first documented prediction across the full experimental velocity range correlated against available measured data, addressing a significant literature gap. The predictions fidelity relies on rigorous computational criteria defined through a detailed sensitivity analysis. This demonstrated numerical requirements significantly more demanding than those typically employed for computing static polars or simulating dynamic pitching motion of rigid airfoils, underscoring the severity of the aeroelastic problem. Quantitatively the simulation systematically over-predicts the critical onset velocity and under-predicts the LCO amplitudes.However, the results show strong qualitative agreement with experimental observations, successfully reproducing key dynamic stall mechanics and bifurcation phenomena.

preprint2026arXiv

SPER: Accelerating Progressive Entity Resolution via Stochastic Bipartite Maximization

Entity Resolution (ER) is a critical data cleaning task for identifying records that refer to the same real-world entity. In the era of Big Data, traditional batch ER is often infeasible due to volume and velocity constraints, necessitating Progressive ER methods that maximize recall within a limited computational budget. However, existing progressive approaches fail to scale to high-velocity streams because they rely on deterministic sorting to prioritize candidate pairs, a process that incurs prohibitive super-linear complexity and heavy initialization costs. To address this scalability wall, we introduce SPER (Stochastic Progressive ER), a novel framework that redefines prioritization as a sampling problem rather than a ranking problem. By replacing global sorting with a continuous stochastic bipartite maximization strategy, SPER acts as a probabilistic high-pass filter that selects high-utility pairs in strictly linear time. Extensive experiments on eight real-world datasets demonstrate that SPER achieves significant speedups (3x to >6x) over state-of-the-art baselines while maintaining comparable recall and precision.

preprint2022arXiv

A hybrid Lagrangian -- Eulerian flow solver applied to elastically mounted cylinders in tandem arrangement

The fluid structure interaction of cylinders in tandem arrangement is used as validation basis of a multi-domain Lagrangian-Eulerian hybrid flow solver. In this hybrid combination, separate grids of limited width are defined around every solid body, on which the Eulerian flow equations are solved using finite volume approximations. In order to interconnect the domains defined by the grids, the entire flow is described in Lagrangian coordinates and the corresponding equations are solved via particle approximations in fully coupled mode with the solutions within the Eulerian grids. The flow solver is also strongly (implicitly) coupled with the structural dynamic equations in case the cylinders are elastically supported. In the present work, the Eulerian part solves the compressible flow equations in density-velocity-pressure formulation and uses pre-conditioning at low Ma while the Lagrangian part is based on the density-dilatation-vorticity-pressure formulation. The hybrid solver is first validated in the case of an isolated rigid cylinder at $Re=100$. Then the case of a single elastically mounted cylinder at $Re=200$ is considered, followed by the case of two cylinders in tandem arrangement that are either rigid or elastically mounted. Good agreement with results produced with spectral and immersed boundary methods is found indicating the capabilities of the hybrid predictions. Also the flexibility of the method in handling complex multi-body fluid structure interaction problems is demonstrated by allowing grid-overlapping.

preprint2022arXiv

Bipartite Graph Matching Algorithms for Clean-Clean Entity Resolution: An Empirical Evaluation

Entity Resolution (ER) is the task of finding records that refer to the same real-world entities. A common scenario is when entities across two clean sources need to be resolved, which we refer to as Clean-Clean ER. In this paper, we perform an extensive empirical evaluation of 8 bipartite graph matching algorithms that take in as input a bipartite similarity graph and provide as output a set of matched entities. We consider a wide range of matching algorithms, including algorithms that have not previously been applied to ER, or have been evaluated only in other ER settings. We assess the relative performance of the algorithms with respect to accuracy and time efficiency over 10 established, real datasets, from which we extract >700 different similarity graphs. Our results provide insights into the relative performance of these algorithms and guidelines for choosing the best one, depending on the data at hand.

preprint2022arXiv

Generalized Supervised Meta-blocking (technical report)

Entity Resolution constitutes a core data integration task that relies on Blocking in order to tame its quadratic time complexity. Schema-agnostic blocking achieves very high recall, requires no domain knowledge and applies to data of any structuredness and schema heterogeneity. This comes at the cost of many irrelevant candidate pairs (i.e., comparisons), which can be significantly reduced through Meta-blocking techniques, i.e., techniques that leverage the co-occurrence patterns of entities inside the blocks: first, a weighting scheme assigns a score to every pair of candidate entities in proportion to the likelihood that they are matching and then, a pruning algorithm discards the pairs with the lowest scores. Supervised Meta-blocking goes beyond this approach by combining multiple scores per comparison into a feature vector that is fed to a binary classifier. By using probabilistic classifiers, Generalized Supervised Meta-blocking associates every pair of candidates with a score that can be used by any pruning algorithm. For higher effectiveness, new weighting schemes are examined as features. Through an extensive experimental analysis, we identify the best pruning algorithms, their optimal sets of features as well as the minimum possible size of the training set. The resulting approaches achieve excellent performance across several established benchmark datasets.

preprint2022arXiv

Investigation of a submerged fully passive energy-extracting flapping foil operating in sheared inflow

In this work a fully passive energy harvesting foil is studied computationally. An in-house 2nd order finite volume CFD solver, MaPFlow, is strongly coupled with a rigid body dynamics solver to investigate the foil operation under uniform and sheared inflow conditions with/without free surface. The mesh follows the airfoil motion using a radial basis function (RBF) mesh deformation approach. Initially, MaPFlow predictions are compared to experimental and numerical results available in the literature, where reasonable agreement is found. Next, one-phase simulations are considered for linearly sheared inflow for various shear rates. Results suggested that foil performance can be enhanced under sheared inflow conditions. Finally, two-phase simulations taking into account the free surface, for both uniform and sheared inflow, are considered. Predictions indicate a significant deterioration in performance of the system when the foil operates under the free surface due to the interaction of the shed vorticity with the free surface.

preprint2022arXiv

On the structure of Vorticity and Turbulence Fields in a separated flow around a finite wing; analysis using Direct Numerical Simulation

We investigate the spatial distributions and production mechanisms of vorticity and turbulent kinetic energy around a finite NACA 0018 wing with square wingtip profile at $Re_c=10^4$ and $10^{\circ}$ angle of attack with the aid of Direct Numerical Simulation (DNS). The analysis focuses on the highly inhomogeneous region around the tip and the near wake; this region is highly convoluted, strongly three-dimensional, and far from being self-similar. The flow separates close to the leading edge creating a large, open recirculation zone around the central part of the wing. In the proximity of the tip, the flow remains attached but another smaller recirculation zone forms closer to the trailing edge; this zone strongly affects the development of main wing tip vortex. The early formation mechanisms of three vortices close to the leading edge are elucidated and discussed. More specifically, we analyse the role of vortex stretching/compression and tilting, and how it affects the strength of each vortex as it approaches the trailing edge. We find that the three-dimensional flow separation at the sharp tip close to the leading edge plays an important role on the subsequent vortical flow development on the suction side. The production of turbulent kinetic energy and Reynolds stresses is also investigated and discussed in conjunction with the identified vortex patterns. The detailed analysis of the mechanisms that sustain vorticity and turbulent kinetic energy improves our understanding of these highly three dimensional, non-equilibrium flows and can lead to better actuation methods to manipulate these flows.

preprint2022arXiv

Reconstruction of irregular flow dynamics around two square cylinders from sparse measurements using a data-driven algorithm

We propose a data-driven algorithm for reconstructing the irregular, chaotic flow dynamics around two side-by-side square cylinders from sparse, time-resolved, velocity measurements in the wake. We use Proper Orthogonal Decomposition (POD) to reduce the dimensionality of the problem and then explore two different reconstruction approaches: in the first approach, we use the subspace system identification algorithm n4sid to extract a linear dynamical model directly from the data (including the modelling and measurement error covariance matrices) and then employ Kalman filter theory to synthesize a linearly optimal estimator. In the second approach, the estimator matrices are directly identified using n4sid. A systematic study reveals that the first strategy outperforms the second in terms of reconstruction accuracy, robustness and computational efficiency. We also consider the problem of sensor placement. A greedy approach based on the QR pivoting algorithm is compared against sensors placed at the POD mode peaks; we show that the former approach is more accurate in recovering the flow characteristics away from the cylinders. We demonstrate that a linear dynamic model with a sufficiently large number of states and relatively few measurements, can recover accurately complex flow features, such as the interaction of the irregular flapping motion of the jet emanating from the gap with the vortices shed from the cylinders as well as the convoluted patterns downstream arising from the amalgamation of the individual wakes. The proposed methodology is entirely data-driven, does not have tunable parameters, and the resulting matrices are unique (to within a linear coordinate transformation of the state vector). The method can be applied directly to either experimental or computational data.

preprint2022arXiv

Three-dimensional Geospatial Interlinking with JedAI-spatial

Geospatial data constitutes a considerable part of (Semantic) Web data, but so far, its sources are inadequately interlinked in the Linked Open Data cloud. Geospatial Interlinking aims to cover this gap by associating geometries with topological relations like those of the Dimensionally Extended 9-Intersection Model. Due to its quadratic time complexity, various algorithms aim to carry out Geospatial Interlinking efficiently. We present JedAI-spatial, a novel, open-source system that organizes these algorithms according to three dimensions: (i) Space Tiling, which determines the approach that reduces the search space, (ii) Budget-awareness, which distinguishes interlinking algorithms into batch and progressive ones, and (iii) Execution mode, which discerns between serial algorithms, running on a single CPU-core, and parallel ones, running on top of Apache Spark. We analytically describe JedAI-spatial's architecture and capabilities and perform thorough experiments to provide interesting insights about the relative performance of its algorithms.

preprint2020arXiv

A Survey of Blocking and Filtering Techniques for Entity Resolution

Efficiency techniques are an integral part of Entity Resolution, since its infancy. In this survey, we organized the bulk of works in the field into Blocking, Filtering and hybrid techniques, facilitating their understanding and use. We also provided an in-dept coverage of each category, further classifying the corresponding works into novel sub-categories. Lately, the efficiency techniques have received more attention, due to the rise of Big Data. This includes large volumes of semi-structured data, which pose challenges not only to the scalability of efficiency techniques, but also to their core assumptions: the requirement of Blocking for schema knowledge and of Filtering for high similarity thresholds. The former led to the introduction of schema-agnostic Blocking in conjunction with Block Processing techniques, while the latter led to more relaxed criteria of similarity. Our survey covers these new fields in detail, putting in context all relevant works.

preprint2020arXiv

End-to-End Entity Resolution for Big Data: A Survey

One of the most important tasks for improving data quality and the reliability of data analytics results is Entity Resolution (ER). ER aims to identify different descriptions that refer to the same real-world entity, and remains a challenging problem. While previous works have studied specific aspects of ER (and mostly in traditional settings), in this survey, we provide for the first time an end-to-end view of modern ER workflows, and of the novel aspects of entity indexing and matching methods in order to cope with more than one of the Big Data characteristics simultaneously. We present the basic concepts, processing steps and execution strategies that have been proposed by different communities, i.e., database, semantic Web and machine learning, in order to cope with the loose structuredness, extreme diversity, high speed and large scale of entity descriptions used by real-world applications. Finally, we provide a synthetic discussion of the existing approaches, and conclude with a detailed presentation of open research directions.

preprint2020arXiv

OBDA for the Web: Creating Virtual RDF Graphs On Top of Web Data Sources

Due to Variety, Web data come in many different structures and formats, with HTML tables and REST APIs (e.g., social media APIs) being among the most popular ones. A big subset of Web data is also characterised by Velocity, as data gets frequently updated so that consumers can obtain the most up-to-date version of the respective datasets. At the moment, though, these data sources are not effectively supported by Semantic Web tools. To address variety and velocity, we propose Ontop4theWeb, a system that maps Web data of various formats into virtual RDF triples, thus allowing for querying them on-the-fly without materializing them as RDF. We demonstrate how Ontop4theWeb can use SPARQL to uniformly query popular, but heterogeneous Web data sources, like HTML tables and Web APIs. We showcase our approach in a number of use cases, such as Twitter, Foursquare, Yelp and HTML tables. We carried out a thorough experimental evaluation which verifies the high efficiency of our framework, which goes beyond the current state-of-the-art in this area, in terms of both functionality and performance.