Researcher profile

Miguel Couceiro

Miguel Couceiro contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
28works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

28 published item(s)

preprint2026arXiv

Shapley Regression for Rare Disease Diagnosis Support: a case study on APDS

Activated PI3K8 Syndrome (APDS) is a rare genetic immune disorder caused by variants in PIK3CD or PIK3R1, with highly heterogeneous symptoms that often delay diagnosis. Early recognition is hampered by overlapping clinical presentations and limited clinician awareness, motivating systematic, data-driven approaches to detect APDS-associated phenotypic patterns in routine electronic health records. Traditional linear scoring systems cannot capture complex symptom interactions, while deep learning models, though expressive, often lack interpretability. To bridge this gap, we propose Shapley regression, a novel game-theoretic model replacing the linear predictor with a k-additive cooperative game, explicitly modeling co-occurrence of symptoms while maintaining the transparency and convexity of logistic regression. We carry out an empirical study of our lightweight method on eight public biomedical datasets, showing that a 2-additive model with $l_{2}$ regularization achieves an optimal trade-off between predictive power and noise robustness. We also apply it to a real-world cohort of 222 patients, on which Shapley regression accurately distinguished APDS cases from matched controls, confirming and validating phenotypes known to be associated with APDS, and facilitating the exploration of pairwise interactions between symptoms, validated by clinical experts.

preprint2026arXiv

Which Are the Low-Resource Languages of the Semantic Web?

Emerging digital technologies are exacerbating the existing divide in Open Access Data (OAD) between high-and low-resource languages, excluding many communities from the global digital transformation. Multilingual Linked Open Data Knowledge Graphs (LOD KGs) could contribute to mitigating this divide through cross-lingual transfer; however, no clear quantitative definition of low-resource languages has yet been established in the context of LOD KGs. In this poster, we present a methodology to analyze the distribution of languages across LOD KGs and propose a preliminary multi-level categorization based on DBpedia, BabelNet, and Wikidata. This categorization is leveraged to bring a formal definition of low-, high-, and medium-resource languages that could be later leveraged to select cross-lingual transfer candidates.

preprint2022arXiv

Component twin-width as a parameter for BINARY-CSP and its semiring generalisations

We investigate the fine-grained and the parameterized complexity of several generalizations of binary constraint satisfaction problems (BINARY-CSPs), that subsume variants of graph colouring problems. Our starting point is the observation that several algorithmic approaches that resulted in complexity upper bounds for these problems, share a common structure. We thus explore an algebraic approach relying on semirings that unifies different generalizations of BINARY-CSPs (such as the counting, the list, and the weighted versions), and that facilitates a general algorithmic approach to efficiently solving them. The latter is inspired by the (component) twin-width parameter introduced by Bonnet et al., which we generalize via edge-labelled graphs in order to formulate it to arbitrary binary constraints. We consider input instances with bounded component twin-width, as well as constraint templates of bounded component twin-width, and obtain an FPT algorithm as well as an improved, exponential-time algorithm, for broad classes of binary constraints. We illustrate the advantages of this framework by instantiating our general algorithmic approach on several classes of problems (e.g., the $H$-coloring problem and its variants), and showing that it improves the best complexity upper bounds in the literature for several well-known problems.

preprint2022arXiv

Galois theory for analogical classifiers

Analogical proportions are 4-ary relations that read "A is to B as C is to D". Recent works have highlighted the fact that such relations can support a specific form of inference, called analogical inference. This inference mechanism was empirically proved to be efficient in several reasoning and classification tasks. In the latter case, it relies on the notion of analogy preservation. In this paper, we explore this relation between formal models of analogy and the corresponding classes of analogy preserving functions, and we establish a Galois theory of analogical classifiers. We illustrate the usefulness of this Galois framework over Boolean domains, and we explicitly determine the closed sets of analogical classifiers, i.e., classifiers that are compatible with the analogical inference, for each pair of Boolean analogies.

preprint2022arXiv

Reducibility of $n$-ary semigroups: from quasitriviality towards idempotency

Let $X$ be a nonempty set. Denote by $\mathcal{F}^n_k$ the class of associative operations $F\colon X^n\to X$ satisfying the condition $F(x_1,\ldots,x_n)\in\{x_1,\ldots,x_n\}$ whenever at least $k$ of the elements $x_1,\ldots,x_n$ are equal to each other. The elements of $\mathcal{F}^n_1$ are said to be quasitrivial and those of $\mathcal{F}^n_n$ are said to be idempotent. We show that $\mathcal{F}^n_1=\cdots =\mathcal{F}^n_{n-2}\subseteq\mathcal{F}^n_{n-1}\subseteq\mathcal{F}^n_n$ and we give conditions on the set $X$ for the last inclusions to be strict. The class $\mathcal{F}^n_1$ was recently characterized by Couceiro and Devillet, who showed that its elements are reducible to binary associative operations. However, some elements of $\mathcal{F}^n_n$ are not reducible. In this paper, we characterize the class $\mathcal{F}^n_{n-1}\setminus\mathcal{F}^n_1$ and show that its elements are reducible. We give a full description of the corresponding reductions and show how each of them is built from a quasitrivial semigroup and an Abelian group whose exponent divides $n-1$.

preprint2021arXiv

A Bayesian Neural Network based on Dropout Regulation

Bayesian Neural Networks (BNN) have recently emerged in the Deep Learning world for dealing with uncertainty estimation in classification tasks, and are used in many application domains such as astrophysics, autonomous driving...BNN assume a prior over the weights of a neural network instead of point estimates, enabling in this way the estimation of both aleatoric and epistemic uncertainty of the model prediction.Moreover, a particular type of BNN, namely MC Dropout, assumes a Bernoulli distribution on the weights by using Dropout.Several attempts to optimize the dropout rate exist, e.g. using a variational approach.In this paper, we present a new method called "Dropout Regulation" (DR), which consists of automatically adjusting the dropout rate during training using a controller as used in automation.DR allows for a precise estimation of the uncertainty which is comparable to the state-of-the-art while remaining simple to implement.

preprint2020arXiv

LimeOut: An Ensemble Approach To Improve Process Fairness

Artificial Intelligence and Machine Learning are becoming increasingly present in several aspects of human life, especially, those dealing with decision making. Many of these algorithmic decisions are taken without human supervision and through decision making processes that are not transparent. This raises concerns regarding the potential bias of these processes towards certain groups of society, which may entail unfair results and, possibly, violations of human rights. Dealing with such biased models is one of the major concerns to maintain the public trust. In this paper, we address the question of process or procedural fairness. More precisely, we consider the problem of making classifiers fairer by reducing their dependence on sensitive features while increasing (or, at least, maintaining) their accuracy. To achieve both, we draw inspiration from "dropout" techniques in neural based approaches, and propose a framework that relies on "feature drop-out" to tackle process fairness. We make use of "LIME Explanations" to assess a classifier's fairness and to determine the sensitive features to remove. This produces a pool of classifiers (through feature dropout) whose ensemble is shown empirically to be less dependent on sensitive features, and with improved or no impact on accuracy.

preprint2020arXiv

Tackling scalability issues in mining path patterns from knowledge graphs: a preliminary study

Features mined from knowledge graphs are widely used within multiple knowledge discovery tasks such as classification or fact-checking. Here, we consider a given set of vertices, called seed vertices, and focus on mining their associated neighboring vertices, paths, and, more generally, path patterns that involve classes of ontologies linked with knowledge graphs. Due to the combinatorial nature and the increasing size of real-world knowledge graphs, the task of mining these patterns immediately entails scalability issues. In this paper, we address these issues by proposing a pattern mining approach that relies on a set of constraints (e.g., support or degree thresholds) and the monotonicity property. As our motivation comes from the mining of real-world knowledge graphs, we illustrate our approach with PGxLOD, a biomedical knowledge graph.

preprint2015arXiv

Hereditarily rigid relations

An $h$-ary relation $\r$ on a finite set $A$ is said to be \emph{hereditarily rigid} if the unary partial functions on $A$ that preserve $\r$ are the subfunctions of the identity map or of constant maps. A family of relations ${\mathcal F}$ is said to be \emph{hereditarily strongly rigid} if the partial functions on $A$ that preserve every $\r \in {\mathcal F}$ are the subfunctions of projections or constant functions. In this paper we show that hereditarily rigid relations exist and we give a lower bound on their arities. We also prove that no finite hereditarily strongly rigid families of relations exist and we also construct an infinite hereditarily strongly rigid family of relations.

preprint2013arXiv

Discrete integrals based on comonotonic modularity

It is known that several discrete integrals, including the Choquet and Sugeno integrals as well as some of their generalizations, are comonotonically modular functions. Based on a recent description of the class of comonotonically modular functions, we axiomatically identify more general families of discrete integrals that are comonotonically modular, including signed Choquet integrals and symmetric signed Choquet integrals as well as natural extensions of Sugeno integrals.

preprint2013arXiv

On the arity gap of polynomial functions

The authors' previous results on the arity gap of functions of several variables are refined by considering polynomial functions over arbitrary fields. We explicitly describe the polynomial functions with arity gap at least 3, as well as the polynomial functions with arity gap equal to 2 for fields of characteristic 0 or 2. These descriptions are given in the form of decomposition schemes of polynomial functions. Similar descriptions are given for arbitrary finite fields. However, we show that these descriptions do not extend to infinite fields of odd characteristic.

preprint2012arXiv

Locally monotone Boolean and pseudo-Boolean functions

We propose local versions of monotonicity for Boolean and pseudo-Boolean functions: say that a pseudo-Boolean (Boolean) function is p-locally monotone if none of its partial derivatives changes in sign on tuples which differ in less than p positions. As it turns out, this parameterized notion provides a hierarchy of monotonicities for pseudo-Boolean (Boolean) functions. Local monotonicities are shown to be tightly related to lattice counterparts of classical partial derivatives via the notion of permutable derivatives. More precisely, p-locally monotone functions are shown to have p-permutable lattice derivatives and, in the case of symmetric functions, these two notions coincide. We provide further results relating these two notions, and present a classification of p-locally monotone functions, as well as of functions having p-permutable derivatives, in terms of certain forbidden "sections", i.e., functions which can be obtained by substituting constants for variables. This description is made explicit in the special case when p=2.

preprint2011arXiv

A generalization of Goodstein's theorem: interpolation by polynomial functions of distributive lattices

We consider the problem of interpolating functions partially defined over a distributive lattice, by means of lattice polynomial functions. Goodstein&#39;s theorem solves a particular instance of this interpolation problem on a distributive lattice L with least and greatest elements 0 and 1, resp.: Given an n-ary partial function f over L, defined on all 0-1 tuples, f can be extended to a lattice polynomial function p over L if and only if f is monotone; in this case, the interpolating polynomial p is unique. We extend Goodstein&#39;s theorem to a wider class of n-ary partial functions f over a distributive lattice L, not necessarily bounded, where the domain of f is a cuboid of the form D={a1,b1}x...x{an,bn} with ai<bi, and determine the class of such partial functions which can be interpolated by lattice polynomial functions. In this wider setting, interpolating polynomials are not necessarily unique; we provide explicit descriptions of all possible lattice polynomial functions which interpolate these partial functions, when such an interpolation is available.

preprint2011arXiv

Axiomatizations and factorizations of Sugeno utility functions

In this paper we consider a multicriteria aggregation model where local utility functions of different sorts are aggregated using Sugeno integrals, and which we refer to as Sugeno utility functions. We propose a general approach to study such functions via the notion of pseudo-Sugeno integral (or, equivalently, pseudo-polynomial function), which naturally generalizes that of Sugeno integral, and provide several axiomatizations for this class of functions. Moreover, we address and solve the problem of factorizing a Sugeno utility function as a composition of a Sugeno integral with local utility functions, if such a factorization exists.

preprint2011arXiv

Axiomatizations of Lovász extensions of pseudo-Boolean functions

Three important properties in aggregation theory are investigated, namely horizontal min-additivity, horizontal max-additivity, and comonotonic additivity, which are defined by certain relaxations of the Cauchy functional equation in several variables. We show that these properties are equivalent and we completely describe the functions characterized by them. By adding some regularity conditions, these functions coincide with the Lovász extensions vanishing at the origin, which subsume the discrete Choquet integrals. We also propose a simultaneous generalization of horizontal min-additivity and horizontal max-additivity, called horizontal median-additivity, and we describe the corresponding function class. Additional conditions then reduce this class to that of symmetric Lovász extensions, which includes the discrete symmetric Choquet integrals.

preprint2011arXiv

Axiomatizations of quasi-Lovász extensions of pseudo-Boolean functions

We introduce the concept of quasi-Lovász extension as being a mapping $f\colon I^n\to\R$ defined on a nonempty real interval $I$ containing the origin and which can be factorized as $f(x_1,...,x_n)=L(ϕ(x_1),...,ϕ(x_n))$, where $L$ is the Lovász extension of a pseudo-Boolean function $ψ\colon\{0,1\}^n\to\R$ (i.e., the function $L\colon\R^n\to\R$ whose restriction to each simplex of the standard triangulation of $[0,1]^n$ is the unique affine function which agrees with $ψ$ at the vertices of this simplex) and $ϕ\colon I\to\R$ is a nondecreasing function vanishing at the origin. These functions appear naturally within the scope of decision making under uncertainty since they subsume overall preference functionals associated with discrete Choquet integrals whose variables are transformed by a given utility function. To axiomatize the class of quasi-Lovász extensions, we propose generalizations of properties used to characterize the Lovász extensions, including a comonotonic version of modularity and a natural relaxation of homogeneity. A variant of the latter property enables us to axiomatize also the class of symmetric quasi-Lovász extensions, which are compositions of symmetric Lovász extensions with 1-place nondecreasing odd functions.

preprint2011arXiv

On the lattice of equational classes of Boolean functions and its closed intervals

Let A be a finite set with at least two elements. The composition of two classes I and J of operations on A, is defined as the set of all compositions of functions in I with functions in J. This binary operation gives a monoid structure to the set E_A of all equational classes of operations on A. The set E_A of equational classes of operations on A also constitutes a complete distributive lattice under intersection and union. Clones of operations, i.e. classes containing all projections and idempotent under class composition, also form a lattice which is strictly contained in E_A. In the Boolean case |A|=2, the lattice E_A contains uncountably many equational classes, but only countably many of them are clones. The aim of this paper is to provide a better understanding of this uncountable lattice of equational classes of Boolean functions, by analyzing its &#34;closed&#34; intervals&#34; [C_1,C_2], for clones C_1 and C_2. For |A|=2, we give a complete classification of all such closed intervals in terms of their size, and provide a simple, necessary and sufficient condition characterizing the uncountable closed intervals of E_A.

preprint2011arXiv

On the poset of computation rules for nonassociative calculus

The symmetric maximum, denoted by v, is an extension of the usual max operation so that 0 is the neutral element, and -x is the symmetric (or inverse) of x, i.e., x v(-x)=0. However, such an extension does not preserve the associativity of max. This fact asks for systematic ways of parenthesing (or bracketing) terms of a sequence (with more than two arguments) when using such an extended maximum. We refer to such systematic (predefined) ways of parenthesing as computation rules. As it turns out there are infinitely many computation rules each of which corresponding to a systematic way of bracketing arguments of sequences. Essentially, computation rules reduce to deleting terms of sequences based on the condition x v(-x)=0. This observation gives raise to a quasi-order on the set of such computation rules: say that rule 1 is below rule 2 if for all sequences of numbers, rule 1 deletes more terms in the sequence than rule 2. In this paper we present a study of this quasi-ordering of computation rules. In particular, we show that the induced poset of all equivalence classes of computation rules is uncountably infinite, has infinitely many maximal elements, has infinitely many atoms, and it embeds the powerset of natural numbers ordered by inclusion.

preprint2011arXiv

Pseudo-polynomial functions over finite distributive lattices

In this paper we consider an aggregation model f: X1 x ... x Xn --> Y for arbitrary sets X1, ..., Xn and a finite distributive lattice Y, factorizable as f(x1, ..., xn) = p(u1(x1), ..., un(xn)), where p is an n-variable lattice polynomial function over Y, and each uk is a map from Xk to Y. The resulting functions are referred to as pseudo-polynomial functions. We present an axiomatization for this class of pseudo-polynomial functions which differs from the previous ones both in flavour and nature, and develop general tools which are then used to obtain all possible such factorizations of a given pseudo-polynomial function.

preprint2010arXiv

Associative polynomial functions over bounded distributive lattices

The associativity property, usually defined for binary functions, can be generalized to functions of a given fixed arity n>=1 as well as to functions of multiple arities. In this paper, we investigate these two generalizations in the case of polynomial functions over bounded distributive lattices and present explicit descriptions of the corresponding associative functions. We also show that, in this case, both generalizations of associativity are essentially the same.

preprint2010arXiv

Axiomatizations of signed discrete Choquet integrals

We study the so-called signed discrete Choquet integral (also called non-monotonic discrete Choquet integral) regarded as the Lovász extension of a pseudo-Boolean function which vanishes at the origin. We present axiomatizations of this generalized Choquet integral, given in terms of certain functional equations, as well as by necessary and sufficient conditions which reveal desirable properties in aggregation theory.

preprint2010arXiv

Invariant functionals on completely distributive lattices

In this paper we are interested in functionals defined on completely distributive lattices and which are invariant under mappings preserving {arbitrary} joins and meets. We prove that the class of nondecreasing invariant functionals coincides with the class of Sugeno integrals associated with $\{0,1\}$-valued capacities, the so-called term functionals, thus extending previous results both to the infinitary case as well as to the realm of completely distributive lattices. Furthermore, we show that, in the case of functionals over complete chains, the nondecreasing condition is redundant. Characterizations of the class of Sugeno integrals, as well as its superclass comprising all polynomial functionals, are provided by showing that the axiomatizations (given in terms of homogeneity) of their restriction to finitary functionals still hold over completely distributive lattices. We also present canonical normal form representations of polynomial functionals on completely distributive lattices, which appear as the natural extensions to their finitary counterparts, and as a by-product we obtain an axiomatization of complete distributivity in the case of bounded lattices.

preprint2010arXiv

Quasi-polynomial functions over bounded distributive lattices

In [arXiv 0811.3913] the authors introduced the notion of quasi-polynomial function as being a mapping f: X^n -> X defined and valued on a bounded chain X and which can be factorized as f(x_1,...,x_n)=p(phi(x_1),...,phi(x_n)), where p is a polynomial function (i.e., a combination of variables and constants using the chain operations / and) and phi is an order-preserving map. In the current paper we study this notion in the more general setting where the underlying domain and codomain sets are, possibly different, bounded distributive lattices, and where the inner function is not necessarily order-preserving. These functions appear naturally within the scope of decision making under uncertainty since, as shown in this paper, they subsume overall preference functionals associated with Sugeno integrals whose variables are transformed by a given utility function. To axiomatize the class of quasi-polynomial functions, we propose several generalizations of well-established properties in aggregation theory, as well as show that some of the characterizations given in [arXiv 0811.3913] still hold in this general setting. Moreover, we investigate the so-called transformed polynomial functions (essentially, compositions of unary mappings with polynomial functions) and show that, under certain conditions, they reduce to quasi-polynomial functions.

preprint2009arXiv

Characterizations of discrete Sugeno integrals as polynomial functions over distributive lattices

We give several characterizations of discrete Sugeno integrals over bounded distributive lattices, as particular cases of lattice polynomial functions, that is, functions which can be represented in the language of bounded lattices using variables and constants. We also consider the subclass of term functions as well as the classes of symmetric polynomial functions and weighted minimum and maximum functions, and present their characterizations, accordingly. Moreover, we discuss normal form representations of these functions.

preprint2009arXiv

Representations and characterizations of polynomial functions on chains

We are interested in representations and characterizations of lattice polynomial functions f:L^n -> L, where L is a given bounded distributive lattice. In companion papers [arXiv 0901.4888, arXiv 0808.2619], we investigated certain representations and provided various characterizations of these functions both as solutions of certain functional equations and in terms of necessary and sufficient conditions. In the present paper, we investigate these representations and characterizations in the special case when L is a chain, i.e., a totally ordered lattice. More precisely, we discuss representations of lattice polynomial functions given in terms of standard simplices and we present new axiomatizations of these functions by relaxing some of the conditions given in [arXiv 0901.4888, arXiv 0808.2619] and by considering further conditions, namely comonotonic minitivity and maxitivity.