Source author record

Claudio Ferrari

Claudio Ferrari appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning Artificial Intelligence astro-ph.IM eess.IV Graphics math.OC nlin.CD physics.flu-dyn

Catalog footprint

What is connected

8works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking Heads

Generating speech-driven 3D talking heads presents numerous challenges; among those is dealing with varying mesh topologies where no point-wise correspondence exists across the meshes the model can animate. While previous literature works assume fixed mesh structures, in this work we present the first framework capable of animating 3D faces in arbitrary topologies, including real scanned data. Our approach leverages heat diffusion to predict features that are robust to the mesh topology. We explore two training settings: a registered one, in which meshes in a training sequences share a fixed topology but any mesh can be animated at test time, and an fully unregistered one, which allows effective training with varying mesh structures. Additionally, we highlight the limitations of current evaluation metrics and propose new metrics for better lip-syncing evaluation. An extensive evaluation shows our approach performs favorably compared to fixed topology techniques, setting a new benchmark by offering a versatile and high-fidelity solution for 3D talking heads where the topology constraint is dropped. The code along with the pre-trained model are available.

preprint2022arXiv

Instance-wise algorithm configuration with graph neural networks

We present our submission for the configuration task of the Machine Learning for Combinatorial Optimization (ML4CO) NeurIPS 2021 competition. The configuration task is to predict a good configuration of the open-source solver SCIP to solve a mixed integer linear program (MILP) efficiently. We pose this task as a supervised learning problem: First, we compile a large dataset of the solver performance for various configurations and all provided MILP instances. Second, we use this data to train a graph neural network that learns to predict a good configuration for a specific instance. The submission was tested on the three problem benchmarks of the competition and improved solver performance over the default by 12% and 35% and 8% across the hidden test instances. We ranked 3rd out of 15 on the global leaderboard and won the student leaderboard. We make our code publicly available at \url{https://github.com/RomeoV/ml4co-competition} .

preprint2022arXiv

Sparse to Dense Dynamic 3D Facial Expression Generation

In this paper, we propose a solution to the task of generating dynamic 3D facial expressions from a neutral 3D face and an expression label. This involves solving two sub-problems: (i)modeling the temporal dynamics of expressions, and (ii) deforming the neutral mesh to obtain the expressive counterpart. We represent the temporal evolution of expressions using the motion of a sparse set of 3D landmarks that we learn to generate by training a manifold-valued GAN (Motion3DGAN). To better encode the expression-induced deformation and disentangle it from the identity information, the generated motion is represented as per-frame displacement from a neutral configuration. To generate the expressive meshes, we train a Sparse2Dense mesh Decoder (S2D-Dec) that maps the landmark displacements to a dense, per-vertex displacement. This allows us to learn how the motion of a sparse set of landmarks influences the deformation of the overall face surface, independently from the identity. Experimental results on the CoMA and D3DFACS datasets show that our solution brings significant improvements with respect to previous solutions in terms of both dynamic expression generation and mesh reconstruction, while retaining good generalization to unseen data. The code and the pretrained model will be made publicly available.

preprint2022arXiv

What makes you, you? Analyzing Recognition by Swapping Face Parts

Deep learning advanced face recognition to an unprecedented accuracy. However, understanding how local parts of the face affect the overall recognition performance is still mostly unclear. Among others, face swap has been experimented to this end, but just for the entire face. In this paper, we propose to swap facial parts as a way to disentangle the recognition relevance of different face parts, like eyes, nose and mouth. In our method, swapping parts from a source face to a target one is performed by fitting a 3D prior, which establishes dense pixels correspondence between parts, while also handling pose differences. Seamless cloning is then used to obtain smooth transitions between the mapped source regions and the shape and skin tone of the target face. We devised an experimental protocol that allowed us to draw some preliminary conclusions when the swapped images are classified by deep networks, indicating a prominence of the eyes and eyebrows region. Code available at https://github.com/clferrari/FacePartsSwap

preprint2020arXiv

Inner Eye Canthus Localization for Human Body Temperature Screening

In this paper, we propose an automatic approach for localizing the inner eye canthus in thermal face images. We first coarsely detect 5 facial keypoints corresponding to the center of the eyes, the nosetip and the ears. Then we compute a sparse 2D-3D points correspondence using a 3D Morphable Face Model (3DMM). This correspondence is used to project the entire 3D face onto the image, and subsequently locate the inner eye canthus. Detecting this location allows to obtain the most precise body temperature measurement for a person using a thermal camera. We evaluated the approach on a thermal face dataset provided with manually annotated landmarks. However, such manual annotations are normally conceived to identify facial parts such as eyes, nose and mouth, and are not specifically tailored for localizing the eye canthus region. As additional contribution, we enrich the original dataset by using the annotated landmarks to deform and project the 3DMM onto the images. Then, by manually selecting a small region corresponding to the eye canthus, we enrich the dataset with additional annotations. By using the manual landmarks, we ensure the correctness of the 3DMM projection, which can be used as ground-truth for future evaluations. Moreover, we supply the dataset with the 3D head poses and per-point visibility masks for detecting self-occlusions. The data will be publicly released.

preprint2020arXiv

Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema

Mycosis fungoides (MF) is a rare, potentially life threatening skin disease, which in early stages clinically and histologically strongly resembles Eczema, a very common and benign skin condition. In order to increase the survival rate, one needs to provide the appropriate treatment early on. To this end, one crucial step for specialists is the evaluation of histopathological slides (glass slides), or Whole Slide Images (WSI), of the patients' skin tissue. We introduce a deep learning aided diagnostics tool that brings a two-fold value to the decision process of pathologists. First, our algorithm accurately segments WSI into regions that are relevant for an accurate diagnosis, achieving a Mean-IoU of 69% and a Matthews Correlation score of 83% on a novel dataset. Additionally, we also show that our model is competitive with the state of the art on a reference dataset. Second, using the segmentation map and the original image, we are able to predict if a patient has MF or Eczema. We created two models that can be applied in different stages of the diagnostic pipeline, potentially eliminating life-threatening mistakes. The classification outcome is considerably more interpretable than using only the WSI as the input, since it is also based on the segmentation map. Our segmentation model, which we call EU-Net, extends a classical U-Net with an EfficientNet-B7 encoder which was pre-trained on the Imagenet dataset.

preprint2016arXiv

BEaTriX, expanded X-ray beam facility for testing modular elements of telescope optics: an update

We present in this paper an update on the design of BEaTriX (Beam Expander Testing X-ray facility), an X-ray apparatus to be realized at INAF/OAB and that will generate an expanded, uniform and parallel beam of soft X-rays. BEaTriX will be used to perform the functional tests of X-ray focusing modules of large X-ray optics such as those for the ATHENA X-ray observatory, using the Silicon Pore Optics (SPO) as a baseline technology, and Slumped Glass Optics (SGO) as a possible alternative. Performing the tests in X-rays provides the advantage of an in-situ, at-wavelength quality control of the optical modules produced in series by the industry, performing a selection of the modules with the best angular resolution, and, in the case of SPOs, there is also the interesting possibility to align the parabolic and the hyperbolic stacks directly under X-rays, to minimize the aberrations. However, a parallel beam with divergence below 2 arcsec is necessary in order to measure mirror elements that are expected to reach an angular resolution of about 4 arcsec, since the ATHENA requirement for the entire telescope is 5 arcsec. Such a low divergence over the typical aperture of modular optics would require an X-ray source to be located in a several kilometers long vacuum tube. In contrast, BEaTriX will be compact enough (5 m x 14 m) to be housed in a small laboratory, will produce an expanded X-ray beam 60 mm x 200 mm broad, characterized by a very low divergence (1.5 arcsec HEW), strong polarization, high uniformity, and X-ray energy selectable between 1.5 keV and 4.5 keV. In this work we describe the BEaTriX layout and show a performance simulation for the X-ray energy of 4.5 keV.

preprint2012arXiv

Analytical modeling for the heat transfer in sheared flows of nanofluids

We developed a model for the enhancement of the heat flux by spherical and elongated nano- particles in sheared laminar flows of nano-fluids. Besides the heat flux carried by the nanoparticles the model accounts for the contribution of their rotation to the heat flux inside and outside the particles. The rotation of the nanoparticles has a twofold effect, it induces a fluid advection around the particle and it strongly influences the statistical distribution of particle orientations. These dynamical effects, which were not included in existing thermal models, are responsible for changing the thermal properties of flowing fluids as compared to quiescent fluids. The proposed model is strongly supported by extensive numerical simulations, demonstrating a potential increase of the heat flux far beyond the Maxwell-Garnet limit for the spherical nanoparticles. The road ahead which should lead towards robust predictive models of heat flux enhancement is discussed.

Claudio Ferrari

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking Heads

Instance-wise algorithm configuration with graph neural networks

Sparse to Dense Dynamic 3D Facial Expression Generation

What makes you, you? Analyzing Recognition by Swapping Face Parts

Inner Eye Canthus Localization for Human Body Temperature Screening

Semantic Segmentation of Histopathological Slides for the Classification of Cutaneous Lymphoma and Eczema

BEaTriX, expanded X-ray beam facility for testing modular elements of telescope optics: an update

Analytical modeling for the heat transfer in sheared flows of nanofluids