Researcher profile

Yujiang Wang

Yujiang Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2024arXiv

Medical records condensation: a roadmap towards healthcare data democratisation

The prevalence of artificial intelligence (AI) has envisioned an era of healthcare democratisation that promises every stakeholder a new and better way of life. However, the advancement of clinical AI research is significantly hurdled by the dearth of data democratisation in healthcare. To truly democratise data for AI studies, challenges are two-fold: 1. the sensitive information in clinical data should be anonymised appropriately, and 2. AI-oriented clinical knowledge should flow freely across organisations. This paper considers a recent deep-learning advent, dataset condensation (DC), as a stone that kills two birds in democratising healthcare data. The condensed data after DC, which can be viewed as statistical metadata, abstracts original clinical records and irreversibly conceals sensitive information at individual levels; nevertheless, it still preserves adequate knowledge for learning deep neural networks (DNNs). More favourably, the compressed volumes and the accelerated model learnings of condensed data portray a more efficient clinical knowledge sharing and flowing system, as necessitated by data democratisation. We underline DC's prospects for democratising clinical data, specifically electrical healthcare records (EHRs), for AI research through experimental results and analysis across three healthcare datasets of varying data types.

preprint2022arXiv

A real-time and unsupervised face Re-Identification system for Human-Robot Interaction

In the context of Human-Robot Interaction (HRI), face Re-Identification (face Re-ID) aims to verify if certain detected faces have already been observed by robots. The ability of distinguishing between different users is crucial in social robots as it will enable the robot to tailor the interaction strategy toward the users' individual preferences. So far face recognition research has achieved great success, however little attention has been paid to the realistic applications of Face Re-ID in social robots. In this paper, we present an effective and unsupervised face Re-ID system which simultaneously re-identifies multiple faces for HRI. This Re-ID system employs Deep Convolutional Neural Networks to extract features, and an online clustering algorithm to determine the face's ID. Its performance is evaluated on two datasets: the TERESA video dataset collected by the TERESA robot, and the YouTube Face Dataset (YTF Dataset). We demonstrate that the optimised combination of techniques achieves an overall 93.55% accuracy on TERESA dataset and an overall 90.41% accuracy on YTF dataset. We have implemented the proposed method into a software module in the HCI^2 Framework for it to be further integrated into the TERESA robot, and has achieved real-time performance at 10~26 Frames per second.

preprint2022arXiv

Dilated Convolutions with Lateral Inhibitions for Semantic Image Segmentation

Dilated convolutions are widely used in deep semantic segmentation models as they can enlarge the filters' receptive field without adding additional weights nor sacrificing spatial resolution. However, as dilated convolutional filters do not possess positional knowledge about the pixels on semantically meaningful contours, they could lead to ambiguous predictions on object boundaries. In addition, although dilating the filter can expand its receptive field, the total number of sampled pixels remains unchanged, which usually comprises a small fraction of the receptive field's total area. Inspired by the Lateral Inhibition (LI) mechanisms in human visual systems, we propose the dilated convolution with lateral inhibitions (LI-Convs) to overcome these limitations. Introducing LI mechanisms improves the convolutional filter's sensitivity to semantic object boundaries. Moreover, since LI-Convs also implicitly take the pixels from the laterally inhibited zones into consideration, they can also extract features at a denser scale. By integrating LI-Convs into the Deeplabv3+ architecture, we propose the Lateral Inhibited Atrous Spatial Pyramid Pooling (LI-ASPP), the Lateral Inhibited MobileNet-V2 (LI-MNV2) and the Lateral Inhibited ResNet (LI-ResNet). Experimental results on three benchmark datasets (PASCAL VOC 2012, CelebAMask-HQ and ADE20K) show that our LI-based segmentation models outperform the baseline on all of them, thus verify the effectiveness and generality of the proposed LI-Convs.

preprint2022arXiv

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

Emotion recognition in smart eyewear devices is highly valuable but challenging. One key limitation of previous works is that the expression-related information like facial or eye images is considered as the only emotional evidence. However, emotional status is not isolated; it is tightly associated with people's visual perceptions, especially those sentimental ones. However, little work has examined such associations to better illustrate the cause of different emotions. In this paper, we study the emotionship analysis problem in eyewear systems, an ambitious task that requires not only classifying the user's emotions but also semantically understanding the potential cause of such emotions. To this end, we devise EMOShip, a deep-learning-based eyewear system that can automatically detect the wearer's emotional status and simultaneously analyze its associations with semantic-level visual perceptions. Experimental studies with 20 participants demonstrate that, thanks to the emotionship awareness, EMOShip not only achieves superior emotion recognition accuracy over existing methods (80.2% vs. 69.4%), but also provides a valuable understanding of the cause of emotions. Pilot studies with 20 participants further motivate the potential use of EMOShip to empower emotion-aware applications, such as emotionship self-reflection and emotionship life-logging.

preprint2022arXiv

FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild

Image-based age estimation aims to predict a person's age from facial images. It is used in a variety of real-world applications. Although end-to-end deep models have achieved impressive results for age estimation on benchmark datasets, their performance in-the-wild still leaves much room for improvement due to the challenges caused by large variations in head pose, facial expressions, and occlusions. To address this issue, we propose a simple yet effective method to explicitly incorporate facial semantics into age estimation, so that the model would learn to correctly focus on the most informative facial components from unaligned facial images regardless of head pose and non-rigid deformation. To this end, we design a face parsing-based network to learn semantic information at different scales and a novel face parsing attention module to leverage these semantic features for age estimation. To evaluate our method on in-the-wild data, we also introduce a new challenging large-scale benchmark called IMDB-Clean. This dataset is created by semi-automatically cleaning the noisy IMDB-WIKI dataset using a constrained clustering method. Through comprehensive experiment on IMDB-Clean and other benchmark datasets, under both intra-dataset and cross-dataset evaluation protocols, we show that our method consistently outperforms all existing age estimation methods and achieves a new state-of-the-art performance. To the best of our knowledge, our work presents the first attempt of leveraging face parsing attention to achieve semantic-aware age estimation, which may be inspiring to other high level facial analysis tasks. Code and data are available on \url{https://github.com/ibug-group/fpage}.

preprint2022arXiv

Intracranial EEG structure-function coupling predicts surgical outcomes in focal epilepsy

Alterations to structural and functional brain networks have been reported across many neurological conditions. However, the relationship between structure and function -- their coupling -- is relatively unexplored, particularly in the context of an intervention. Epilepsy surgery alters the brain structure and networks to control the functional abnormality of seizures. Given that surgery is a structural modification aiming to alter the function, we hypothesized that stronger structure-function coupling preoperatively is associated with a greater chance of post-operative seizure control. We constructed structural and functional brain networks in 39 subjects with medication-resistant focal epilepsy using data from intracranial EEG (pre-surgery), structural MRI (pre-and post-surgery), and diffusion MRI (pre-surgery). We investigated pre-operative structure-function coupling at two spatial scales a) at the global iEEG network level and b) at the resolution of individual iEEG electrode contacts using virtual surgeries. At global network level, seizure-free individuals had stronger structure-function coupling pre-operatively than those that were not seizure-free regardless of the choice of interictal segment or frequency band. At the resolution of individual iEEG contacts, the virtual surgery approach provided complementary information to localize epileptogenic tissues. In predicting seizure outcomes, structure-function coupling measures were more important than clinical attributes, and together they predicted seizure outcomes with an accuracy of 85% and sensitivity of 87%. The underlying assumption that the structural changes induced by surgery translate to the functional level to control seizures is valid when the structure-functional coupling is strong. Mapping the regions that contribute to structure-functional coupling using virtual surgeries may help aid surgical planning.

preprint2022arXiv

Normative brain mapping of interictal intracranial EEG to localise epileptogenic tissue

The identification of abnormal electrographic activity is important in a wide range of neurological disorders, including epilepsy for localising epileptogenic tissue. However, this identification may be challenging during non-seizure (interictal) periods, especially if abnormalities are subtle compared to the repertoire of possible healthy brain dynamics. Here, we investigate if such interictal abnormalities become more salient by quantitatively accounting for the range of healthy brain dynamics in a location-specific manner. To this end, we constructed a normative map of brain dynamics, in terms of relative band power, from interictal intracranial recordings from 234 subjects (21,598 electrode contacts). We then compared interictal recordings from 62 patients with epilepsy to the normative map to identify abnormal regions. We hypothesised that if the most abnormal regions were spared by surgery, then patients would be more likely to experience continued seizures post-operatively. We first confirmed that the spatial variations of band power in the normative map across brain regions were consistent with healthy variations reported in the literature. Second, when accounting for the normative variations, regions which were spared by surgery were more abnormal than those resected only in patients with persistent post-operative seizures (t=-3.6, p=0.0003), confirming our hypothesis. Third, we found that this effect discriminated patient outcomes (AUC=0.75 p=0.0003). Normative mapping is a well-established practice in neuroscientific research. Our study suggests that this approach is feasible to detect interictal abnormalities in intracranial EEG, and of potential clinical value to identify pathological tissue in epilepsy. Finally, we make our normative intracranial map publicly available to facilitate future investigations in epilepsy and beyond.

preprint2022arXiv

Volumetric and structural connectivity abnormalities co-localise in TLE

Patients with temporal lobe epilepsy (TLE) exhibit both volumetric and structural connectivity abnormalities relative to healthy controls. How these abnormalities inter-relate and their mechanisms are unclear. We computed grey matter volumetric changes and white matter structural connectivity abnormalities in 144 patients with unilateral TLE and 96 healthy controls. Regional volumes were calculated using T1-weighted MRI, while structural connectivity was derived using white matter fibre tractography from diffusion-weighted MRI. For each regional volume and each connection strength, we calculated the effect size between patient and control groups in a group-level analysis. We then applied hierarchical regression to investigate the relationship between volumetric and structural connectivity abnormalities in individuals. Additionally, we quantified whether abnormalities co-localised within individual patients by computing Dice similarity scores. In TLE, white matter connectivity abnormalities were greater when joining two grey matter regions with abnormal volumes. Similarly, grey matter volumetric abnormalities were greater when joined by abnormal white matter connections. The extent of volumetric and connectivity abnormalities related to epilepsy duration, but co-localisation did not. Co-localisation was primarily driven by neighbouring abnormalities in the ipsilateral hemisphere. Overall, volumetric and structural connectivity abnormalities were related in TLE. Our results suggest that shared mechanisms may underlie changes in both volume and connectivity alterations in patients with TLE.

preprint2021arXiv

Dynamic Face Video Segmentation via Reinforcement Learning

For real-time semantic video segmentation, most recent works utilised a dynamic framework with a key scheduler to make online key/non-key decisions. Some works used a fixed key scheduling policy, while others proposed adaptive key scheduling methods based on heuristic strategies, both of which may lead to suboptimal global performance. To overcome this limitation, we model the online key decision process in dynamic video segmentation as a deep reinforcement learning problem and learn an efficient and effective scheduling policy from expert information about decision history and from the process of maximising global return. Moreover, we study the application of dynamic video segmentation on face videos, a field that has not been investigated before. By evaluating on the 300VW dataset, we show that the performance of our reinforcement key scheduler outperforms that of various baselines in terms of both effective key selections and running speed. Further results on the Cityscapes dataset demonstrate that our proposed method can also generalise to other scenarios. To the best of our knowledge, this is the first work to use reinforcement learning for online key-frame decision in dynamic video segmentation, and also the first work on its application on face videos.

preprint2021arXiv

Face Mask Extraction in Video Sequence

Inspired by the recent development of deep network-based methods in semantic image segmentation, we introduce an end-to-end trainable model for face mask extraction in video sequence. Comparing to landmark-based sparse face shape representation, our method can produce the segmentation masks of individual facial components, which can better reflect their detailed shape variations. By integrating Convolutional LSTM (ConvLSTM) algorithm with Fully Convolutional Networks (FCN), our new ConvLSTM-FCN model works on a per-sequence basis and takes advantage of the temporal correlation in video clips. In addition, we also propose a novel loss function, called Segmentation Loss, to directly optimise the Intersection over Union (IoU) performances. In practice, to further increase segmentation accuracy, one primary model and two additional models were trained to focus on the face, eyes, and mouth regions, respectively. Our experiment shows the proposed method has achieved a 16.99% relative improvement (from 54.50% to 63.76% mean IoU) over the baseline FCN model on the 300 Videos in the Wild (300VW) dataset.

preprint2020arXiv

Focal to bilateral tonic-clonic seizures are associated with widespread network abnormality in temporal lobe epilepsy

Objective: To identify if whole-brain structural network alterations in patients with temporal lobe epilepsy (TLE) and focal to bilateral tonic-clonic seizures (FBTCS) differ from alterations in patients without FBTCS. Methods: We dichotomized a cohort of 83 drug-resistant patients with TLE into those with and without FBTCS and compared each group to 29 healthy controls. For each subject, we used diffusion MRI to construct whole-brain structural networks. First, we measured the extent of alterations by performing FBTCS-negative (FBTCS-) versus control and FBTCS-positive (FBTCS+) versus control comparisons, thereby delineating altered sub-networks of the whole-brain structural network. Second, by standardising networks of each patient using control networks, we measured the subject-specific abnormality at every brain region in the network, thereby quantifying the spatial localisation and the amount of abnormality in every patient. Results: Both FBTCS+ and FBTCS- patient groups had altered sub-networks with reduced fractional anisotropy (FA) and increased mean diffusivity (MD) compared to controls. The altered subnetwork in FBTCS+ patients was more widespread than in FBTCS- patients (441 connections altered at t>3, p<0.001 in FBTCS+ compared to 21 connections altered at t>3, p=0.01 in FBTCS-). Significantly greater abnormalities-aggregated over the entire brain network as well as assessed at the resolution of individual brain areas-were present in FBTCS+ patients (p<0.001, d=0.82). In contrast, the fewer abnormalities present in FBTCS- patients were mainly localised to the temporal and frontal areas. Significance: The whole-brain structural network is altered to a greater and more widespread extent in patients with TLE and FBTCS. We suggest that these abnormal networks may serve as an underlying structural basis or consequence of the greater seizure spread observed in FBTCS.

preprint2020arXiv

Independent components of human brain morphology

Quantification of brain morphology has become an important cornerstone in understanding brain structure. Measures of cortical morphology such as thickness and surface area are frequently used to compare groups of subjects or characterise longitudinal changes. However, such measures are often treated as independent from each other. A recently described scaling law, derived from a statistical physics model of cortical folding, demonstrates that there is a tight covariance between three commonly used cortical morphology measures: cortical thickness, total surface area, and exposed surface area. We show that assuming the independence of cortical morphology measures can hide features and potentially lead to misinterpretations. Using the scaling law, we account for the covariance between cortical morphology measures and derive novel independent measures of cortical morphology. By applying these new measures, we show that new information can be gained; in our example we show that distinct morphological alterations underlie healthy ageing compared to temporal lobe epilepsy, even on the coarse level of a whole hemisphere. We thus provide a conceptual framework for characterising cortical morphology in a statistically valid and interpretable manner, based on theoretical reasoning about the shape of the cortex.

preprint2020arXiv

Multivariate white matter alterations are associated with epilepsy duration

Previous studies investigating associations between white matter alterations and duration of temporal lobe epilepsy (TLE) have shown differing results, and were typically limited to univariate analyses of tracts in isolation. In this study we apply a multivariate measure (the Mahalanobis distance), to capture the distinct ways white matter may differ in individual patients, and relate this to epilepsy duration. Diffusion MRI, from a cohort of 94 subjects (28 healthy controls, 33 left-TLE and 33 right-TLE), was used to assess associations between tract fractional anisotropy (FA) and epilepsy duration. Using ten white matter tracts, we analysed associations using traditional univariate analyses (z-scores) and a complementary multivariate approach (Mahalanobis distance), incorporating multiple white matter tracts into a single unified analysis. In patients with right-TLE, FA was not significantly associated with epilepsy duration for any tract studied in isolation. In patients with left-TLE, the FA of two limbic tracts (ipsilateral fornix, contralateral cingulum gyrus) was significantly negatively associated with epilepsy duration (Bonferonni corrected p<0.05). Using a multivariate approach we found significant ipsilateral positive associations with duration in both left, and right-TLE cohorts (left-TLE: Spearman&#39;s rho=0.487, right-TLE: Spearman&#39;s rho=0.422). Extrapolating our multivariate results to duration equals zero (i.e. at onset) we found no significant difference between patients and controls. Associations using the multivariate approach were more robust than univariate methods. The multivariate distance measure provides non-overlapping and more robust results than traditional univariate analyses. Future studies should consider adopting both frameworks into their analysis in order to ascertain a more complete understanding of epilepsy progression, regardless of laterality.

preprint2020arXiv

Predicting the Impact of Electric Field Stimulation in a Detailed Computational Model of Cortical Tissue

Neurostimulation using weak electric fields has generated excitement in recent years due to its potential as a medical intervention. However, study of this stimulation modality has been hampered by inconsistent results and large variability within and between studies. In order to begin addressing this variability we need to properly characterise the impact of the current on the underlying neuron populations. To develop and test a computational model capable of capturing the impact of electric field stimulation on networks of neurons. We construct a cortical tissue model with distinct layers and explicit neuron morphologies. We then apply a model of electrical stimulation and carry out multiple test case simulations. The cortical slice model is compared to experimental literature and shown to capture the main features of the electrophysiological response to stimulation. Namely, the model showed 1) a similar level of depolarisation in individual pyramidal neurons, 2) acceleration of intrinsic oscillations, and 3) retention of the spatial profile of oscillations in different layers. We then apply alternative electric fields to demonstrate how the model can capture differences in neuronal responses to the electric field. We demonstrate that the tissue response is dependent on layer depth, the angle of the apical dendrite relative to the field, and stimulation strength. We present publicly available computational modelling software that predicts the neuron network population response to electric field stimulation.

preprint2020arXiv

Reliability and comparability of human brain structural covariance networks

Structural covariance analysis is a widely used structural MRI analysis method which characterises the co-relations of morphology between brain regions over a group of subjects. To our knowledge, little has been investigated in terms of the comparability of results between different data sets or the reliability of results over the same subjects in different rescan sessions, image resolutions, or FreeSurfer versions. In terms of comparability, our results show substantial differences in the structural covariance matrix between data sets of age- and sex-matched healthy human adults. These differences persist after site correction, they are exacerbated by low sample sizes, and they are most pronounced when using average cortical thickness as a morphological measure. Down-stream graph theoretic analyses further show statistically significant differences. In terms of reliability, substantial differences were also found when comparing repeated scan sessions of the same subjects, and image resolutions and FreeSurfer versions of the same image. We could further estimate the relative measurement error and showed that it is largest when using thickness. With simulated data, we argue that cortical thickness is least reliable because of larger relative measurement errors. Practically, we make the following recommendations (1) pooling subjects across sites into one group should be avoided, particularly if sites differ in image resolutions, demographics, or preprocessing; (2) surface area and volume should be preferred as morphological measures over cortical thickness; (3) a large number of subjects should be used to estimate structural covariance; (4) measurement error should be assessed where repeated measurements are available; (5) if combining sites is critical, univariate site-correction is insufficient, but error covariance should be explicitly measured and modelled.

preprint2018arXiv

Universality in human cortical folding across lobes of individual brains

Background: We have previously demonstrated that cortical folding across mammalian species follows a universal scaling law that can be derived from a simple theoretical model. The same scaling law has also been shown to hold across brains of our own species, irrespective of age or sex. These results, however, only relate measures of complete cortical hemispheres. There are known systematic variations in morphology between different brain regions, and region-specific changes with age. It is therefore of interest to extend our analyses to different cortical regions, and analyze the scaling law within an individual brain. Methods: To directly compare the morphology of sub-divisions of the cortical surface in a size-independent manner, we base our method on a topological invariant of closed surfaces. We reconstruct variables of a complete hemisphere from each lobe of the brain so that it has the same gyrification index, average thickness and average Gaussian curvature. Results: We show that different lobes are morphologically diverse but obey the same scaling law that was observed across human subjects and across mammalian species. This is also the case for subjects with Alzheimer&#39;s disease. The age-dependent offset changes at similar rates for all lobes in healthy subjects, but differs most dramatically in the temporal lobe in Alzheimer&#39;s disease. Significance: Our results further support the idea that while morphological parameters can vary locally across the cortical surface/across subjects of the same species/across species, the processes that drive cortical gyrification are universal.