Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
26works
0followers
22topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

26 published item(s)

preprint2026arXiv

Anisotropic Modality Align

Training multimodal large language models has long been limited by the scarcity of high-quality paired multimodal data. Recent studies show that the shared representation space of pretrained multimodal contrastive models can serve as a bridge, enabling models to perform multimodal training with unimodal data. However, the key premise of this paradigm remains insufficiently understood: can representations from different modalities be reliably interchanged? The core obstacle lies in the persistent Modality Gap in the shared space. In this work, we revisit the geometric nature of the modality gap. We find that modality representations already share compatible dominant semantic geometry. What truly hinders modality interchangeability is not a simple global shift, but an anisotropic residual structure concentrated along a small number of dominant directions. Based on this finding, we further propose the principle of anisotropic modality gap alignment: effective modality alignment should align with the target-modality distribution while preserving the semantic structure of the source modality. Guided by this principle, we propose an anisotropic geometric correction framework, AnisoAlign, for unpaired modality alignment. This framework leverages the internal geometric prior of the target modality and performs bounded correction on source-modality representations, thereby constructing substitute representations in the target modality. Experiments confirm its benefits in both geometric diagnostics and text-only MLLM training. Overall, this work recasts the modality gap from an empirical observation into a correctable, structured geometric phenomenon and provides a new representation alignment perspective for training multimodal models with unimodal data.

preprint2026arXiv

Milestones over Outcome: Unlocking Geometric Reasoning with Sub-Goal Verifiable Reward

Multimodal Large Language Models (MLLMs) struggle with complex geometric reasoning, largely because "black box" outcome-based supervision fails to distinguish between lucky guesses and rigorous deduction. To address this, we introduce a paradigm shift towards subgoal-level evaluation and learning. We first construct GeoGoal, a benchmark synthesized via a rigorous formal verification data engine, which converts abstract proofs into verifiable numeric subgoals. This structure reveals a critical divergence between reasoning quality and outcome accuracy. Leveraging this, we propose the Sub-Goal Verifiable Reward (SGVR) framework, which replaces sparse signals with dense rewards based on the Skeleton Rate. Experiments demonstrate that SGVR not only enhances geometric performance (+9.7%) but also exhibits strong generalization, transferring gains to general math (+8.0%) and other general reasoning tasks (+2.8%), demonstrating broad applicability across diverse domains.

preprint2026arXiv

Multimodal Feedback for Handheld Tool Guidance: Combining Wrist-Based Haptics with Augmented Reality

We investigate how vibrotactile wrist feedback can enhance spatial guidance for handheld tool movement in optical see-through augmented reality (AR). While AR overlays are widely used to support surgical tasks, visual occlusion, lighting conditions, and interface ambiguity can compromise precision and confidence. To address these challenges, we designed a multimodal system combining AR visuals with a custom wrist-worn haptic device delivering directional and state-based cues. A formative study with experienced surgeons and residents identified key tool maneuvers and preferences for reference mappings, guiding our cue design. In a cue identification experiment (N=21), participants accurately recognized five vibration patterns under visual load, with higher recognition for full-actuator states than spatial direction cues. In a guidance task (N=27), participants using both AR and haptics achieved significantly higher spatial precision (5.8 mm) and usability (SUS = 88.1) than those using either modality alone, despite having modest increases in task time. Participants reported that haptic cues provided reassuring confirmation and reduced cognitive effort during alignment. Our results highlight the promise of integrating wrist-based haptics into AR systems for high-precision, visually complex tasks such as surgical guidance. We discuss design implications for multimodal interfaces supporting confident, efficient tool manipulation.

preprint2025arXiv

GeoBench: Rethinking Multimodal Geometric Problem-Solving via Hierarchical Evaluation

Geometric problem solving constitutes a critical branch of mathematical reasoning, requiring precise analysis of shapes and spatial relationships. Current evaluations of geometric reasoning in vision-language models (VLMs) face limitations, including the risk of test data contamination from textbook-based benchmarks, overemphasis on final answers over reasoning processes, and insufficient diagnostic granularity. To address these issues, we present GeoBench, a hierarchical benchmark featuring four reasoning levels in geometric problem-solving: Visual Perception, Goal-Oriented Planning, Rigorous Theorem Application, and Self-Reflective Backtracking. Through six formally verified tasks generated via TrustGeoGen, we systematically assess capabilities ranging from attribute extraction to logical error correction. Experiments reveal that while reasoning models like OpenAI-o3 outperform general MLLMs, performance declines significantly with increasing task complexity. Key findings demonstrate that sub-goal decomposition and irrelevant premise filtering critically influence final problem-solving accuracy, whereas Chain-of-Thought prompting unexpectedly degrades performance in some tasks. These findings establish GeoBench as a comprehensive benchmark while offering actionable guidelines for developing geometric problem-solving systems.

preprint2025arXiv

Non-Euclidean interfaces decode the continuous landscape of graphene-induced surface reconstructions

Interfacial reconstruction between two-dimensional (2D) materials and metal substrates fundamentally governs heterostructure properties, yet conventional flat substrates fail to capture the continuous crystallographic landscape. Here, we overcome this topological limitation using non-Euclidean interfaces-curved 2D graphene-copper surfaces as a model system-to traverse the infinite spectrum of lattice orientations. By integrating multimodal microscopy with a deep-learning-enhanced dimensional upscaling framework, we translate 2D scanning electron microscopy (SEM) contrast into quantitative three-dimensional (3D) morphologies with accurate facet identification. Coupling these observations with machine-learning-assisted density functional theory, we demonstrate that reconstruction is governed by a unified thermodynamic mechanism where high-index facets correspond to specific local minima in the surface energy landscape. This work resolves the long-standing complexity of graphene-copper faceting and establishes non-Euclidean surface topologies as a generalizable paradigm for decoding and controlling interfacial reconstruction in diverse metal-2D material systems.

preprint2023arXiv

Quantum computing of fluid dynamics using the hydrodynamic Schrödinger equation

Simulating fluid dynamics on a quantum computer is intrinsically difficult due to the nonlinear and non-Hamiltonian nature of the Navier-Stokes equation (NSE). We propose a framework for quantum computing of fluid dynamics based on the hydrodynamic Schrödinger equation (HSE), which can be promising in simulating three-dimensional turbulent flows in various engineering applications. The HSE is derived by generalizing the Madelung transform to compressible/incompressible flows with finite vorticity and dissipation. Since the HSE is expressed as a unitary operator on a two-component wave function, it is more suitable than the NSE for quantum computing. The flow governed by the HSE can resemble a turbulent flow consisting of tangled vortex tubes with the five-thirds scaling of energy spectrum. We develop a prediction-correction quantum algorithm to solve the HSE. This algorithm is implemented for simple flows on the quantum simulator Qiskit with exponential speedup.

preprint2022arXiv

Do Deep Neural Networks Always Perform Better When Eating More Data?

Data has now become a shortcoming of deep learning. Researchers in their own fields share the thinking that "deep neural networks might not always perform better when they eat more data," which still lacks experimental validation and a convincing guiding theory. Here to fill this lack, we design experiments from Identically Independent Distribution(IID) and Out of Distribution(OOD), which give powerful answers. For the purpose of guidance, based on the discussion of results, two theories are proposed: under IID condition, the amount of information determines the effectivity of each sample, the contribution of samples and difference between classes determine the amount of sample information and the amount of class information; under OOD condition, the cross-domain degree of samples determine the contributions, and the bias-fitting caused by irrelevant elements is a significant factor of cross-domain. The above theories provide guidance from the perspective of data, which can promote a wide range of practical applications of artificial intelligence.

preprint2022arXiv

Foreground Object Structure Transfer for Unsupervised Domain Adaptation

Unsupervised domain adaptation aims to train a classification model from the labeled source domain for the unlabeled target domain. Since the data distributions of the two domains are different, the model often performs poorly on the target domain. Existing methods align the feature distributions of the source and target domains and learn domain-invariant features to improve the performance of the model. However, the features are usually aligned as a whole, and the domain adaptation task fails to serve the classification, which will ignore the class information and lead to misalignment.In this paper, we investigate those features that should be used for domain alignment, introduce prior knowledge to extract foreground features to guide the domain adaptation task for classification tasks, and perform alignment in the local structure of objects. We propose a method called Foreground Object Structure Transfer(FOST). The key to FOST is the new clustering based condition, which combines the relative position relationship of foreground objects. Based on this conditions, FOST makes the data distribution of the same class more compact in geometry. In practice, since the label of the target domain is not available, we use the clustering information of the source domain to assign pseudo labels to the target domain samples, and then according to the source domain data prior knowledge guides those positive features to maximum the inter-class distance between different classes and mimimum the intra-class distance. Extensive experimental results on various benchmarks ($i.e.$ ImageCLEF-DA, Office-31, Office-Home, Visda-2017) under different domain adaptation settings prove that our FOST compares favorably against the existing state-of-the-art domain adaptation methods.

preprint2022arXiv

Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data

Procedures are inherently hierarchical. To "make videos", one may need to "purchase a camera", which in turn may require one to "set a budget". While such hierarchical knowledge is critical for reasoning about complex procedures, most existing work has treated procedures as shallow structures without modeling the parent-child relation. In this work, we attempt to construct an open-domain hierarchical knowledge-base (KB) of procedures based on wikiHow, a website containing more than 110k instructional articles, each documenting the steps to carry out a complex procedure. To this end, we develop a simple and efficient method that links steps (e.g., "purchase a camera") in an article to other articles with similar goals (e.g., "how to choose a camera"), recursively constructing the KB. Our method significantly outperforms several strong baselines according to automatic evaluation, human judgment, and application to downstream tasks such as instructional video retrieval. A demo with partial data can be found at https://wikihow-hierarchy.github.io. The code and the data are at https://github.com/shuyanzhou/wikihow_hierarchy.

preprint2021arXiv

Deceptive Reinforcement Learning for Privacy-Preserving Planning

In this paper, we study the problem of deceptive reinforcement learning to preserve the privacy of a reward function. Reinforcement learning is the problem of finding a behaviour policy based on rewards received from exploratory behaviour. A key ingredient in reinforcement learning is a reward function, which determines how much reward (negative or positive) is given and when. However, in some situations, we may want to keep a reward function private; that is, to make it difficult for an observer to determine the reward function used. We define the problem of privacy-preserving reinforcement learning, and present two models for solving it. These models are based on dissimulation -- a form of deception that `hides the truth'. We evaluate our models both computationally and via human behavioural experiments. Results show that the resulting policies are indeed deceptive, and that participants can determine the true reward function less reliably than that of an honest agent.

preprint2020arXiv

An improved FastEuler-DLKF small-UAV AHRS algorithm

The accurate Attitude Heading Reference System(AHRS) is an important apart of the UAV reliable flight system. Aiming at the application scenarios of near ground navigation of small-UAV, this paper establishes a loose couple error model of the gyroscope/accelerometer/magnetometer, and presents an improved FastEuler Double-Layer Kalman Filter algorithm. Using low-cost devices which include MEMS Inertial Measurement Units(IMU) and magnetometers, this paper constructs the AHRS hardware and software systems of UAV, and designs the offline and real-time verification platforms. Moreover, the attitude changes of UAV is analyzed by the simulation and flight test, respectively. In addition, an adaptive factor is used to adjust the measurement noise covariance in order to eliminate the harmful effects of linear acceleration in the accelerometer, which is solved the roll and ptich angle. The experimental comparison with the Complementary Filter shows that the proposed algorithm can provide accurate attitude information when UAV is flying, which improves the accuracy and reliability of attitude solution, and removes the influence the gyro bias for the attitude estimation.

preprint2020arXiv

An improved nonlinear FastEuler AHRS estimation based on the SVDCKF algorithm

In this paper, we present a Singular Value Decomposition Cubature Kalman Filter(SVDCKF) fusion algorithm based on the improved nonlinear FastEuler Attitude and Heading Reference and System(AHRS) estimation model for small-UAV attitude. The contributions of this work are the derivation of the low-cost IMU/MAG integrated AHRS model combined with the quaternion attitude determination, and use the FastEuler to correct the gyroscope attitude update, which can increase the real-time solution. In addition, the SVDCKF algorithm is fused the various raw sensors data in order to improve the filter accuracy compared with the CKF. The simulation and experiment results demonstrate the proposed algorithm has the more excellent attitude solution accuracy compared with the CKF in the low and high dynamic flight conditions.

preprint2020arXiv

An Intelligent Quaternion SVDCKF AHRS Estimation with Variable Adaptive Methods in Complex Conditions

Aimed at solving the problem of Attitude and Heading Reference System(AHRS) in the complex and dynamic conditions for small-UAV, An intelligent Singular Value Decomposition Cubature Kalman Filter(SVDCKF) combined with the Variable Adaptive Methods(VAM) is proposed in this paper. Considering the nonlinearity of quaternion AHRS model and non-positive definite of the state covariance matrix, the SVDCKF algorithm is presented with both the SVD and CKF in order to better obtain the filter accuracy and reliability. Additionally, there are the different changes of the values in the accelerometer measurement resulting from the complex flying conditions. Thus, the VAM is designed to deal with three-axis values of the acceleration and tune intelligently the measurement noise matrix Ra. Moreover, the heading measurement from the three-axis values of the magnetometer is calculated according to the whether to use the three-axis values of the acceleration in the special situations. The simulation and experiment results demonstrate that the proposed filter algorithm has the more excellent attitude solution accuracy and robustness than both the Complementary Filter(CF) and the Error State Kalman Filter(ESKF).

preprint2020arXiv

How to Eliminate Detour Behaviors in E-hailing? Real-time Detecting and Time-dependent Pricing

With the rapid development of information and communication technology (ICT), taxi business becomes a typical electronic commerce mode. However, one traditional problem still exists in taxi service, that greedy taxi drivers may deliberately take unnecessary detours to overcharge passengers. The detection of these fraudulent behaviors is essential to ensure high-quality taxi service. In this paper, we propose a novel framework for detecting and analyzing the detour behaviors both in off-line database and among on-line trips. Applying our framework to real-world taxi data-set, a remarkable performance (AUC surpasses 0.98) has been achieved in off-line classification. Meanwhile, we further extend the off-line methods to on-line detection, a warning mechanism is introduced to remind drivers and an excellent precision (AUC surpasses 0.90) also has arrived in this phases. After conducting extensive experiments to verify the relationships between pricing regulations and detour behaviors, some quantitative pricing suggestions, including rising base fare and reducing distance-based fare rate, are provided to eliminate detour behaviors from the long term.

preprint2020arXiv

MedDialog: Two Large-scale Medical Dialogue Datasets

Medical dialogue systems are promising in assisting in telemedicine to increase access to healthcare services, improve the quality of patient care, and reduce medical costs. To facilitate the research and development of medical dialogue systems, we build two large-scale medical dialogue datasets: MedDialog-EN and MedDialog-CN. MedDialog-EN is an English dataset containing 0.3 million conversations between patients and doctors and 0.5 million utterances. MedDialog-CN is an Chinese dataset containing 1.1 million conversations and 4 million utterances. To our best knowledge, MedDialog-(EN,CN) are the largest medical dialogue datasets to date. The dataset is available at https://github.com/UCSD-AI4H/Medical-Dialogue-System

preprint2020arXiv

Sex Differences in Severity and Mortality Among Patients With COVID-19: Evidence from Pooled Literature Analysis and Insights from Integrated Bioinformatic Analysis

Objective: To conduct a meta-analysis of current studies that examined sex differences in severity and mortality in patients with COVID-19, and identify potential mechanisms underpinning these differences. Methods: We performed a systematic review to collate data from observational studies examining associations of sex differences with clinical outcomes of COVID-19. PubMed, Web of Science and four preprint servers were searched for relevant studies. Data were extracted and analyzed using meta-analysis where possible, with summary data presented otherwise. Publicly available bulk RNA sequencing (RNA-seq), single-cell RNA sequencing (scRNA-seq), and chromatin immunoprecipitation sequencing (ChIP-seq) data were analyzed to explore the potential mechanisms underlying the observed association. Results: 39 studies met inclusion criteria, representing 77932 patients, of which 41510 (53.3%) were males. Men were at a markedly increased risk of developing severe cases compared with women. Furthermore, the pooled odds ratio (OR) of mortality for male group compared with the female group indicated significant higher mortality rate for male. Data from scRNA-seq suggest that men have a higher amount of ACE2-expressing pulmonary alveolar type II cells than women. Sex-based immunological differences exist. The expression of androgen receptor (AR) is positively correlated with ACE2, and there is evidence that AR may directly regulate the expression of ACE2. Conclusions: This meta-analysis detected an increased severity and mortality rate in the male populations with COVID-19, which might be attributable to the sex-based differences in cellular compositions and immunological microenvironments of the lung. The host cell receptor ACE2 is likely regulated by AR signaling pathway, which is identified as a potential target for prevention and treatment of SARS-Cov-2 infections in men.

preprint2020arXiv

Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms

Pretraining has become a standard technique in computer vision and natural language processing, which usually helps to improve performance substantially. Previously, the most dominant pretraining method is transfer learning (TL), which uses labeled data to learn a good representation network. Recently, a new pretraining approach -- self-supervised learning (SSL) -- has demonstrated promising results on a wide range of applications. SSL does not require annotated labels. It is purely conducted on input data by solving auxiliary tasks defined on the input data examples. The current reported results show that in certain applications, SSL outperforms TL and the other way around in other applications. There has not been a clear understanding on what properties of data and tasks render one approach outperforms the other. Without an informed guideline, ML researchers have to try both methods to find out which one is better empirically. It is usually time-consuming to do so. In this work, we aim to address this problem. We perform a comprehensive comparative study between SSL and TL regarding which one works better under different properties of data and tasks, including domain difference between source and target tasks, the amount of pretraining data, class imbalance in source data, and usage of target data for additional pretraining, etc. The insights distilled from our comparative studies can help ML researchers decide which method to use based on the properties of their applications.

preprint2020arXiv

Variational approximations of empirical Bayes posteriors in high-dimensional linear models

In high-dimensions, the prior tails can have a significant effect on both posterior computation and asymptotic concentration rates. To achieve optimal rates while keeping the posterior computations relatively simple, an empirical Bayes approach has recently been proposed, featuring thin-tailed conjugate priors with data-driven centers. While conjugate priors ease some of the computational burden, Markov chain Monte Carlo methods are still needed, which can be expensive when dimension is high. In this paper, we develop a variational approximation to the empirical Bayes posterior that is fast to compute and retains the optimal concentration rate properties of the original. In simulations, our method is shown to have superior performance compared to existing variational approximations in the literature across a wide range of high-dimensional settings.

preprint2019arXiv

Modelling of the turbulent burning velocity based on Lagrangian statistics of propagating surfaces

We propose a predictive model of the turbulent burning velocity $S_T$ in homogeneous isotropic turbulence (HIT) based on Lagrangian statistics of propagating surfaces. The propagating surfaces with a constant displacement speed are initially arranged on a plane, and they evolve in non-reacting HIT, behaving like the propagation of a planar premixed flame front. The universal constants in the model of $S_T$ characterize the enhancement of area growth of premixed flames by turbulence, and they are determined by Lagrangian statistics of propagating surfaces. The flame area is then modelled by the area of propagating surfaces at a truncation time. This truncation time signals the statistical stationary state of the evolutionary geometry of propagating surfaces, and it is modelled by an explicit expression using limiting conditions of very weak and strong turbulence. Another parameter in the model of $S_T$ characterizes the effect of fuel chemistry on $S_T$, and it is pre-determined by very few available data points of $S_T$ from experiments or direct numerical simulation (DNS) in weak turbulence. The proposed model is validated using three DNS series of turbulent premixed flames with various fuels. The model prediction of $S_T$ generally agrees well with DNS in a wide range of premixed combustion regimes, and it captures the basic trends of $S_T$ in terms of the turbulence intensity, including the linear growth in weak turbulence and the `bending effect' in strong turbulence.

preprint2018arXiv

Bayesian inference in high-dimensional linear models using an empirical correlation-adaptive prior

In the context of a high-dimensional linear regression model, we propose the use of an empirical correlation-adaptive prior that makes use of information in the observed predictor variable matrix to adaptively address high collinearity, determining if parameters associated with correlated predictors should be shrunk together or kept apart. Under suitable conditions, we prove that this empirical Bayes posterior concentrates around the true sparse parameter at the optimal rate asymptotically. A simplified version of a shotgun stochastic search algorithm is employed to implement the variable selection procedure, and we show, via simulation experiments across different settings and a real-data application, the favorable performance of the proposed method compared to existing methods.

preprint2018arXiv

Modeling Three-dimensional Invasive Solid Tumor Growth in Heterogeneous Microenvironment under Chemotherapy

A systematic understanding of the evolution and growth dynamics of invasive solid tumors in response to different chemotherapy strategies is crucial for the development of individually optimized oncotherapy. Here, we develop a hybrid three-dimensional (3D) computational model that integrates pharmacokinetic model, continuum diffusion-reaction model and discrete cell automaton model to investigate 3D invasive solid tumor growth in heterogeneous microenvironment under chemotherapy. Specifically, we consider the effects of heterogeneous environment on drug diffusion, tumor growth, invasion and the drug-tumor interaction on individual cell level. We employ the hybrid model to investigate the evolution and growth dynamics of avascular invasive solid tumors under different chemotherapy strategies. Our simulations reproduce the well-established observation that constant dosing is generally more effective in suppressing primary tumor growth than periodic dosing, due to the resulting continuous high drug concentration. In highly heterogeneous microenvironment, the malignancy of the tumor is significantly enhanced, leading to inefficiency of chemotherapies. The effects of geometrically-confined microenvironment and non-uniform drug dosing are also investigated. Our computational model, when supplemented with sufficient clinical data, could eventually lead to the development of efficient in silico tools for prognosis and treatment strategy optimization.

preprint2017arXiv

Exploiting ITO colloidal nanocrystals for ultrafast pulse generation

Dynamical materials that capable of responding to optical stimuli have always been pursued for designing novel photonic devices and functionalities, of which the response speed and amplitude as well as integration adaptability and energy effectiveness are especially critical. Here we show ultrafast pulse generation by exploiting the ultrafast and sensitive nonlinear dynamical processes in tunably solution-processed colloidal epsilon-near-zero (ENZ) transparent conducting oxide (TCO) nanocrystals (NCs), of which the potential respond response speed is >2 THz and modulation depth is ~23% pumped at ~0.7 mJ/cm2, benefiting from the highly confined geometry in addition to the ENZ enhancement effect. These ENZ NCs may offer a scalable and printable material solution for dynamic photonic and optoelectronic devices.

preprint2015arXiv

Electrically-gated near-field radiative thermal transistor

In this work, we propose a near-field radiative thermal transistor made of two graphene-covered silicon carbide (SiC) plates separated by a nanometer vacuum gap. Thick SiC plates serve as the thermal "source" and "drain", while graphene sheets function as the "gate" to modulate the near-field photon tunneling by tuning chemical potential with applied voltage biases symmetrically or asymmetrically. The radiative heat flux calculated from fluctuational electrodynamics significantly varies with graphene chemical potentials, which can tune the coupling between graphene plasmon across the vacuum gap. Thermal modulation, switching, and amplification, which are the key features required for a thermal transistor, are theoretically realized and analyzed. This work will pave the way to active thermal management, thermal circuits, and thermal computing.

preprint2015arXiv

Performance Analysis of a Near-Field Thermophotovoltaic Device with a Metallodielectric Selective Emitter and Electrical Contacts for the Photovoltaic Cell

A near-field thermophotovoltaic (TPV) system with a multilayer emitter of alternate tungsten and alumina layer is proposed in this paper. The fluctuational electrodynamics along with the dyadic Green function for a multilayered structure is applied to calculate the spectral heat flux, and the charge transport equations are solved to get the photocurrent generation and electrical power output. The spectral heat flux is much enhanced when plain tungsten emitter is replaced with multilayer emitter. The mechanism of surface plasmon polariton coupling in the tungsten thin film, which is responsible for the heat flux enhancement, is analyzed. In addition, the invalidity of effective medium theory to predict the optical properties of multilayer structure in near-field radiation is discussed. The tungsten and alumina layer thicknesses are optimized to match the spectral heat flux with the bandgap of TPV cell. Practically, with a gold reflector placed on the back of TPV cell, which also acts as the back electrode, and a 5-nm-thick indium tin oxide (ITO) layer as the front contact, when the emitter and receiver temperature are respectively set as 2000 K and 300 K, the conversion efficiency and electrical power output can be achieved to 23.7% and 0.31 MW/m2 at a vacuum gap distance of 100 nm.

preprint2015arXiv

Spectrally enhancing near-field radiative heat transfer by exciting magnetic polariton in SiC gratings

In the present work, we theoretically demonstrate, for the first time, that near field radiative transport between 1D periodic grating microstructures separated by subwavelength vacuum gaps can be significantly enhanced by exciting magnetic resonance or polariton. Fluctuational electrodynamics that incorporates scattering matrix theory with rigorous coupled wave analysis is employed to exactly calculate the near field radiative heat flux between two SiC gratings. Besides the well known coupled surface phonon polaritons (SPhP), an additional spectral radiative heat flux peak, which is due to magnetic polariton, is found within the phonon absorption band of SiC. The mechanisms, behaviors and interplays between magnetic polariton, coupled SPhP, single interface SPhP, and Wood's anomaly in the near field radiative transport are elucidated in detail. The findings will open up a new way to control near field radiative heat transfer by magnetic resonance with micro or nanostructured metamaterials.

preprint2014arXiv

Tungsten Nanowire Based Hyperbolic Metamaterial Emitters for Near-field Thermophotovoltaic Applications

Recently, near-field radiative heat transfer enhancement across nanometer vacuum gaps has been intensively studied between two hyperbolic metamaterials (HMMs) due to unlimited wavevectors and high photonic density of state. In this work, we theoretically analyze the energy conversion performance of a thermophotovoltaic (TPV) cell made of In0.2Ga0.8Sb when paired with a HMM emitter composed of tungsten nanowire arrays embedded in Al2O3 host at nanometer vacuum gaps. Fluctuational electrodynamics integrated with effective medium theory and anisotropic thin-film optics is used to calculate the near-field radiative heat transfer. It is found that the spectral radiative energy is enhanced by the epsilon-near-zero and hyperbolic modes at different polarizations. As a result, the power output from a semi-infinite TPV cell is improved by 1.85 times with the nanowire HMM emitter over that with a plain tungsten emitter at a vacuum gap of 10 nm. Moreover, by using a thin TPV cell with 10 um thickness, the conversion efficiency can be greatly improved from 19.5% to 31.5% without affecting the power generation, due to the total internal reflection occurring at the bottom cell interface that minimizes the sub-bandgap spectral radiative energy. Furthermore, the effects of a TPV cell and a nanowire emitter with finite thicknesses are also studied. The result shows that the maximum efficiency of 32.2% is achieved with an optimal cell thickness of 3 um while the nanowire HMM emitter should be thick enough to be opaque. The fundamental understanding and insights obtained here will facilitate the design and application of novel materials in enhancing near-field TPV energy conversion.