Source author record

Han Huang

Han Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci cond-mat.mes-hall Computation and Language eess.IV Machine Learning math.FA math.PR Artificial Intelligence cond-mat.supr-con Emerging Technologies Human-Computer Interaction Information Theory Logic in Computer Science math.IT math.MG

Catalog footprint

What is connected

18works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2025arXiv

Biochemical Computing Mode for Sequential Logic

Recent years have witnessed the growing scholarly interest in the next-generation general-purpose computers. Various innovative computing modes have been proposed, such as optical, quantum phenomena, and DNA-based modes. Sequential logic circuits are a critical factor that enables these modes to function as general-purpose computers, given their essential role in facilitating continuous computation and memory storage through their ability to store states. However, compared to computability, it is often overlooked due to the difficulty of its implementation. In this paper, we first demonstrate sequential mapping, a crucial necessary condition for electronic computers to realize sequential logic circuits, and highlight this distinctive property of general-purpose computers in the context of logic gate circuits. To achieve computational functionalities comparable to those of electronic computers, we utilize the control effect of enzymes on enzymatic reactions to design a logic gate model that is composed of small molecules and driven by enzymes, subsequently propose a biochemical computing mode. Furthermore, we mathematically analyze the static and dynamic input-output properties of biochemical logic gate components and prove that the biochemical computing mode satisfies sequential mapping similar to electronic computers. When combined with the storage characteristics of NOT-AND gates, it can realize sequential logic circuits. The findings can serve as a theoretical foundation for developing general-purpose biochemical computers.

preprint2022arXiv

CrossHuman: Learning Cross-Guidance from Multi-Frame Images for Human Reconstruction

We propose CrossHuman, a novel method that learns cross-guidance from parametric human model and multi-frame RGB images to achieve high-quality 3D human reconstruction. To recover geometry details and texture even in invisible regions, we design a reconstruction pipeline combined with tracking-based methods and tracking-free methods. Given a monocular RGB sequence, we track the parametric human model in the whole sequence, the points (voxels) corresponding to the target frame are warped to reference frames by the parametric body motion. Guided by the geometry priors of the parametric body and spatially aligned features from RGB sequence, the robust implicit surface is fused. Moreover, a multi-frame transformer (MFT) and a self-supervised warp refinement module are integrated to the framework to relax the requirements of parametric body and help to deal with very loose cloth. Compared with previous works, our CrossHuman enables high-fidelity geometry details and texture in both visible and invisible regions and improves the accuracy of the human reconstruction even under estimated inaccurate parametric human models. The experiments demonstrate that our method achieves state-of-the-art (SOTA) performance.

preprint2022arXiv

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution.

preprint2022arXiv

Personalized Diagnostic Tool for Thyroid Cancer Classification using Multi-view Ultrasound

Over the past decades, the incidence of thyroid cancer has been increasing globally. Accurate and early diagnosis allows timely treatment and helps to avoid over-diagnosis. Clinically, a nodule is commonly evaluated from both transverse and longitudinal views using thyroid ultrasound. However, the appearance of the thyroid gland and lesions can vary dramatically across individuals. Identifying key diagnostic information from both views requires specialized expertise. Furthermore, finding an optimal way to integrate multi-view information also relies on the experience of clinicians and adds further difficulty to accurate diagnosis. To address these, we propose a personalized diagnostic tool that can customize its decision-making process for different patients. It consists of a multi-view classification module for feature extraction and a personalized weighting allocation network that generates optimal weighting for different views. It is also equipped with a self-supervised view-aware contrastive loss to further improve the model robustness towards different patient groups. Experimental results show that the proposed framework can better utilize multi-view information and outperform the competing methods.

preprint2022arXiv

Semantic Distillation Guided Salient Object Detection

Most existing CNN-based salient object detection methods can identify local segmentation details like hair and animal fur, but often misinterpret the real saliency due to the lack of global contextual information caused by the subjectiveness of the SOD task and the locality of convolution layers. Moreover, due to the unrealistically expensive labeling costs, the current existing SOD datasets are insufficient to cover the real data distribution. The limitation and bias of the training data add additional difficulty to fully exploring the semantic association between object-to-object and object-to-environment in a given image. In this paper, we propose a semantic distillation guided SOD (SDG-SOD) method that produces accurate results by fusing semantically distilled knowledge from generated image captioning into the Vision-Transformer-based SOD framework. SDG-SOD can better uncover inter-objects and object-to-environment saliency and cover the gap between the subjective nature of SOD and its expensive labeling. Comprehensive experiments on five benchmark datasets demonstrate that the SDG-SOD outperforms the state-of-the-art approaches on four evaluation metrics, and largely improves the model performance on DUTS, ECSSD, DUT, HKU-IS, and PASCAL-S datasets.

preprint2022arXiv

Situational Perception Guided Image Matting

Most automatic matting methods try to separate the salient foreground from the background. However, the insufficient quantity and subjective bias of the current existing matting datasets make it difficult to fully explore the semantic association between object-to-object and object-to-environment in a given image. In this paper, we propose a Situational Perception Guided Image Matting (SPG-IM) method that mitigates subjective bias of matting annotations and captures sufficient situational perception information for better global saliency distilled from the visual-to-textual task. SPG-IM can better associate inter-objects and object-to-environment saliency, and compensate the subjective nature of image matting and its expensive annotation. We also introduce a textual Semantic Transformation (TST) module that can effectively transform and integrate the semantic feature stream to guide the visual representations. In addition, an Adaptive Focal Transformation (AFT) Refinement Network is proposed to adaptively switch multi-scale receptive fields and focal points to enhance both global and local details. Extensive experiments demonstrate the effectiveness of situational perception guidance from the visual-to-textual tasks on image matting, and our model outperforms the state-of-the-art methods. We also analyze the significance of different components in our model. The code will be released soon.

preprint2022arXiv

Structured Local Radiance Fields for Human Avatar Modeling

It is extremely challenging to create an animatable clothed human avatar from RGB videos, especially for loose clothes due to the difficulties in motion modeling. To address this problem, we introduce a novel representation on the basis of recent neural scene rendering techniques. The core of our representation is a set of structured local radiance fields, which are anchored to the pre-defined nodes sampled on a statistical human body template. These local radiance fields not only leverage the flexibility of implicit representation in shape and appearance modeling, but also factorize cloth deformations into skeleton motions, node residual translations and the dynamic detail variations inside each individual radiance field. To learn our representation from RGB data and facilitate pose generalization, we propose to learn the node translations and the detail variations in a conditional generative latent space. Overall, our method enables automatic construction of animatable human avatars for various types of clothes without the need for scanning subject-specific templates, and can generate realistic images with dynamic details for novel poses. Experiment show that our method outperforms state-of-the-art methods both qualitatively and quantitatively.

preprint2022arXiv

Ultrafast Optical Spectroscopy Evidence of Pseudogap and Electron-Phonon Coupling in an Iron-Based Superconductor KCa$_2$Fe$_4$As$_4$F$_2$

We use ultrafast optical spectroscopy to study the nonequilibrium quasiparticle relaxation dynamics of the iron-based superconductor KCa$_2$Fe$_4$As$_4$F$_2$ with $T_c=33.5$ K. Our results reveal a possible pseudogap ($Δ_{PG}$ = 2.4 $\pm$ 0.1 meV) below $T^*\approx 50$ K but prior to the opening of a superconducting gap ($Δ_{SC}$(0) $\approx$ 4.3 $\pm$ 0.1 meV). Measurements under high pump fluence real two distinct, coherent phonon oscillations with 1.95 and 5.51 THz frequencies, respectively. The high-frequency $A_{1g}$(2) mode corresponds to the $c-$axis polarized vibrations of FeAs planes with a nominal electron-phonon coupling constant $λ_{A_{1g}(2)}$ = 0.194 $\pm$ 0.02. Our findings suggest that the pseudogap is likely a precursor of superconductivity, and the electron-phonon coupling may play an essential role in the superconducting pairing in KCa$_2$Fe$_4$As$_4$F$_2$.

preprint2021arXiv

Temperature evolution of quasiparticle dispersion dynamics in semimetallic 1T-TiTe2 via high-resolution angle-resolved photoemission spectroscopy and ultrafast optical pump-probe spectroscopy

High-resolution angle-resolved photoemission spectroscopy and ultrafast optical pump-probe spectroscopy were used to study semimetallic 1T - TiTe2 quasiparticle dispersion and dynamics. A kink and a flat band, having the same energy scale and temperature-dependent behaviors along the G-M direction, were detected. Both manifested at low temperatures but blurred as temperature increased. The kink was formed by an electron-phonon coupling. And the localized flat band might be closely related to an electron-phonon coupling. Ultrafast optical spectroscopy identified multiple distinct time scales in the 10-300 K range. Quantitative analysis of the fastest decay process evidenced a significant lifetime temperature dependence at high temperatures, while this starts to change slowly below ~ 100 K where an anomalous Hall coefficient occurred. At low temperature, a coherent A1g phonon mode with a frequency of ~ 4.36 THz was extracted. Frequency temperature dependence suggests that phonon hardening occurs as temperature falls and anharmonic effects can explain it. Frequency fluence dependence indicates that the phonons soften as fluence increases.

preprint2020arXiv

A remark on the smallest singular value of powers of Gaussian matrices

Let $n,k\geq 1$ and let $G$ be the $n\times n$ random matrix with i.i.d. standard real Gaussian entries. We show that there are constants $c_k,C_k>0$ depending only on $k$ such that the smallest singular value of $G^k$ satisfies $$ c_k\,t\leq {\mathbb P}\big\{s_{\min}(G^k)\leq t^k\,n^{-1/2}\big\}\leq C_k\,t,\quad t\in(0,1], $$ and, furthermore, $$ c_k/t\leq {\mathbb P}\big\{\|G^{-k}\|_{HS}\geq t^k\,n^{1/2}\big\}\leq C_k/t,\quad t\in[1,\infty), $$ where $\|\cdot\|_{HS}$ denotes the Hilbert-Schmidt norm.

preprint2020arXiv

Distribution of the minimal distance of random linear codes

In this paper, we study the distribution of the minimal distance (in the Hamming metric) of a random linear code of dimension $k$ in $\mathbb{F}_q^n$. We provide quantitative estimates showing that the distribution function of the minimal distance is close ({\it{}superpolynomially} in $n$)to the cumulative distribution function of the minimum of $(q^k-1)/(q-1)$ independent binomial random variables with parameters $\frac{1}{q}$ and $n$. The latter, in turn, converges to a Gumbel distribution at integer points when $\frac{k}{n}$ converges to a fixed number in $(0,1)$. Our result confirms in a strong sense that apart from identification of the weights of proportional codewords, the probabilistic dependencies introduced by the linear structure of the random code, produce a negligible effect on the minimal code weight. As a corollary of the main result, we obtain an improvement of the Gilbert--Varshamov bound for $2<q<49$.

preprint2020arXiv

Improved bounds for Hadwiger's covering problem via thin-shell estimates

A central problem in discrete geometry, known as Hadwiger's covering problem, asks what the smallest natural number $N\left(n\right)$ is such that every convex body in ${\mathbb R}^{n}$ can be covered by a union of the interiors of at most $N\left(n\right)$ of its translates. Despite continuous efforts, the best general upper bound known for this number remains as it was more than sixty years ago, of the order of ${2n \choose n}n\ln n$. In this note, we improve this bound by a sub-exponential factor. That is, we prove a bound of the order of ${2n \choose n}e^{-c\sqrt{n}}$ for some universal constant $c>0$. Our approach combines ideas from previous work by Artstein-Avidan and the second named author with tools from Asymptotic Geometric Analysis. One of the key steps is proving a new lower bound for the maximum volume of the intersection of a convex body $K$ with a translate of $-K$; in fact, we get the same lower bound for the volume of the intersection of $K$ and $-K$ when they both have barycenter at the origin. To do so, we make use of measure concentration, and in particular of thin-shell estimates for isotropic log-concave measures. Using the same ideas, we establish an exponentially better bound for $N\left(n\right)$ when restricting our attention to convex bodies that are $ψ_{2}$. By a slightly different approach, an exponential improvement is established also for classes of convex bodies with positive modulus of convexity.

preprint2020arXiv

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild

Unconstrained remote gaze estimation remains challenging mostly due to its vulnerability to the large variability in head-pose. Prior solutions struggle to maintain reliable accuracy in unconstrained remote gaze tracking. Among them, appearance-based solutions demonstrate tremendous potential in improving gaze accuracy. However, existing works still suffer from head movement and are not robust enough to handle real-world scenarios. Especially most of them study gaze estimation under controlled scenarios where the collected datasets often cover limited ranges of both head-pose and gaze which introduces further bias. In this paper, we propose novel end-to-end appearance-based gaze estimation methods that could more robustly incorporate different levels of head-pose representations into gaze estimation. Our method could generalize to real-world scenarios with low image quality, different lightings and scenarios where direct head-pose information is not available. To better demonstrate the advantage of our methods, we further propose a new benchmark dataset with the most rich distribution of head-gaze combination reflecting real-world scenarios. Extensive evaluations on several public datasets and our own dataset demonstrate that our method consistently outperforms the state-of-the-art by a significant margin.

preprint2020arXiv

Size of nodal domains of the eigenvectors of a G(n,p) graph

Consider an eigenvector of the adjacency matrix of a G(n, p) graph. A nodal domain is a connected component of the set of vertices where this eigenvector has a constant sign. It is known that with high probability, there are exactly two nodal domains for each eigenvector corresponding to a non-leading eigenvalue. We prove that with high probability, the sizes of these nodal domains are approximately equal to each other.

preprint2016arXiv

Upper bound for the Dvoretzky dimension in Milman-Schechtman theorem

For a symmetric convex body $K\subset\mathbb{R}^n$, the Dvoretzky dimension $k(K)$ is the largest dimension for which a random central section of $K$ is almost spherical. A Dvoretzky-type theorem proved by V.~D.~Milman in 1971 provides a lower bound for $k(K)$ in terms of the average $M(K)$ and the maximum $b(K)$ of the norm generated by $K$ over the Euclidean unit sphere. Later, V.~D.~Milman and G. Schechtman obtained a matching upper bound for $k(K)$ in the case when $\frac{M(K)}{b(K)}>c(\frac{\log(n)}{n})^{\frac{1}{2}}$. In this paper, we will give an elementary proof of the upper bound in Milman-Schechtman theorem which does not require any restriction on $M(K)$ and $b(K)$.

preprint2012arXiv

Frictional characteristics of exfoliated and epitaxial graphene

To determine the friction coefficient of graphene, micro-scale scratch tests are conducted on exfoliated and epitaxial graphene at ambient conditions. The experimental results show that the monolayer, bilayer, and trilayer graphene all yield friction coefficients of approximately 0.03. The friction coefficient of pristine graphene is less than that of disordered graphene, which is treated by oxygen plasma. Ramping force scratch tests are performed on graphene with various numbers of layers to determine the normal load required for the probe to penetrate graphene. A very low friction coefficient and also its high pressure resistance make graphene a promising material for antiwear coatings.

preprint2012arXiv

Study of electromagnetic enhancement for surface enhanced Raman spectroscopy of SiC graphene

The electromagnetic enhancement for surface enhanced Raman spectroscopy (SERS) of graphene is studied by inserting a layer of Al2O3 between epitaxial graphene and Au nanoparticles. Different excitation lasers are utilized to study the relationship between laser wavelength and SERS. The theoretical calculation shows that the extinction spectrum of Au nanoparticles is modulated by the presence of graphene. The experimental results of the relationship between the excitation laser wavelength and the enhancement factor fit well with the calculated results. An exponential relationship is observed between the enhancement factor and the thickness of the spacer layer.

preprint2011arXiv

Surface energy engineering of graphene

Contact angle goniometry is conducted for epitaxial graphene on SiC. Although only a single layer of epitaxial graphene exists on SiC, the contact angle drastically changes from 69° on SiC substrates to 92° with graphene. It is found that there is no thickness dependence of the contact angle from the measurements of single, bi, and multi layer graphene and highly ordered pyrolytic graphite (HOPG). After graphene is treated with oxygen plasma, the level of damage is investigated by Raman spectroscopy and correlation between the level of disorder and wettability is reported. By using low power oxygen plasma treatment, the wettability of graphene is improved without additional damage, which can solve the adhesion issues involved in the fabrication of graphene devices.

Han Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

18 published item(s)

Biochemical Computing Mode for Sequential Logic

CrossHuman: Learning Cross-Guidance from Multi-Frame Images for Human Reconstruction

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

Personalized Diagnostic Tool for Thyroid Cancer Classification using Multi-view Ultrasound

Semantic Distillation Guided Salient Object Detection

Situational Perception Guided Image Matting

Structured Local Radiance Fields for Human Avatar Modeling

Ultrafast Optical Spectroscopy Evidence of Pseudogap and Electron-Phonon Coupling in an Iron-Based Superconductor KCa$_2$Fe$_4$As$_4$F$_2$

Temperature evolution of quasiparticle dispersion dynamics in semimetallic 1T-TiTe2 via high-resolution angle-resolved photoemission spectroscopy and ultrafast optical pump-probe spectroscopy

A remark on the smallest singular value of powers of Gaussian matrices

Distribution of the minimal distance of random linear codes

Improved bounds for Hadwiger's covering problem via thin-shell estimates

Learning to Detect Head Movement in Unconstrained Remote Gaze Estimation in the Wild

Size of nodal domains of the eigenvectors of a G(n,p) graph

Upper bound for the Dvoretzky dimension in Milman-Schechtman theorem

Frictional characteristics of exfoliated and epitaxial graphene

Study of electromagnetic enhancement for surface enhanced Raman spectroscopy of SiC graphene

Surface energy engineering of graphene