Source author record

Kun Huang

Kun Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

24works

22topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

How Mobile World Model Guides GUI Agents?

Recent advances in vision-language models have enabled mobile GUI agents to perceive visual interfaces and execute user instructions, but reliable prediction of action consequences remains critical for long-horizon and high-risk interactions. Existing mobile world models provide either text-based or image-based future states, yet it remains unclear which representation is useful, whether generated rollouts can replace real environments, and how test-time guidance helps agents of different strengths. To answer the above questions, we filter and annotate mobile world-model data, then train world models across four modalities: delta text, full text, diffusion-based images, and renderable code. These models achieve SoTA performance on both MobileWorldBench and Code2WorldBench. Furthermore, by evaluating their downstream utility on AITZ, AndroidControl, and AndroidWorld, we obtain three findings. First, renderable code reconstruction achieves high in-distribution fidelity and provides effective multimodal supervision for data construction, while text-based feedback is more robust for online out-of-distribution (OOD) execution. Second, world-model-generated trajectories can provide transferable interaction experience in the training process and improve agents' end-to-end task performance, although these data do not preserve the original distribution. Last, for overconfident mobile agents with low action entropy, posterior self-reflection provides limited gains, suggesting that world models are more effective as prior perception or training supervision than as universal post-hoc verifiers.

preprint2022arXiv

A Compressed Gradient Tracking Method for Decentralized Optimization with Linear Convergence

Communication compression techniques are of growing interests for solving the decentralized optimization problem under limited communication, where the global objective is to minimize the average of local cost functions over a multi-agent network using only local computation and peer-to-peer communication. In this paper, we propose a novel compressed gradient tracking algorithm (C-GT) that combines gradient tracking technique with communication compression. In particular, C-GT is compatible with a general class of compression operators that unifies both unbiased and biased compressors. We show that C-GT inherits the advantages of gradient tracking-based algorithms and achieves linear convergence rate for strongly convex and smooth objective functions. Numerical examples complement the theoretical findings and demonstrate the efficiency and flexibility of the proposed algorithm.

preprint2022arXiv

Accurate calibration of multi-perspective cameras from a generalization of the hand-eye constraint

Multi-perspective cameras are quickly gaining importance in many applications such as smart vehicles and virtual or augmented reality. However, a large system size or absence of overlap in neighbouring fields-of-view often complicate their calibration. We present a novel solution which relies on the availability of an external motion capture system. Our core contribution consists of an extension to the hand-eye calibration problem which jointly solves multi-eye-to-base problems in closed form. We furthermore demonstrate its equivalence to the multi-eye-in-hand problem. The practical validity of our approach is supported by our experiments, indicating that the method is highly efficient and accurate, and outperforms existing closed-form alternatives.

preprint2022arXiv

Deep 360$^\circ$ Optical Flow Estimation Based on Multi-Projection Fusion

Optical flow computation is essential in the early stages of the video processing pipeline. This paper focuses on a less explored problem in this area, the 360$^\circ$ optical flow estimation using deep neural networks to support increasingly popular VR applications. To address the distortions of panoramic representations when applying convolutional neural networks, we propose a novel multi-projection fusion framework that fuses the optical flow predicted by the models trained using different projection methods. It learns to combine the complementary information in the optical flow results under different projections. We also build the first large-scale panoramic optical flow dataset to support the training of neural networks and the evaluation of panoramic optical flow estimation methods. The experimental results on our dataset demonstrate that our method outperforms the existing methods and other alternative deep networks that were developed for processing 360° content.

preprint2022arXiv

Label Adversarial Learning for Skeleton-level to Pixel-level Adjustable Vessel Segmentation

You can have your cake and eat it too. Microvessel segmentation in optical coherence tomography angiography (OCTA) images remains challenging. Skeleton-level segmentation shows clear topology but without diameter information, while pixel-level segmentation shows a clear caliber but low topology. To close this gap, we propose a novel label adversarial learning (LAL) for skeleton-level to pixel-level adjustable vessel segmentation. LAL mainly consists of two designs: a label adversarial loss and an embeddable adjustment layer. The label adversarial loss establishes an adversarial relationship between the two label supervisions, while the adjustment layer adjusts the network parameters to match the different adversarial weights. Such a design can efficiently capture the variation between the two supervisions, making the segmentation continuous and tunable. This continuous process allows us to recommend high-quality vessel segmentation with clear caliber and topology. Experimental results show that our results outperform manual annotations of current public datasets and conventional filtering effects. Furthermore, such a continuous process can also be used to generate an uncertainty map representing weak vessel boundaries and noise.

preprint2021arXiv

Quantum key distribution over scattering channel

Scattering of light by cloud, haze, and fog decreases the transmission efficiency of communication channels in quantum key distribution (QKD), reduces the system's practical security, and thus constrains the deployment of free-space QKD. Here, we employ the wavefront shaping technology to compensate distorted optical signals in high-loss scattering quantum channels and fulfill a polarization-encoded BB84 QKD experiment. With this quantum channel compensation technology, we achieve a typical enhancement of about 250 in transmission efficiency and improve the secure key rate from 0 to $1.85\times10^{-6}$ per sifted key. The method and its first time validation show the great potential to expand the territory of QKD systems from lossless channels to highly scattered ones and therefore enhances the deployment ability of global quantum communication network.

preprint2020arXiv

Generalized Perfect Optical Vortex along Arbitrary Trajectories

Perfect optical vortex (POV) is a type of vortex beam with an infinite thin ring and a fixed radius independent of its topological charge. Here we propose the concept of generalized perfect optical vortex along arbitrary curves beyond the regular shapes of circle and ellipse. Generalized perfect optical vortices also share the similar properties as POVs, such as defined only along infinite thin curves and owning topological charges independent of scales. Notably, they naturally degenerate to the POVs and elliptic POVs along circles and ellipses, respectively. We also experimentally generated the generalized perfect optical vortices through a digital micromirror device (DMD) and measured the phase distributions by interferometry, exhibiting good agreements with the simulations. Moreover, we derive a proper modified formula to yield the generalized perfect optical vortices with uniform intensity distribution along predesigned curves. The generalized perfect optical vortices might find the potential applications in optical tweezers and communication.

preprint2020arXiv

Learning to Parse Wireframes in Images of Man-Made Environments

In this paper, we propose a learning-based approach to the task of automatically extracting a "wireframe" representation for images of cluttered man-made environments. The wireframe (see Fig. 1) contains all salient straight lines and their junctions of the scene that encode efficiently and accurately large-scale geometry and object shapes. To this end, we have built a very large new dataset of over 5,000 images with wireframes thoroughly labelled by humans. We have proposed two convolutional neural networks that are suitable for extracting junctions and lines with large spatial support, respectively. The networks trained on our dataset have achieved significantly better performance than state-of-the-art methods for junction detection and line segment detection, respectively. We have conducted extensive experiments to evaluate quantitatively and qualitatively the wireframes obtained by our method, and have convincingly shown that effectively and efficiently parsing wireframes for images of man-made environments is a feasible goal within reach. Such wireframes could benefit many important visual tasks such as feature correspondence, 3D reconstruction, vision-based mapping, localization, and navigation. The data and source code are available at https://github.com/huangkuns/wireframe.

preprint2020arXiv

Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters

One of the central goals in precision health is the understanding and interpretation of high-dimensional biological data to identify genes and markers associated with disease initiation, development, and outcomes. Though significant effort has been committed to harness gene expression data for multiple analyses while accounting for time-to-event modeling by including survival times, many traditional analyses have focused separately on non-negative matrix factorization (NMF) of the gene expression data matrix and survival regression with Cox proportional hazards model. In this work, Cox proportional hazards regression is integrated with NMF by imposing survival constraints. This is accomplished by jointly optimizing the Frobenius norm and partial log likelihood for events such as death or relapse. Simulation results on synthetic data demonstrated the superiority of the proposed method, when compared to other algorithms, in finding survival associated gene clusters. In addition, using human cancer gene expression data, the proposed technique can unravel critical clusters of cancer genes. The discovered gene clusters reflect rich biological implications and can help identify survival-related biomarkers. Towards the goal of precision health and cancer treatments, the proposed algorithm can help understand and interpret high-dimensional heterogeneous genomics data with accurate identification of survival-associated gene clusters.

preprint2016arXiv

Flat Helical Nanosieves

Compact and miniaturized devices with flexible functionalities are always highly demanded in optical integrated systems. Plasmonic nanosieve has been successfully harnessed as an ultrathin flat platform for complex manipulation of light, including holography, vortex generation and non-linear processes. Compared with most of reported single-functional devices, multi-functional nanosieves might find more complex and novel applications across nano-photonics, optics and nanotechnology. Here, we experimentally demonstrate a promising roadmap for nanosieve-based helical devices, which achieves full manipulations of optical vortices, including its generation, hybridization, spatial multiplexing, focusing and non-diffraction propagation etc., by controlling the geometric phase of spin light via over 121 thousands of spatially-rotated nano-sieves. Thanks to such spin-conversion nanosieve helical elements, it is no longer necessary to employ the conventional two-beam interferometric measurement to characterize optical vortices, while the interference can be realized natively without changing any parts of the current setup. The proposed strategy makes the far-field manipulations of optical orbital angular momentum within an ultrathin interface viable and bridges singular optics and integrated optics. In addition, it enables more unique extensibility and flexibility in versatile optical elements than traditional phase-accumulated helical optical devices.

preprint2016arXiv

On Secrecy Capacity of Minimum Storage Regenerating Codes

In this paper, we revisit the problem of characterizing the secrecy capacity of minimum storage regenerating (MSR) codes under the passive $(l_1,l_2)$-eavesdropper model, where the eavesdropper has access to data stored on $l_1$ nodes and the repair data for an additional $l_2$ nodes. We study it from the information-theoretic perspective. First, some general properties of MSR codes as well as a simple and generally applicable upper bound on secrecy capacity are given. Second, a new concept of \emph{stable} MSR codes is introduced, where the stable property is shown to be closely linked with secrecy capacity. Finally, a comprehensive and explicit result on secrecy capacity in the linear MSR scenario is present, which generalizes all related works in the literature and also predicts certain results for some unexplored linear MSR codes.

preprint2016arXiv

Security Concerns in Minimum Storage Cooperative Regenerating Codes

Here, we revisit the problem of exploring the secrecy capacity of minimum storage cooperative regenerating (MSCR) codes under the $\{l_1,l_2\}$-eavesdropper model, where the eavesdropper can observe the data stored on $l_1$ nodes and the repair downloads of an additional $l_2$ nodes. Compared to minimum storage regenerating (MSR) codes which support only single node repairs, MSCR codes allow efficient simultaneous repairs of multiple failed nodes, referred to as a \emph{repair group}. However, the repair data sent from a helper node to another failed node may vary with different repair groups or the sets of helper nodes, which would inevitably leak more data information to the eavesdropper and even render the storage system unable to maintain any data secrecy. In this paper, we introduce and study a special category of MSCR codes, termed "\emph{stable}" MSCR codes, where the repair data from any one helper node to any one failed node is required to be independent of the repair group or the set of helper nodes. Our main contributions include: 1. Demonstrating that two existing MSCR codes inherently are not stable and thus have poor secrecy capacity, 2. Converting one existing MSCR code to a stable one, which offers better secrecy capacity when compared to the original one, 3. Employing information theoretic analysis to characterize the secrecy capacity of stable MSCR codes in certain situations.

preprint2015arXiv

Continuously Shaping Orbital Angular Momentum with an Analog Optical Vortex Transmitter

Dynamic generation of obitial angular momentum (OAM) of light has enabled complex manipulation of micro-particles, high-dimension quantum entanglement and optical communication. We report an analog vortex transmitter made of one bilaterally symmetric grating and an aperture, emitting optical vortices with the average OAM value continuously variant in the entire rational range. Benefiting from linearly-varying transverse dislocation along its axis of symmetry, this diffractive transmitter possesses extra degree of freedom in engineering broadband optical vortices meanwhile preserving a novel spiniform phase with equally spaced singularities. It unlimitedly increases the average OAM of light by embracing more singularities, which is significantly different from that for Laguerre-Gaussian (LG) and Bessel vortex beams. Realizing analog generation of OAM in a single device, this technique can be potentially extended to other frequencies and applied to a wide spectrum of developments on quantum physics, aperiodic photonics and optical manipulation.

preprint2014arXiv

Capacity analysis of a multi-cell multi-antenna cooperative cellular network with co-channel interference

Characterization and modeling of co-channel interference is critical for the design and performance evaluation of realistic multi-cell cellular networks. In this paper, based on alpha stable processes, an analytical co-channel interference model is proposed for multi-cell multiple-input multi-output (MIMO) cellular networks. The impact of different channel parameters on the new interference model is analyzed numerically. Furthermore, the exact normalized downlink average capacity is derived for a multi-cell MIMO cellular network with co-channel interference. Moreover, the closed-form normalized downlink average capacity is derived for cell-edge users in the multi-cell multiple-input single-output (MISO) cooperative cellular network with co-channel interference. From the new co-channel interference model and capacity, the impact of cooperative antennas and base stations on cell-edge user performance in the multi-cell multi-antenna cellular network is investigated by numerical methods. Numerical results show that cooperative transmission can improve the capacity performance of multi-cell multi-antenna cooperative cellular networks, especially in a scenario with a high density of interfering base stations. The capacity performance gain is degraded with the increased number of cooperative antennas or base stations.

preprint2014arXiv

iGPSe: A Visual Analytic System for Integrative Genomic Based Cancer Patient Stratification

Background: Cancers are highly heterogeneous with different subtypes. These subtypes often possess different genetic variants, present different pathological phenotypes, and most importantly, show various clinical outcomes such as varied prognosis and response to treatment and likelihood for recurrence and metastasis. Recently, integrative genomics (or panomics) approaches are often adopted with the goal of combining multiple types of omics data to identify integrative biomarkers for stratification of patients into groups with different clinical outcomes. Results: In this paper we present a visual analytic system called Interactive Genomics Patient Stratification explorer (iGPSe) which significantly reduces the computing burden for biomedical researchers in the process of exploring complicated integrative genomics data. Our system integrates unsupervised clustering with graph and parallel sets visualization and allows direct comparison of clinical outcomes via survival analysis. Using a breast cancer dataset obtained from the The Cancer Genome Atlas (TCGA) project, we are able to quickly explore different combinations of gene expression (mRNA) and microRNA features and identify potential combined markers for survival prediction. Conclusions: Visualization plays an important role in the process of stratifying given population patients. Visual tools allowed for the selection of possibly features across various datasets for the given patient population. We essentially made a case for visualization for a very important problem in translational informatics.

preprint2014arXiv

Quantum state engineering of light with continuous-wave optical parametric oscillators

The ability to engineer the quantum state of traveling optical fields is a central requirement for quantum information science and technology, including quantum communication, computing and metrology. In this video article, we describe the reliable generation of non-Gaussian states, including single-photon states and coherent state superpositions, using a conditional preparation method operated on the non-classical light emitted by optical parametric oscillators. Type-I and type-II phase-matched OPOs operated below threshold, i.e. single-mode or two-mode squeezed vacuum sources, are considered and common procedures, such as the required frequency filtering or the high-efficiency quantum state characterization by homodyning, are detailed. The reported method enables a high fidelity with the targeted state and the generation of the state in a well-controlled spatiotemporal mode, a crucial feature for their use in subsequent protocols.

preprint2014arXiv

Remote creation of hybrid entanglement between particle-like and wave-like optical qubits

The wave-particle duality of light has led to two different encodings for optical quantum information processing. Several approaches have emerged based either on particle-like discrete-variable states, e.g. finite-dimensional quantum systems, or on wave-like continuous-variable states, e.g. infinite-dimensional systems. Here, we demonstrate the first measurement-induced generation of entanglement between optical qubits of these different types, located at distant places and connected by a lossy channel. Such hybrid entanglement, which is a key resource for a variety of recently proposed schemes, including quantum cryptography and computing, enables to convert information from one Hilbert space to the other via teleportation and therefore connect remote quantum processors based upon different encodings. Beyond its fundamental significance for the exploration of entanglement and its possible instantiations, our optical circuit opens the promises for heterogeneous network implementations, where discrete and continuous-variable operations and techniques can be efficiently combined.

preprint2014arXiv

Subwavelength focusing of azimuthally polarized beams with vortical phase in dielectrics by using an ultra-thin lens

We demonstrate that a planar and ultrathin binary lens can focus an azimuthally polarized beam with vortical phase (APV) to a subwavelength spot of transverse polarization. The results elaborates that, in the multi-layer medium, this focused spot, which is beyond the Rayleigh diffraction limitation, can be well maintained for several wavelengths after travelling through the dielectric interfaces, which is not attainable by using other vector beams (i.e., radially, linearly and circularly polarized beams) as the illuminating light. This compact optical system can be valuable in data writing and defect identification of wafer or silicon chips, owing to the enhanced polarized focusing through interfaces. It also enables to be highly integrated with traditional microscopy for the far-field super-resolution imaging, surface scanning and detection, and subwavelength focusing, owing to the enhanced focusing performance (reduced width and extended length) as well as the planarized configuration of the ultrathin lens.

preprint2012arXiv

Convexity of the smallest principal curvature of the convex level sets of some quasi-linear elliptic equations with respect to the height

For the $p$-harmonic function with strictly convex level sets, we find a test function which comes from the combination of the norm of gradient of the $p$-harmonic function and the smallest principal curvature of the level sets of $p$-harmonic function. We prove that this curvature function is convex with respect to the height of the $p$-harmonic function. This test function is an affine function of the height when the $p$-harmonic function is the $p$-Green function on the ball. For the minimal graph, we obtain a similar results.

preprint2012arXiv

Enhanced Josephson tunneling between high temperature superconductors through a normal pseudogap underdoped cuprate with a finite energy cooperon

The Josephson coupling between optimally cuprate superconductors separated by a spacer with a finite energy cooperon excitation which contributes to the Josephson coupling strength, is examined. For an underdoped cuprate barrier in its normal state, the YRZ model gives a good description of the temperature dependent enhanced Josephson coupling. A detailed examination of origin of the enhancement shows a significant contribution from the cooperon excitation which is comparable to that from nodal quasiparticles.

preprint2012arXiv

Network Backbone Discovery Using Edge Clustering

In this paper, we investigate the problem of network backbone discovery. In complex systems, a "backbone" takes a central role in carrying out the system functionality and carries the bulk of system traffic. It also both simplifies and highlight underlying networking structure. Here, we propose an integrated graph theoretical and information theoretical network backbone model. We develop an efficient mining algorithm based on Kullback-Leibler divergence optimization procedure and maximal weight connected subgraph discovery procedure. A detailed experimental evaluation demonstrates both the effectiveness and efficiency of our approach. The case studies in the real world domain further illustrates the usefulness of the discovered network backbones.

preprint2010arXiv

Andreev and Single Particle Tunneling Spectroscopies in Underdoped Cuprates

We study tunneling spectroscopy between a normal metal and underdoped cuprate superconductor modeled by a phenomenological theory in which the pseudogap is a precursor to the undoped Mott insulator. In the transparent tunneling limit, the spectra show a small energy gap associated with Andreev reflection. In the Giaever limit, the spectra show a large energy gap associated with single particle tunneling. Our theory semi-quantitatively describes the two gap behavior observed in tunneling experiments.

preprint2009arXiv

Ginzburg-Landau theory of a trapped Fermi gas with a BEC-BCS crossover

The Ginzburg-Landau theory of a trapped Fermi gas with a BEC-BCS crossover is derived by the path-integral method. In addition to the standard Ginzburg-Landau equation, a second equation describing the total atom density is obtained. These two coupled equations are necessary to describe both homogeneous and inhomogeneous systems. The Ginzburg-Landau theory is valid near the transition temperature $T_c$ on both sides of the crossover. In the weakly-interacting BEC region, it is also accurate at zero temperature where the Ginzburg-Landau equation can be mapped onto the Gross-Pitaevskii (GP) equation. The applicability of GP equation at finite temperature is discussed. On the BEC side, the fluctuation of the order parameter is studied and the renormalization to the molecule coupling constant is obtained.

preprint2009arXiv

The induced interaction in a Fermi gas with a BEC-BCS crossover

We study the effect of the induced interaction on the superfluid transition temperature of a Fermi gas with a BEC-BCS crossover. The Gorkov-Melik-Barkhudarov theory about the induced interaction is extended from the BCS side to the entire crossover, and the pairing fluctuation is treated in the approach by Nozières and Schmitt-Rink. At unitarity, the induced interaction reduces the transition temperature by about twenty percent. In the BCS limit, the transition temperature is reduced by a factor about 2.22, as found by Gorkov and Melik-Barkhudarov. Our result shows that the effect of the induced interaction is important both on the BCS side and in the unitary region.

Kun Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

How Mobile World Model Guides GUI Agents?

A Compressed Gradient Tracking Method for Decentralized Optimization with Linear Convergence

Accurate calibration of multi-perspective cameras from a generalization of the hand-eye constraint

Deep 360$^\circ$ Optical Flow Estimation Based on Multi-Projection Fusion

Label Adversarial Learning for Skeleton-level to Pixel-level Adjustable Vessel Segmentation

Quantum key distribution over scattering channel

Generalized Perfect Optical Vortex along Arbitrary Trajectories

Learning to Parse Wireframes in Images of Man-Made Environments

Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters

Flat Helical Nanosieves

On Secrecy Capacity of Minimum Storage Regenerating Codes

Security Concerns in Minimum Storage Cooperative Regenerating Codes

Continuously Shaping Orbital Angular Momentum with an Analog Optical Vortex Transmitter

Capacity analysis of a multi-cell multi-antenna cooperative cellular network with co-channel interference

iGPSe: A Visual Analytic System for Integrative Genomic Based Cancer Patient Stratification

Quantum state engineering of light with continuous-wave optical parametric oscillators

Remote creation of hybrid entanglement between particle-like and wave-like optical qubits

Subwavelength focusing of azimuthally polarized beams with vortical phase in dielectrics by using an ultra-thin lens

Convexity of the smallest principal curvature of the convex level sets of some quasi-linear elliptic equations with respect to the height

Enhanced Josephson tunneling between high temperature superconductors through a normal pseudogap underdoped cuprate with a finite energy cooperon

Network Backbone Discovery Using Edge Clustering

Andreev and Single Particle Tunneling Spectroscopies in Underdoped Cuprates

Ginzburg-Landau theory of a trapped Fermi gas with a BEC-BCS crossover

The induced interaction in a Fermi gas with a BEC-BCS crossover