Source author record

Ying Huang

Ying Huang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci Machine Learning Computation and Language Applications Computational Engineering, Finance, and Science Computer Science and Game Theory cond-mat.quant-gas cs.CY eess.IV Information Retrieval Multiagent Systems physics.ao-ph physics.atom-ph Populations and Evolution Social and Information Networks

Catalog footprint

What is connected

15works

16topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Online Knowledge Distillation for Efficient Pose Estimation

Existing state-of-the-art human pose estimation methods require heavy computational resources for accurate predictions. One promising technique to obtain an accurate yet lightweight pose estimator is knowledge distillation, which distills the pose knowledge from a powerful teacher model to a less-parameterized student model. However, existing pose distillation works rely on a heavy pre-trained estimator to perform knowledge transfer and require a complex two-stage learning procedure. In this work, we investigate a novel Online Knowledge Distillation framework by distilling Human Pose structure knowledge in a one-stage manner to guarantee the distillation efficiency, termed OKDHP. Specifically, OKDHP trains a single multi-branch network and acquires the predicted heatmaps from each, which are then assembled by a Feature Aggregation Unit (FAU) as the target heatmaps to teach each branch in reverse. Instead of simply averaging the heatmaps, FAU which consists of multiple parallel transformations with different receptive fields, leverages the multi-scale information, thus obtains target heatmaps with higher-quality. Specifically, the pixel-wise Kullback-Leibler (KL) divergence is utilized to minimize the discrepancy between the target heatmaps and the predicted ones, which enables the student network to learn the implicit keypoint relationship. Besides, an unbalanced OKDHP scheme is introduced to customize the student networks with different compression rates. The effectiveness of our approach is demonstrated by extensive experiments on two common benchmark datasets, MPII and COCO.

preprint2022arXiv

StyleBERT: Chinese pretraining by font style information

With the success of down streaming task using English pre-trained language model, the pre-trained Chinese language model is also necessary to get a better performance of Chinese NLP task. Unlike the English language, Chinese has its special characters such as glyph information. So in this article, we propose the Chinese pre-trained language model StyleBERT which incorporate the following embedding information to enhance the savvy of language model, such as word, pinyin, five stroke and chaizi. The experiments show that the model achieves well performances on a wide range of Chinese NLP tasks.

preprint2020arXiv

Deep Frequent Spatial Temporal Learning for Face Anti-Spoofing

Face anti-spoofing is crucial for the security of face recognition system, by avoiding invaded with presentation attack. Previous works have shown the effectiveness of using depth and temporal supervision for this task. However, depth supervision is often considered only in a single frame, and temporal supervision is explored by utilizing certain signals which is not robust to the change of scenes. In this work, motivated by two stream ConvNets, we propose a novel two stream FreqSaptialTemporalNet for face anti-spoofing which simultaneously takes advantage of frequent, spatial and temporal information. Compared with existing methods which mine spoofing cues in multi-frame RGB image, we make multi-frame spectrum image as one input stream for the discriminative deep neural network, encouraging the primary difference between live and fake video to be automatically unearthed. Extensive experiments show promising improvement results using the proposed architecture. Meanwhile, we proposed a concise method to obtain a large amount of spoofing training data by utilizing a frequent augmentation pipeline, which contributes detail visualization between live and fake images as well as data insufficiency issue when training large networks.

preprint2020arXiv

Estimates of daily ground-level NO2 concentrations in China based on big data and machine learning approaches

Nitrogen dioxide (NO2) is one of the most important atmospheric pollutants. However, current ground-level NO2 concentration data are lack of either high-resolution coverage or full coverage national wide, due to the poor quality of source data and the computing power of the models. To our knowledge, this study is the first to estimate the ground-level NO2 concentration in China with national coverage as well as relatively high spatiotemporal resolution (0.25 degree; daily intervals) over the newest past 6 years (2013-2018). We advanced a Random Forest model integrated K-means (RF-K) for the estimates with multi-source parameters. Besides meteorological parameters, satellite retrievals parameters, we also, for the first time, introduce socio-economic parameters to assess the impact by human activities. The results show that: (1) the RF-K model we developed shows better prediction performance than other models, with cross-validation R2 = 0.64 (MAPE = 34.78%). (2) The annual average concentration of NO2 in China showed a weak increasing trend . While in the economic zones such as Beijing-Tianjin-Hebei region, Yangtze River Delta, and Pearl River Delta, the NO2 concentration there even decreased or remained unchanged, especially in spring. Our dataset has verified that pollutant controlling targets have been achieved in these areas. With mapping daily nationwide ground-level NO2 concentrations, this study provides timely data with high quality for air quality management for China. We provide a universal model framework to quickly generate a timely national atmospheric pollutants concentration map with a high spatial-temporal resolution, based on improved machine learning methods.

preprint2020arXiv

Joint Deep Learning of Facial Expression Synthesis and Recognition

Recently, deep learning based facial expression recognition (FER) methods have attracted considerable attention and they usually require large-scale labelled training data. Nonetheless, the publicly available facial expression databases typically contain a small amount of labelled data. In this paper, to overcome the above issue, we propose a novel joint deep learning of facial expression synthesis and recognition method for effective FER. More specifically, the proposed method involves a two-stage learning procedure. Firstly, a facial expression synthesis generative adversarial network (FESGAN) is pre-trained to generate facial images with different facial expressions. To increase the diversity of the training images, FESGAN is elaborately designed to generate images with new identities from a prior distribution. Secondly, an expression recognition network is jointly learned with the pre-trained FESGAN in a unified framework. In particular, the classification loss computed from the recognition network is used to simultaneously optimize the performance of both the recognition network and the generator of FESGAN. Moreover, in order to alleviate the problem of data bias between the real images and the synthetic images, we propose an intra-class loss with a novel real data-guided back-propagation (RDBP) algorithm to reduce the intra-class variations of images from the same class, which can significantly improve the final performance. Extensive experimental results on public facial expression databases demonstrate the superiority of the proposed method compared with several state-of-the-art FER methods.

preprint2020arXiv

More Information Supervised Probabilistic Deep Face Embedding Learning

Researches using margin based comparison loss demonstrate the effectiveness of penalizing the distance between face feature and their corresponding class centers. Despite their popularity and excellent performance, they do not explicitly encourage the generic embedding learning for an open set recognition problem. In this paper, we analyse margin based softmax loss in probability view. With this perspective, we propose two general principles: 1) monotonic decreasing and 2) margin probability penalty, for designing new margin loss functions. Unlike methods optimized with single comparison metric, we provide a new perspective to treat open set face recognition as a problem of information transmission. And the generalization capability for face embedding is gained with more clean information. An auto-encoder architecture called Linear-Auto-TS-Encoder(LATSE) is proposed to corroborate this finding. Extensive experiments on several benchmarks demonstrate that LATSE help face embedding to gain more generalization capability and it boosted the single model performance with open training dataset to more than $99\%$ on MegaFace test.

preprint2016arXiv

A Comparative Study of Object Trackers for Infrared Flying Bird Tracking

Bird strikes present a huge risk for aircraft, especially since traditional airport bird surveillance is mainly dependent on inefficient human observation. Computer vision based technology has been proposed to automatically detect birds, determine bird flying trajectories, and predict aircraft takeoff delays. However, the characteristics of bird flight using imagery and the performance of existing methods applied to flying bird task are not well known. Therefore, we perform infrared flying bird tracking experiments using 12 state-of-the-art algorithms on a real BIRDSITE-IR dataset to obtain useful clues and recommend feature analysis. We also develop a Struck-scale method to demonstrate the effectiveness of multiple scale sampling adaption in handling the object of flying bird with varying shape and scale. The general analysis can be used to develop specialized bird tracking methods for airport safety, wildness and urban bird population studies.

preprint2016arXiv

Multi-Agent System for Groundwater Depletion Using Game Theory

Groundwater is one of the most vital of all common pool resources throughout the world. More than half of groundwater is used to grow crops. This research models groundwater depletion patterns within a multi-agent system framework. Irrigators are modeled as agents in the multi-agent system. The irrigation strategies adopted by the agents are investigated using game theory. A set of five irrigators, growing three crops: corn, sorghum and wheat, have been considered in this study. To allow groundwater flow, these agents are assumed to be located in adjoining farm lands. Irrigators are modeled selfish agents that strategize their irrigation patterns in order to maximize their own utilities, i.e. the difference between the total revenue obtained from crop sales and the costs incurred, including groundwater extraction costs. Due to groundwater flow, and have no incentive to conserve groundwater. This leads to unsustainable depletion of the resource under Nash equilibrium, when no irrigator can increase its utility by unilaterally changing its strategy. All parameters in this research are representative of Kansas. Recorded environmental and economic data of the region, along with the DSSAT software, have been used to obtain these futuristic projections. One of the emergent phenomena of the simulations is the adoption of crop rotation patterns by the irrigators to conserve groundwater. The irrigators grow corn, which is a more profitable yet water intensive crop in one year, and in the next, conserve water by growing sorghum instead. Another emergent outcome of this research is the viability of LEMAs. When the irrigators are subject to LEMA-level limits on groundwater use, there is a slight increase in the aggregate utility of the LEMA.

preprint2016arXiv

Text Network Exploration via Heterogeneous Web of Topics

A text network refers to a data type that each vertex is associated with a text document and the relationship between documents is represented by edges. The proliferation of text networks such as hyperlinked webpages and academic citation networks has led to an increasing demand for quickly developing a general sense of a new text network, namely text network exploration. In this paper, we address the problem of text network exploration through constructing a heterogeneous web of topics, which allows people to investigate a text network associating word level with document level. To achieve this, a probabilistic generative model for text and links is proposed, where three different relationships in the heterogeneous topic web are quantified. We also develop a prototype demo system named TopicAtlas to exhibit such heterogeneous topic web, and demonstrate how this system can facilitate the task of text network exploration. Extensive qualitative analyses are included to verify the effectiveness of this heterogeneous topic web. Besides, we validate our model on real-life text networks, showing that it preserves good performance on objective evaluation metrics.

preprint2015arXiv

Genetic structure of Sino-Tibetan populations revealed by forensic STR loci

The origin and diversification of Sino-Tibetan populations have been a long-standing hot debate. However, the limited genetic information of Tibetan populations keeps this topic far from clear. In the present study, we genotyped 15 forensic autosomal STRs from 803 unrelated Tibetan individuals from Gansu Province (635 from Gannan and 168 from Tianzhu). We combined these data with published dataset to infer a detailed population affinities and admixture of Sino-Tibetan populations. Our results revealed that the genetic structure of Sino-Tibetan populations was strongly correlated with linguistic affiliations. Although the among-population variances are relatively small, the genetic components for Tibetan, Lolo-Burmese, and Han Chinese were quite distinctive, especially for the Deng, Nu, and Derung of Lolo-Burmese. Southern indigenous populations, such as Tai-Kadai and Hmong-Mien populations might have made substantial genetic contribution to Han Chinese and Altaic populations, but not to Tibetans. Likewise, Han Chinese but not Tibetan shared very similar genetic makeups with Altaic populations, which did not support the North Asian origin of Tibetan populations. The dataset generated here are also valuable for forensic identification.

preprint2013arXiv

Logistic regression analysis with standardized markers

Two different approaches to analysis of data from diagnostic biomarker studies are commonly employed. Logistic regression is used to fit models for probability of disease given marker values, while ROC curves and risk distributions are used to evaluate classification performance. In this paper we present a method that simultaneously accomplishes both tasks. The key step is to standardize markers relative to the nondiseased population before including them in the logistic regression model. Among the advantages of this method are the following: (i) ensuring that results from regression and performance assessments are consistent with each other; (ii) allowing covariate adjustment and covariate effects on ROC curves to be handled in a familiar way, and (iii) providing a mechanism to incorporate important assumptions about structure in the ROC curve into the fitted risk model. We develop the method in detail for the problem of combining biomarker data sets derived from multiple studies, populations or biomarker measurement platforms, when ROC curves are similar across data sources. The methods are applicable to both cohort and case-control sampling designs. The data set motivating this application concerns Prostate Cancer Antigen 3 (PCA3) for diagnosis of prostate cancer in patients with or without previous negative biopsy where the ROC curves for PCA3 are found to be the same in the two populations. The estimated constrained maximum likelihood and empirical likelihood estimators are derived. The estimators are compared in simulation studies and the methods are illustrated with the PCA3 data set.

preprint2013arXiv

Novel Bismaleimide Resin/Silsesquioxane and Titania Nanocomposites by the Sol-Gel Process: the Preparation, Morphology, Thermal and Thermo-mechanical Properties

Bismaleimide(BMI) resin/silsesquioxane or titania nanocomposites were synthesized from bismaleimide resin and SiO3/2 or TiO2 via the sol-gel process of N-γ-triethoxylsilylpropyl-maleamic acid (TESPMA) or tetrabutyltitanate (Ti(OnBu)4, TBT), respectively, in the presence of the AP-BMI prepolymers. These nanocomposite materials were characterized by FT-IR, FE-SEM, TGA and DMA. It was found that the nano-scale SiO3/2 or TiO2 particles were formed in the AP-BMI resin matrix and the average original particle size of the dispersed phase in the nanocomposites was less than 100nm, but the particle aggregates with bigger size existed. Obvious improvements of Tg and the heat resistance of the AP-BMI resin were achieved by introduction of the nano-sized SiO3/2 inorganic phase, and the modulus at high temperatures was improved too. The incorporation of nano-scale TiO2 particles into the AP-BMI resin improved the Tg of the polymer, but lowered the thermal resistance of the material, and improved the modulus of the material at lower temperatures, but lowered the modulus at higher temperatures.

preprint2013arXiv

Preparation and characterization of Bismaleimide resin/titania nanocomposites via sol-gel process

Bismaleimide (BMI) resin/ titania nanocomposites were synthesized from allylated-phenolic modified bismaleimide resin and TiO2 via the sol-gel process of tetrabutyltitanate (Ti(OnBu)4, TBT). These nanocomposite materials were characterized by FT-IR, XRD, FE-SEM, TGA and DMA. It was found that the nano-scale TiO2 particles were formed in the AP-BMI resin matrix, and the average primary particle size of the dispersed phase in the nanocomposites was less than 100nm, but the particle aggregates with bigger size existed. Obvious improvements of glass transition temperature and heat resistance properties of the AP-BMI resins were achieved by the introduction of nano-sized TiO2 inorganic phase, and the modulus of the material was also improved.

preprint2013arXiv

Rabi Oscillations between Atomic and Molecular Condensates Driven with Coherent One-Color Photoassociation

We demonstrate coherent one-color photoassociation of a Bose-Einstein condensate, which results in Rabi oscillations between atomic and molecular condensates. We attain atom-molecule Rabi frequencies that are comparable to decoherence rates by driving photoassociation of atoms in an $^{88}$Sr condensate to a weakly-bound level of the metastable $^{1}S_{0}+^{3}P_{1}$ molecular potential, which has a long lifetime and large Franck-Condon overlap integral with the ground scattering state. Transient shifts and broadenings of the excitation spectrum are clearly seen at short times, and they create an asymmetric excitation profile that only displays Rabi oscillations for blue detuning from resonance.

preprint2010arXiv

First Principles NMR Signatures of Graphene Oxide

Nuclear magnetic resonance (NMR) has been widely used in the graphene oxide (GO) structure study. However, the detailed relationship between its spectroscopic features and the GO structural configuration has remained elusive. Based on first principles $^{13}$C chemical shift calculations using the gauge including projector augmented waves (GIPAW) method, we provide a spectrum-structure connection. Chemical shift of carbon is found to be very sensitive to atomic environment, even with an identical oxidation group. Factors determining the chemical shifts for epoxy and hydroxy groups have been discussed. GO structures previously reported in the literature have been checked from the NMR point of view. The energetically favorable hydroxy chain structure is not expected to be widely existed in real GO samples according to our NMR simulations. The epoxy pair we proposed previously is also supported by chemical shift calculations.

Ying Huang

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Online Knowledge Distillation for Efficient Pose Estimation

StyleBERT: Chinese pretraining by font style information

Deep Frequent Spatial Temporal Learning for Face Anti-Spoofing

Estimates of daily ground-level NO2 concentrations in China based on big data and machine learning approaches

Joint Deep Learning of Facial Expression Synthesis and Recognition

More Information Supervised Probabilistic Deep Face Embedding Learning

A Comparative Study of Object Trackers for Infrared Flying Bird Tracking

Multi-Agent System for Groundwater Depletion Using Game Theory

Text Network Exploration via Heterogeneous Web of Topics

Genetic structure of Sino-Tibetan populations revealed by forensic STR loci

Logistic regression analysis with standardized markers

Novel Bismaleimide Resin/Silsesquioxane and Titania Nanocomposites by the Sol-Gel Process: the Preparation, Morphology, Thermal and Thermo-mechanical Properties

Preparation and characterization of Bismaleimide resin/titania nanocomposites via sol-gel process

Rabi Oscillations between Atomic and Molecular Condensates Driven with Coherent One-Color Photoassociation

First Principles NMR Signatures of Graphene Oxide