Source author record

Xue Bai

Xue Bai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Computation and Language eess.AS physics.optics Sound Machine Learning Methodology physics.chem-ph Quantitative Methods Social and Information Networks

Catalog footprint

What is connected

11works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MiLe Loss: a New Entropy-Weighed Loss for Mitigating the Bias of Learning Difficulties in Large Language Models

Generative language models are usually pretrained on large text corpus via predicting the next token (i.e., sub-word/word/phrase) given the previous ones. Recent works have demonstrated the impressive performance of large generative language models on downstream tasks. However, existing generative language models generally neglect an inherent challenge in text corpus during training, i.e., the imbalance between frequent tokens and infrequent ones. It can lead a language model to be dominated by common and easy-to-learn tokens, thereby overlooking the infrequent and difficult-to-learn ones. To alleviate that, we propose a MiLe Loss function for mitigating the bias of learning difficulties with tokens. During training, it can dynamically assess the learning difficulty of a to-be-learned token, according to the information entropy of the corresponding predicted probability distribution over the vocabulary. Then it scales the training loss adaptively, trying to lead the model to focus more on the difficult-to-learn tokens. On the Pile dataset, we train generative language models at different scales of 468M, 1.2B, and 6.7B parameters. Experiments reveal that models incorporating the proposed MiLe Loss can gain consistent performance improvement on downstream benchmarks.

preprint2023arXiv

Danlu Tongdu tablets treat lumbar spinal stenosis through reducing reactive oxygen species and apoptosis by regulating CDK2/CDK4/CDKN1A expression

Lumbar spinal stenosis (LSS) is caused by the compression of the nerve root or cauda equina nerve by stenosis of the lumbar spinal canal or intervertebral foramen, and is manifested as chronic low back and leg pain. Danlu Tongdu (DLTD) tablets can relieve chronic pain caused by LSS, but the molecular mechanism remains largely unknown. In this study, the potential molecular mechanism of DLTD tablets in the treatment of LSS was firstly predicted by network pharmacology method. Results showed that DLTD functions in regulating anti-oxidative, apoptosis, and inflammation signaling pathways. Furthermore, the flow cytometry results showed that DLTD tablets efficiently reduced ROS content and inhibited rat neural stem cell apoptosis induced by hydrogen peroxide. DLTD also inhibited the mitochondrial membrane potential damage induced by hydrogen peroxide. Elisa analysis showed that DLTD induced cell cycle related protein, CDK2 and CDK4 and reduced CDKN1A protein expression level. Taken together, our study provided new insights of DLTD in treating LSS through reducing ROS content, decreasing apoptosis by inhibiting CDKN1A and promoting CDK2 and CDK4 expression levels.

preprint2020arXiv

An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances

In this paper, we propose a sub-utterance unit selection framework to remove acoustic segments in audio recordings that carry little information for acoustic scene classification (ASC). Our approach is built upon a universal set of acoustic segment units covering the overall acoustic scene space. First, those units are modeled with acoustic segment models (ASMs) used to tokenize acoustic scene utterances into sequences of acoustic segment units. Next, paralleling the idea of stop words in information retrieval, stop ASMs are automatically detected. Finally, acoustic segments associated with the stop ASMs are blocked, because of their low indexing power in retrieval of most acoustic scenes. In contrast to building scene models with whole utterances, the ASM-removed sub-utterances, i.e., acoustic utterances without stop acoustic segments, are then used as inputs to the AlexNet-L back-end for final classification. On the DCASE 2018 dataset, scene classification accuracy increases from 68%, with whole utterances, to 72.1%, with segment selection. This represents a competitive accuracy without any data augmentation, and/or ensemble strategy. Moreover, our approach compares favourably to AlexNet-L with attention.

preprint2020arXiv

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

In this technical report, we present a joint effort of four groups, namely GT, USTC, Tencent, and UKE, to tackle Task 1 - Acoustic Scene Classification (ASC) in the DCASE 2020 Challenge. Task 1 comprises two different sub-tasks: (i) Task 1a focuses on ASC of audio signals recorded with multiple (real and simulated) devices into ten different fine-grained classes, and (ii) Task 1b concerns with classification of data into three higher-level classes using low-complexity solutions. For Task 1a, we propose a novel two-stage ASC system leveraging upon ad-hoc score combination of two convolutional neural networks (CNNs), classifying the acoustic input according to three classes, and then ten classes, respectively. Four different CNN-based architectures are explored to implement the two-stage classifiers, and several data augmentation techniques are also investigated. For Task 1b, we leverage upon a quantization method to reduce the complexity of two of our top-accuracy three-classes CNN-based architectures. On Task 1a development data set, an ASC accuracy of 76.9\% is attained using our best single classifier and data augmentation. An accuracy of 81.9\% is then attained by a final model fusion of our two-stage ASC classifiers. On Task 1b development data set, we achieve an accuracy of 96.7\% with a model size smaller than 500KB. Code is available: https://github.com/MihawkHu/DCASE2020_task1.

preprint2020arXiv

Wettability and surface energy of parylene F

Parylenes are barrier materials employed as protective layers. However, many parylenes are unsuitable for applications under harsh conditions. A new material, parylene F, demonstrates considerable potential for a wide range of applications due to its high temperature and UV resistance. For the first time, the wettability and surface energy of parylene F were investigated to determine the feasibility of parylene F as an alternative to the commonly employed parylene C. The results show that parylene F has a hydrophobic surface with a water contact angle of 109.63 degrees. We found that 3.5 ul probe liquid is an optimal value for the contact angle measurement of parylene F. Moreover, we found that the Owens-Wendt-Kaelble and the Lifshitz-van der Waals/acid-base approaches are unsuitable for determining the surface energy of parylene F, whereas an approach based on the limitless liquid-solid interface wetting system is compatible. Furthermore, the results show that parylene F has a surface energy of 39.05 mJ/m2. Considering the improved resistance, relatively low cost, and the desirable properties, parylene F can replace parylene C for applications under harsh conditions.

preprint2016arXiv

Utility-based Link Recommendation for Online Social Networks

Link recommendation, which suggests links to connect currently unlinked users, is a key functionality offered by major online social networks. Salient examples of link recommendation include "People You May Know" on Facebook and LinkedIn as well as "You May Know" on Google+. The main stakeholders of an online social network include users (e.g., Facebook users) who use the network to socialize with other users and an operator (e.g., Facebook Inc.) that establishes and operates the network for its own benefit (e.g., revenue). Existing link recommendation methods recommend links that are likely to be established by users but overlook the benefit a recommended link could bring to an operator. To address this gap, we define the utility of recommending a link and formulate a new research problem - the utility-based link recommendation problem. We then propose a novel utility-based link recommendation method that recommends links based on the value, cost, and linkage likelihood of a link, in contrast to existing link recommendation methods which focus solely on linkage likelihood. Specifically, our method models the dependency relationship between value, cost, linkage likelihood and utility-based link recommendation decision using a Bayesian network, predicts the probability of recommending a link with the Bayesian network, and recommends links with the highest probabilities. Using data obtained from a major U.S. online social network, we demonstrate significant performance improvement achieved by our method compared to prevalent link recommendation methods from representative prior research.

preprint2015arXiv

Invisible sensor: Simultaneous sensing and camouflaging in multiphysical fields

To manipulate various types of physical signals in one single device has long captivated the attention of scientists and engineers. This however is very challenging, if not impossible, even for emerging metamaterials. Up to date, many artificial materials have been proposed, theoretically and (or) experimentally, for manipulating various waves/signals on a one-function-one-device basis. In this work, for the very first time, we employ undecorated natural materials to experimentally demonstrate a simultaneous camouflage for thermal current and electric dc current on the same device. It demonstrates how ingenuity can overcome the limitations of natural material systems without the need for complex decoration to impart inhomogeneous and (or) anisotropic properties, which was previously considered impossible to accomplish except by using metamaterials.

preprint2014arXiv

Invisibility's Flicker: Detecting Thermal Cloaks via Transient Effects

Recent research on the development of a thermal cloak has concentrated on engineering an inhomogeneous thermal conductivity and homogeneous volumetric heat capacity. While the perfect cloak of inhomogeneous $κ$ and $ρc_p$ is known to be exact (no signals scattering or penetrating to the cloak's interior), no such analysis has been considered for this case. Using analytic, computational, and experimental techniques, we demonstrate that these approximate cloaks are detectable. Although they work as perfect cloaks in the steady-state, their transient (time-dependent) response is imperfect and a detectable amount of heat is scattered. This is sufficient to determine the presence of a cloak and any heat source it contains, but the material composition hidden within the cloak is not detectable in practice.

preprint2014arXiv

Manipulating Steady Heat Conduction by Sensu-shaped Thermal Metamaterials

The ability to design the control of heat flow has innumerable benefits in the design of electronic systems such as thermoelectric energy harvesters, solid-state lighting, and thermal imagers, where the thermal design plays a key role in performance and device reliability. However, to realize one advanced control function of thermal flux, one needs to design one sophisticated, multilayered and inhomogeneous thermal structure with different composition/shape at different regions of one device. In this work, we employ one identical sensu-unit with facile natural composition to experimentally realize a new class of thermal metamaterials for controlling thermal conduction (e.g., thermal concentrator, focusing/resolving, uniform heating), only resorting to positioning and locating the same unit element of sensu-shape structure. The thermal metamaterial unit and the proper arrangement of multiple identical units are capable of transferring, redistributing and managing thermal energy in a versatile fashion. It is also shown that our sensu-shape unit elements can be used in manipulating dc currents without any change in the layout for the thermal counterpart. The proposed scheme can also be applied to control dc electric currents and dc magnetic fields that governed by Laplace equation. These could markedly enhance the capabilities in thermal sensing, thermal imaging, thermal-energy storage, thermal packaging, thermal therapy, and more domains beyond.

preprint2014arXiv

Robust Linear Regression: A Review and Comparison

Ordinary least-squares (OLS) estimators for a linear model are very sensitive to unusual values in the design space or outliers among y values. Even one single atypical value may have a large effect on the parameter estimates. This article aims to review and describe some available and popular robust techniques, including some recent developed ones, and compare them in terms of breakdown point and efficiency. In addition, we also use a simulation study and a real data application to compare the performance of existing robust methods under different scenarios.

preprint2013arXiv

Bilayer Isotropic Thermal Cloak

Invisibility has attracted intensive research in various communities, e.g., optics, electromagnetics, acoustics, thermodynamics, etc. However, the most of them have only been experimentally achieved by virtue of simplified approaches due to their inhomogeneous and extreme parameters imposed by transformation-optic method, and usually require challenging realization with metamaterials. In this paper, we demonstrate an advanced bilayer thermal cloak with naturally available materials first time. This scheme, directly from thermal conduction equation, has been validated as an exact cloak rather than a reduced one, and we experimentally confirmed its perfect performance (heat-front maintenance and heat protection) in an actual setup. The proposed scheme may open a new avenue to control the diffusive heat flow in ways inconceivable with phonons.

Xue Bai

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

MiLe Loss: a New Entropy-Weighed Loss for Mitigating the Bias of Learning Difficulties in Large Language Models

Danlu Tongdu tablets treat lumbar spinal stenosis through reducing reactive oxygen species and apoptosis by regulating CDK2/CDK4/CDKN1A expression

An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances

Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data Augmentation

Wettability and surface energy of parylene F

Utility-based Link Recommendation for Online Social Networks

Invisible sensor: Simultaneous sensing and camouflaging in multiphysical fields

Invisibility's Flicker: Detecting Thermal Cloaks via Transient Effects

Manipulating Steady Heat Conduction by Sensu-shaped Thermal Metamaterials

Robust Linear Regression: A Review and Comparison

Bilayer Isotropic Thermal Cloak