Source author record

Wei Bai

Wei Bai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence physics.soc-ph Distributed, Parallel, and Cluster Computing Machine Learning hep-ex hep-ph Logic in Computer Science math.RT Networking and Internet Architecture Populations and Evolution Systems and Control

Catalog footprint

What is connected

10works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Jumping to male-dominated occupations: A novel way to reduce gender wage gap for Chinese women

Occupational segregation is widely considered as one major reason leading to the gender discrimination in labor market. Using large-scale Chinese resume data of online job seekers, we uncover an interesting phenomenon that occupations with higher proportion of men have smaller gender wage gap measured by the female-male ratio on wage. We further show that the severity of occupational segregation in China is low both overall and regionally, and the inter-occupational discrimination is much smaller than the intra-occupational discrimination. That is to say, Chinese women do not face large barriers when changing their occupations. Accordingly, we suggest Chineses women a new way to narrow the gender wage gap: to join male-dominated occupations. Meanwhile, it is worth noticing that although the gender wage gap is smaller in male-dominated occupations, it does not mean that the gender discrimination is smaller there.

preprint2022arXiv

The Gender Pay Gap in China: Insights from a Discrimination Perspective

Equal pay is an essential component of gender equality, one of the Sustainable Development Goals of the United Nations. Using resume data of over ten million Chinese online job seekers in 2015, we study the current gender pay gap in China. The results show that on average women only earned 71.57\% of what men earned in China. The gender pay gap exists across all age groups and educational levels. Contrary to the commonly held view that developments in education, economy, and a more open culture would reduce the gender pay gap, the fusion analysis of resume data and socio-economic data presents that they have not helped reach the gender pay equality in China. China seems to be stuck in a place where traditional methods cannot make further progress. Our analysis further shows that 81.47\% of the variance in the gender pay gap can be potentially attributed to discrimination. In particular, compared with the unmarried, both the gender pay gap itself and proportion potentially attributed to discrimination of the married are larger, indicating that married women suffer greater inequality and more discrimination than unmarried ones. Taken together, we suggest that more research attention should be paid to the effect of discrimination in understanding gender pay gap based on the family constraint theory. We also suggest the Chinese government to increase investment in family-supportive policies and grants in addition to female education.

preprint2020arXiv

Domain-specific Communication Optimization for Distributed DNN Training

Communication overhead poses an important obstacle to distributed DNN training and draws increasing attention in recent years. Despite continuous efforts, prior solutions such as gradient compression/reduction, compute/communication overlapping and layer-wise flow scheduling, etc., are still coarse-grained and insufficient for an efficient distributed training especially when the network is under pressure. We present DLCP, a novel solution exploiting the domain-specific properties of deep learning to optimize communication overhead of DNN training in a fine-grained manner. At its heart, DLCP comprises of several key innovations beyond prior work: e.g., it exploits {\em bounded loss tolerance} of SGD-based training to improve tail communication latency which cannot be avoided purely through gradient compression. It then performs fine-grained packet-level prioritization and dropping, as opposed to flow-level scheduling, based on layers and magnitudes of gradients to further speedup model convergence without affecting accuracy. In addition, it leverages inter-packet order-independency to perform per-packet load balancing without causing classical re-ordering issues. DLCP works with both Parameter Server and collective communication routines. We have implemented DLCP with commodity switches, integrated it with various training frameworks including TensorFlow, MXNet and PyTorch, and deployed it in our small-scale testbed with 10 Nvidia V100 GPUs. Our testbed experiments and large-scale simulations show that DLCP delivers up to $84.3\%$ additional training acceleration over the best existing solutions.

preprint2020arXiv

Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV

Objectives.--To estimate the basic reproduction number of the Wuhan novel coronavirus (2019-nCoV). Methods.--Based on the susceptible-exposed-infected-removed (SEIR) compartment model and the assumption that the infectious cases with symptoms occurred before January 25, 2020 are resulted from free propagation without intervention, we estimate the basic reproduction number of 2019-nCoV according to the reported confirmed cases and suspected cases, as well as the theoretical estimated number of infected cases by other research teams, together with some epidemiological determinants learned from the severe acute respiratory syndrome. Results The basic reproduction number falls between 2.8 to 3.3 by using the real-time reports on the number of 2019-nCoV infected cases from People's Daily in China, and falls between 3.2 and 3.9 on the basis of the predicted number of infected cases from colleagues. Conclusions.--The early transmission ability of 2019-nCoV is closed to or slightly higher than SARS. It is a controllable disease with moderate-high transmissibility. Timely and effective control measures are needed to suppress the further transmissions. Notes Added.--Using a newly reported epidemiological determinants for early 2019-nCoV, the estimated basic reproduction number is in the range [2.2,3.0].

preprint2020arXiv

Quantitative Evaluations on Saliency Methods: An Experimental Study

It has been long debated that eXplainable AI (XAI) is an important topic, but it lacks rigorous definition and fair metrics. In this paper, we briefly summarize the status quo of the metrics, along with an exhaustive experimental study based on them, including faithfulness, localization, false-positives, sensitivity check, and stability. With the experimental results, we conclude that among all the methods we compare, no single explanation method dominates others in all metrics. Nonetheless, Gradient-weighted Class Activation Mapping (Grad-CAM) and Randomly Input Sampling for Explanation (RISE) perform fairly well in most of the metrics. Utilizing a set of filtered metrics, we further present a case study to diagnose the classification bases for models. While providing a comprehensive experimental study of metrics, we also examine measuring factors that are missed in current metrics and hope this valuable work could serve as a guide for future research.

preprint2020arXiv

Weakness Analysis of Cyberspace Configuration Based on Reinforcement Learning

In this work, we present a learning-based approach to analysis cyberspace configuration. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of agents as attackers, our method becomes better at rapidly finding attack paths for previously hidden paths, especially in multiple domain cyberspace. To achieve these results, we pose finding attack paths as a Reinforcement Learning (RL) problem and train an agent to find multiple domain attack paths. To enable our RL policy to find more hidden attack paths, we ground representation introduction an multiple domain action select module in RL. By designing a simulated cyberspace experimental environment to verify our method. Our objective is to find more hidden attack paths, to analysis the weakness of cyberspace configuration. The experimental results show that our method can find more hidden multiple domain attack paths than existing baselines methods.

preprint2015arXiv

RepNet: Cutting Tail Latency in Data Center Networks with Flow Replication

Data center networks need to provide low latency, especially at the tail, as demanded by many interactive applications. To improve tail latency, existing approaches require modifications to switch hardware and/or end-host operating systems, making them difficult to be deployed. We present the design, implementation, and evaluation of RepNet, an application layer transport that can be deployed today. RepNet exploits the fact that only a few paths among many are congested at any moment in the network, and applies simple flow replication to mice flows to opportunistically use the less congested path. RepNet has two designs for flow replication: (1) RepSYN, which only replicates SYN packets and uses the first connection that finishes TCP handshaking for data transmission, and (2) RepFlow which replicates the entire mice flow. We implement RepNet on {\tt node.js}, one of the most commonly used platforms for networked interactive applications. {\tt node}'s single threaded event-loop and non-blocking I/O make flow replication highly efficient. Performance evaluation on a real network testbed and in Mininet reveals that RepNet is able to reduce the tail latency of mice flows, as well as application completion times, by more than 50\%.

preprint2014arXiv

Enabling Automatic Certification of Online Auctions

We consider the problem of building up trust in a network of online auctions by software agents. This requires agents to have a deeper understanding of auction mechanisms and be able to verify desirable properties of a given mechanism. We have shown how these mechanisms can be formalised as semantic web services in OWL-S, a good enough expressive machine-readable formalism enabling software agents, to discover, invoke, and execute a web service. We have also used abstract interpretation to translate the auction's specifications from OWL-S, based on description logic, to COQ, based on typed lambda calculus, in order to enable automatic verification of desirable properties of the auction by the software agents. For this language translation, we have discussed the syntactic transformation as well as the semantics connections between both concrete and abstract domains. This work contributes to the implementation of the vision of agent-mediated e-commerce systems.

preprint2013arXiv

Revisiting $Kπ$ puzzle in the pQCD factorization approach

In this paper, we calculated the branching ratios and direct CP violation of the four $B\to Kπ$ decays with the inclusion of all currently known next-to-leading order (NLO) contributions by employing the perturbative QCD (pQCD) factorization approach. We found that (a) Besides the 10% enhancement from the NLO vertex corrections, the quark-loops and magnetic penguins, the NLO contributions to the form factors can provide an additional $\sim 15%$ enhancement to the branching ratios, and lead to a very good agreement with the data; (b) The NLO pQCD predictions are $\acp^{dir}(B^0\to K^+π^-)=(-6.5\pm 3.1)%$ and $\acp^{dir}(B^+\to K^+ π^0)=(2.2\pm 2.0)%$, become well consistent with the data due to the inclusion of the NLO contributions.

preprint2010arXiv

Derivations of the finite-dimensional special odd Hamiltonian superalgebras

The aim is to determine the derivations of the three series of finite-dimensional Z-graded Lie superalgebras of Cartan-type over a field of characteristic p > 3, called the special odd Hamiltonian superalgebras. To that end we first determine the derivations of negative Z-degree for the restricted and simple special odd Hamiltonian superalgebras by means of weight space decompositions. Then the results are used to determine the derivations of negative Z-degree for the nonrestricted and non-simple special odd Hamiltonian superalgebras. Finally the derivation algebras and the outer derivation algebras of those Lie superalgebras are completely determined.

Wei Bai

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

Jumping to male-dominated occupations: A novel way to reduce gender wage gap for Chinese women

The Gender Pay Gap in China: Insights from a Discrimination Perspective

Domain-specific Communication Optimization for Distributed DNN Training

Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV

Quantitative Evaluations on Saliency Methods: An Experimental Study

Weakness Analysis of Cyberspace Configuration Based on Reinforcement Learning

RepNet: Cutting Tail Latency in Data Center Networks with Flow Replication

Enabling Automatic Certification of Online Auctions

Revisiting $Kπ$ puzzle in the pQCD factorization approach

Derivations of the finite-dimensional special odd Hamiltonian superalgebras