Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
23works
0followers
23topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

23 published item(s)

preprint2026arXiv

Aligning LLM Uncertainty with Human Disagreement in Subjectivity Analysis

Large language models for subjectivity analysis are typically trained with aggregated labels, which compress variations in human judgment into a single supervision signal. This paradigm overlooks the intrinsic uncertainty of low-agreement samples and often induces overconfident predictions, undermining reliability and generalization in complex subjective settings. In this work, we advocate uncertainty-aware subjectivity analysis, where models are expected to make predictions while expressing uncertainty that reflects human disagreement. To operationalize this perspective, we propose a two-phase Disagreement Perception and Uncertainty Alignment (DPUA) framework. Specifically, DPUA jointly models label prediction, rationale generation, and uncertainty expression under an uncertainty-aware setting. In the disagreement perception phase, adaptive decoupled learning enhances the model's sensitivity to disagreement-related cues while preserving task performance. In the uncertainty alignment phase, GRPO-based reward optimization further improves uncertainty-aware reasoning and aligns the model's confidence expression with the human disagreement distribution. Experiments on three subjectivity analysis tasks show that DPUA preserves task performance while better aligning model uncertainty with human disagreement, mitigating overconfidence on boundary samples, and improving out-of-distribution generalization.

preprint2026arXiv

The Straight and Narrow: Do LLMs Possess an Internal Moral Path?

Enhancing the moral alignment of Large Language Models (LLMs) is a critical challenge in AI safety. Current alignment techniques often act as superficial guardrails, leaving the intrinsic moral representations of LLMs largely untouched. In this paper, we bridge this gap by leveraging Moral Foundations Theory (MFT) to map and manipulate the fine-grained moral landscape of LLMs. Through cross-lingual linear probing, we validate the shared nature of moral representations in middle layers and uncover a shared yet different moral subspace between English and Chinese. Building upon this, we extract steerable Moral Vectors and successfully validate their efficacy at both internal and behavioral levels. Leveraging the high generalizability of morality, we propose Adaptive Moral Fusion (AMF), a dynamic inference-time intervention that synergizes probe detection with vector injection to tackle the safety-helpfulness trade-off. Empirical results confirm that our approach acts as a targeted intrinsic defense, effectively reducing incorrect refusals on benign queries while minimizing jailbreak success rates compared to standard baselines.

preprint2026arXiv

VisualQuest: A Benchmark for Abstract Visual Reasoning in MLLMs

We introduce VisualQuest, a novel dataset designed to rigorously evaluate multimodal large language models (MLLMs) on abstract visual reasoning tasks that require the integration of symbolic, cultural, and linguistic knowledge. Unlike existing benchmarks that focus on direct image captioning or classification of realistic images, VisualQuest comprises 3,551 non-photographic, stylized images spanning four categories: Public Figures, Popular Culture, Linguistic Expressions, and Literary Works. Each image is paired with targeted questions to probe complex reasoning. We benchmark ten state-of-the-art MLLMs and find that only Gemini-2.5-flash and GPT-4o achieve strong overall performance, while 3.7 percent of the images remain unrecognized by any model, underscoring persistent challenges in multimodal understanding. Fine-grained analysis shows that Gemini excels at recognizing stylized public figures, whereas GPT-4o leads in linguistic reasoning tasks such as visual puns and emoji combinations. VisualQuest provides a comprehensive and challenging resource for advancing research in abstract visual reasoning and highlights key areas for future model improvement. The dataset is available at https://github.com/xkt88/VISUALQUEST.

preprint2023arXiv

Dynamic online prediction model and its application to automobile claim frequency data

Prediction modelling of claim frequency is an important task for pricing and risk management in non-life insurance and needed to be updated frequently with the changes in the insured population, regulatory legislation and technology. Existing methods are either done in an ad hoc fashion, such as parametric model calibration, or less so for the purpose of prediction. In this paper, we develop a Dynamic Poisson state space (DPSS) model which can continuously update the parameters whenever new claim information becomes available. DPSS model allows for both time-varying and time-invariant coefficients. To account for smoothness trends of time-varying coefficients over time, smoothing splines are used to model time-varying coefficients. The smoothing parameters are objectively chosen by maximum likelihood. The model is updated using batch data accumulated at pre-specified time intervals, which allows for a better approximation of the underlying Poisson density function. The proposed method can be also extended to the distributional assumption of zero-inflated Poisson and negative binomial. In the simulation, we show that the new model has significantly higher prediction accuracy compared to existing methods. We apply this methodology to a real-world automobile insurance claim data set in China over a period of six years and demonstrate its superiority by comparing it with the results of competing models from the literature.

preprint2022arXiv

Price DOES Matter! Modeling Price and Interest Preferences in Session-based Recommendation

Session-based recommendation aims to predict items that an anonymous user would like to purchase based on her short behavior sequence. The current approaches towards session-based recommendation only focus on modeling users' interest preferences, while they all ignore a key attribute of an item, i.e., the price. Many marketing studies have shown that the price factor significantly influences users' behaviors and the purchase decisions of users are determined by both price and interest preferences simultaneously. However, it is nontrivial to incorporate price preferences for session-based recommendation. Firstly, it is hard to handle heterogeneous information from various features of items to capture users' price preferences. Secondly, it is difficult to model the complex relations between price and interest preferences in determining user choices. To address the above challenges, we propose a novel method Co-guided Heterogeneous Hypergraph Network (CoHHN) for session-based recommendation. Towards the first challenge, we devise a heterogeneous hypergraph to represent heterogeneous information and rich relations among them. A dual-channel aggregating mechanism is then designed to aggregate various information in the heterogeneous hypergraph. After that, we extract users' price preferences and interest preferences via attention layers. As to the second challenge, a co-guided learning scheme is designed to model the relations between price and interest preferences and enhance the learning of each other. Finally, we predict user actions based on item features and users' price and interest preferences. Extensive experiments on three real-world datasets demonstrate the effectiveness of the proposed CoHHN. Further analysis reveals the significance of price for session-based recommendation.

preprint2022arXiv

Risk Loadings in Classification Ratemaking

The risk premium of a policy is the sum of the pure premium and the risk loading. In the classification ratemaking process, generalized linear models are usually used to calculate pure premiums, and various premium principles are applied to derive the risk loadings. No matter which premium principle is used, some risk loading parameters should be given in advance subjectively. To overcome this subjective problem and calculate the risk premium more reasonably and objectively, we propose a top-down method to calculate these risk loading parameters. First, we implement the bootstrap method to calculate the total risk premium of the portfolio. Then, under the constraint that the portfolio's total risk premium should equal the sum of the risk premiums of each policy, the risk loading parameters are determined. During this process, besides using generalized linear models, three kinds of quantile regression models are also applied, namely, traditional quantile regression model, fully parametric quantile regression model, and quantile regression model with coefficient functions. The empirical result shows that the risk premiums calculated by the method proposed in this study can reasonably differentiate the heterogeneity of different risk classes.

preprint2022arXiv

Robotic Inspection of Underground Utilities for Construction Survey Using a Ground Penetrating Radar

Ground Penetrating Radar (GPR) is a very useful non-destructive evaluation (NDE) device for locating and mapping underground assets prior to digging and trenching efforts in construction. This paper presents a novel robotic system to automate the GPR data collection process, localize the underground utilities, interpret and reconstruct the underground objects for better visualization allowing regular non-professional users to understand the survey results. This system is composed of three modules: 1) an Omni-directional robotic data collection platform, that carries an RGB-D camera with an Inertial Measurement Unit (IMU) and a GPR antenna to perform automatic GPR data collection, and tag each GPR measurement with visual positioning information at every sampling step; 2) a learning-based migration module to interpret the raw GPR B-scan image into a 2D cross-section model of objects; 3) a 3D reconstruction module, i.e., GPRNet, to generate underground utility model represented as fine 3D point cloud. Comparative studies are performed on synthetic data and field GPR raw data with various incompleteness and noise. Experimental results demonstrate that our proposed method achieves a $30.0\%$ higher GPR imaging accuracy in mean Intersection Over Union (IoU) than the conventional back projection (BP) migration approach and $6.9\%$-$7.2\%$ less loss in Chamfer Distance (CD) than baseline methods regarding point cloud model reconstruction. The GPR-based robotic inspection provides an effective tool for civil engineers to detect and survey underground utilities before construction.

preprint2022arXiv

Snowmass2021 Cosmic Frontier White Paper: Puzzling Excesses in Dark Matter Searches and How to Resolve Them

Intriguing signals with excesses over expected backgrounds have been observed in many astrophysical and terrestrial settings, which could potentially have a dark matter origin. Astrophysical excesses include the Galactic Center GeV gamma-ray excess detected by the Fermi Gamma-Ray Space Telescope, the AMS antiproton and positron excesses, and the 511 and 3.5 keV X-ray lines. Direct detection excesses include the DAMA/LIBRA annual modulation signal, the XENON1T excess, and low-threshold excesses in solid state detectors. We discuss avenues to resolve these excesses, with actions the field can take over the next several years.

preprint2022arXiv

The Chinese Hα Solar Explorer (CHASE) mission: An overview

The Chinese Hα Solar Explorer (CHASE), dubbed "Xihe" - Goddess of the Sun, was launched on October 14, 2021 as the first solar space mission of China National Space Administration (CNSA). The CHASE mission is designed to test a newly developed satellite platform and to acquire the spectroscopic observations in the Hα waveband. The Hα Imaging Spectrograph (HIS) is the scientific payload of the CHASE satellite. It consists of two observational modes: raster scanning mode and continuum imaging mode. The raster scanning mode obtains full-Sun or region-of-interest spectral images from 6559.7 to 6565.9 Å and from 6567.8 to 6570.6 Å with 0.024 Å pixel spectral resolution and 1 minute temporal resolution. The continuum imaging mode obtains photospheric images in continuum around 6689 Å with the full width at half maximum of 13.4 Å. The CHASE mission will advance our understanding of the dynamics of solar activity in the photosphere and chromosphere. In this paper, we present an overview of the CHASE mission including the scientific objectives, HIS instrument overview, data calibration flow, and first results of on-orbit observations.

preprint2022arXiv

Wave induced thrust on a submerged hydrofoil: pitch stiffness effects

Submerged flapping foils can convert wave energy directly into thrust, which could be potentially utilised for green marine propulsion. This study analyses the wave-induced flapping hydrofoil propulsion using an in-house developed, new computational fluid dynamics (CFD) framework. The numerical model was initially validated against a few benchmarked problems and then used for the numerical investigation of wave-induced flapping hydrofoil propulsion. The transition between drag and thrust can be observed from the vortex flow pattern. The pitch stiffness and other physical parameter were non-dimensionalised for the first time. The optimal wave conditions and the optimal pitch stiffness are given for the future green marine system design.

preprint2021arXiv

Bi-GCN: Binary Graph Convolutional Network

Graph Neural Networks (GNNs) have achieved tremendous success in graph representation learning. Unfortunately, current GNNs usually rely on loading the entire attributed graph into network for processing. This implicit assumption may not be satisfied with limited memory resources, especially when the attributed graph is large. In this paper, we pioneer to propose a Binary Graph Convolutional Network (Bi-GCN), which binarizes both the network parameters and input node features. Besides, the original matrix multiplications are revised to binary operations for accelerations. According to the theoretical analysis, our Bi-GCN can reduce the memory consumption by an average of ~30x for both the network parameters and input data, and accelerate the inference speed by an average of ~47x, on the citation networks. Meanwhile, we also design a new gradient approximation based back-propagation method to train our Bi-GCN well. Extensive experiments have demonstrated that our Bi-GCN can give a comparable performance compared to the full-precision baselines. Besides, our binarization approach can be easily applied to other GNNs, which has been verified in the experiments.

preprint2021arXiv

Interference Impact on Decode-and-Forward Relay Networks with RIS-Assisted Source and Relays

In this letter, we consider the scenario of decode-and-forward relay network with reconfigurable intelligent surface (RIS)-assisted source and relays in the presence of interference. We derive approximate closed-form expression for the system outage probability assuming Rayleigh fading channels and opportunistic relaying scheme. In addition, we study the system behavior at the high signal-to-noise ratio (SNR) regime, where the diversity order and coding gain are obtained and analyzed. The results show that the system can achieve a diversity order of Gd = min(N1,N2)K, where N1 and N2 are the numbers of reflecting elements at the source and relays, respectively, and K is the number of relays. In addition, findings illustrate that for the same diversity order, utilizing one relay with multiple reflecting elements gives better performance than utilizing multiple relays with a single reflecting element. Furthermore, findings illustrate that the interference at the destination is more severe on the system performance than the interference at the relays. Therefore, under the same interference powers and for a fixed number of relays K, results show that the case where the first hop is dominating the performance N1 < N2 gives better results in terms of coding gain than the case where N2 < N1.

preprint2020arXiv

Accurate Closed-Form Approximations to Channel Distributions of RIS-Aided Wireless Systems

This paper proposes highly accurate closed-form approximations to channel distributions of two different reconfigurable intelligent surface (RIS)-based wireless system setups, namely, dual-hop RIS-aided (RIS-DH) scheme and RIS-aided transmit (RIS-T) scheme. Differently from previous works, the proposed approximations reveal to be very tight for arbitrary number $N$ of reflecting metasurface&#39;s elements. Our findings are then applied to the performance analysis of the considered systems, in which the outage probability, bit error rate, and average channel capacity are derived. Results show that the achievable diversity orders $G_d$ for RIS-DH and RIS-T schemes are $N-1<G_d<N$ and $N$, respectively. Furthermore, it is revealed that both schemes can not provide the multiplexing gain and only diversity gains are achieved. For the RIS-DH scheme, the channels are similar to the keyhole multiple-input multiple-output (MIMO) channels with only one degree of freedom, while the RIS-T scheme is like the transmit diversity structure.

preprint2020arXiv

Channel Estimation via Direct Calculation and Deep Learning for RIS-Aided mmWave Systems

This paper proposes a novel reconfigurable intelligent surface (RIS) architecture which enables channel estimation of RIS-assisted millimeter wave (mmWave) systems. More specifically, two channel estimation methods, namely, direct calculation (DC) and deep learning (DL) methods, are proposed to skillfully convert the overall channel estimation into two tasks: the channel estimation and the angle parameter estimation of a small number of active elements. In particular, the direct calculation method calculates the angle parameters directly through the channel estimates of adjacent active elements and, based on it, the DL method reduces the angle offset rate and further improves the accuracy of angle parameter estimation. Compared with the traditional methods, the proposed schemes reduce the complexity of the RIS channel estimation while outperforming the beam training method in terms of minimum square error, achievable rate, and outage probability.

preprint2020arXiv

Dirac series for some real exceptional Lie groups

Up to equivalence, this paper classifies all the irreducible unitary representations with non-zero Dirac cohomology for the following simple real exceptional Lie groups: ${\rm EI}=E_{6(6)}, {\rm EIV}=E_{6(-26)}, {\rm FI}=F_{4(4)}, {\rm FII}=F_{4(-20)}$. Along the way, we find an irreducible unitary representation of $F_{4(4)}$ whose Dirac index vanishes, while its Dirac cohomology is non-zero. This disproves a conjecture raised in 2015 asserting that there should be no cancellation between the even part and the odd part of the Dirac cohomology.

preprint2020arXiv

GPR-based Subsurface Object Detection and Reconstruction Using Random Motion and DepthNet

Ground Penetrating Radar (GPR) is one of the most important non-destructive evaluation (NDE) devices to detect the subsurface objects (i.e. rebars, utility pipes) and reveal the underground scene. One of the biggest challenges in GPR based inspection is the subsurface targets reconstruction. In order to address this issue, this paper presents a 3D GPR migration and dielectric prediction system to detect and reconstruct underground targets. This system is composed of three modules: 1) visual inertial fusion (VIF) module to generate the pose information of GPR device, 2) deep neural network module (i.e., DepthNet) which detects B-scan of GPR image, extracts hyperbola features to remove the noise in B-scan data and predicts dielectric to determine the depth of the objects, 3) 3D GPR migration module which synchronizes the pose information with GPR scan data processed by DepthNet to reconstruct and visualize the 3D underground targets. Our proposed DepthNet processes the GPR data by removing the noise in B-scan image as well as predicting depth of subsurface objects. For DepthNet model training and testing, we collect the real GPR data in the concrete test pit at Geophysical Survey System Inc. (GSSI) and create the synthetic GPR data by using gprMax3.0 simulator. The dataset we create includes 350 labeled GPR images. The DepthNet achieves an average accuracy of 92.64% for B-scan feature detection and an 0.112 average error for underground target depth prediction. In addition, the experimental results verify that our proposed method improve the migration accuracy and performance in generating 3D GPR image compared with the traditional migration methods.

preprint2020arXiv

Judging a Book by Its Cover: The Effect of Facial Perception on Centrality in Social Networks

Facial appearance matters in social networks. Individuals frequently make trait judgments from facial clues. Although these face-based impressions lack the evidence to determine validity, they are of vital importance, because they may relate to human network-based social behavior, such as seeking certain individuals for help, advice, dating, and cooperation, and thus they may relate to centrality in social networks. However, little to no work has investigated the apparent facial traits that influence network centrality, despite the large amount of research on attributions of the central position including personality and behavior. In this paper, we examine whether perceived traits based on facial appearance affect network centrality by exploring the initial stage of social network formation in a first-year college residential area. We took face photos of participants who are freshmen living in the same residential area, and we asked them to nominate community members linking to different networks. We then collected facial perception data by requiring other participants to rate facial images for three main attributions: dominance, trustworthiness, and attractiveness. Meanwhile, we proposed a framework to discover how facial appearance affects social networks. Our results revealed that perceived facial traits were correlated with the network centrality and that they were indicative to predict the centrality of people in different networks. Our findings provide psychological evidence regarding the interaction between faces and network centrality. Our findings also offer insights in to a combination of psychological and social network techniques, and they highlight the function of facial bias in cuing and signaling social traits. To the best of our knowledge, we are the first to explore the influence of facial perception on centrality in social networks.

preprint2020arXiv

On the Performance of Dual-Hop Systems over Mixed FSO/mmWave Fading Channels

Free-space optical (FSO) links are considered as a cost-efficient way to fill the backhaul/fronthaul connectivity gap between millimeter wave (mmWave) access networks and optical fiber based central networks. In this paper, we investigate the end-to-end performance of dual-hop mixed FSO/mmWave systems to address this combined use. The FSO link is modeled as a Gamma-Gamma fading channel using both heterodyne detection and indirect modulation/direct detection with pointing error impairments, while the mmWave link experiences the fluctuating two-ray fading. Under the assumption of both amplify-and-forward and decode-and-forward relaying, we derive novel closed-form expressions for the outage probability, average bit error probability (BER), ergodic capacity, effective capacity in terms of bivariate Fox&#39;s $H$-functions. Additionally, we discuss the diversity gain and provide other important engineering insights based on the high signal-to-noise-ratio analysis of the outage probability and the average BER. Finally, all our analytical results are verified using Monte Carlo simulations.

preprint2020arXiv

Outage Probability and Capacity Scaling Law of Multiple RIS-Aided Cooperative Networks

In this letter, we consider a dual-hop cooperative network assisted by multiple reconfigurable intelligent surfaces (RISs). Assuming that the RIS with the highest instantaneous end-to-end signal-to-noise ratio (SNR) is selected to aid the communication, the outage probability (OP) and average sum-rate are investigated. Specifically, an exact analysis for the OP is developed. In addition, relying on the extreme value theory, closed-form expressions for the asymptotic OP and asymptotic sum-rate are derived, based on which the capacity scaling law is established. Our results are corroborated through simulations and insightful discussions are provided. In particular, our analysis shows that the number of RISs as well as the number of reflecting elements play a crucial role in the capacity scaling law of multiple RIS-aided cooperative networks. Also, comparisons with relay-aided systems are carried out to demonstrate that the proposed system setup outperforms relaying schemes both in terms of the OP and average sum-rate.

preprint2020arXiv

Secrecy Outage Probability Analysis for RIS-Assisted NOMA Systems

In this paper, the physical layer security (PLS) for a novel reconfigurable intelligent surface (RIS)-assisted non-orthogonal multiple access (NOMA) system in a multi-user scenario is investigated, where we consider the worst case that the eavesdropper also utilizes the advantage of the RISs. More specifically, we derive analytical results for the secrecy outage probability (SOP). From the numerical results, we observe that the use of RISs can improve the secrecy performance compared to traditional NOMA systems. However, for the worst case that the received signals at the eavesdropper comes from the RISs and source, increasing the number of intelligent elements on the RIS has a negative impact on the secrecy performance. At high SNRs, the system&#39;s SOP tends to a constant. Finally, the secrecy performance can be improved through the group selection.

preprint2020arXiv

The First Round Result from the TianQin-1 Satellite

The TianQin-1 satellite (TQ-1), which is the first technology demonstration satellite for the TianQin project, was launched on 20 December 2019. The first round of experiment had been carried out from 21 December 2019 until 1 April 2020. The residual acceleration of the satellite is found to be about $1\times10^{-10}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$ and about $5\times10^{-11}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.05~{\rm Hz}\,$, measured by an inertial sensor with a sensitivity of $5\times10^{-12}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$. The micro-Newton thrusters has demonstrated a thrust resolution of $0.1~μ{\rm N}$ and a thrust noise of $0.3~μ{\rm N}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}$. The residual noise of the satellite with drag-free control is $3\times10^{-9}~{\rm m}/{\rm s}^{2}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$. The noise level of the optical readout system is about $30~{\rm pm}/{\rm Hz}^{1/2}$ at $0.1~{\rm Hz}\,$. The temperature stability at temperature monitoring position is controlled to be about $\pm3~{\rm mK}$ per orbit, and the mismatch between the center-of-mass of the satellite and that of the test mass is measured with a precision of better than $0.1~{\rm mm}$.

preprint2020arXiv

The TianQin project: current progress on science and technology

TianQin is a planned space-based gravitational wave (GW) observatory consisting of three earth orbiting satellites with an orbital radius of about $10^5~{\rm km}$. The satellites will form a equilateral triangle constellation the plane of which is nearly perpendicular to the ecliptic plane. TianQin aims to detect GWs between $10^{-4}~{\rm Hz}$ and $1~{\rm Hz}$ that can be generated by a wide variety of important astrophysical and cosmological sources, including the inspiral of Galactic ultra-compact binaries, the inspiral of stellar-mass black hole binaries, extreme mass ratio inspirals, the merger of massive black hole binaries, and possibly the energetic processes in the very early universe or exotic sources such as cosmic strings. In order to start science operations around 2035, a roadmap called the 0123 plan is being used to bring the key technologies of TianQin to maturity, supported by the construction of a series of research facilities on the ground. Two major projects of the 0123 plan are being carried out. In this process, the team has created a new generation $17~{\rm cm}$ single-body hollow corner-cube retro-reflector which has been launched with the QueQiao satellite on 21 May 2018; a new laser ranging station equipped with a $1.2~{\rm m}$ telescope has been constructed and the station has successfully ranged to all the five retro-reflectors on the Moon; and the TianQin-1 experimental satellite has been launched on 20 December 2019 and the first round result shows that the satellite has exceeded all of its mission requirements.

preprint2020arXiv

Weakly Supervised Semantic Segmentation in 3D Graph-Structured Point Clouds of Wild Scenes

The deficiency of 3D segmentation labels is one of the main obstacles to effective point cloud segmentation, especially for scenes in the wild with varieties of different objects. To alleviate this issue, we propose a novel deep graph convolutional network-based framework for large-scale semantic scene segmentation in point clouds with sole 2D supervision. Different with numerous preceding multi-view supervised approaches focusing on single object point clouds, we argue that 2D supervision is capable of providing sufficient guidance information for training 3D semantic segmentation models of natural scene point clouds while not explicitly capturing their inherent structures, even with only single view per training sample. Specifically, a Graph-based Pyramid Feature Network (GPFN) is designed to implicitly infer both global and local features of point sets and an Observability Network (OBSNet) is introduced to further solve object occlusion problem caused by complicated spatial relations of objects in 3D scenes. During the projection process, perspective rendering and semantic fusion modules are proposed to provide refined 2D supervision signals for training along with a 2D-3D joint optimization strategy. Extensive experimental results demonstrate the effectiveness of our 2D supervised framework, which achieves comparable results with the state-of-the-art approaches trained with full 3D labels, for semantic point cloud segmentation on the popular SUNCG synthetic dataset and S3DIS real-world dataset.