Source author record

Leilei Shi

Leilei Shi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

q-fin.TR Computer Vision Cryptography and Security Machine Learning Multimedia physics.optics q-fin.GN q-fin.ST

Catalog footprint

What is connected

5works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

HiDream-O1-Image: A Natively Unified Image Generative Foundation Model with Pixel-level Unified Transformer

The evolution of visual generative models has long been constrained by fragmented architectures relying on disjoint text encoders and external VAEs. In this report, we present HiDream-O1-Image, a natively unified generative foundation model via pixel-space Diffusion Transformer, that pioneers a paradigm shift from modular architectures to an end-to-end in-context visual generation engine. By mapping raw image pixels, text tokens, and task-specific conditions into a single shared token space, HiDream-O1-Image achieves a structural unification of multimodal inputs within an Unified Transformer (UiT) architecture. This native encoding paradigm eliminates the need for separate VAEs or disjoint pre-trained text encoders, allowing the model to treat diverse generation and editing tasks as a consistent in-context reasoning process. Extensive experiments show that HiDream-O1-Image excels across various generation tasks, including text-to-image generation, instruction-based editing, and subject-driven personalization. Notably, with only 8B parameters, HiDream-O1-Image (8B) achieves performance parity with or even surpasses established state-of-the-art models with significantly larger parameters (e.g., 27B Qwen-Image). Crucially, to validate the immense scalability of this paradigm, we successfully scale the architecture up to over 200B parameters. Experimental results demonstrate that this massive-scale version HiDream-O1-Image-Pro (200B+) unlocks unprecedented generative capabilities and superior performance, establishing new state-of-the-art benchmarks. Ultimately, HiDream-O1-Image highlights the immense potential of natively unified architectures and charts a highly scalable path toward next-generation multimodal AI.

preprint2022arXiv

sqSGD: Locally Private and Communication Efficient Federated Learning

Federated learning (FL) is a technique that trains machine learning models from decentralized data sources. We study FL under local notions of privacy constraints, which provides strong protection against sensitive data disclosures via obfuscating the data before leaving the client. We identify two major concerns in designing practical privacy-preserving FL algorithms: communication efficiency and high-dimensional compatibility. We then develop a gradient-based learning algorithm called \emph{sqSGD} (selective quantized stochastic gradient descent) that addresses both concerns. The proposed algorithm is based on a novel privacy-preserving quantization scheme that uses a constant number of bits per dimension per client. Then we improve the base algorithm in three ways: first, we apply a gradient subsampling strategy that simultaneously offers better training performance and smaller communication costs under a fixed privacy budget. Secondly, we utilize randomized rotation as a preprocessing step to reduce quantization error. Thirdly, an adaptive gradient norm upper bound shrinkage strategy is adopted to improve accuracy and stabilize training. Finally, the practicality of the proposed framework is demonstrated on benchmark datasets. Experiment results show that sqSGD successfully learns large models like LeNet and ResNet with local privacy constraints. In addition, with fixed privacy and communication level, the performance of sqSGD significantly dominates that of various baseline algorithms.

preprint2014arXiv

Ultra-narrow Linewidth Fiber Laser with Self-injection Feedback Based on Rayleigh Backscattering

A single longitudinal mode fiber laser with ultra-narrow linewidth based on self-injection feedback by using the linewidth compress mechanism of Rayleigh backscattering (RBS) are proposed and demonstrated. Since the linewidth of RBS is narrower than that of the incident light in optical fibers and they have the same centre wavelength, the RBS can act as a mechanism to compress the linewidth of the incident light in fiber ring laser. In addition, more RBS signal could be collected to help further compress the laser linewidth besides the free spectral range is expanded when the self-injection feedback method is used. Our experimental results show that the side-mode suppression ratio of our laser is up to 75dB and the laser linewidth could be low to ~130Hz.

preprint2010arXiv

A Security Price Volatile Trading Conditioning Model

We develop a theoretical trading conditioning model subject to price volatility and return information in terms of market psychological behavior, based on analytical transaction volume-price probability wave distributions in which we use transaction volume probability to describe price volatility uncertainty and intensity. Applying the model to high frequent data test in China stock market, we have main findings as follows: 1) there is, in general, significant positive correlation between the rate of mean return and that of change in trading conditioning intensity; 2) it lacks significance in spite of positive correlation in two time intervals right before and just after bubble crashes; and 3) it shows, particularly, significant negative correlation in a time interval when SSE Composite Index is rising during bull market. Our model and findings can test both disposition effect and herd behavior simultaneously, and explain excessive trading (volume) and other anomalies in stock market.

preprint2010arXiv

Does Security Transaction Volume-Price Behavior Resemble a Probability Wave?

Motivated by how transaction amount constrain trading volume and price volatility in stock market, we, in this paper, study the relation between volume and price if amount of transaction is given. We find that accumulative trading volume gradually emerges a kurtosis near the price mean value over a trading price range when it takes a longer trading time, regardless of actual price fluctuation path, time series, or total transaction volume in the time interval. To explain the volume-price behavior, we, in terms of physics, propose a transaction energy hypothesis, derive a time-independent transaction volume-price probability wave equation, and get two sets of analytical volume distribution eigenfunctions over a trading price range. By empiric test, we show the existence of coherence in stock market and demonstrate the model validation at this early stage. The volume-price behaves like a probability wave.

Leilei Shi

What is connected

Connect this record

See the researcher in context

Building this map preview

5 published item(s)

HiDream-O1-Image: A Natively Unified Image Generative Foundation Model with Pixel-level Unified Transformer

sqSGD: Locally Private and Communication Efficient Federated Learning

Ultra-narrow Linewidth Fiber Laser with Self-injection Feedback Based on Rayleigh Backscattering

A Security Price Volatile Trading Conditioning Model

Does Security Transaction Volume-Price Behavior Resemble a Probability Wave?