Researcher profile

Ray T. Chen

Ray T. Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2023arXiv

A Point-of-Care Biosensor for Rapid Detection and Differentiation of COVID-19 Virus (SARS-CoV-2) and Influenza Virus Using Subwavelength Grating Micro-ring Resonator

In the context of continued spread of coronavirus disease 2019 (COVID-19) caused by SARS-CoV-2 and the emergence of new variants, the demand for rapid, accurate, and frequent detection is increasing. Besides, the new predominant strain, Omicron variant, manifests more similar clinical features to those of other common respiratory infections. The concurrent detection of multiple potential pathogens helps distinguish SARS-CoV-2 infection from other diseases with overlapping symptoms, which is significant for patients to receive tailored treatment and containing the outbreak. Here, we report a lab-on-a-chip biosensing platform for SARS-CoV-2 detection based on subwavelength grating micro-ring resonator. The sensing surface is functionalized by specific antibody against SARS-CoV-2 spike protein, which could produce redshifts of resonant peaks by antigen-antibody combination, thus achieving quantitative detection. Additionally, the sensor chip is integrated with a microfluidic chip with an anti-backflow Y-shaped structure that enables the concurrent detection of two analytes. In this study, we realized the detection and differentiation of COVID-19 and influenza A H1N1. Experimental results show that the limit of detection of our device reaches 100 fg/mL (1.31 fM) within 15 min detecting time, and cross-reactivity tests manifest the specificity of the optical diagnostic assay. Further, the integrated packaging and streamlined workflow facilitate its use for clinical applications. Thus, the biosensing platform offers a promising solution to achieve ultrasensitive, selective, multiplexed, and quantitative point-of-care detection of COVID-19.

preprint2023arXiv

Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accelerator

The wide adoption and significant computing resource of attention-based transformers, e.g., Vision Transformers and large language models (LLM), have driven the demand for efficient hardware accelerators. There is a growing interest in exploring photonics as an alternative technology to digital electronics due to its high energy efficiency and ultra-fast processing speed. Photonic accelerators have shown promising results for CNNs, which mainly rely on weight-static linear operations. However, they encounter issues when efficiently supporting Transformer architectures, questioning the applicability of photonics to advanced ML tasks. The primary hurdle lies in their inefficiency in handling unique workloads in Transformers, i.e., dynamic and full-range tensor multiplication. In this work, we propose Lightening-Transformer, the first light-empowered, high-performance, and energy-efficient photonic Transformer accelerator. To overcome prior designs' fundamental limitations, we introduce a novel dynamically-operated photonic tensor core, DPTC, a crossbar array of interference-based optical vector dot-product engines supporting highly parallel, dynamic, and full-range matrix multiplication. Furthermore, we design a dedicated accelerator that integrates our novel photonic computing cores with photonic interconnects for inter-core data broadcast, fully unleashing the power of optics. Comprehensive evaluations show that ours achieves >2.6x energy and >12x latency reductions compared to prior photonic accelerators and delivers the lowest energy cost and 2 to 3 orders of magnitude lower energy-delay product compared to electronic Transformer accelerators, all while maintaining digital-comparable accuracy. Our work highlights the immense potential of photonics for advanced ML workloads, such as Transformer-backboned LLM. Our work is available at https://github.com/zhuhanqing/Lightening-Transformer.

preprint2023arXiv

M3ICRO: Machine Learning-Enabled Compact Photonic Tensor Core based on PRogrammable Multi-Operand Multimode Interference

Photonic computing shows promise for transformative advancements in machine learning (ML) acceleration, offering ultra-fast speed, massive parallelism, and high energy efficiency. However, current photonic tensor core (PTC) designs based on standard optical components hinder scalability and compute density due to their large spatial footprint. To address this, we propose an ultra-compact PTC using customized programmable multi-operand multimode interference (MOMMI) devices, named M3ICRO. The programmable MOMMI leverages the intrinsic light propagation principle, providing a single-device programmable matrix unit beyond the conventional computing paradigm of one multiply-accumulate (MAC) operation per device. To overcome the optimization difficulty of customized devices that often requires time-consuming simulation, we apply ML for optics to predict the device behavior and enable a differentiable optimization flow. We thoroughly investigate the reconfigurability and matrix expressivity of our customized PTC, and introduce a novel block unfolding method to fully exploit the computing capabilities of a complex-valued PTC for near-universal real-valued linear transformations. Extensive evaluations demonstrate that M3ICRO achieves a 3.4-9.6x smaller footprint, 1.6-4.4x higher speed, 10.6-42x higher compute density, 3.7-12x higher system throughput, and superior noise robustness compared to state-of-the-art coherent PTC designs, while maintaining close-to-digital task accuracy across various ML benchmarks. Our code is open-sourced at https://github.com/JeremieMelo/M3ICRO-MOMMI.

preprint2022arXiv

A compact butterfly-style silicon photonic-electronic neural chip for hardware-efficient deep learning

The optical neural network (ONN) is a promising hardware platform for next-generation neurocomputing due to its high parallelism, low latency, and low energy consumption. Previous ONN architectures are mainly designed for general matrix multiplication (GEMM), leading to unnecessarily large area cost and high control complexity. Here, we move beyond classical GEMM-based ONNs and propose an optical subspace neural network (OSNN) architecture, which trades the universality of weight representation for lower optical component usage, area cost, and energy consumption. We devise a butterfly-style photonic-electronic neural chip to implement our OSNN with up to 7x fewer trainable optical components compared to GEMM-based ONNs. Additionally, a hardware-aware training framework is provided to minimize the required device programming precision, lessen the chip area, and boost the noise robustness. We experimentally demonstrate the utility of our neural chip in practical image recognition tasks, showing that a measured accuracy of 94.16% can be achieved in hand-written digit recognition tasks with 3-bit weight programming precision.

preprint2022arXiv

ADEPT: Automatic Differentiable DEsign of Photonic Tensor Cores

Photonic tensor cores (PTCs) are essential building blocks for optical artificial intelligence (AI) accelerators based on programmable photonic integrated circuits. PTCs can achieve ultra-fast and efficient tensor operations for neural network (NN) acceleration. Current PTC designs are either manually constructed or based on matrix decomposition theory, which lacks the adaptability to meet various hardware constraints and device specifications. To our best knowledge, automatic PTC design methodology is still unexplored. It will be promising to move beyond the manual design paradigm and "nurture" photonic neurocomputing with AI and design automation. Therefore, in this work, for the first time, we propose a fully differentiable framework, dubbed ADEPT, that can efficiently search PTC designs adaptive to various circuit footprint constraints and foundry PDKs. Extensive experiments show superior flexibility and effectiveness of the proposed ADEPT framework to explore a large PTC design space. On various NN models and benchmarks, our searched PTC topology outperforms prior manually-designed structures with competitive matrix representability, 2-30x higher footprint compactness, and better noise robustness, demonstrating a new paradigm in photonic neural chip design. The code of ADEPT is available at https://github.com/JeremieMelo/ADEPT using the https://github.com/JeremieMelo/pytorch-onn (TorchONN) library.

preprint2022arXiv

Lab-on-a-Chip Optical Biosensor Platform: Micro Ring Resonator Integrated with Near-Infrared Fourier Transform Spectrometer

A micro-ring-resonator (MRR) optical biosensor based on the evanescent field sensing mechanism has been extensively studied due to its high sensitivity and compact device size. However, a suitable on-chip integrated spectrometer device has to be demonstrated for the lab-on-a-chip applications, which can read the resonance wavelength shift from MRR biosensors based on minuscule changes in refractive index. In this paper, we demonstrated the design and experimental results of the near-infrared lab-on-a-chip optical biosensor platform that monolithically integrates the MRR and the on-chip spectrometer on the silicon-on-insulator (SOI) wafer, which can eliminate the external optical spectrum analyzer for scanning the wavelength spectrum. The symmetric add-drop MRR biosensor is designed to have a free spectral range (FSR) of ~19 nm, and a bulk sensitivity of ~73 nm/RIU; then the drop-port output resonance peaks are reconstructed from the integrated spatial-heterodyne Fourier transform spectrometer (SHFTS) with the spectral resolution of ~3.1 nm and bandwidth of ~50 nm, which results in the limit of detection of 0.042 RIU. The MRR output spectrum with air- and water-claddings are measured and reconstructed from the MRR-SHFTS integrated device experimentally to validate the wavelength shifting measurement.

preprint2022arXiv

Packaging-enhanced optical fiber-chip interconnect with enlarged grating coupler and multimode fiber

Optical I/O plays a crucial role in the lifespan of lab-on-a-chip systems, from preliminary testing to operation in the target environment. However, due to the precise alignments required, efficient and reliable fiber-to-chip connections remain challenging, yielding inconsistent test results and unstable packaged performance. To overcome this issue, for use in single mode on-chip systems, we propose the incorporation of area-enlarged grating couplers working in conjunction with multimode fibers. This combination enables simpler, faster, and more reliable connections than the traditional small area grating coupler with single-mode fiber. In this work, we experimentally demonstrate a 3dB in-plane (X, Y) spatial tolerance of (10.2 μm, 17.3 μm) for the large area configuration, being at least (2.49, 3.33) times that of the small area one, and agreeing well with theoretical calculations. The simple concept is readily applicable to a range of photonic systems where cheaper more robust optical I/O is desired.

preprint2020arXiv

Extra Loss-free Non-Hermitian Engineered Single Mode Laser Systems

In a laser system non-Hermitian methods such as Parity-Time (PT) Symmetry and Supersymmetry (SUSY) have shown and demonstrated the ability to suppress unwanted lasing modes and, thus, achieved single mode lasing operation through the addition of lossy passive elements. While these approaches enable laser engineering versatility, they rely on the drawback of adding optical losses to a system tasked to produce single mode gain. Unlike PT and SUSY lasers, here we show an extra loss-free non-Hermitian laser engineering approach to realize single mode lasing operation for the first time. By selectively enhancing the fundamental modes quality factor, we obtain single mode operation with higher output power per cavity since all cavities in this system contribute to the laser output, in contrast to other non-Hermitian approaches. Furthermore, we show that this approach interestingly allows reducing the number of to-be-designed cavities in super-partner array as compared with, for example, the SUSY approach, thus leading to reduced design complexity upon coupled cavity scale up of laser arrays. In summary, the ability to engineer coupled laser systems where each laser cavity contributes to coherent light amplification opens up a new degree of laser-design freedom leading to increased device performance and simultaneous reduced design and fabrication complexity.

preprint2020arXiv

Hexagonal Transverse Coupled Cavity VCSEL Redefining the High-Speed Lasers

The vertical-cavity surface-emitting lasers (VCSELs) have emerged as a vital approach for realizing energy efficient, high speed optical interconnects in the data center and supercomputers. As of today, VCSEL is the most suitable for mass production in terms of cost-effectiveness and reliability. However, there are still key challenges for higher speed modulation above 40 GHz. Here, a hexagonal transverse coupled cavity VCSEL adiabatically coupled through the center cavity is proposed. A 3-dB roll-off modulation bandwidth of 45 GHz is demonstrated, which is five times greater than a conventional VCSEL fabricated on the same epi-wafer structure. While a parity time (PT) symmetry approaches add loss to engineer the topological state of the laser system, here, a radical paradigm shift with gain introduces symmetry breaking. This idea, then enables a single mode operation with a side-mode suppression-ratio (SMSR) of > 30 decibels and signal-to-noise ratio (SNR) of > 45 decibels. The energy distribution inside the coupled cavity system is also redistributed to provide a coherent gain in a spatially separated system. Consequently, throughput power is three times higher than that of the conventional VCSEL.