1
|
Liu W, Huang Y, Sun R, Fu T, Yang S, Chen H. Ultra-compact multi-task processor based on in-memory optical computing. LIGHT, SCIENCE & APPLICATIONS 2025; 14:134. [PMID: 40122842 PMCID: PMC11930997 DOI: 10.1038/s41377-025-01814-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 02/13/2025] [Accepted: 03/05/2025] [Indexed: 03/25/2025]
Abstract
To enhance the computational density and energy efficiency of on-chip neuromorphic hardware, this study introduces a novel network architecture for multi-task processing with in-memory optical computing. On-chip optical neural networks are celebrated for their capability to transduce a substantial volume of parameters into optical form while conducting passive computing, yet they encounter challenges in scalability and multitasking. Leveraging the principles of transfer learning, this approach involves embedding the majority of parameters into fixed optical components and a minority into adjustable electrical components. Furthermore, with deep regression algorithm in modeling physical propagation process, a compact optical neural network achieve to handle diverse tasks. In this work, two ultra-compact in-memory diffraction-based chips with integration of more than 60,000 parameters/mm2 were fabricated, employing deep neural network model and the hard parameter sharing algorithm, to perform multifaceted classification and regression tasks, respectively. The experimental results demonstrate that these chips achieve accuracies comparable to those of electrical networks while significantly reducing the power-intensive digital computation by 90%. Our work heralds strong potential for advancing in-memory optical computing frameworks and next generation of artificial intelligence platforms.
Collapse
Affiliation(s)
- Wencan Liu
- Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
| | - Yuyao Huang
- Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
| | - Run Sun
- Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
| | - Tingzhao Fu
- Hunan Provincial Key Laboratory of Novel Nano Optoelectronic Information Materials and Devices, College of Advanced Interdisciplinary Studies, National University of Defense Technology, Changsha, China
| | - Sigang Yang
- Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
| | - Hongwei Chen
- Department of Electronic Engineering, Tsinghua University, Beijing, China.
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China.
| |
Collapse
|
2
|
Ren H, Zhou S, Feng Y, Wang D, Yang X, Chen S. Flippable multitask diffractive neural networks based on double-sided metasurfaces. OPTICS LETTERS 2025; 50:1997-2000. [PMID: 40085616 DOI: 10.1364/ol.555533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2025] [Accepted: 02/26/2025] [Indexed: 03/16/2025]
Abstract
Diffractive neural networks (DNNs) have garnered significant attention in recent years as a physical computing framework, combining high computational speed, parallelism, and low-power consumption. However, the non-reconfigurability of cascaded diffraction layers limits the ability of DNNs to perform multitasking, and methods such as replacing diffraction layers or light sources, while theoretically feasible, are difficult to implement in practice. This Letter introduces a flippable diffractive neural network (F-DNN) in which the diffraction layer is an integrated structure processed on both sides of the substrate. This design allows rapid task switching by flipping diffraction layers and overcomes alignment challenges that arise when replacing layers. Classification-based simulation results demonstrate that F-DNN addresses the limitations of traditional multitask DNN architectures, offering both superior performance and scalability, which provides a new approach for realizing high-speed, low-power, and multitask artificial intelligence systems.
Collapse
|
3
|
Shao H, Wang W, Zhang Y, Gao B, Jiang C, Li Y, Xie P, Yan Y, Shen Y, Wu Z, Wang R, Ji Y, Ling H, Huang W, Ho JC. Adaptive In-Sensor Computing for Enhanced Feature Perception and Broadband Image Restoration. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2025; 37:e2414261. [PMID: 39659128 DOI: 10.1002/adma.202414261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Revised: 11/22/2024] [Indexed: 12/12/2024]
Abstract
Traditional imaging systems struggle in weak or complex lighting environments due to their fixed spectral responses, resulting in spectral mismatches and degraded image quality. To address these challenges, a bioinspired adaptive broadband image sensor is developed. This innovative sensor leverages a meticulously designed type-I heterojunction alignment of 0D perovskite quantum dots (PQDs) and 2D black phosphorus (BP). This configuration enables efficient carrier injection control and advanced computing capabilities within an integrated phototransistor array. The sensor's unique responses to both visible and infrared (IR) light facilitate selective enhancement and precise feature extraction under varying lighting conditions. Furthermore, it supports real-time convolution and image restoration within a convolutional autoencoder (CAE) network, effectively countering image degradation by capturing spectral features. Remarkably, the hardware responsivity weights perform comparably to software-trained weights, achieving an image restoration accuracy of over 85%. This approach offers a robust and versatile solution for machine vision applications that demand precise and adaptive imaging in dynamic lighting environments.
Collapse
Affiliation(s)
- He Shao
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Weijun Wang
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Yuxuan Zhang
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Boxiang Gao
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Chunsheng Jiang
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Yezhan Li
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Pengshan Xie
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Yan Yan
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Yi Shen
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Zenghui Wu
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
| | - Ruiheng Wang
- State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing, 210023, China
| | - Yu Ji
- State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing, 210023, China
| | - Haifeng Ling
- State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing, 210023, China
| | - Wei Huang
- State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing, 210023, China
| | - Johnny C Ho
- Department of Materials Science and Engineering, City University of Hong Kong, Hong Kong SAR, 999077, China
- State Key Laboratory of Terahertz and Millimeter Waves, City University of Hong Kong, Hong Kong SAR, 999077, China
- Institute for Materials Chemistry and Engineering, Kyushu University, Fukuoka, 816-8580, Japan
| |
Collapse
|
4
|
Wang Y, Wang Y, Yu A, Hu M, Wang Q, Pang C, Xiong H, Cheng Y, Qi J. Non-Interleaved Shared-Aperture Full-Stokes Metalens via Prior-Knowledge-Driven Inverse Design. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2025; 37:e2408978. [PMID: 39586985 DOI: 10.1002/adma.202408978] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 11/18/2024] [Indexed: 11/27/2024]
Abstract
Characterization of electromagnetic wave polarization states is critical in various applications of materials, biomedical, and imaging. The emergence of metasurfaces opens up the possibility of implementing highly integrated full-Stokes imagers. Despite rapid development, prevailing schemes on metasurface-based full-Stokes imagers require interleaved or cascaded designs, inevitably resulting in performance deterioration, bulky size, and complicating the imaging procedure due to misalignment. To overcome these challenges, a non-interleaved shared-aperture full-Stokes metalens enabled by the prior-knowledge-driven inverse design methodology is proposed. The metalens can be directly deployed into imagers, performing high-accuracy diffraction-limited full-Stokes imaging without other ancillary components, breaking intrinsic constraints of efficiency and resolution in interleaved design. To demonstrate this, the metalens is integrated into the W-band passive imaging system, as an alternative solution, to perform polarimetric imaging, which reduces the reliance on the high cost and high complexity of the ortho-mode transducer and broadband correlator in traditional polarimetric radiometers. Furthermore, with the assistance of a 3D reconstruction method, the feasibility of multi-polarization information for contactless surface slope measurements is explored. This work may open new paradigms for the full-Stokes imager designing and broaden the applications of metasurface polarimetric imaging.
Collapse
Affiliation(s)
- Yuzhong Wang
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Yifei Wang
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Axiang Yu
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Mingshuang Hu
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Qiming Wang
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Cheng Pang
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Huimin Xiong
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Yayun Cheng
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| | - Jiaran Qi
- Department of Microwave Engineering, School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, 150001, China
| |
Collapse
|
5
|
Cheng K, Deng C, Ye F, Li H, Shen F, Fan Y, Gong Y. Metasurface-Based Image Classification Using Diffractive Deep Neural Network. NANOMATERIALS (BASEL, SWITZERLAND) 2024; 14:1812. [PMID: 39591053 PMCID: PMC11597900 DOI: 10.3390/nano14221812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Revised: 11/08/2024] [Accepted: 11/09/2024] [Indexed: 11/28/2024]
Abstract
The computer-assisted inverse design of photonic computing, especially by leveraging artificial intelligence algorithms, offers great convenience to accelerate the speed of development and improve calculation accuracy. However, traditional thickness-based modulation methods are hindered by large volume and difficult fabrication process, making it hard to meet the data-driven requirements of flexible light modulation. Here, we propose a diffractive deep neural network (D2NN) framework based on a three-layer all-dielectric phased transmitarray as hidden layers, which can perform the classification of handwritten digits. By tailoring the radius of a silicon nanodisk of a meta-atom, the metasurface can realize the phase profile calculated by D2NN and maintain a relative high transmittance of 0.9 at a wavelength of 600 nm. The designed image classifier consists of three layers of phase-only metasurfaces, each of which contains 1024 units, mimicking a fully connected neural network through the diffraction of light fields. The classification task of handwriting digits from the '0' to '5' dataset is verified, with an accuracy of over 90% on the blind test dataset, as well as demonstrated by the full-wave simulation. Furthermore, the performance of the more complex animal image classification task is also validated by increasing the number of neurons to enhance the connectivity of the neural network. This study may provide a possible solution for practical applications such as biomedical detection, image processing, and machine vision based on all-optical computing.
Collapse
Affiliation(s)
- Kaiyang Cheng
- International School of Microelectronics, Dongguan University of Technology, Dongguan 523808, China; (K.C.); (C.D.); (F.Y.); (Y.G.)
| | - Cong Deng
- International School of Microelectronics, Dongguan University of Technology, Dongguan 523808, China; (K.C.); (C.D.); (F.Y.); (Y.G.)
| | - Fengyu Ye
- International School of Microelectronics, Dongguan University of Technology, Dongguan 523808, China; (K.C.); (C.D.); (F.Y.); (Y.G.)
| | - Hongqiang Li
- College of Electronic and Information Engineering, Tongji University, Shanghai 200092, China;
- The Institute of Dongguan Tongji University, Dongguan 523808, China
| | - Fei Shen
- International School of Microelectronics, Dongguan University of Technology, Dongguan 523808, China; (K.C.); (C.D.); (F.Y.); (Y.G.)
| | - Yuancheng Fan
- Key Laboratory of Light Field Manipulation and Information Acquisition, Ministry of Industry and Information Technology and School of Physical Science and Technology, Northwestern Polytechnical University, Xi’an 710129, China
| | - Yubin Gong
- International School of Microelectronics, Dongguan University of Technology, Dongguan 523808, China; (K.C.); (C.D.); (F.Y.); (Y.G.)
- National Key Laboratory on Vacuum Electronics, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| |
Collapse
|
6
|
Zhu W, Sun J, Cheng Y, Bai H, Han L, Wang Y, Song C, Pan F. Photoresponsive Two-Dimensional Magnetic Junctions for Reconfigurable In-Memory Sensing. ACS NANO 2024; 18:27009-27015. [PMID: 39288273 DOI: 10.1021/acsnano.4c09735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/19/2024]
Abstract
Magnetic tunneling junctions (MTJs) lie in the core of magnetic random access memory, holding promise in integrating memory and computing to reduce hardware complexity, transition latency, and power consumption. However, traditional MTJs are insensitive to light, limiting their functionality in in-memory sensing─a crucial component for machine vision systems in artificial intelligence applications. Herein, the convergence of magnetic memory with optical sensing capabilities is achieved in the all-two-dimensional (2D) magnetic junction Fe3GaTe2/WSe2/Fe3GaTe2, which combines 2D magnetism and optoelectronic properties. The clean intrinsic band gap and prominent photoresponse of interlayer WSe2 endow the tunneling barrier with optical tunability. The on-off states of junctions and the magnetoresistance can be flexibly controlled by the intensity of the optical signal at room temperature. Based on the optical-tunable magnetoresistance in all-2D magnetic junctions, a machine vision system with the architecture of in-memory sensing and computing is constructed, which possesses high performance in image recognition. Our work exhibits the advantages of 2D magneto-electronic devices and extends the application scenarios of magnetic memory devices in artificial intelligence.
Collapse
Affiliation(s)
- Wenxuan Zhu
- Key Laboratory of Advanced Materials (MOE), School of Materials Science and Engineering, Tsinghua University, Beijing 100084,China
| | - Jiacheng Sun
- Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084,China
| | - Yuan Cheng
- Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084,China
- Department of Electronic Engineering, Tsinghua University, Beijing 100084,China
| | - Hua Bai
- Key Laboratory of Advanced Materials (MOE), School of Materials Science and Engineering, Tsinghua University, Beijing 100084,China
| | - Lei Han
- Key Laboratory of Advanced Materials (MOE), School of Materials Science and Engineering, Tsinghua University, Beijing 100084,China
| | - Yuyan Wang
- Beijing National Research Center for Information Science and Technology, Tsinghua University, Beijing 100084,China
| | - Cheng Song
- Key Laboratory of Advanced Materials (MOE), School of Materials Science and Engineering, Tsinghua University, Beijing 100084,China
| | - Feng Pan
- Key Laboratory of Advanced Materials (MOE), School of Materials Science and Engineering, Tsinghua University, Beijing 100084,China
| |
Collapse
|
7
|
Zhan Z, Wang H, Liu Q, Fu X. Optoelectronic nonlinear Softmax operator based on diffractive neural networks. OPTICS EXPRESS 2024; 32:26458-26469. [PMID: 39538511 DOI: 10.1364/oe.527843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 06/26/2024] [Indexed: 11/16/2024]
Abstract
Softmax, a pervasive nonlinear operation, plays a pivotal role in numerous statistics and deep learning (DL) models such as ChatGPT. To compute it is expensive especially for at-scale models. Several software and hardware speed-up strategies are proposed but still suffer from low efficiency, poor scalability. Here we propose a photonic-computing solution including massive programmable neurons that is capable to execute such operation in an accurate, computation-efficient, robust and scalable manner. Experimental results show our diffraction-based computing system exhibits salient generalization ability in diverse artificial and real-world tasks (mean square error <10-5). We further analyze its performances against several realistic restricted factors. Such flexible system not only contributes to optimizing Softmax operation mechanism but may provide an inspiration of manufacturing a plug-and-play module for general optoelectronic accelerators.
Collapse
|
8
|
Xu Z, Zhou T, Ma M, Deng C, Dai Q, Fang L. Large-scale photonic chiplet Taichi empowers 160-TOPS/W artificial general intelligence. Science 2024; 384:202-209. [PMID: 38603505 DOI: 10.1126/science.adl1203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 03/11/2024] [Indexed: 04/13/2024]
Abstract
The pursuit of artificial general intelligence (AGI) continuously demands higher computing performance. Despite the superior processing speed and efficiency of integrated photonic circuits, their capacity and scalability are restricted by unavoidable errors, such that only simple tasks and shallow models are realized. To support modern AGIs, we designed Taichi-large-scale photonic chiplets based on an integrated diffractive-interference hybrid design and a general distributed computing architecture that has millions-of-neurons capability with 160-tera-operations per second per watt (TOPS/W) energy efficiency. Taichi experimentally achieved on-chip 1000-category-level classification (testing at 91.89% accuracy in the 1623-category Omniglot dataset) and high-fidelity artificial intelligence-generated content with up to two orders of magnitude of improvement in efficiency. Taichi paves the way for large-scale photonic computing and advanced tasks, further exploiting the flexibility and potential of photonics for modern AGI.
Collapse
Affiliation(s)
- Zhihao Xu
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
- Tsinghua Shenzhen International Graduate School, Shenzhen, China
| | - Tiankuang Zhou
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
- Institute for Brain and Cognitive Science, Tsinghua University (THUIBCS), Beijing, China
| | - Muzhou Ma
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, China
| | - ChenChen Deng
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
| | - Qionghai Dai
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
- Institute for Brain and Cognitive Science, Tsinghua University (THUIBCS), Beijing, China
- Department of Automation, Tsinghua University, Beijing, China
| | - Lu Fang
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, China
- Institute for Brain and Cognitive Science, Tsinghua University (THUIBCS), Beijing, China
| |
Collapse
|
9
|
Cheng Y, Zhang J, Zhou T, Wang Y, Xu Z, Yuan X, Fang L. Photonic neuromorphic architecture for tens-of-task lifelong learning. LIGHT, SCIENCE & APPLICATIONS 2024; 13:56. [PMID: 38403652 PMCID: PMC10894876 DOI: 10.1038/s41377-024-01395-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 01/08/2024] [Accepted: 01/24/2024] [Indexed: 02/27/2024]
Abstract
Scalable, high-capacity, and low-power computing architecture is the primary assurance for increasingly manifold and large-scale machine learning tasks. Traditional electronic artificial agents by conventional power-hungry processors have faced the issues of energy and scaling walls, hindering them from the sustainable performance improvement and iterative multi-task learning. Referring to another modality of light, photonic computing has been progressively applied in high-efficient neuromorphic systems. Here, we innovate a reconfigurable lifelong-learning optical neural network (L2ONN), for highly-integrated tens-of-task machine intelligence with elaborated algorithm-hardware co-design. Benefiting from the inherent sparsity and parallelism in massive photonic connections, L2ONN learns each single task by adaptively activating sparse photonic neuron connections in the coherent light field, while incrementally acquiring expertise on various tasks by gradually enlarging the activation. The multi-task optical features are parallelly processed by multi-spectrum representations allocated with different wavelengths. Extensive evaluations on free-space and on-chip architectures confirm that for the first time, L2ONN avoided the catastrophic forgetting issue of photonic computing, owning versatile skills on challenging tens-of-tasks (vision classification, voice recognition, medical diagnosis, etc.) with a single model. Particularly, L2ONN achieves more than an order of magnitude higher efficiency than the representative electronic artificial neural networks, and 14× larger capacity than existing optical neural networks while maintaining competitive performance on each individual task. The proposed photonic neuromorphic architecture points out a new form of lifelong learning scheme, permitting terminal/edge AI systems with light-speed efficiency and unprecedented scalability.
Collapse
Affiliation(s)
- Yuan Cheng
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, 100084, China
| | - Jianing Zhang
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, 100084, China
| | - Tiankuang Zhou
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, 100084, China
| | - Yuyan Wang
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, 100084, China
| | - Zhihao Xu
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
| | - Xiaoyun Yuan
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
- Institute for Brain and Cognitive Science, Tsinghua University (THUIBCS), Beijing, 100084, China
| | - Lu Fang
- Sigma Laboratory, Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China.
- Beijing National Research Center for Information Science and Technology (BNRist), Beijing, 100084, China.
- Institute for Brain and Cognitive Science, Tsinghua University (THUIBCS), Beijing, 100084, China.
| |
Collapse
|
10
|
Ouyang H, Zhao Z, Tao Z, You J, Cheng X, Jiang T. Parallel edge extraction operators on chip speed up photonic convolutional neural networks. OPTICS LETTERS 2024; 49:838-841. [PMID: 38359195 DOI: 10.1364/ol.517583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 01/17/2024] [Indexed: 02/17/2024]
Abstract
We experimentally establish a 3 × 3 cross-shaped micro-ring resonator (MRR) array-based photonic multiplexing architecture relying on silicon photonics to achieve parallel edge extraction operations in images for photonic convolution neural networks. The main mathematical operations involved are convolution. Precisely, a faster convolutional calculation speed of up to four times is achieved by extracting four feature maps simultaneously with the same photonic hardware's structure and power consumption, where a maximum computility of 0.742 TOPS at an energy cost of 48.6 mW and a convolution accuracy of 95.1% is achieved in an MRR array chip. In particular, our experimental results reveal that this system using parallel edge extraction operators instead of universal operators can improve the imaging recognition accuracy for CIFAR-10 dataset by 6.2% within the same computing time, reaching a maximum of 78.7%. This work presents high scalability and efficiency of parallel edge extraction chips, furnishing a novel, to the best of our knowledge, approach to boost photonic computing speed.
Collapse
|
11
|
Zhou H, Zhao C, He C, Huang L, Man T, Wan Y. Optical computing metasurfaces: applications and advances. NANOPHOTONICS (BERLIN, GERMANY) 2024; 13:419-441. [PMID: 39635656 PMCID: PMC11501951 DOI: 10.1515/nanoph-2023-0871] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 01/15/2024] [Indexed: 12/07/2024]
Abstract
Integrated photonic devices and artificial intelligence have presented a significant opportunity for the advancement of optical computing in practical applications. Optical computing technology is a unique computing system based on optical devices and computing functions, which significantly differs from the traditional electronic computing technology. On the other hand, optical computing technology offers the advantages such as fast speed, low energy consumption, and high parallelism. Yet there are still challenges such as device integration and portability. In the burgeoning development of micro-nano optics technology, especially the deeply ingrained concept of metasurface technique, it provides an advanced platform for optical computing applications, including edge detection, image or motion recognition, logic computation, and on-chip optical computing. With the aim of providing a comprehensive introduction and perspective for optical computing metasurface applications, we review the recent research advances of optical computing, from nanostructure and computing methods to practical applications. In this work, we review the challenges and analysis of optical computing metasurfaces in engineering field and look forward to the future development trends of optical computing.
Collapse
Affiliation(s)
- Hongqiang Zhou
- School of Physics and Optoelectronic Engineering, Beijing University of Technology, Beijing100124, China
| | - Chongli Zhao
- School of Physics and Optoelectronic Engineering, Beijing University of Technology, Beijing100124, China
| | - Cong He
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing100081, China
| | - Lingling Huang
- Beijing Engineering Research Center of Mixed Reality and Advanced Display, School of Optics and Photonics, Beijing Institute of Technology, Beijing100081, China
| | - Tianlong Man
- School of Physics and Optoelectronic Engineering, Beijing University of Technology, Beijing100124, China
| | - Yuhong Wan
- School of Physics and Optoelectronic Engineering, Beijing University of Technology, Beijing100124, China
| |
Collapse
|
12
|
Jia Q, Zhang Y, Shi B, Li H, Li X, Feng R, Sun F, Cao Y, Wang J, Qiu CW, Ding W. Vector vortex beams sorting of 120 modes in visible spectrum. NANOPHOTONICS (BERLIN, GERMANY) 2023; 12:3955-3962. [PMID: 39635195 PMCID: PMC11501642 DOI: 10.1515/nanoph-2023-0482] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Accepted: 09/25/2023] [Indexed: 12/07/2024]
Abstract
Polarization (P), angular index (l), and radius index (p) are three independent degrees of freedom (DoFs) of vector vortex beams, which have found extensive applications in various domains. While efficient sorting of a single DoF has been achieved successfully, simultaneous sorting of all these DoFs in a compact and efficient manner remains a challenge. In this study, we propose a beam sorter that simultaneously handles all the three DoFs using a diffractive deep neural network (D2NN), and demonstrate the robust sorting of 120 Laguerre-Gaussian (LG) modes experimentally in the visible spectrum. Our proposed beam sorter underscores the considerable potential of D2NN in optical field manipulation and promises to enhance the diverse applications of vector vortex beams.
Collapse
Affiliation(s)
- Qi Jia
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Yanxia Zhang
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Bojian Shi
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Hang Li
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Xiaoxin Li
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Rui Feng
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Fangkui Sun
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Yongyin Cao
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Jian Wang
- School of Physics, Harbin Institute of Technology, Harbin150001, China
| | - Cheng-Wei Qiu
- Department of Electrical and Computer Engineering, National University of Singapore, Singapore117583, Singapore
| | - Weiqiang Ding
- Institute of Advanced Photonics, School of Physics, Harbin Institute of Technology, Harbin150001, China
- Collaborative Innovation Center of Extreme Optics, Shanxi University, Taiyuan030006, Shanxi, China
| |
Collapse
|
13
|
Feng J, Chen H, Yang D, Hao J, Lin J, Jin P. Multi-wavelength diffractive neural network with the weighting method. OPTICS EXPRESS 2023; 31:33113-33122. [PMID: 37859098 DOI: 10.1364/oe.499840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/04/2023] [Indexed: 10/21/2023]
Abstract
Recently, the diffractive deep neural network (D2NN) has demonstrated the advantages to achieve large-scale computational tasks in terms of high speed, low power consumption, parallelism, and scalability. A typical D2NN with cascaded diffractive elements is designed for monochromatic illumination. Here, we propose a framework to achieve the multi-wavelength D2NN (MW-D2NN) based on the method of weight coefficients. In training, each wavelength is assigned a specific weighting and their output planes construct the wavelength weighting loss function. The trained MW-D2NN can implement the classification of images of handwritten digits at multi-wavelength incident beams. The designed 3-layers MW-D2NN achieves a simulation classification accuracy of 83.3%. We designed a 1-layer MW-D2NN. The simulation and experiment classification accuracy are 71.4% and 67.5% at RGB wavelengths. Furthermore, the proposed MW-D2NN can be extended to intelligent machine vision systems for multi-wavelength and incoherent illumination.
Collapse
|
14
|
Liu W, Fu T, Huang Y, Sun R, Yang S, Chen H. C-DONN: compact diffractive optical neural network with deep learning regression. OPTICS EXPRESS 2023; 31:22127-22143. [PMID: 37381294 DOI: 10.1364/oe.490072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/08/2023] [Indexed: 06/30/2023]
Abstract
A new method to improve the integration level of an on-chip diffractive optical neural network (DONN) is proposed based on a standard silicon-on-insulator (SOI) platform. The metaline, which represents a hidden layer in the integrated on-chip DONN, is composed of subwavelength silica slots, providing a large computation capacity. However, the physical propagation process of light in the subwavelength metalinses generally requires an approximate characterization using slot groups and extra length between adjacent layers, which limits further improvements of the integration of on-chip DONN. In this work, a deep mapping regression model (DMRM) is proposed to characterize the process of light propagation in the metalines. This method improves the integration level of on-chip DONN to over 60,000 and elimnates the need for approximate conditions. Based on this theory, a compact-DONN (C-DONN) is exploited and benchmarked on the Iris plants dataset to verify the performance, yielding a testing accuracy of 93.3%. This method provides a potential solution for future large-scale on-chip integration.
Collapse
|
15
|
Wetherfield B, Wilkinson TD. Planar Fourier optics for slab waveguides, surface plasmon polaritons, and 2D materials. OPTICS LETTERS 2023; 48:2945-2948. [PMID: 37262250 DOI: 10.1364/ol.491576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 04/27/2023] [Indexed: 06/03/2023]
Abstract
Recent experimental work has demonstrated the potential of combining the merits of diffractive and on-chip photonic information processing devices in a single chip by making use of planar (or slab) waveguides. Here, arguments are developed to show that diffraction formulas familiar from 3D Fourier optics can be adapted to 2D under certain mild conditions on the operating speeds of the devices in question. In addition to serving those working in on-chip photonics, this Letter provides analytical tools for the study of surface plasmon polaritons, surface waves, and the optical, acoustic, and crystallographic properties of 2D materials.
Collapse
|
16
|
Bai Y, Xu X, Tan M, Sun Y, Li Y, Wu J, Morandotti R, Mitchell A, Xu K, Moss DJ. Photonic multiplexing techniques for neuromorphic computing. NANOPHOTONICS (BERLIN, GERMANY) 2023; 12:795-817. [PMID: 39634356 PMCID: PMC11501529 DOI: 10.1515/nanoph-2022-0485] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Revised: 11/01/2022] [Accepted: 12/03/2022] [Indexed: 12/07/2024]
Abstract
The simultaneous advances in artificial neural networks and photonic integration technologies have spurred extensive research in optical computing and optical neural networks (ONNs). The potential to simultaneously exploit multiple physical dimensions of time, wavelength and space give ONNs the ability to achieve computing operations with high parallelism and large-data throughput. Different photonic multiplexing techniques based on these multiple degrees of freedom have enabled ONNs with large-scale interconnectivity and linear computing functions. Here, we review the recent advances of ONNs based on different approaches to photonic multiplexing, and present our outlook on key technologies needed to further advance these photonic multiplexing/hybrid-multiplexing techniques of ONNs.
Collapse
Affiliation(s)
- Yunping Bai
- State Key Laboratory of Information Photonics and Optical Communications, Beijing University of Posts and Telecommunications, Beijing100876, China
| | - Xingyuan Xu
- State Key Laboratory of Information Photonics and Optical Communications, Beijing University of Posts and Telecommunications, Beijing100876, China
| | - Mengxi Tan
- Faculty of Engineering, RMIT University, Melbourne, VIC3001, Australia
| | - Yang Sun
- Optical Sciences Centre, Swinburne University of Technology, Hawthorn, VIC3122, Australia
| | - Yang Li
- Optical Sciences Centre, Swinburne University of Technology, Hawthorn, VIC3122, Australia
| | - Jiayang Wu
- Optical Sciences Centre, Swinburne University of Technology, Hawthorn, VIC3122, Australia
| | - Roberto Morandotti
- INRS-Énergie, Matériaux et Télécommunications, 1650 Boulevard Lionel-Boulet, Varennes, QCJ3X 1S2, Canada
| | - Arnan Mitchell
- Faculty of Engineering, RMIT University, Melbourne, VIC3001, Australia
| | - Kun Xu
- State Key Laboratory of Information Photonics and Optical Communications, Beijing University of Posts and Telecommunications, Beijing100876, China
| | - David J. Moss
- Optical Sciences Centre, Swinburne University of Technology, Hawthorn, VIC3122, Australia
| |
Collapse
|
17
|
Sadeghzadeh H, Koohi S. Translation-invariant optical neural network for image classification. Sci Rep 2022; 12:17232. [PMID: 36241863 PMCID: PMC9568607 DOI: 10.1038/s41598-022-22291-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 10/12/2022] [Indexed: 01/06/2023] Open
Abstract
The classification performance of all-optical Convolutional Neural Networks (CNNs) is greatly influenced by components' misalignment and translation of input images in the practical applications. In this paper, we propose a free-space all-optical CNN (named Trans-ONN) which accurately classifies translated images in the horizontal, vertical, or diagonal directions. Trans-ONN takes advantages of an optical motion pooling layer which provides the translation invariance property by implementing different optical masks in the Fourier plane for classifying translated test images. Moreover, to enhance the translation invariance property, global average pooling (GAP) is utilized in the Trans-ONN structure, rather than fully connected layers. The comparative studies confirm that taking advantage of vertical and horizontal masks along GAP operation provide the best translation invariance property, compared to the alternative network models, for classifying horizontally and vertically shifted test images up to 50 pixel shifts of Kaggle Cats and Dogs, CIFAR-10, and MNIST datasets, respectively. Also, adopting the diagonal mask along GAP operation achieves the best classification accuracy for classifying translated test images in the diagonal direction for large number of pixel shifts (i.e. more than 30 pixel shifts). It is worth mentioning that the proposed translation invariant networks are capable of classifying the translated test images not included in the training procedure.
Collapse
Affiliation(s)
- Hoda Sadeghzadeh
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran
| | - Somayyeh Koohi
- Department of Computer Engineering, Sharif University of Technology, Tehran, Iran.
| |
Collapse
|