Universal Chiplet Interconnect Express Notes

This article is based on the references and idea from Dr. Zhen Zhou and Prof. Yang Yi, thank you for your guidance and help.

  • Wave physics as an analog recurrent neural network1
  • A Radio Frequency Analog Computer for Computational Electromagnetics2
  • Universal Chiplet Interconnect Express (UCIe): An Open Industry Standard for Innovations With Chiplets at Package Level3
  • Challenges and recent prospectives of 3D heterogeneous integration4
  • Chiplet Technology and Heterogenous Integration5
  • Chiplet Technology & Heterogeneous Integration6

Overview

UCIe is short for Universial Chiplet Interconnect Express7. UCIe Cononsortium provides whitepapers for V1.08 as well as for V1.19. Intel’s Dr. Debendra Das Sharma10 writes the paper that defines UCIe. In addition, it provides webinars11 by embedding YouTube videos in the webpage. UCIe Consortium also has a YouTube channel12 that contains lots of useful tutorials. Spepcifications13 could not be accessed to the general public as only employees of partners are able to see.

The following table provides a high level overview of UCIe architecture from both hardware perspective on the left side to the software protocol stack on the right side of the table.

PHYRDID2D AdapterFDIProtocol
namePhysical LayerRaw Die to Die InterfaceDie to Die AdapterFlit-aware Die to Die InterfaceProtocol Layer
Concepts81️⃣Link Traning
2️⃣Lane Repair
3️⃣Lane Reversal
4️⃣Scrambling
5️⃣Sidband
6️⃣Analog Front End
7️⃣Clock Forwarding
1️⃣ARB/MUX
2️⃣CRC/Retry
3️⃣Link State Mgmt
4️⃣Parameter Negotiation
1️⃣PCIe14
2️⃣CXL1516
🔹CXL.io
🔹CXL.cache
🔹CXL.mem
3️⃣Streaming17
4️⃣Raw
ExtensionPackagingFlit
UCIe layering approach, Page 13 of slides18 has a flowchart as a goal to achieve.
PHYD2DProtocol
CXLARB/MUX
CRClightweight 16 bit triple bit detection, converge at high frequency, pipeline.
RETRYsimplify from PCIe by selective Nak19
LSM20Link state management: Reset/Active/PowerMgmt/Error flows
PN21PN with remote link partner22
PCI
CXL
Steamingsupport Raw formats
UCIe Die-to-Die Adapter Concepts

Hardware

This section discusses the hardware aspect of UCIe™ as well as advanced packaging for chiplets.

Packaging

Packaging for integrated circuit contains knowledge and technology from various aspects, the following table is aiming to summarize the different categories of technology on chiplets packaging.

CategoriesDetails
Die/Package Stacking2.5D, 3D
Die Thinning/Stacking
Package on Package
Package in Package
Wafer Level PackageWafer Bumping
Cu Pillar
Fan-out WLP
Wafer Level IPD
Wafer Level MEMS
InterconnectionCap Wire Bond
Wire Bond
Flip Chip
Embedded SubstrateSESUB
a-EASI
Double Side MoldingDouble Side Molding
Selective Molding
Flexible Encapsulation
Molded Underfill
Irregular Packaging
AntennaAntenna on Package
Antenna in Package
Board AssemblyHigh Density SMT
ACF Bonding
Wire Bond on Flex
Laser Welding
Flex Bending
ShieldingConformal Shielding
Compartment Shielding
Selective Shielding
Magnetic shielding
System in Package Technology23

UCIe V1.08 supports 2 types of packaging, namely standard package(UCIe-S) and advanced package(UCIe-A).

Standard PackageAdvanced Package
Characteristics
Data Rate per Lane(GT/s)244, 8, 12, 16, 24, 324, 8, 12, 16, 24, 32
Width/Configuration(each cluster)16 Full Duplex64 Full Duplex
Bump pitch(\mu m)100-13025-55
Channel Reach(mm)10\leqslant L \leqslant 25\leqslant 2
Raw Bit Error Rate(BER)◾️10^{27} 16GT/s short reach
◾️10^{15} 16GT/s long reach
◾️10^{27} 12GT/s
◾️10^{15} 16GT/s
Targets for Key Metrics
B/W Shoreline(GB/s/mm)28-224165-1317
B/W Density(GB/s/mm^{2})22-125188-1350
Bump
bump maps!!◾️\times16
◾️\times32
◾️8-columns,
◾️10-columns
◾️16-columns
❗️UCIe-A area/column type efficiency plots25
UCIe™ Packaging Specs Comparison26, Table I10, Table I27

UCIe™ at the moment(V1.1) only supports 2D and 2.5D packaging, 2 figures are placed in this article for illustration purpose. The following figure Figure 4: UCIe : Layering Approach and different packaging choices is extracted from V1.0 of whitepaper8.

The standard/laminate packaging (2D) is used for cost-effective performance. The advanced packaging(2.5D) is used for power-efficient performance. There are multiple commercially available options, some of which are shown in the diagram. UCIe specification embraces all types of packaging choices in these categories.

Universal Chiplet Interconnect Express (UCIe)®: Building an open chiplet ecosystem8

The following figure is from Figure 4 of V1.1 whitepaper9.

Heterogeneous Integrations

There are several books that give a comprehensive description on heterogeneous integration which enable readers to quickly learn about the basics of problem-solving methods and understand the trade-offs inherent in making system-level decisions.

“Heterogeneous Integrations”28

“Chiplet Design and Heterogeneous Integration Packaging”29

_https://semiengineering.com/is-ucie-really-universal/

https://www.computer.org/csdl/magazine/mi/2023/02/10013705/1JP1z96RPP2

_https://docs.pingcode.com/info/34526.html

UCIe-S Bump map

The video30 explains the routing of UCIe-S bump maps with details.

checkboard arrangement is useful for interrupt concerns, mechanical compatibility.

UCIe-A area/column type efficiency plots31

UCIe-A interoperability across bump pitch range32: fixed beachfront, signal ordering rules.

UCIe considers not only the protocol software side, the hardware side regarding electrical and packaging are highly emphasized for chiplets design.

Flit mean flow control unit.

CXL: compute express link33

Wafer Level Packaging

Fan-In

Fan-Out: any package with connections fanned-out of the chip surface, enabling more external I/Os.34 A comparision of Fan-In and Fan-Out illustration for a RF module.35

FOWLP: Fan-Out Wafer Level Packaging, this image36 shows how the FOWLP based chip is made.

Interposer37

Interposer contains the following characteristics:

  • Purpose: electronic substrate to interconnect between the fine-pitch I/Os at the die level on the top side to the coarser dimensional features on the package on the bottom side of the interposer.37
  • TSV: Through-silicon Via, illustration of TSV on High Bandwidth Mememory38, in reference39 page 10 figure 17, illustrations of how RDL is manufactured is presented.
  • TPV: Through-package Via
  • RDL: redistribution layers on both sides of the interposer, in reference39 page 10 figure 16, illustrations of how RDL is manufactured is presented.

Through-silicon Via40

  • Via First
  • Via Middle
  • Via Last

Advanced Packaging Technology

There is an incredible number of advanced packaging types and brand names from Intel (EMIB, Foveros, Foveros Omni, Foveros Direct), TSMC (InFO-OS, InFO-LSI, InFO-SOW, InFO-SoIS, CoWoS-S, CoWoS­-R, CoWoS-L, SoIC), Samsung (FOSiP, X-Cube, I-Cube, HBM, DDR/LPDDR DRAM, CIS), ASE (FoCoS, FOEB), Sony (CIS), Micron (HBM), SKHynix (HBM), and YMTC (XStacking).

Advanced Packaging Part 2 – Review Of Options/Use From Intel, TSMC, Samsung, AMD, ASE, Sony, Micron, SKHynix, YMTC, Tesla, and Nvidia41
TechnologyCorporationCategory
EMIB4243Intel2.5D🔵thin pieces of silicon with multi-layer BEOL44 interconnects, embedded in organic substrates. Fig 4(A)43
❗️limitations of interposer SI and enable similar inter-die bandwidth at lower cost45
Foveros4647Intel2.5D/3D48🔬49, with EMIB named Co-EMIB50
🔵F2F \mubumps for Foveros
🔵Cu Column for Foveros Omni
Foveros DirectIntelHybrid Bonding48🔵Cu to Cu Bonding for Foveros Direct51
ODI5245Intel3D🔵Type I: sharing
🔵Type II: single die
🔵Cupper Pillar or Package
CoWoS53TSMC2.5D
Package
Chip Last
🔬54, CoWoS®-S structure55, NEC SX-Aurora56, high-K MiM57, iCAP58
AMD EPYC MI30059🔬60
CoWoS-R61TSMC2.5D
Package
Chip Last
🔬62,
🔴InFO technology to utilize RDL interposer which comprised of polymer and copper traces.
🔴Build bigger chips: adding high-bandwidth memory (HBM) stacks to one or more processors.63
CoWoS-L64TSMC2.5D
Package
Chip Last
🔬65,
🔴interposer with LSI66 chip
🔴RDL layers for power and signal delivery
InFO67TSMCPackage
Chip First
🔴InFO_oS68: LSI66+RDL
🔴InFO_PoP🔬69: FOWLP70+PoP71, DRAM TIV72 connect to Fan-Out processor. InFO_B73
❗️InFO_PoP vs CoWoS vs SoIC🔬74
🔴InFO_LSI: InFO_oS+LSI on Substrate66
🔴InFO_SoIS: System-on-Integrate Substrate55
🔴InFO_SoW: System-on-Wafer
SoIC75TSMC3D
Chip Stack76
🔴CoW77:
🔴WoW78: bond face to face with same size79.
❗️CoW and WoW comparison: 🔬80
❗️SoIC: Fronend, chiplet; InFO&CoWoS: Backend, integration.
AMD EPYC MI30059🔬60
I-Cube81Samsung822.5D
H-Cube83Samsung2.5D
X-Cube84Samsung3D⚪️compete with SoIC
ASE
Sony
Micron
SKHynix85
YMTC86
Packaging technologies39

Concepts for packaging

2.5D vs 3D: The major difference between 2.5D and 3D are 2.5D uses passive silicon where 3D uses active silicon which not only has electrical connections but has integrated circuits as well.87 In 2.5D structure, two or more active semiconductor chips are placed side-by-side on a silicon interposer for achieving extremely high die-to-die interconnect density. In 3D structure, active chips are integrated by die stacking for shortest interconnect and smallest package footprint.88 2.5D strcture needs interposer while 3D strcture enables direct vertical connection between dies at page 11 of 39.

chip first vs chip last: InFO is a chip first process where the chip is placed first, then build the RDLs are built around it. With CoWoS, the RDLs are built up then the chip is placed.

Hybrid Packaging

Optics and UCIe: the article89 figure 6 contains the evolution of integration between optical data connection and IC to have larger bandwitdh for Co-packaged datacenter optics.

RawStandard 256 bit flit mode
Stream✅ only90
UCIe 1.0

PHY Layer

Module

A pair of physical interface between dies is defined as module, with main band for data transfer and side band for connection overhead.

The following figure91 shows the connection of modules for UCIe-S as well as UCIe-A.

  • UCIe-S: 16TX, 16 RX, Side Band DATA, CLK

Die 2 Die Adapter

Flit formats

Contents

flit header carries: protocol identifier, stack identifier92, sequence number, Ack/Nak completion, pause of data stream indication.

pause of data stream indication(PDS): as for 68 byte size is note a multiple of lane numbers, an pause indication is needed, which is a dedicated header followed by all 0 for receiver to react.

Types

Raw format: bypass D2D which can be used directly on PHY layer.

68 Flit format: this stems form CXL flit format, 64 bytes from FDI, 4 bytes inserted in D2D, 2 bytes of flit header, 2 bytes of CRC, data shifting to for RDI as it allows 64 bytes flit size.

Standard 256B End Header Flit Format: this design is related to how data buffer works thus header is placed at the end.

Standard 256B Start Header Flit Format: CXL 256 Flit Model protocol

Latency Optimized 256B without Optional bytes:

Latency Optimized 256B with Optional bytes:

FormatNamePCIe Non-Flit ModePCIe Flit ModeCXL 68B Flit ModeCXL 256B Flit ModeStreaming17
1RawOptionalOptionalOptionalOptionalMandatory
268BMandatory🚫Mandatory🚫🚫8, ✅9
3Std 256B End Header🚫Mandatory🚫🚫🚫8, ✅9
4Std 256B Start Header🚫Optional🚫Mandatory🚫8, ✅9
5Latency Optmized 256B without optional bytes🚫🚫🚫Optional🚫8, ✅9
6Latency Optimized 256B with optional bytes🚫Strongly Recommended🚫Strongly Recommended🚫8, ✅9
Protocol and Flit Format Matrix93

Unraveling PCIe 6.0 FLIT Mode Challenges

Link Initialization Flow

Link Training State Machine

Expand from UCIe

This section contains some papers that apply concepts form chiplet with machine learning, aiming to achieve better performance, scalability, energy efficiency.

Computing In-Memory

“SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks”94

  • 📌benchmarking simulator named SIAM to evaluate the performance of chiplet-based IMC architectures.
  • 📎SIAM integrates device, circuit, architecture, network-on-chip (NoC), network-on-package (NoP), and DRAM access models to realize an end-to-end system.
  • ✅The chiplet-based IMC architecture obtained through SIAM shows 130\times and 72\times improvement in energy-efficiency for ResNet-50 on the ImageNet dataset compared to Nvidia V10095 and T496 GPUs.

“Neuromorphic Computing Based on CMOS-Integrated Memristive Arrays: Current State and Perspectives”97

“SWAP: A Server-Scale Communication-Aware Chiplet-Based Manycore PIM Accelerator”98

  • 📌a novel server scale 2.5-D manycore architecture called SWAP that accounts for the traffic characteristics of DL applications.
  • ✅SWAP achieves significant performance and energy consumption improvements with much lower fabrication cost than state-of-the-art network-on-package NoP topologies.

“In-Memory Computing based Acceleration: Large-Scale to Edge Computing”99

  • 💡heterogeneous big-little chiplet-based IMC architecture that utilizes big and little IMC-based chiplets coupled with an optimal network-on-package or NoP configuration.
  • 💡on-Chip Training: a ReRAM-based in-memory computing accelerator for on-chip training and inference of millimeter Wave (mmWave) CNN and inference of RGB CNN models for personalized home-based rehabilitation systems.
  • ✅on-chip training and inference using IMC architectures can enable energy-efficient edge computing.

Neuromorphic Computing

“Chiplet-based Architecture Design for Multi-Core Neuromorphic Processor”100

  • 📌chiplet-based architecture for a multi-core neuromorphic processor.
  • 💡multiple neural processing chiplets are connected together through the interposer.
  • 📎neuromorphic chip101 contains programmable neuron and routers for spike handling and topology mapping.

Spiking Neural Network

“Abisko: Deep codesign of an architecture for spiking neural networks using novel neuromorphic materials”102

  • 📌designed as a chiplet that can be deployed in contemporary computer architectures and investigating novel neuromorphic materials to improve its design.
  • 📌developing a productive software stack for the neuromorphic accelerator that will also be portable.

Understand UCIe from other stacks

If we consider UCIe to be the data transmission on the chip package scale, PCIe is on the data transmission on the device scale, then Ethernet/Internet is the data transmission on the network scale. Through they are targeting on different problems with different addresses, there are some similiarities between those protocol stacks.

OSI(Ethernet Protocol)

Perhaps OSI 7 layer netwrok protocol model is one of the most widely recognized protocols in the world, it is designed to separate different functionalities based on a hierarchy structure, which is definitely similar to UCIe.

  1. Hughes T W, Williamson I A D, Minkov M, et al. Wave physics as an analog recurrent neural network[J]. Science advances, 2019, 5(12): eaay6946. 📁[]
  2. Udayanga, Nilan, Arjuna Madanayake, S. I. Hariharan, Jifu Liang, Soumyajit Mandal, Leonid Belostotski, and Len T. Bruton. “A radio frequency analog computer for computational electromagnetics.” IEEE Journal of Solid-State Circuits 56, no. 2 (2020): 440-454.[]
  3. Sharma, Debendra Das, Gerald Pasdast, Zhiguo Qian, and Kemal Aygun. “Universal chiplet interconnect express (UCIe): An open industry standard for innovations with chiplets at package level.” IEEE Transactions on Components, Packaging and Manufacturing Technology 12, no. 9 (2022): 1423-1431.[]
  4. Zhang, S., Z. Li, H. Zhou, R. Li, S. Wang, K. W. Paik, and P. He. “Challenges and recent prospectives of 3D heterogeneous integration.” E-Prime-Advances in Electrical Engineering, Electronics and Energy 2 (2022): 100052. 📁[]
  5. Beyene, Wendemagegnehu T. “Chiplet Technology and Heterogeneous Integration.” 📁[]
  6. Ramamurthy, Anu. “Chiplet Technology & Heterogeneous Integration”, Microchip, (2021). 📁[]
  7. Universial Chiplet Interconnect Express Offical Website[]
  8. Sharma, Debendra D. “Universal chiplet interconnect express (UCIe): Building an open chiplet ecosystem.” UCIE whitepaper (2022).[][][][][][][][][][]
  9. UCIeTM 1.1 Specification: Backward Compatible Evolution of UCIe for Driving an Open Chiplet Ecosystem with New Usage Models[][][][][][][]
  10. Sharma, Debendra Das, et al. “Universal chiplet interconnect express (UCIe): An open industry standard for innovations with chiplets at package level.” IEEE Transactions on Components, Packaging and Manufacturing Technology 12.9 (2022): 1423-1431. 📁[][]
  11. UCIe Webinar recordings[]
  12. UCIe Consortium[]
  13. UCIe™ Specifications[]
  14. PCI-SIG: Peripheral Component Interconnect Special Interest Group[]
  15. Compute Express Link™ (CXL™) is an industry-supported Cache-Coherent Interconnect for Processors, Memory Expansion and Accelerators. []
  16. CXL Consortium[]
  17. e.g., AXI, CHI, SMP coherency protocols, SFI, and CPI[][]
  18. Open Domain Specific Architecture(ODSA)[]
  19. Unraveling PCIe 6.0 FLIT Mode Challenges[]
  20. Link State Management[]
  21. Parameter Negotiation[]
  22. for every protocol, UCIe defines 1 mandatory flit mode and optional ones[]
  23. ASE, System in Package (SiP)[]
  24. Transfer per second is also known as sample rate, i.e. the number of data samples captured per second, each sample normally occurring at the clock edge.[]
  25. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[]
  26. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[]
  27. Universal Chiplet Interconnect Express (UCIe)®: An open standard for developing a successful chiplet ecosystem. 📁[]
  28. Lau, John H. Heterogeneous integrations. Springer Singapore, 2019.[]
  29. Lau, John H. Chiplet design and heterogeneous integration packaging. Springer Nature, 2023.[]
  30. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[]
  31. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[]
  32. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[]
  33. Compute Express Link™: The Breakthrough CPU-to-Device Interconnect
    CXL™
    []
  34. What is Fan-Out Packaging?[]
  35. Keysight, RF Module Fan-Out Wafer Level (FOWL) Packaging[]
  36. Flip-chip on FO-WLP: assembly process flow.[]
  37. Sundaram, V. and Tummala, R.R. (2014). Interposer Technology. In Handbook of 3D Integration (eds P. Garrou, M. Koyanagi and P. Ramm).[][]
  38. High Bandwidth Memory Schematic[]
  39. 国金证券, “先进封装价值量提升叠加需求回暖,封测产业链机遇将至”[][][][]
  40. Through-silicon via[]
  41. Advanced Packaging Part 2 – Review Of Options/Use From Intel, TSMC, Samsung, AMD, ASE, Sony, Micron, SKHynix, YMTC, Tesla, and Nvidia[]
  42. Embedded Multi-Die Interconnect Bridge[]
  43. Mahajan, Ravi, Robert Sankman, Neha Patel, Dae-Woo Kim, Kemal Aygun, Zhiguo Qian, Yidnekachew Mekonnen et al. “Embedded multi-die interconnect bridge (EMIB)–a high density, high bandwidth packaging interconnect.” In 2016 IEEE 66th Electronic Components and Technology Conference (ECTC), pp. 557-565. IEEE, 2016. 📁[][]
  44. Back end of line[]
  45. Elsherbini, A. A., S. M. Liff, and J. M. Swan. “Heterogeneous integration using omni-directional interconnect packaging.” In 2019 IEEE International Electron Devices Meeting (IEDM), pp. 19-4. IEEE, 2019. 📁[][]
  46. Greek: Unique and Special[]
  47. Intel Foveros Technology Explained[]
  48. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[][]
  49. Foveros Illustration[]
  50. Foveros with EMIB Illustration[]
  51. Foveros Direct: Advanced Packaging Technology to Continue Moore’s Law[]
  52. Omni-Directional Interconnect[]
  53. Chip-on-Wafer-on-Substrate[]
  54. TSMC CoWoS Illustration[]
  55. Douglas Yu, TSMC Packaging Technologies for
    Chiplets and 3D
    [][]
  56. NEC SX-Aurora Cross Section Illustration[]
  57. high-K MiM Illustration[]
  58. iCAP Illustration[]
  59. Samuel K. Moore, AMD’s Next GPU Is a 3D-Integrated Superchip, IEEE Spectrum[][]
  60. AMD EPYC MI300[][]
  61. CoWoS®-R[]
  62. CoWoS-R Illustration[]
  63. TSMC OIP: 3DFabric Alliance and 3Dblox[]
  64. CoWoS®-L[]
  65. CoWoS-L Illustration[]
  66. Local Silicon Interconnect[][][]
  67. Integrated Fan-Out[]
  68. Chiang, Y. P., S. P. Tai, W. C. Wu, John Yeh, C. T. Wang, and C. H. Douglas. “InFO_oS (integrated fan-out on substrate) technology for advanced chiplet integration.” In 2021 IEEE 71st Electronic Components and Technology Conference (ECTC), pp. 130-135. IEEE, 2021. 📁[]
  69. InFO_PoP Illustration[]
  70. Fan-Out Wafer Level Packaging[]
  71. Package on Package[]
  72. Through InFO Via[]
  73. Advanced Packaging 1-2 #TSMC[]
  74. InFO_PoP vs CoWoS with SoC and SoIC[]
  75. System of Integrated Chips[]
  76. Advanced Packaging 1-2 #TSMC📺[]
  77. Chip on Wafer[]
  78. Wafer on Wafer[]
  79. TSMC Announces Wafer-on-Wafer 3D Stacking Technology[]
  80. CoW and WoW comparison[]
  81. Interposer Cube[]
  82. UCIe Consortium, UCIe™ Packaging Technologies Webinar📺[]
  83. Hybrid Substrate Cube[]
  84. eXtended Cube[]
  85. Tom’s Hardware, SK Hynix and Nvidia reportedly working on a radical GPU redesign that 3D-stacks HBM memory directly on top of the processing cores[]
  86. YANGTZE Memory[]
  87. Semicon Talk, Intel Foveros📺[]
  88. ASE, 2.5D and 3D IC Packaging[]
  89. Minkenberg, Cyriel, Rajesh Krishnaswamy, Aaron Zilkie, et al. “Co‐packaged Datacenter Optics: Opportunities and Challenges.” IET Optoelectronics 15, no. 2 (2021): 77-91. 📁[]
  90. data path transfer define arbitrarily[]
  91. Cadence, UCIe PHY and Controller—To Die For[]
  92. version of stacks for D2D adepter to work with[]
  93. Introduction to UCIe Tutorial: UCIe Protocols📺[]
  94. Krishnan, Gokul, Sumit K. Mandal, Manvitha Pannala, Chaitali Chakrabarti, Jae-Sun Seo, Umit Y. Ogras, and Yu Cao. “SIAM: Chiplet-based scalable in-memory acceleration with mesh for deep neural networks.” ACM Transactions on Embedded Computing Systems (TECS) 20, no. 5s (2021): 1-24. 📁[]
  95. NVIDIA V100 TENSOR CORE GPU[]
  96. NVIDIA T4 Tensor Core GPU for AI Inference[]
  97. Mikhaylov, A N, E G Gryaznov, M N Koryazhkina, et al. 2023. “Neuromorphic Computing Based on CMOS-Integrated Memristive Arrays: Current State and Perspectives.” Supercomputing Frontiers and Innovations 10, no. 2: 77–103. 📁[]
  98. Sharma, Harsh, Sumit K. Mandal, Janardhan Rao Doppa, Umit Y. Ogras, and Partha Pratim Pande. “SWAP: A Server-Scale Communication-Aware Chiplet-Based Manycore PIM Accelerator.” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 41, no. 11 (2022): 4145-4156. 📁[]
  99. Goksoy, Ahmet Alper. “In-Memory Computing based Acceleration: Large-Scale to Edge Computing.” PhD diss., Arizona State University, 2023. 📁[]
  100. Lan, Jingjing, Vishnu P. Nambiar, Rheeshaalaen Sabapathy, Mihai Dragos Rotaru, and Anh Tuan Do. “Chiplet-based Architecture Design for Multi-Core Neuromorphic Processor.” In 2021 IEEE 23rd Electronics Packaging Technology Conference (EPTC), pp. 410-412. IEEE, 2021. 📁[]
  101. Nambiar, Vishnu P., Junran Pu, Yun Kwan Lee, Aarthy Mani, T. Luo, L. Yang, Eng-Kiat Koh et al. “0.5 V 4.8 pJ/SOP 0.93\mu\mathrm {W} Leakage/core Neuromorphic Processor with Asynchronous NoC and Reconfigurable LIF Neuron.” In 2020 IEEE Asian Solid-State Circuits Conference (A-SSCC), pp. 1-4. IEEE, 2020. 📁[]
  102. Vetter, Jeffrey S., Date, Prasanna, Fahim, Farah, Kulkarni, Shruti R., Maksymovych, Petro, Talin, A. Alec, Tallada, Marc Gonzalez, Vanna-iampikul, Pruek, Young, Aaron R., Brooks, David, Cao, Yu, Gu-Yeon, Wei, Lim, Sung Kyu, Liu, Frank, Marinella, Matthew, Sumpter, Bobby, and Miniskar, Narasinga Rao. 2023. “Abisko: Deep codesign of an architecture for spiking neural networks using novel neuromorphic materials”. United States. https://doi.org/10.1177/10943420231178537.[]

Posted

in

by

Tags:

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *

🧭