PC1 - Dataflows and Mapping |
- Heterogeneous Dataflow Accelerators for Multi-DNN Workloads
- Wire-Aware Architecture and Dataflow for CNN Accelerators
- Shortcut Mining: Exploiting Cross-layer Shortcut Reuse in DCNN Accelerators
- MAERI: Enabling Flexible Dataflow Mapping over DNN Accelerators via Reconfigurable Interconnects
- X-Cache : A Modular Architecture for Domain-Specific Caches
- FLAT: An Optimized Dataflow for Mitigating Attention Bottlenecks
- Sigma: Compiling Einstein Summations to Locality-Aware Dataflow
- FLAT: An Optimized Dataflow forMitigating Attention Bottlenecks
- ZigZag: Enlarging Joint Architecture-Mapping Design Space Exploration for DNN Accelerators
- Understanding Reuse, Performance, and Hardware Cost of DNN Dataflows: A Data-Centric Approach
- CoSA: Scheduling by Constrained Optimization for Spatial Accelerators
- A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators
- Leveraging Domain Information for the Efficient Automated Design of Deep Learning Accelerators
- DeFiNES: Enabling Fast Exploration of the Depth-first Scheduling Space for DNN Accelerators through Analytical Modeling
|