Matrix Matrix Multiplication

A High Shared Memory Utilization Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs

Sparse general matrix-matrix multiplication (SpGEMM) is fundamental to numerous scientific applications. Traditional hash-based approaches fail to strike a trade-off between reducing hash collisions ...

GitHub

CUDA Kernel for Matrix-Matrix Multiplication on Nvidia GPUs

This code accompanies the blog post Matrix Multiplication Faster Than Nvidia, Sometimes. It provides a CUDA kernel for single-precision matrix-matrix multiplication, with two notable features: use of ...

insideHPC

Intel MKL Speeds Up Small Matrix-Matrix Multiplication for Automatic Driving

Nearly all big science, machine learning, neural network, and machine vision applications employ algorithms that involve large matrix-matrix multiplication. But multiplying large matrices pushes the ...

IEEE

Matrix-Matrix Multiplication Through Hyperspectral Compute-in-Memory

Abstract: We propose a hyperspectral compute-in-memory architecture using optical frequency combs and programmable optical memories. By fully utilizing frequency, space, and time dimensions, this ...

IEEE

Application Level Synthesis: Creating Matrix-Matrix Multiplication Library: A Case Study

Abstract: Efficiently synthesizing an entire application that consists of multiple algorithms for hardware implementation is a very difficult and unsolved problem. One of the main challenges is the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results