olcf's CUDA series
01. CUDA C Basics
slide
Host: The CPU and its memory
Device: The GPU and its memory
Simple Processing Flow
COPY memory (from CPU to GPU)
Load GPU program a...
One thing that we haven't done is some benchmarking. For this I though I'd start with a simple perceptron and scale up the size of the tensors.
⛶import { Tensor } from "../src/js/tensor.js";
import...
Hi everyone! I'm Reenmayee, a 2nd-year BTech student, and today I’m starting my 3-month learning sprint toward mastering CUDA, GenAI, and DSA.
What I Did Today:
Set up Google Colab with GPU...