Fast memcpy, A System Design
ACM Sigarch
DECEMBER 19, 2022
We look here at a Gedankenexperiment: move 16 bytes per cycle , addressing not just the CPU movement, but also the surrounding system design. A lesser design cannot possibly move 16 bytes per cycle. This base design can map easily onto many current chips. Thought Experiment. bytes remaining to move. Cache Pollution.
Let's personalize your content