Thread
Stories related to "Beware of misleading GPU vs CPU benchmarks" across the full archive.
A new version of the SPEC CPU benchmark suite is published.
Abstract: "In-memory tree structured index search is a fundamental database operation. Modern processors provide tremendous computing power by integrating multiple cores, each with wide vector units. There has been much work to exploit modern processor architectures for
database primitives like sca...
Article also contains a link to a good LWN article on CPU caches.
Abstract:
> Every year, novel NVIDIA GPU designs are introduced. This rapid architectural and technological progression, coupled with a reluctance by manufacturers to disclose low-level details, makes it difficult for even the most proficient GPU software designers to remain up-to-date with the t...
I've been noodling for a while on the idea of doing text manipulations on the GPU. One such operation is unescaping of strings (also a primitive required for JSON parsing). Today I got around to implementing one of my ideas, gist in the link.
The interaction of `"` and `\` can be boiled down to a...
A blog post about https://lobste.rs/s/10sox2/sketch_string_unescaping_on_gpgpu
This is a bit of a followup to my earlier post on string unescaping. I don't think the approach as I've written it is very practical, but I think it's an intriguing direction. I believe parsing JSON on GPU can be done, but probably requires some very clever and tricky techniques to work well with th...
Why Skylake CPUs Are Sometimes 50% Slower – How Intel Has Broken Existing Code
(aloiskraus.wordpress.com)
Abstract: "We develop a dynamic dictionary data structure for the GPU, supporting fast insertions and deletions, based on the Log Structured Merge tree (LSM). Our implementation on an NVIDIA K40c GPU has an average update (insertion or deletion) rate of 225 M elements/s, 13.5x faster than merging it...