🦞🌯 Lobster Roll

Thread

Beware of misleading GPU vs CPU benchmarks (pythonspeed.com)

Stories related to "Beware of misleading GPU vs CPU benchmarks" across the full archive.

Beware of misleading GPU vs CPU benchmarks (pythonspeed.com)
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler (github.com)
Benchmarking Grace Hopper CPU+GPU "Superchip" (blog.hpc.qmul.ac.uk)
Burn 0.20.0: Unifying CPU & GPU kernels with CubeCL (burn.dev)
Loo.py: transformation-based code generation for GPUs and CPUs (arxiv.org)
Neanderthal 0.8.0: CPU and GPU support on Linux, Windows, and OS X! (neanderthal.uncomplicate.org)
SPEC CPU® 2017 (spec.org)
A new version of the SPEC CPU benchmark suite is published.
Thinking Parallel, Part II: Tree Traversal on the GPU (devblogs.nvidia.com)
Where’s all my CPU and memory gone? The answer: Slack (medium.com)
ARM Takes Wing: Qualcomm vs. Intel CPU comparison (blog.cloudflare.com)
Epic Services & Stability Update (100% CPU load increase after Meltdown mitigation) (epicgames.com)
Mastering Linux performance - CPU time and CPU usage (jaroslawr.com)
FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs (2010) (webislands.net)
Abstract: "In-memory tree structured index search is a fundamental database operation. Modern processors provide tremendous computing power by integrating multiple cores, each with wide vector units. There has been much work to exploit modern processor architectures for database primitives like sca...
Effect of CPU Caches (medium.com)
Article also contains a link to a good LWN article on CPU caches.
Dissecting the NVIDIA Volta GPU Architecture via Microbenchmarking (arxiv.org)
Abstract: > Every year, novel NVIDIA GPU designs are introduced. This rapid architectural and technological progression, coupled with a reluctance by manufacturers to disclose low-level details, makes it difficult for even the most proficient GPU software designers to remain up-to-date with the t...
What optimizations you can expect from CPU? (dendibakh.github.io)
A sketch of string unescaping on GPGPU (gist.github.com)
I've been noodling for a while on the idea of doing text manipulations on the GPU. One such operation is unescaping of strings (also a primitive required for JSON parsing). Today I got around to implementing one of my ideas, gist in the link. The interaction of `"` and `\` can be boiled down to a...
A sketch of string unescaping on GPGPU (raphlinus.github.io)
A blog post about https://lobste.rs/s/10sox2/sketch_string_unescaping_on_gpgpu
Native Code Performance and Memory: The Elephant in the CPU (channel9.msdn.com)
Towards GPGPU JSON parsing (raphlinus.github.io)
This is a bit of a followup to my earlier post on string unescaping. I don't think the approach as I've written it is very practical, but I think it's an intriguing direction. I believe parsing JSON on GPU can be done, but probably requires some very clever and tricky techniques to work well with th...
Deep Packet Inspection Using GPU's (2017) (on-demand.gputechconf.com)
Why Skylake CPUs Are Sometimes 50% Slower – How Intel Has Broken Existing Code (aloiskraus.wordpress.com)
Comparing Serverless Performance for CPU Bound Tasks (blog.cloudflare.com)
GPU LSM: A Dynamic Dictionary Data Structure for the GPU (arxiv.org)
Abstract: "We develop a dynamic dictionary data structure for the GPU, supporting fast insertions and deletions, based on the Log Structured Merge tree (LSM). Our implementation on an NVIDIA K40c GPU has an average update (insertion or deletion) rate of 225 M elements/s, 13.5x faster than merging it...
AMD Announces 7nm Rome CPUs and MI60 GPUs (tomshardware.com)
The MuQSS CPU scheduler (2017) (lwn.net)
The Curious Case of BEAM CPU Usage (stressgrid.com)
Why use an FPGA instead of a CPU or GPU? (blog.esciencecenter.nl)
Incremental flattening for nested data parallelism on the GPU (futhark-lang.org)
Hyperscan: A Fast Multi-pattern Regex Matcher for Modern CPUs (branchfree.org)