🦞🌯 Lobster Roll

Thread

Crawling a billion web pages in just over 24 hours (andrewkchan.dev)

Stories related to "Crawling a billion web pages in just over 24 hours" across the full archive.

Crawling a billion web pages in just over 24 hours, in 2025 (andrewkchan.dev)
Crawling a billion web pages in just over 24 hours, in 2025 (andrewkchan.dev)
Crawling a billion web pages in just over 24 hours, in 2025 (andrewkchan.dev)
Crawling a billion web pages in just over 24 hours (andrewkchan.dev)
How to crawl a quarter billion webpages in 40 hours (michaelnielsen.org)
We rendered a million web pages to find out what makes the web slow (catchjs.com)
Why Web Pages Suck (stratechery.com)
Accelerated Mobile Pages – A new approach to web performance (ampproject.org)
How We Moved 34,000 WIRED Pages to One Site in 9 Hours (wired.com)
Why Do Websites Publish AMP Pages? (daringfireball.net)
Social Fixer malware campaign uses NPM to inject code into webpages (numin.it)
Is WEBrick Webscale? (schneems.com)
Optimizing web servers for high throughput and low latency (blogs.dropbox.com)
How 3rd Party Scripts can be performant citizens on the web (twnsnd.com)
A Guide to Faster Web App I/O and Data Operations with Streams (sitepen.com)
A Pinterest Progressive Web App Performance Case Study (medium.com)
Just because your site isn't for emerging markets, doesn't excuse you from web performance optimisation (twnsnd.com)
The Unbearable Lightness of Web Pages (thelocalyarn.com)
nuster- v1.7.9.4 is released (A web caching proxy server) (github.com)
Web cache server performance benchmark: nuster vs squid (github.com)
Smaller Lodash bundles with Webpack and Babel (nolanlawson.com)
Yesquel: Scalable SQL Storage for Web Applications (2015) (cs.nyu.edu)
Abstract: "Web applications have been shifting their storage systems from SQL to NOSQL systems. NOSQL systems scale well but drop many convenient SQL features, such as joins, secondary indexes, and/or transactions. We design, develop, and evaluate Yesquel, a system that provides performance and sca...
Web Framework Performance Comparison Round 16 (techempower.com)
Web cache server HTTP/2 performance benchmark: nuster vs nginx (github.com)
How to Hurricane-Proof a Web Server (2017) (arstechnica.com)
Fairytale about performance in web application (itnext.io)
Story describes how I fixed JS execution time from 26 to 1 sec
Conservative web development (drewdevault.com)
Staticman --- add comments to GitHub Pages without JavaScript (staticman.net)
Installing Hugo and publishing Hugo web-pages on OpenBSD server (bsdboy.ml)
Noria: dynamic, partially-stateful data-flow for high-performance web applications (usenix.org)
A while ago, I [asked](https://lobste.rs/s/cqnzl5/lobste_rs_access_pattern_statistics_for) about traffic statistics for lobste.rs for a research project. Finally, the result of that work has now been published as the linked system, which speeds up lobste.rs by ~5x over that provided by MySQL. The so...