zstd decompression should almost always be very fast. It's faster to decompress ...

19h · on March 19, 2023

Unfortunately we're not just searching for things but extracting word frequencies of every user for stylometric analysis, so we need to do custom crunching.

Spreading this task into many sub-slices of the files is annoying because the frequencies per user add up quite a lot, which results in quite a massive amount of data.

zX41ZdbW · on March 18, 2023

[flagged]

zX41ZdbW · on March 18, 2023

Two SSDs on AWS machine only give 3800 MB/sec :(

eska · on March 19, 2023

Meanwhile a single consumer Samsung 980 Pro 2TB for 200€ gives me stable 7000 MB/sec