overview for epstein_files

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 2 points 4 days ago

fantastic work btw

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 1 points 4 days ago

seems like all three gaps are covered so I'll join you on this one and see if I can get anything

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 3 points 5 days ago

alrighty, I'm currently in the middle of the archive.org upload but I can transfer the chunks I already have over to a different machine and do it there with a new IP

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 3 points 5 days ago

age gate > page not found

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 3 points 5 days ago

I messaged you on the other site; I'm currently getting a Could not determine Content-Length (got None) error

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 7 points 5 days ago

this method is not working for me anymore

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 6 points 5 days ago

I'm waiting for /u/Kindly_District9380 's version but I've been slowly working backwards on this in the meantime https://archive.org/details/dataset9_url_list

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 4 points 5 days ago

about 25gb

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 6 points 5 days ago

I’m using a partial download I already had and not the 48gb version but I will be gathering as many chunks as I can as well. Thanks for making this

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 5 points 5 days ago* (last edited 5 days ago)

I'll get the first set (42k files in 31G) uploading as soon as I get it zipped up. it's the one least likely to have any new files in it since I started at the beginning like others but it's worth a shot

edit 01FEB2026 1208AM EST - 6.4/30gb uploaded to archive.org

edit 01FEB2026 0430AM EST - 13/30gb uploaded to archive.org; scrape using a different url set going backwards is currently at 75.4k files

edit 01FEB2026 1233PM EST - had an internet outage overnight and lost all progress on the archive.org upload, currently back to 11/30gb. the scrape using a previous url set seems to be getting very few new files now, sitting at 77.9k at the moment

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 6 points 5 days ago

maybe archive.org? that way they can be torrented if others want to attempt their own merging techniques? either way it will be a long upload, my speed is not especially good. I'm still churning through one set of urls that is 1.2M lines, most are failing but I have 65k from that batch so far.

Epstein Files Jan 30, 2026 Release - Archived from Justice.gov by epstein_files_guy in c/datahoarder@lemmy.ml

[-] epstein_files_guy@lemmy.world 8 points 5 days ago

looking forward to your torrent, will seed.

I have several incomplete sets of files from dataset 9 that I downloaded with a scraped set of urls - should I try to get them to you to compare as well?