Modern Hell

Modern Hell

Share this post

Modern Hell
Modern Hell
Modern Hell #29: Echoes of Ourselves

Modern Hell #29: Echoes of Ourselves

We spent years putting everything online. Now that data is being used to create a future nobody wants.

Colin Horgan's avatar
Colin Horgan
Apr 28, 2023
∙ Paid
5

Share this post

Modern Hell
Modern Hell
Modern Hell #29: Echoes of Ourselves
2
Share

Last week, the Washington Post published details of its analysis of the data sets that train Large Language Models (LLMs). The Post was unable to examine the data used to train ChatGPT because OpenAI hasn’t disclosed its sources, but it did analyze Google’s C4 data set, “a massive snapshot of the contents of 15 million websites that have been used to in…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Colin Horgan
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share