This plot shows an interesting result: most pages of the Talmud are not that differentiable from each other! Most of the amudim do not form isolated clusters, other than those of Brachot + Shabbos, and Nedarim + Niddah + Sotah.
We believe this is because of the repetition of common words/phrases in the Talmud.
Nevertheless, even among the large central cluster, we can still see general groupings of different masechtot and topics.
We believe this can be extremely useful as a pedagogical aid! If you've begun learning a certain page of gemara, you can search for it here, and then pan over nearby amudim to find ones that have similar vocabulary/word usage.