Difference between revisions of "GeoCities URL Lists"

From Archiveteam
Jump to: navigation, search
Line 1: Line 1:
* swebb's current url list: http://badcheese.com/~steve/only_geocities.txt.bz2 (no longer updated)
+
* swebb's current url list: http://badcheese.com/~steve/ALL-GEO-SEEDS-20090730.txt.bz2 (Same url list that archive.org is using)
 
* sods list : [http://blog.odonnell.nu/static/sites.tar.bz2] - over 700,000 unique geocities sites (not pages), I don't have the ability to download them, hopefully some of the downloaders can make use of this.
 
* sods list : [http://blog.odonnell.nu/static/sites.tar.bz2] - over 700,000 unique geocities sites (not pages), I don't have the ability to download them, hopefully some of the downloaders can make use of this.
  

Revision as of 15:17, 22 October 2009

URLs drawn from specific sources

It is especially important to back up URLs linked from news sites and other project that cared about the quality of the sites they link too. The following URL lists are all extracted from dumps of/crawling these sites: