Difference between revisions of "Wikipedia"

From Archiveteam
Jump to navigation Jump to search
(Adding much-needed Wikipedia logo.)
(41 intermediate revisions by 11 users not shown)
Line 1: Line 1:
For once, a site that recognizes the importance of third-party backups! They have a [http://download.wikipedia.org/ main downloads page] from which you can get XML dumps from [http://download.wikipedia.org/backup-index.html individual wikis].
{{Infobox project
| title = Wikipedia
| logo = Wikipedia.png
| url = http://www.wikipedia.org/
| project_status = {{online}}
| archiving_status = {{saved}}
| irc = wikiteam
}}


There's an old article dump (2008/03/12) [http://thepiratebay.org/torrent/4794236/enwiki-20080312-pages-articles.xml.bz2 up on the pirate bay], from the [http://thepiratebay.org/user/archiveteam/ archiveteam TPB account].
'''Wikipedia''' is the largest [[wiki]] on the planet, with several million articles available in English and several million more in dozens of available languages.
 
[[File:Wikipedia nostalgia.png|thumb|right|[http://nostalgia.wikipedia.org Wikipedia nostalgia], a frozen version of Wikipedia from 2001]]
[[File:Wikipedia, the free encyclopedia april fools day 2010.png|thumb|right|April Fools Day 2010]]
<center>'''No more [[Library of Alexandria|Libraries of Alexandria]] destroyed.'''</center>
 
[[File:Size of English Wikipedia in August 2010 (L).png|thumb|right|700px|English Wikipedia in August 2010, if printed.]]
 
For once, a site that recognizes the importance of third-party backups! They have a [http://dumps.wikimedia.org/ main downloads page] from which you can get XML dumps from individual wikis (Wikimedia Foundation hosts more than 800 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons, Wikivoyage, Wikidata).
 
== Tools ==
* [https://github.com/WikiTeam/wikiteam/blob/master/wikipediadownloader.py WikiTeam script] to download Wikipedia dumps from download.wikimedia.org
 
== Backups ==
As of 19:07, 10 July 2016 (EDT), dumps.wikimedia.org only has about 10 earlier versions of dumps for each wiki, generally going back to around October 2015. They don't seem to be linked, but they are accessible via http://dumps.wikimedia.org/''wikiname''/ (where ''wikiname'' is listed on the index page).
 
There's an old article dump (2008/03/12) [http://thepiratebay.org/torrent/4794236/enwiki-20080312-pages-articles.xml.bz2 up on The Pirate Bay] [magnet:?xt=urn:btih:5dc4df42109c8d1dbc759276d62225223ca69c53&dn=enwiki-20080312-pages-articles.xml.bz2&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Fopen.demonii.com%3A1337&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Fexodus.desync.com%3A6969 magnet], from the [http://thepiratebay.org/user/archiveteam/ ArchiveTeam TPB account], although it has no seeders as of 19:07, 10 July 2016 (EDT).
 
There is no current public backup for images uploaded to [[Wikimedia Commons]], which has about 32 million images and other media files uploaded on its services as of 19:07, 10 July 2016 (EDT).
 
Links:
* [http://download.wikipedia.org/ official backups site]
* http://download.wikimedia.org/archive/ - about a dozen older dumps, including [http://dumps.wikimedia.org/archive/enwiki/20060816/ one from 2006], as well as 2 from [https://dumps.wikimedia.org/archive/2001/ 2001].
* {{url|http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z|old wikipedia backups discovered}}
** [https://web.archive.org/web/20130522000621/http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z Direct Wayback link]
** {{url|http://lists.wikimedia.org/pipermail/foundation-l/2010-December/063088.html|announcement on foundation-l}}
** {{url|https://web.archive.org/web/20120306052415/http://grey.colorado.edu/wikipedia_2001/|script for parsing them}}
 
* Internet Archive results: http://www.archive.org/search.php?query=wikipedia%20dumps (223,142 results as of 20:25, 10 July 2016 (EDT))
** {{IA id|wikimediadownloads}} - Primary collection, manage by Hydriz
*** 915,108 items, with archivedates from Nov 10, 2005 through Jul 10, 2016 as of 20:34, 10 July 2016 (EDT)
** {{IA id|wikipediadumps}} - Older, somewhat forgotten collection
*** 810 items, with archivedates from April 9, 2010 through Aug 13, 2014 as of 20:25, 10 July 2016 (EDT)
*** Three sets of all or most of the different language editions of Wikipedia, from 2010-04-08, 2010-06-10 and 2011-08-08.
**** 2010-04 has an underscore between the wiki name and the date, and is missing ltwiki (Lithuanian) presumably because it was created between then and June 2010.
**** 2010-06 has the same identifier format, and contains one edition that is missing from the other two: emwiki (which appears to be the [[wikipedia:Emilian-Romagnol]] edition).
**** 2011-08 has a dash (rather than an underscore) both before and after "wiki", and is missing 7 editions that are present in the other two (ace, ckb, hu, krc, mwl, pcd, pnb) and contains 7 missing from them (ak, be_x_old, eml, fj, hz, ng, tokipona).
*** There are also 12 other misc dumps:
**** {{IA id|arwiki20110112}}
**** {{IA id|de_labswikimedia-20110904}}
**** {{IA id|de_labswikimedia-20111013}}
**** {{IA id|en_labswikimedia-20110906}}
**** {{IA id|en_labswikimedia-20111015}}
**** {{IA id|enwiki-20110620-item-1-of-2}}
**** {{IA id|enwiki-20110620-item-2-of-2}}
**** {{IA id|flaggedrevs_labswikimedia-20110907}}
**** {{IA id|flaggedrevs_labswikimedia-20111016}}
**** {{IA id|idwiki20101106}}
**** {{IA id|readerfeedback_labswikimedia-20110907}}
**** {{IA id|readerfeedback_labswikimedia-20111016}}
 
* [http://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive Compilation of links to Wikipedia archives]
* [http://nostalgia.wikipedia.org/wiki/HomePage A backup of Wikipedia as of Thursday, December 20, 2001]
 
=== Transferring to IA ===
[[User:Hydriz|Hydriz]] is currently transferring the dumps of all Wikimedia projects into the Internet Archive. Wikimedia itself has provided resources to me for transferring these dumps to the Internet Archive. The results are in the {{IA id|wikimediadownloads}} collection, which is still being kept up to date as of 20:38, 10 July 2016 (EDT).
 
== Vital signs ==
 
Stable, but they seriously use a lot of tactics to get donations.
 
== Offline readers ==
* [http://www.okawix.com/ Okawix] ([http://www.okawix.com/zenos/ files])
* [http://www.kiwix.org Kiwix] ([http://download.kiwix.org/zim/ files])


== See also ==
== See also ==
* [[Wikimedia Commons]]
* [[Wikia]]
* [[Wikia]]
* [[Wikis]]
* [[Wikis]]
* [[Nupedia]]
* [[GNUPedia]]
* [[Citizendium]]
* [[WikiTravel]] - Not a Wikimedia project, but its content was forked to create WMF-hosted rival Wikivoyage.
* [[WikiTeam]]
== External links ==
* http://www.wikipedia.org
* http://www.wikimedia.org
* https://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive
{{Navigation box}}
[[Category:Wikis]]

Revision as of 12:27, 16 May 2019

Wikipedia
Wikipedia logo
URL http://www.wikipedia.org/
Status Online!
Archiving status Saved!
Archiving type Unknown
IRC channel #wikiteam (on hackint)

Wikipedia is the largest wiki on the planet, with several million articles available in English and several million more in dozens of available languages.

Wikipedia nostalgia, a frozen version of Wikipedia from 2001
April Fools Day 2010
No more Libraries of Alexandria destroyed.
English Wikipedia in August 2010, if printed.

For once, a site that recognizes the importance of third-party backups! They have a main downloads page from which you can get XML dumps from individual wikis (Wikimedia Foundation hosts more than 800 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons, Wikivoyage, Wikidata).

Tools

Backups

As of 19:07, 10 July 2016 (EDT), dumps.wikimedia.org only has about 10 earlier versions of dumps for each wiki, generally going back to around October 2015. They don't seem to be linked, but they are accessible via http://dumps.wikimedia.org/wikiname/ (where wikiname is listed on the index page).

There's an old article dump (2008/03/12) up on The Pirate Bay magnet, from the ArchiveTeam TPB account, although it has no seeders as of 19:07, 10 July 2016 (EDT).

There is no current public backup for images uploaded to Wikimedia Commons, which has about 32 million images and other media files uploaded on its services as of 19:07, 10 July 2016 (EDT).

Links:

Transferring to IA

Hydriz is currently transferring the dumps of all Wikimedia projects into the Internet Archive. Wikimedia itself has provided resources to me for transferring these dumps to the Internet Archive. The results are in the wikimediadownloads collection, which is still being kept up to date as of 20:38, 10 July 2016 (EDT).

Vital signs

Stable, but they seriously use a lot of tactics to get donations.

Offline readers

See also

External links