Difference between revisions of "Projects"

From Archiveteam
Jump to navigation Jump to search
(NEW PROJECTS PAGES LAYOUT. See: user:bzc6p/Restructuring projects pages.)
Line 1: Line 1:
{{Projects status}}
{{Projects status}}
Here's where Archive Teamsters can list the '''projects''' they are currently working on and organize new projects.


= Projects =
This page should contain, or directly link to, almost all ArchiveTeam archiving endavours, categorized.
Our [[Current Projects]] page.
* '''[[#Current Projects|Current Projects]]''': currently active, upcoming and recently finished grandiose ArchiveTeam projects.  (Extract of the next two categories.)
:''See also: [[:Category:In progress]].''
* '''[[#Warrior Projects|Warrior Projects]]''': projects that utilize(d) ArchiveTeam's distributed archiving system.
* '''[[#Manual Projects 2|Manual Projects]]''' that need(ed) much more effort than just pushing a button.
* '''[[#Small Projects|Small Projects]]''': small-scale website archiving projects usually done by a single individual.
* '''[[#Early Projects|Early Projects]]''': first archiving endavours on the dawn of ArchiveTeam, in a format nobody is apparently able/dare to touch.


== ArchiveTeam Warrior ==
(The box on the top counts projects having dedicated wiki pages, those numbers aren't complete and far don't contain all projects mentioned in the sections below.)
The [[ArchiveTeam Warrior]] is a virtual machine that will allow you to lend a hand on large archiving projects whenever they come up.


== ArchiveBot ==
If you know of a website in danger, let us know that on [[IRC]]. If it's a larger site, please also mention it on the '''[[Deathwatch]]''' page. And, after a decision is made on IRC, or if it doesn't need a decision, then, to help things kept documented and up to date, you are encouraged to add projects, or modify their status
* in the appropriate section(s),
* on the project's dedicated wiki page (if any),
* on [[Deathwatch]] and/or on [[Alive... OR ARE THEY]].


[[ArchiveBot]] is an IRC bot that automates archiving for smaller sites.
The box on the top is generated automatically from projects' dedicated wiki pages, so shouldn't be touched.


== Websites at risk ==
'''Important:''' Contents of sections below are '''embedded''' from other pages, that is, don't edit the section, nor this page, but use the "'''Edit this list'''" link! (That opens the corresponding page for editing, and after editing, you'll be forwarded to the page containing only that list: don't worry, you didn't delete the others.)


See [[Deathwatch]] and [[Alive... OR ARE THEY]].
= Current Projects =
<div class="mw-collapsible" style="width:100%; background-color: #CCFFFF; border: 1px solid; padding: 5px">
Currently active team projects you can get involved in.
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". -->
<div class="mw-collapsible-content" style="width:100%">
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Current_Projects&action=edit Edit this list]</span>'''
{{:Current Projects}}
</div>
</div>


== Ideas for projects ==
= Warrior Projects =
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: #99FF99; border: 1px solid; padding: 5px">
ArchiveTeam's past, current and future Warrior projects with details, in a table form.
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". -->
<div class="mw-collapsible-content" style="width:100%">
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Warrior_projects&action=edit Edit this list]</span>'''
{{Warrior projects}}
</div>
</div>


See [[Deathwatch]] and [[Alive... OR ARE THEY]].
= Manual Projects =
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: #CCFF99; border: 1px solid; padding: 5px">
Difficult, discussion-intensive, human-resource-intensive and audit projects.
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". -->
<div class="mw-collapsible-content" style="width:100%">
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Manual_projects&action=edit Edit this list]</span>'''
{{Manual projects}}
</div>
</div>


== Finished projects ==
= Small Projects =
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: #FFCCFF; border: 1px solid; padding: 5px">
List of smaller website rescuing projects, usually done by single individuals.
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". -->
<div class="mw-collapsible-content" style="width:100%">
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Small_projects&action=edit Edit this list]</span>'''
{{Small projects}}
</div>
</div>


This is a list of completed projects which do not have their own page on this wiki. TODO - A page will be made for them in time.
= Early Projects =
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: lightgray; border: 1px solid; padding: 5px">
List of ArchiveTeam's early endavours, for historical interest, not edited.
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". -->
<div class="mw-collapsible-content" style="width:100%">
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Early_projects&action=edit Edit this list]</span>'''
{{Early projects}}
</div>
</div>


See [[:Category:Rescued Sites]] for projects which do have their own page on this wiki.
* [http://www.archiveteam.org Archive Team] founded by [[User:Jscott|Jason Scott]] [http://archiveteam.org/index.php?title=Main_Page&diff=prev&oldid=3]
* [http://thepiratebay.org/user/archiveteam/ an archiveteam thepiratebay.org user] created by [[User:Bbot|bbot]]
** Get the password from him or Jason. (Not really a ''project'', per se.)
* ([http://mirrors.sdboyd56.com/infoanarchy/ mirror] | [http://sdboyd56.com/archives/infoanarchy_archive-201102.tar.gz 4.5MB archive]) [http://www.infoanarchy.org/en/Main_Page The infoAnarchy wiki] was archived by [[User:Sdboyd|Scott]].
** infoAnarchy was down for several months in the first part of 2011, but is back up as of May 2011.  There is now very little content updating on the site.  As of 2014-06-02, infoAnarchy has a "Revive infoanarchy.org blog & wiki" notice and a request for donations, suggesting it may not have a future.  As of 2014-06-02, a "database is locked" message will be given to logged-in users.
** If there are future updates to that archive, they may be found at http://sdboyd56.com/archives/
** FIXME - This archive has non-relative links, requiring it to be in /infoanarchy.  It needs to be redone or edited to have relative links.
** FIXME - This archive does not include the complete history, which is absolutely essential in this case, as significant editing history exists.
* ([http://mirrors.sdboyd56.com/cyberpunk_project/ mirror]) [http://project.cyberpunk.dotru The Cyberpunk Project] was archived by [[User:Sdboyd|Scott]]
** Note that this wiki does not allow the Russian TLD, so the URL will have to be edited to be visited.
** Most pages haven't been changed since 2007.  It hasn't been updated or changed since April 2010.
** FIXME - this mirror is incomplete, or its links are pointing to the live website.
* ([http://www.archive.org/details/kasabi archive]) Kasabi's data was retrieved and uploaded to archive.org by [[User:Edsu|Edsu]].
* ([https://archive.org/details/foxytunes.com-panicgrab-20130704 archive]) FoxyTunes was archived by [[User:Start|Start]]
** (it's less than 1MB!)
* ([https://archive.org/details/emulation-zone-archive archive]) Emulation Zone was archived by [[User:Start|Start]]
** FIXME - vgaa.emulationzone.org-2014-0708.warc.gz got interrupted by a crash and needs to be re-archived
== Other projects ==
* '''[[FanFiction.Net]]''' is being pre-emptively archived.
* '''[[User:ip2k|seanp2k]]''' is running [http://somaseek.com somaseek.com] and tracking all the song history for all of the internet radio stations on [http://somafm.com somafm.com] since March 2010.
* '''[[User:Ross|Ross]]''' is interviewing the sites of 2008.
* '''[[User:LesOrchard|l.m.orchard]]''' is starting work on some self-hosted web apps that will migrate and archive from other sites. (ie. [http://github.com/lmorchard/friendfeedarchiver FriendFeed], [http://github.com/lmorchard/memex/ Delicious])
* '''[[User:Sungo|sungo]]''' is archiving etherpad.
* '''[[User:Tsp|Tsp]]''' is attempting to archive the stories from fanfiction.net and fictionpress.
* '''[[User:Emijrp|emijrp]]''' is a member of [[WikiTeam]]. Also, downloading albums from [[Jamendo]]. You can know more about his projects in his userpage.
* '''[[User:jcbradley|Jean-Claude Bradley]]''' and '''[[User:romney|Andrew Lang]]''' are archiving the [http://onsbooks.wikispaces.com/ Open Notebook Science projects Reaction Attempts and the ONS Solubility Challenge].  This includes the lab notebooks and all associated raw data files.
* '''[[User:Hydriz|Hydriz]]''' is currently archiving all [http://dumps.wikimedia.org available dumps and downloads] generated by Wikimedia and uploading them to the Internet Archive (see [http://www.archive.org/details/wikimediadownloads collection]).
== Dead projects ==
* [[User:EmuWikiAdmin|EmuWikiAdmin]] created [http://www.emuwiki.com EmuWiki], a collection of all emulators, emulator documents, and hardware information that exists, regrouped in a referenced database.  Unfortunately, it [http://gbatemp.net/t230096-emuwiki-com-closes-down shut down] in May 2010 due to copyright issues.  A 20GB torrent was released, and its contents are available at https://archive.org/details/EmuWiki_Collection.
== Tools ==
* [[Software]]
* [[httrack options]]
== See also ==
* [[Archives]]


{{Navigation pager
{{Navigation pager
Line 77: Line 74:
| next = Philosophy
| next = Philosophy
}}
}}
{{Navigation box}}

Revision as of 18:05, 28 June 2015

Projects status
Online (331) · Special cases (51) · Endangered (70) · Closing (16) · Offline (424)
Rescued Sites (498) · Self-Saved (17) · Partially Rescued Sites (213) · In Progress (43) · Upcoming (11) · Not Saved Yet (409) · On hiatus (12) · Lost Sites (91)
Unknown Status (65)

This page should contain, or directly link to, almost all ArchiveTeam archiving endavours, categorized.

  • Current Projects: currently active, upcoming and recently finished grandiose ArchiveTeam projects. (Extract of the next two categories.)
  • Warrior Projects: projects that utilize(d) ArchiveTeam's distributed archiving system.
  • Manual Projects that need(ed) much more effort than just pushing a button.
  • Small Projects: small-scale website archiving projects usually done by a single individual.
  • Early Projects: first archiving endavours on the dawn of ArchiveTeam, in a format nobody is apparently able/dare to touch.

(The box on the top counts projects having dedicated wiki pages, those numbers aren't complete and far don't contain all projects mentioned in the sections below.)

If you know of a website in danger, let us know that on IRC. If it's a larger site, please also mention it on the Deathwatch page. And, after a decision is made on IRC, or if it doesn't need a decision, then, to help things kept documented and up to date, you are encouraged to add projects, or modify their status

The box on the top is generated automatically from projects' dedicated wiki pages, so shouldn't be touched.

Important: Contents of sections below are embedded from other pages, that is, don't edit the section, nor this page, but use the "Edit this list" link! (That opens the corresponding page for editing, and after editing, you'll be forwarded to the page containing only that list: don't worry, you didn't delete the others.)

Current Projects

Currently active team projects you can get involved in.

Edit this list

Archive Team recruiting

Warrior-based projects

ArchiveTeam's Choice: Telegram

Short-term, urgent projects

  • DeviantArt: Archiving custom widgets, favorites, group affiliations, countdown timers, admin forums, and admin announcements. IRC Channel #devianttart (on hackint)

Medium-term projects

(none currently)

Long-term projects

An updated Warrior virtual appliance (v3.2) is now available with better support for newer projects that utilize wget-at.

Manual projects

  • ArchiveBot: For those with lots of disk space, bandwidth and long-term commitment. IRC Channel #archivebot (on hackint).
  • Codearchiver: Dumping and archival of source code repositories and associated version control systems. IRC Channel #codearchiver (on hackint).
  • Dead people: When people die, their webpages and/or social media might go "Poof!" due to fees and other knick-knack. IRC Channel #archiveteam (on hackint)
  • WikiTeam: Saving wikis dumps (XML). And their external links for the Wayback Machine (WARC) as well as exporting MediaWiki databases. Permanent effort, everyone can help (you choose the size of your downloads). IRC Channel #wikiteam (on hackint).

Upcoming & proposed projects

Recently finished projects

  • Taringa!: Shut down on 2024-03-24 with barely two weeks lead time. IRC Channel #mataringa (on hackint).


On Hiatus

ArchiveTeam uses the hackint IRC network – ircs://irc.hackint.org:6697 (TLS required) – webchat: https://webirc.hackint.org/More info

Warrior Projects

ArchiveTeam's past, current and future Warrior projects with details, in a table form.

Manual Projects

Difficult, discussion-intensive, human-resource-intensive and audit projects.

Small Projects

List of smaller website rescuing projects, usually done by single individuals.

Early Projects

List of ArchiveTeam's early endavours, for historical interest, not edited.


Fire DrillProjectsPhilosophy