Difference between revisions of "ArchiveBot"

From Archiveteam
Jump to navigation Jump to search
(more wikipedia style formatting)
m (linkify to archives and dashboard)
Line 1: Line 1:
[[File:FuckYeahArchiveBot.png|400px|right|thumb|A 90 gigabits/second spike in a bandwidth graph.]]
[[File:FuckYeahArchiveBot.png|400px|right|thumb|A 90 gigabits/second spike in a bandwidth graph.]]


'''ArchiveBot''' is an [[IRC]] bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs).  You give it a URL to start at, and it grabs all content under that URL, [[Wget_with_WARC_output|records it in a WARC]], and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).
'''ArchiveBot''' is an [[IRC]] bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs).  You give it a URL to start at, and it grabs all content under that URL, [[Wget_with_WARC_output|records it in a WARC]], and then uploads that WARC to ArchiveTeam servers for eventual injection into the [https://archive.org/search.php?query=archivebot%20collection%3Aarchiveteam&sort=-publicdate Internet Archive] (or other archive sites).


== Details ==
== Details ==


To use ArchiveBot, drop by [http://chat.efnet.org:9090/?nick=&channels=%23archivebot&Login=Login #archivebot] on EFNet. To interact with ArchiveBot, you [https://raw2.github.com/ArchiveTeam/ArchiveBot/master/COMMANDS issue commands] by typing it into the channel. Note you will need channel operator permissions in order to issue archiving jobs.
To use ArchiveBot, drop by [http://chat.efnet.org:9090/?nick=&channels=%23archivebot&Login=Login #archivebot] on EFNet. To interact with ArchiveBot, you [https://raw2.github.com/ArchiveTeam/ArchiveBot/master/COMMANDS issue commands] by typing it into the channel. Note you will need channel operator permissions in order to issue archiving jobs. The [http://archivebot.at.ninjawedding.org:4567 dashboard] shows the sites being downloaded currently.


ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. [[Dev|Contributions welcomed]]! Any isssues or feature requests may be filed at [https://github.com/ArchiveTeam/ArchiveBot/issues the issue tracker].  
ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. [[Dev|Contributions welcomed]]! Any issues or feature requests may be filed at [https://github.com/ArchiveTeam/ArchiveBot/issues the issue tracker].  


Follow [https://twitter.com/atarchivebot @ATArchiveBot] on [[Twitter]]!
Follow [https://twitter.com/atarchivebot @ATArchiveBot] on [[Twitter]]!

Revision as of 01:19, 1 February 2014

A 90 gigabits/second spike in a bandwidth graph.

ArchiveBot is an IRC bot designed to automate the archival of smaller websites (e.g. up to a few hundred thousand URLs). You give it a URL to start at, and it grabs all content under that URL, records it in a WARC, and then uploads that WARC to ArchiveTeam servers for eventual injection into the Internet Archive (or other archive sites).

Details

To use ArchiveBot, drop by #archivebot on EFNet. To interact with ArchiveBot, you issue commands by typing it into the channel. Note you will need channel operator permissions in order to issue archiving jobs. The dashboard shows the sites being downloaded currently.

ArchiveBot's source code can be found at https://github.com/ArchiveTeam/ArchiveBot. Contributions welcomed! Any issues or feature requests may be filed at the issue tracker.

Follow @ATArchiveBot on Twitter!

More

Like ArchiveBot? Check out our homepage and other projects!