https://wiki.archiveteam.org/api.php?action=feedcontributions&user=Megalanya0&feedformat=atomArchiveteam - User contributions [en]2024-03-29T05:47:14ZUser contributionsMediaWiki 1.37.1https://wiki.archiveteam.org/index.php?title=JPG_Magazine&diff=27397JPG Magazine2017-01-16T15:50:50Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = JPG Magazine<br />
| url = http://www.jpgmag.com/<br />
| project_status = {{online}}<br />
| archiving_status = {{saved}}<br />
}}<br />
'''JPG Magazine''' is a collaborative photo magazine. It seems to have gained a little more steam from community support, but it's still advisable that we [http://www.jpgmag.com/downloads/archives.html download the archives].<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Vital Signs ==<br />
<br />
[http://jpgmag.com/blog/2009/01/jpg_magazine_says_goodbye.html Announced it would shut down] on January 5, 2009. As of Janurary 2011, jpgmag.com is still accessible.<br />
<br />
== Who's Working On It? ==<br />
<br />
Lore has downloaded the archives.<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Livemocha&diff=27396Livemocha2017-01-16T15:50:41Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Google_Groups_Files&diff=27395Google Groups Files2017-01-16T15:50:33Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Google Groups Files<br />
| image = Googleparty.jpg<br />
| project_status = {{offline}}<br />
| archiving_status = {{saved}}<br />
}}<br />
<br />
Google is challenging AT again...<br />
<br />
This notice appears on Google Groups pages:<br />
<br />
----<br />
Zipped versions of the pages and files associated with this group will be available for download until August 31, 2011. After this date, this feature and the zip file downloads will be turned off permanently.<br />
----<br />
<br />
A [http://bazaar.launchpad.net/~ndurner/+junk/at-ggz/view/head:/ggroups_zipdl.sh script] is available that searches Google Groups directories and downloads the ZIP files of individual groups. The script uses a Google App Engine hosted app for coordination.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Script ==<br />
=== Requirements ===<br />
(ba)sh, wget, grep, curl<br />
<br />
=== Usage ===<br />
* Normal operation<br />
<pre><br />
./ggroups_zipdl.sh<br />
</pre><br />
<br />
* Discover only (no downloads to store)<br />
<pre><br />
./ggroups_zipdl.sh discover<br />
</pre><br />
<br />
* Download only (no discovery of new groups)<br />
<pre><br />
./ggroups_zipdl.sh download<br />
</pre><br />
<br />
=== Issues ===<br />
<br />
-<br />
[[Category:Google]]<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Google_Video_Warroom&diff=27394Google Video Warroom2017-01-16T15:50:22Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Google Video<br />
| image = Video logo lg.gif<br />
| description = Google Video logo<br />
| URL = http://video.google.com<br />
| project_status = {{closed}} on 2011-04-29[http://video.google.com/support/bin/answer.py?answer=1233300&hl=en]<br />
| archiving_status = {{saved}}<br />
}}<br />
[[File:Papua videos.png|thumb|right|300px|Google Video results for "Papua New Guinea" keyword.]]<br />
<br />
'''"Gentlemen. You can't fight in here. This is the War Room!"''''<br />
<br />
If you want to help archive Google Video, get some machines running and join us in [[IRC]] (EFNet [irc://irc.efnet.org/archiveteam #archiveteam] / [irc://irc.efnet.org/googlegrape #googlegrape])<br />
<br />
The automatic scripts only work on FreeBSD, Linux, Solaris, Windows, OS X, and Cygwin.<br />
<br />
Anyone can help out, but we would *really* appreciate it if you'd use an *NIX system over any thoughts of doing it on a Windows system. If you however choose to pursue the Magical World of Windows - please make sure that what you are collecting is not damaged as a consequence of running it on a Windows system. <br />
<br />
In any case, the first thing to do is to please add your name/nickname to [http://piratepad.net/gv-participants this list], along with the storage and bandwidth you have available.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
= Seed Lists =<br />
<br />
Please send any new seedlists to underscor on IRC, rather than embarking on them yourself. He'll add them to the listerine queue.<br />
<br />
* Original Lists: http://199.48.254.90/at/seeds/<br />
<br />
== Custom searches ==<br />
* PLEASE add your custom searches and their details to this table!<br />
* Words suggestions: public domain, subtitles<br />
* Words already in the table or added to the BOINC client: conference, hack, wiki, linux, creative commons, part, interview, documentary, talk, brain, civilization, evolution, future, language, literature, mind, money, neurolinguistic, singularity<br />
<br />
== Years ==<br />
[http://www.google.com/search?q=1900+site:video.google.com&tbm=vid 1900], [http://www.google.com/search?q=1901+site:video.google.com&tbm=vid 1901], [http://www.google.com/search?q=1902+site:video.google.com&tbm=vid 1902], [http://www.google.com/search?q=1903+site:video.google.com&tbm=vid 1903], [http://www.google.com/search?q=1904+site:video.google.com&tbm=vid 1904], [http://www.google.com/search?q=1905+site:video.google.com&tbm=vid 1905], [http://www.google.com/search?q=1906+site:video.google.com&tbm=vid 1906], [http://www.google.com/search?q=1907+site:video.google.com&tbm=vid 1907], [http://www.google.com/search?q=1908+site:video.google.com&tbm=vid 1908], [http://www.google.com/search?q=1909+site:video.google.com&tbm=vid 1909], [http://www.google.com/search?q=1910+site:video.google.com&tbm=vid 1910], [http://www.google.com/search?q=1911+site:video.google.com&tbm=vid 1911], [http://www.google.com/search?q=1912+site:video.google.com&tbm=vid 1912], [http://www.google.com/search?q=1913+site:video.google.com&tbm=vid 1913], [http://www.google.com/search?q=1914+site:video.google.com&tbm=vid 1914], [http://www.google.com/search?q=1915+site:video.google.com&tbm=vid 1915], [http://www.google.com/search?q=1916+site:video.google.com&tbm=vid 1916], [http://www.google.com/search?q=1917+site:video.google.com&tbm=vid 1917], [http://www.google.com/search?q=1918+site:video.google.com&tbm=vid 1918], [http://www.google.com/search?q=1919+site:video.google.com&tbm=vid 1919], [http://www.google.com/search?q=1920+site:video.google.com&tbm=vid 1920], [http://www.google.com/search?q=1921+site:video.google.com&tbm=vid 1921], [http://www.google.com/search?q=1922+site:video.google.com&tbm=vid 1922], [http://www.google.com/search?q=1923+site:video.google.com&tbm=vid 1923], [http://www.google.com/search?q=1924+site:video.google.com&tbm=vid 1924], [http://www.google.com/search?q=1925+site:video.google.com&tbm=vid 1925], [http://www.google.com/search?q=1926+site:video.google.com&tbm=vid 1926], [http://www.google.com/search?q=1927+site:video.google.com&tbm=vid 1927], [http://www.google.com/search?q=1928+site:video.google.com&tbm=vid 1928], [http://www.google.com/search?q=1929+site:video.google.com&tbm=vid 1929], [http://www.google.com/search?q=1930+site:video.google.com&tbm=vid 1930], [http://www.google.com/search?q=1931+site:video.google.com&tbm=vid 1931], [http://www.google.com/search?q=1932+site:video.google.com&tbm=vid 1932], [http://www.google.com/search?q=1933+site:video.google.com&tbm=vid 1933], [http://www.google.com/search?q=1934+site:video.google.com&tbm=vid 1934], [http://www.google.com/search?q=1935+site:video.google.com&tbm=vid 1935], [http://www.google.com/search?q=1936+site:video.google.com&tbm=vid 1936], [http://www.google.com/search?q=1937+site:video.google.com&tbm=vid 1937], [http://www.google.com/search?q=1938+site:video.google.com&tbm=vid 1938], [http://www.google.com/search?q=1939+site:video.google.com&tbm=vid 1939], [http://www.google.com/search?q=1940+site:video.google.com&tbm=vid 1940], [http://www.google.com/search?q=1941+site:video.google.com&tbm=vid 1941], [http://www.google.com/search?q=1942+site:video.google.com&tbm=vid 1942], [http://www.google.com/search?q=1943+site:video.google.com&tbm=vid 1943], [http://www.google.com/search?q=1944+site:video.google.com&tbm=vid 1944], [http://www.google.com/search?q=1945+site:video.google.com&tbm=vid 1945], [http://www.google.com/search?q=1946+site:video.google.com&tbm=vid 1946], [http://www.google.com/search?q=1947+site:video.google.com&tbm=vid 1947], [http://www.google.com/search?q=1948+site:video.google.com&tbm=vid 1948], [http://www.google.com/search?q=1949+site:video.google.com&tbm=vid 1949], [http://www.google.com/search?q=1950+site:video.google.com&tbm=vid 1950], [http://www.google.com/search?q=1951+site:video.google.com&tbm=vid 1951], [http://www.google.com/search?q=1952+site:video.google.com&tbm=vid 1952], [http://www.google.com/search?q=1953+site:video.google.com&tbm=vid 1953], [http://www.google.com/search?q=1954+site:video.google.com&tbm=vid 1954], [http://www.google.com/search?q=1955+site:video.google.com&tbm=vid 1955], [http://www.google.com/search?q=1956+site:video.google.com&tbm=vid 1956], [http://www.google.com/search?q=1957+site:video.google.com&tbm=vid 1957], [http://www.google.com/search?q=1958+site:video.google.com&tbm=vid 1958], [http://www.google.com/search?q=1959+site:video.google.com&tbm=vid 1959], [http://www.google.com/search?q=1960+site:video.google.com&tbm=vid 1960], [http://www.google.com/search?q=1961+site:video.google.com&tbm=vid 1961], [http://www.google.com/search?q=1962+site:video.google.com&tbm=vid 1962], [http://www.google.com/search?q=1963+site:video.google.com&tbm=vid 1963], [http://www.google.com/search?q=1964+site:video.google.com&tbm=vid 1964], [http://www.google.com/search?q=1965+site:video.google.com&tbm=vid 1965], [http://www.google.com/search?q=1966+site:video.google.com&tbm=vid 1966], [http://www.google.com/search?q=1967+site:video.google.com&tbm=vid 1967], [http://www.google.com/search?q=1968+site:video.google.com&tbm=vid 1968], [http://www.google.com/search?q=1969+site:video.google.com&tbm=vid 1969], [http://www.google.com/search?q=1970+site:video.google.com&tbm=vid 1970], [http://www.google.com/search?q=1971+site:video.google.com&tbm=vid 1971], [http://www.google.com/search?q=1972+site:video.google.com&tbm=vid 1972], [http://www.google.com/search?q=1973+site:video.google.com&tbm=vid 1973], [http://www.google.com/search?q=1974+site:video.google.com&tbm=vid 1974], [http://www.google.com/search?q=1975+site:video.google.com&tbm=vid 1975], [http://www.google.com/search?q=1976+site:video.google.com&tbm=vid 1976], [http://www.google.com/search?q=1977+site:video.google.com&tbm=vid 1977], [http://www.google.com/search?q=1978+site:video.google.com&tbm=vid 1978], [http://www.google.com/search?q=1979+site:video.google.com&tbm=vid 1979], [http://www.google.com/search?q=1980+site:video.google.com&tbm=vid 1980], [http://www.google.com/search?q=1981+site:video.google.com&tbm=vid 1981], [http://www.google.com/search?q=1982+site:video.google.com&tbm=vid 1982], [http://www.google.com/search?q=1983+site:video.google.com&tbm=vid 1983], [http://www.google.com/search?q=1984+site:video.google.com&tbm=vid 1984], [http://www.google.com/search?q=1985+site:video.google.com&tbm=vid 1985], [http://www.google.com/search?q=1986+site:video.google.com&tbm=vid 1986], [http://www.google.com/search?q=1987+site:video.google.com&tbm=vid 1987], [http://www.google.com/search?q=1988+site:video.google.com&tbm=vid 1988], [http://www.google.com/search?q=1989+site:video.google.com&tbm=vid 1989], [http://www.google.com/search?q=1990+site:video.google.com&tbm=vid 1990], [http://www.google.com/search?q=1991+site:video.google.com&tbm=vid 1991], [http://www.google.com/search?q=1992+site:video.google.com&tbm=vid 1992], [http://www.google.com/search?q=1993+site:video.google.com&tbm=vid 1993], [http://www.google.com/search?q=1994+site:video.google.com&tbm=vid 1994], [http://www.google.com/search?q=1995+site:video.google.com&tbm=vid 1995], [http://www.google.com/search?q=1996+site:video.google.com&tbm=vid 1996], [http://www.google.com/search?q=1997+site:video.google.com&tbm=vid 1997], [http://www.google.com/search?q=1998+site:video.google.com&tbm=vid 1998], [http://www.google.com/search?q=1999+site:video.google.com&tbm=vid 1999]<br />
<br />
== Countries ==<br />
[http://www.google.com/search?q=AFGHANISTAN+site:video.google.com&tbm=vid AFGHANISTAN], [http://www.google.com/search?q=ÅLAND+ISLANDS+site:video.google.com&tbm=vid ÅLAND+ISLANDS], [http://www.google.com/search?q=ALBANIA+site:video.google.com&tbm=vid ALBANIA], [http://www.google.com/search?q=ALGERIA+site:video.google.com&tbm=vid ALGERIA], [http://www.google.com/search?q=AMERICAN+SAMOA+site:video.google.com&tbm=vid AMERICAN+SAMOA], [http://www.google.com/search?q=ANDORRA+site:video.google.com&tbm=vid ANDORRA], [http://www.google.com/search?q=ANGOLA+site:video.google.com&tbm=vid ANGOLA], [http://www.google.com/search?q=ANGUILLA+site:video.google.com&tbm=vid ANGUILLA], [http://www.google.com/search?q=ANTARCTICA+site:video.google.com&tbm=vid ANTARCTICA], [http://www.google.com/search?q=ANTIGUA+AND+BARBUDA+site:video.google.com&tbm=vid ANTIGUA+AND+BARBUDA], [http://www.google.com/search?q=ARGENTINA+site:video.google.com&tbm=vid ARGENTINA], [http://www.google.com/search?q=ARMENIA+site:video.google.com&tbm=vid ARMENIA], [http://www.google.com/search?q=ARUBA+site:video.google.com&tbm=vid ARUBA], [http://www.google.com/search?q=AUSTRALIA+site:video.google.com&tbm=vid AUSTRALIA], [http://www.google.com/search?q=AUSTRIA+site:video.google.com&tbm=vid AUSTRIA], [http://www.google.com/search?q=AZERBAIJAN+site:video.google.com&tbm=vid AZERBAIJAN], [http://www.google.com/search?q=BAHAMAS+site:video.google.com&tbm=vid BAHAMAS], [http://www.google.com/search?q=BAHRAIN+site:video.google.com&tbm=vid BAHRAIN], [http://www.google.com/search?q=BANGLADESH+site:video.google.com&tbm=vid BANGLADESH], [http://www.google.com/search?q=BARBADOS+site:video.google.com&tbm=vid BARBADOS], [http://www.google.com/search?q=BELARUS+site:video.google.com&tbm=vid BELARUS], [http://www.google.com/search?q=BELGIUM+site:video.google.com&tbm=vid BELGIUM], [http://www.google.com/search?q=BELIZE+site:video.google.com&tbm=vid BELIZE], [http://www.google.com/search?q=BENIN+site:video.google.com&tbm=vid BENIN], [http://www.google.com/search?q=BERMUDA+site:video.google.com&tbm=vid BERMUDA], [http://www.google.com/search?q=BHUTAN+site:video.google.com&tbm=vid BHUTAN], [http://www.google.com/search?q=BOLIVIA,+PLURINATIONAL+STATE+OF+site:video.google.com&tbm=vid BOLIVIA,+PLURINATIONAL+STATE+OF], [http://www.google.com/search?q=BONAIRE,+SAINT+EUSTATIUS+AND+SABA+site:video.google.com&tbm=vid BONAIRE,+SAINT+EUSTATIUS+AND+SABA], [http://www.google.com/search?q=BOSNIA+AND+HERZEGOVINA+site:video.google.com&tbm=vid BOSNIA+AND+HERZEGOVINA], [http://www.google.com/search?q=BOTSWANA+site:video.google.com&tbm=vid BOTSWANA], [http://www.google.com/search?q=BOUVET+ISLAND+site:video.google.com&tbm=vid BOUVET+ISLAND], [http://www.google.com/search?q=BRAZIL+site:video.google.com&tbm=vid BRAZIL], [http://www.google.com/search?q=BRITISH+INDIAN+OCEAN+TERRITORY+site:video.google.com&tbm=vid BRITISH+INDIAN+OCEAN+TERRITORY], [http://www.google.com/search?q=BRUNEI+DARUSSALAM+site:video.google.com&tbm=vid BRUNEI+DARUSSALAM], [http://www.google.com/search?q=BULGARIA+site:video.google.com&tbm=vid BULGARIA], [http://www.google.com/search?q=BURKINA+FASO+site:video.google.com&tbm=vid BURKINA+FASO], [http://www.google.com/search?q=BURUNDI+site:video.google.com&tbm=vid BURUNDI], <br />
[http://www.google.com/search?q=CAMBODIA+site:video.google.com&tbm=vid CAMBODIA], [http://www.google.com/search?q=CAMEROON+site:video.google.com&tbm=vid CAMEROON], [http://www.google.com/search?q=CANADA+site:video.google.com&tbm=vid CANADA], [http://www.google.com/search?q=CAPE+VERDE+site:video.google.com&tbm=vid CAPE+VERDE], [http://www.google.com/search?q=CAYMAN+ISLANDS+site:video.google.com&tbm=vid CAYMAN+ISLANDS], [http://www.google.com/search?q=CENTRAL+AFRICAN+REPUBLIC+site:video.google.com&tbm=vid CENTRAL+AFRICAN+REPUBLIC], [http://www.google.com/search?q=CHAD+site:video.google.com&tbm=vid CHAD], [http://www.google.com/search?q=CHILE+site:video.google.com&tbm=vid CHILE], [http://www.google.com/search?q=CHINA+site:video.google.com&tbm=vid CHINA], [http://www.google.com/search?q=CHRISTMAS+ISLAND+site:video.google.com&tbm=vid CHRISTMAS+ISLAND], [http://www.google.com/search?q=COCOS+(KEELING)+ISLANDS+site:video.google.com&tbm=vid COCOS+(KEELING)+ISLANDS], [http://www.google.com/search?q=COLOMBIA+site:video.google.com&tbm=vid COLOMBIA], [http://www.google.com/search?q=COMOROS+site:video.google.com&tbm=vid COMOROS], [http://www.google.com/search?q=CONGO+site:video.google.com&tbm=vid CONGO], [http://www.google.com/search?q=CONGO+site:video.google.com&tbm=vid CONGO], [http://www.google.com/search?q=COOK+ISLANDS+site:video.google.com&tbm=vid COOK+ISLANDS], [http://www.google.com/search?q=COSTA+RICA+site:video.google.com&tbm=vid COSTA+RICA], [http://www.google.com/search?q=CÔTE+D'IVOIRE+site:video.google.com&tbm=vid CÔTE+D'IVOIRE], [http://www.google.com/search?q=CROATIA+site:video.google.com&tbm=vid CROATIA], [http://www.google.com/search?q=CUBA+site:video.google.com&tbm=vid CUBA], [http://www.google.com/search?q=CURAÇAO+site:video.google.com&tbm=vid CURAÇAO], [http://www.google.com/search?q=CYPRUS+site:video.google.com&tbm=vid CYPRUS], [http://www.google.com/search?q=CZECH+REPUBLIC+site:video.google.com&tbm=vid CZECH+REPUBLIC], [http://www.google.com/search?q=DENMARK+site:video.google.com&tbm=vid DENMARK], [http://www.google.com/search?q=DJIBOUTI+site:video.google.com&tbm=vid DJIBOUTI], [http://www.google.com/search?q=DOMINICA+site:video.google.com&tbm=vid DOMINICA], <br />
[http://www.google.com/search?q=DOMINICAN+REPUBLIC+site:video.google.com&tbm=vid DOMINICAN+REPUBLIC], [http://www.google.com/search?q=ECUADOR+site:video.google.com&tbm=vid ECUADOR], [http://www.google.com/search?q=EGYPT+site:video.google.com&tbm=vid EGYPT], [http://www.google.com/search?q=EL+SALVADOR+site:video.google.com&tbm=vid EL+SALVADOR], [http://www.google.com/search?q=EQUATORIAL+GUINEA+site:video.google.com&tbm=vid EQUATORIAL+GUINEA], [http://www.google.com/search?q=ERITREA+site:video.google.com&tbm=vid ERITREA], [http://www.google.com/search?q=ESTONIA+site:video.google.com&tbm=vid ESTONIA], [http://www.google.com/search?q=ETHIOPIA+site:video.google.com&tbm=vid ETHIOPIA], [http://www.google.com/search?q=FALKLAND+ISLANDS+(MALVINAS)+site:video.google.com&tbm=vid FALKLAND+ISLANDS+(MALVINAS)], [http://www.google.com/search?q=FAROE+ISLANDS+site:video.google.com&tbm=vid FAROE+ISLANDS], [http://www.google.com/search?q=FIJI+site:video.google.com&tbm=vid FIJI], [http://www.google.com/search?q=FINLAND+site:video.google.com&tbm=vid FINLAND], [http://www.google.com/search?q=FRANCE+site:video.google.com&tbm=vid FRANCE], [http://www.google.com/search?q=FRENCH+GUIANA+site:video.google.com&tbm=vid FRENCH+GUIANA], [http://www.google.com/search?q=FRENCH+POLYNESIA+site:video.google.com&tbm=vid FRENCH+POLYNESIA], [http://www.google.com/search?q=FRENCH+SOUTHERN+TERRITORIES+site:video.google.com&tbm=vid FRENCH+SOUTHERN+TERRITORIES], [http://www.google.com/search?q=GABON+site:video.google.com&tbm=vid GABON], [http://www.google.com/search?q=GAMBIA+site:video.google.com&tbm=vid GAMBIA], [http://www.google.com/search?q=GEORGIA+site:video.google.com&tbm=vid GEORGIA], [http://www.google.com/search?q=GERMANY+site:video.google.com&tbm=vid GERMANY], [http://www.google.com/search?q=GHANA+site:video.google.com&tbm=vid GHANA], [http://www.google.com/search?q=GIBRALTAR+site:video.google.com&tbm=vid GIBRALTAR], [http://www.google.com/search?q=GREECE+site:video.google.com&tbm=vid GREECE], [http://www.google.com/search?q=GREENLAND+site:video.google.com&tbm=vid GREENLAND], [http://www.google.com/search?q=GRENADA+site:video.google.com&tbm=vid GRENADA], [http://www.google.com/search?q=GUADELOUPE+site:video.google.com&tbm=vid GUADELOUPE], [http://www.google.com/search?q=GUAM+site:video.google.com&tbm=vid GUAM], [http://www.google.com/search?q=GUATEMALA+site:video.google.com&tbm=vid GUATEMALA], [http://www.google.com/search?q=GUERNSEY+site:video.google.com&tbm=vid GUERNSEY], [http://www.google.com/search?q=GUINEA+site:video.google.com&tbm=vid GUINEA], [http://www.google.com/search?q=GUINEA-BISSAU+site:video.google.com&tbm=vid GUINEA-BISSAU], [http://www.google.com/search?q=GUYANA+site:video.google.com&tbm=vid GUYANA], [http://www.google.com/search?q=HAITI+site:video.google.com&tbm=vid HAITI], [http://www.google.com/search?q=HEARD+ISLAND+AND+MCDONALD+ISLANDS+site:video.google.com&tbm=vid HEARD+ISLAND+AND+MCDONALD+ISLANDS], [http://www.google.com/search?q=HOLY+SEE+(VATICAN+CITY+STATE)+site:video.google.com&tbm=vid HOLY+SEE+(VATICAN+CITY+STATE)], [http://www.google.com/search?q=HONDURAS+site:video.google.com&tbm=vid HONDURAS], [http://www.google.com/search?q=HONG+KONG+site:video.google.com&tbm=vid HONG+KONG], [http://www.google.com/search?q=HUNGARY+site:video.google.com&tbm=vid HUNGARY], [http://www.google.com/search?q=ICELAND+site:video.google.com&tbm=vid ICELAND], <br />
[http://www.google.com/search?q=INDIA+site:video.google.com&tbm=vid INDIA], [http://www.google.com/search?q=INDONESIA+site:video.google.com&tbm=vid INDONESIA], [http://www.google.com/search?q=IRAN+site:video.google.com&tbm=vid IRAN], [http://www.google.com/search?q=IRAQ+site:video.google.com&tbm=vid IRAQ], [http://www.google.com/search?q=IRELAND+site:video.google.com&tbm=vid IRELAND], [http://www.google.com/search?q=ISLE+OF+MAN+site:video.google.com&tbm=vid ISLE+OF+MAN], [http://www.google.com/search?q=ISRAEL+site:video.google.com&tbm=vid ISRAEL], [http://www.google.com/search?q=ITALY+site:video.google.com&tbm=vid ITALY], [http://www.google.com/search?q=JAMAICA+site:video.google.com&tbm=vid JAMAICA], [http://www.google.com/search?q=JAPAN+site:video.google.com&tbm=vid JAPAN], [http://www.google.com/search?q=JERSEY+site:video.google.com&tbm=vid JERSEY], [http://www.google.com/search?q=JORDAN+site:video.google.com&tbm=vid JORDAN], [http://www.google.com/search?q=KAZAKHSTAN+site:video.google.com&tbm=vid KAZAKHSTAN], [http://www.google.com/search?q=KENYA+site:video.google.com&tbm=vid KENYA], [http://www.google.com/search?q=KIRIBATI+site:video.google.com&tbm=vid KIRIBATI], [http://www.google.com/search?q=KOREA+site:video.google.com&tbm=vid KOREA], [http://www.google.com/search?q=KUWAIT+site:video.google.com&tbm=vid KUWAIT], [http://www.google.com/search?q=KYRGYZSTAN+site:video.google.com&tbm=vid KYRGYZSTAN], [http://www.google.com/search?q=LAO+site:video.google.com&tbm=vid LAO], [http://www.google.com/search?q=LATVIA+site:video.google.com&tbm=vid LATVIA], [http://www.google.com/search?q=LEBANON+site:video.google.com&tbm=vid LEBANON], [http://www.google.com/search?q=LESOTHO+site:video.google.com&tbm=vid LESOTHO], [http://www.google.com/search?q=LIBERIA+site:video.google.com&tbm=vid LIBERIA], [http://www.google.com/search?q=LIBYAN+ARAB+JAMAHIRIYA+site:video.google.com&tbm=vid LIBYAN+ARAB+JAMAHIRIYA], [http://www.google.com/search?q=LIECHTENSTEIN+site:video.google.com&tbm=vid LIECHTENSTEIN], [http://www.google.com/search?q=LITHUANIA+site:video.google.com&tbm=vid LITHUANIA], [http://www.google.com/search?q=LUXEMBOURG+site:video.google.com&tbm=vid LUXEMBOURG], [http://www.google.com/search?q=MACAO+site:video.google.com&tbm=vid MACAO], [http://www.google.com/search?q=MACEDONIA+site:video.google.com&tbm=vid MACEDONIA], [http://www.google.com/search?q=MADAGASCAR+site:video.google.com&tbm=vid MADAGASCAR], [http://www.google.com/search?q=MALAWI+site:video.google.com&tbm=vid MALAWI], [http://www.google.com/search?q=MALAYSIA+site:video.google.com&tbm=vid MALAYSIA], [http://www.google.com/search?q=MALDIVES+site:video.google.com&tbm=vid MALDIVES], [http://www.google.com/search?q=MALI+site:video.google.com&tbm=vid MALI], [http://www.google.com/search?q=MALTA+site:video.google.com&tbm=vid MALTA], [http://www.google.com/search?q=MARSHALL+ISLANDS+site:video.google.com&tbm=vid MARSHALL+ISLANDS], [http://www.google.com/search?q=MARTINIQUE+site:video.google.com&tbm=vid MARTINIQUE], [http://www.google.com/search?q=MAURITANIA+site:video.google.com&tbm=vid MAURITANIA], [http://www.google.com/search?q=MAURITIUS+site:video.google.com&tbm=vid MAURITIUS], [http://www.google.com/search?q=MAYOTTE+site:video.google.com&tbm=vid MAYOTTE], [http://www.google.com/search?q=MEXICO+site:video.google.com&tbm=vid MEXICO], [http://www.google.com/search?q=MICRONESIA,+FEDERATED+STATES+OF+site:video.google.com&tbm=vid MICRONESIA,+FEDERATED+STATES+OF], [http://www.google.com/search?q=MOLDOVA+site:video.google.com&tbm=vid MOLDOVA], [http://www.google.com/search?q=MONACO+site:video.google.com&tbm=vid MONACO], [http://www.google.com/search?q=MONGOLIA+site:video.google.com&tbm=vid MONGOLIA], [http://www.google.com/search?q=MONTENEGRO+site:video.google.com&tbm=vid MONTENEGRO], [http://www.google.com/search?q=MONTSERRAT+site:video.google.com&tbm=vid MONTSERRAT], [http://www.google.com/search?q=MOROCCO+site:video.google.com&tbm=vid MOROCCO], [http://www.google.com/search?q=MOZAMBIQUE+site:video.google.com&tbm=vid MOZAMBIQUE], [http://www.google.com/search?q=MYANMAR+site:video.google.com&tbm=vid MYANMAR], [http://www.google.com/search?q=NAMIBIA+site:video.google.com&tbm=vid NAMIBIA],<br />
[http://www.google.com/search?q=NAURU+site:video.google.com&tbm=vid NAURU], [http://www.google.com/search?q=NEPAL+site:video.google.com&tbm=vid NEPAL], [http://www.google.com/search?q=NETHERLANDS+site:video.google.com&tbm=vid NETHERLANDS], [http://www.google.com/search?q=NEW+CALEDONIA+site:video.google.com&tbm=vid NEW+CALEDONIA], [http://www.google.com/search?q=NEW+ZEALAND+site:video.google.com&tbm=vid NEW+ZEALAND], [http://www.google.com/search?q=NICARAGUA+site:video.google.com&tbm=vid NICARAGUA], [http://www.google.com/search?q=NIGER+site:video.google.com&tbm=vid NIGER], [http://www.google.com/search?q=NIGERIA+site:video.google.com&tbm=vid NIGERIA], [http://www.google.com/search?q=NIUE+site:video.google.com&tbm=vid NIUE], [http://www.google.com/search?q=NORFOLK+ISLAND+site:video.google.com&tbm=vid NORFOLK+ISLAND], [http://www.google.com/search?q=NORTHERN+MARIANA+ISLANDS+site:video.google.com&tbm=vid NORTHERN+MARIANA+ISLANDS], [http://www.google.com/search?q=NORWAY+site:video.google.com&tbm=vid NORWAY], [http://www.google.com/search?q=OMAN+site:video.google.com&tbm=vid OMAN], [http://www.google.com/search?q=PAKISTAN+site:video.google.com&tbm=vid PAKISTAN], [http://www.google.com/search?q=PALAU+site:video.google.com&tbm=vid PALAU], [http://www.google.com/search?q=PALESTINIAN+TERRITORY,+OCCUPIED+site:video.google.com&tbm=vid PALESTINIAN+TERRITORY,+OCCUPIED], [http://www.google.com/search?q=PANAMA+site:video.google.com&tbm=vid PANAMA], [http://www.google.com/search?q=PAPUA+NEW+GUINEA+site:video.google.com&tbm=vid PAPUA+NEW+GUINEA], [http://www.google.com/search?q=PARAGUAY+site:video.google.com&tbm=vid PARAGUAY], [http://www.google.com/search?q=PERU+site:video.google.com&tbm=vid PERU], [http://www.google.com/search?q=PHILIPPINES+site:video.google.com&tbm=vid PHILIPPINES], [http://www.google.com/search?q=PITCAIRN+site:video.google.com&tbm=vid PITCAIRN], [http://www.google.com/search?q=POLAND+site:video.google.com&tbm=vid POLAND], [http://www.google.com/search?q=PORTUGAL+site:video.google.com&tbm=vid PORTUGAL], [http://www.google.com/search?q=PUERTO+RICO+site:video.google.com&tbm=vid PUERTO+RICO], [http://www.google.com/search?q=QATAR+site:video.google.com&tbm=vid QATAR], [http://www.google.com/search?q=RÉUNION+site:video.google.com&tbm=vid RÉUNION], [http://www.google.com/search?q=ROMANIA+site:video.google.com&tbm=vid ROMANIA], [http://www.google.com/search?q=RUSSIAN+FEDERATION+site:video.google.com&tbm=vid RUSSIAN+FEDERATION], [http://www.google.com/search?q=RWANDA+site:video.google.com&tbm=vid RWANDA], [http://www.google.com/search?q=SAINT+BARTHÉLEMY+site:video.google.com&tbm=vid SAINT+BARTHÉLEMY], [http://www.google.com/search?q=SAINT+HELENA,+ASCENSION+AND+TRISTAN+DA+CUNHA+site:video.google.com&tbm=vid SAINT+HELENA,+ASCENSION+AND+TRISTAN+DA+CUNHA], [http://www.google.com/search?q=SAINT+KITTS+AND+NEVIS+site:video.google.com&tbm=vid SAINT+KITTS+AND+NEVIS], [http://www.google.com/search?q=SAINT+LUCIA+site:video.google.com&tbm=vid SAINT+LUCIA], [http://www.google.com/search?q=SAINT+MARTIN+(FRENCH+PART)+site:video.google.com&tbm=vid SAINT+MARTIN+(FRENCH+PART)], [http://www.google.com/search?q=SAINT+PIERRE+AND+MIQUELON+site:video.google.com&tbm=vid SAINT+PIERRE+AND+MIQUELON], [http://www.google.com/search?q=SAINT+VINCENT+AND+THE+GRENADINES+site:video.google.com&tbm=vid SAINT+VINCENT+AND+THE+GRENADINES], [http://www.google.com/search?q=SAMOA+site:video.google.com&tbm=vid SAMOA], [http://www.google.com/search?q=SAN+MARINO+site:video.google.com&tbm=vid SAN+MARINO], [http://www.google.com/search?q=SAO+TOME+AND+PRINCIPE+site:video.google.com&tbm=vid SAO+TOME+AND+PRINCIPE], [http://www.google.com/search?q=SAUDI+ARABIA+site:video.google.com&tbm=vid SAUDI+ARABIA], [http://www.google.com/search?q=SENEGAL+site:video.google.com&tbm=vid SENEGAL], [http://www.google.com/search?q=SERBIA+site:video.google.com&tbm=vid SERBIA], [http://www.google.com/search?q=SEYCHELLES+site:video.google.com&tbm=vid SEYCHELLES], [http://www.google.com/search?q=SIERRA+LEONE+site:video.google.com&tbm=vid SIERRA+LEONE], [http://www.google.com/search?q=SINGAPORE+site:video.google.com&tbm=vid SINGAPORE], [http://www.google.com/search?q=SINT+MAARTEN+(DUTCH+PART)+site:video.google.com&tbm=vid SINT+MAARTEN+(DUTCH+PART)], [http://www.google.com/search?q=SLOVAKIA+site:video.google.com&tbm=vid SLOVAKIA], [http://www.google.com/search?q=SLOVENIA+site:video.google.com&tbm=vid SLOVENIA], [http://www.google.com/search?q=SOLOMON+ISLANDS+site:video.google.com&tbm=vid SOLOMON+ISLANDS], [http://www.google.com/search?q=SOMALIA+site:video.google.com&tbm=vid SOMALIA], [http://www.google.com/search?q=SOUTH+AFRICA+site:video.google.com&tbm=vid SOUTH+AFRICA], [http://www.google.com/search?q=SOUTH+GEORGIA+AND+THE+SOUTH+SANDWICH+ISLANDS+site:video.google.com&tbm=vid SOUTH+GEORGIA+AND+THE+SOUTH+SANDWICH+ISLANDS], [http://www.google.com/search?q=SPAIN+site:video.google.com&tbm=vid SPAIN], [http://www.google.com/search?q=SRI+LANKA+site:video.google.com&tbm=vid SRI+LANKA], [http://www.google.com/search?q=SUDAN+site:video.google.com&tbm=vid SUDAN], [http://www.google.com/search?q=SURINAME+site:video.google.com&tbm=vid SURINAME], [http://www.google.com/search?q=SVALBARD+AND+JAN+MAYEN+site:video.google.com&tbm=vid SVALBARD+AND+JAN+MAYEN], [http://www.google.com/search?q=SWAZILAND+site:video.google.com&tbm=vid SWAZILAND], [http://www.google.com/search?q=SWEDEN+site:video.google.com&tbm=vid SWEDEN], [http://www.google.com/search?q=SWITZERLAND+site:video.google.com&tbm=vid SWITZERLAND], [http://www.google.com/search?q=SYRIA+site:video.google.com&tbm=vid SYRIA], [http://www.google.com/search?q=TAIWAN+site:video.google.com&tbm=vid TAIWAN], [http://www.google.com/search?q=TAJIKISTAN+site:video.google.com&tbm=vid TAJIKISTAN], [http://www.google.com/search?q=TANZANIA+site:video.google.com&tbm=vid TANZANIA], [http://www.google.com/search?q=THAILAND+site:video.google.com&tbm=vid THAILAND], [http://www.google.com/search?q=TIMOR-LESTE+site:video.google.com&tbm=vid TIMOR-LESTE], [http://www.google.com/search?q=TOGO+site:video.google.com&tbm=vid TOGO], [http://www.google.com/search?q=TOKELAU+site:video.google.com&tbm=vid TOKELAU], [http://www.google.com/search?q=TONGA+site:video.google.com&tbm=vid TONGA], [http://www.google.com/search?q=TRINIDAD+AND+TOBAGO+site:video.google.com&tbm=vid TRINIDAD+AND+TOBAGO], [http://www.google.com/search?q=TUNISIA+site:video.google.com&tbm=vid TUNISIA], [http://www.google.com/search?q=TURKEY+site:video.google.com&tbm=vid TURKEY], [http://www.google.com/search?q=TURKMENISTAN+site:video.google.com&tbm=vid TURKMENISTAN], [http://www.google.com/search?q=TURKS+AND+CAICOS+ISLANDS+site:video.google.com&tbm=vid TURKS+AND+CAICOS+ISLANDS], [http://www.google.com/search?q=TUVALU+site:video.google.com&tbm=vid TUVALU], [http://www.google.com/search?q=UGANDA+site:video.google.com&tbm=vid UGANDA], [http://www.google.com/search?q=UKRAINE+site:video.google.com&tbm=vid UKRAINE], [http://www.google.com/search?q=UNITED+ARAB+EMIRATES+site:video.google.com&tbm=vid UNITED+ARAB+EMIRATES], [http://www.google.com/search?q=UNITED+KINGDOM+site:video.google.com&tbm=vid UNITED+KINGDOM], [http://www.google.com/search?q=UNITED+STATES+site:video.google.com&tbm=vid UNITED+STATES], [http://www.google.com/search?q=UNITED+STATES+MINOR+OUTLYING+ISLANDS+site:video.google.com&tbm=vid UNITED+STATES+MINOR+OUTLYING+ISLANDS], [http://www.google.com/search?q=URUGUAY+site:video.google.com&tbm=vid URUGUAY], [http://www.google.com/search?q=UZBEKISTAN+site:video.google.com&tbm=vid UZBEKISTAN], [http://www.google.com/search?q=VANUATU+site:video.google.com&tbm=vid VANUATU], [http://www.google.com/search?q=VENEZUELA+site:video.google.com&tbm=vid VENEZUELA], [http://www.google.com/search?q=VIETNAM+site:video.google.com&tbm=vid VIETNAM], [http://www.google.com/search?q=VIRGIN+ISLANDS,+BRITISH+site:video.google.com&tbm=vid VIRGIN+ISLANDS,+BRITISH], [http://www.google.com/search?q=VIRGIN+ISLANDS,+U.S.+site:video.google.com&tbm=vid VIRGIN+ISLANDS,+U.S.], [http://www.google.com/search?q=WALLIS+AND+FUTUNA+site:video.google.com&tbm=vid WALLIS+AND+FUTUNA], [http://www.google.com/search?q=WESTERN+SAHARA+site:video.google.com&tbm=vid WESTERN+SAHARA], [http://www.google.com/search?q=YEMEN+site:video.google.com&tbm=vid YEMEN], [http://www.google.com/search?q=ZAMBIA+site:video.google.com&tbm=vid ZAMBIA], [http://www.google.com/search?q=ZIMBABWE+site:video.google.com&tbm=vid ZIMBABWE]<br />
<br />
= Progress =<br />
<br />
The following table describes the outcome of various seedlists. For the latest Listerine statistics, see [[#Get_Involved_With_Listerine]].<br />
<br />
'''Legend'''<br />
{| class="wikitable"<br />
| style="background: grey" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Uploaded to Archive.org<br />
|-<br />
| style="background: green" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Done/Complete with no errors<br />
|-<br />
| style="background: orange" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Done/Complete ''with'' errors<br />
|-<br />
| style="background: yellow" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| In progress<br />
|-<br />
| style="background: turquoise" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Partially claimed and in progress<br />
|-<br />
| style="background: red" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Not claimed<br />
|-<br />
| style="background: purple" |&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Moved to listerine<br />
|-<br />
|&nbsp;&nbsp;&nbsp;&nbsp;<br />
| Unknown status (If you know please edit)<br />
|}<br />
<br />
<center><br />
{| class="wikitable" style="text-align: center;"<br />
! Seed list !! Videos (lines) !! Downloaders !! Progress and SIZE <br />
|-<br />
| [http://bit.ly/i7CJ2h seed_videos_rhistory ] || 6949 || Jade Falcon || style="background: purple" | 7 chunks with 1000 videos each <br> ndurner: aa<br>Jade Falcon: downloading...<br />
|-<br />
| [http://pastebin.com/juJnpnU0 seed_videos_ecology ] || 890 || crackbab1 || style="background: yellow" |<br />
|-<br />
| [http://pastebin.com/dikSPMby seed_videos_meme ] || 996 || yipdw || style="background: orange" | Done (12 GB), bad IDs: -7139586667055487256, 744578668610845478, 9027107881335248661<br />
|-<br />
| [http://pastebin.com/fe2Aa9q1 seed_videos_defcon] || 822 || ndurner || style="background: green" | done<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_ml_documentary_dedupe seed_videos_ml_documentary_dedupe] || 1975 || Lightblb, Papyrus, NomDuClavier || style="background: yellow" | 3 completed chunks of 4 (4 claimed)<br/> Lightblb: aa (Complete:38GB With 1 Fail -> Rsync:Done)<br />Papyrus: ab<br />NomDuClavier: ac (complete), ad (complete)<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_ml_lecture_dedupe seed_videos_ml_lecture_dedupe] || 1898 || Lightblb, gribozavr, kn100 || style="background: yellow" | 3 completed chunks of 4 (4 claimed)<br/> Lightblb: aa ab (Done: 65G With 2 Failed -> Rsyncing)<br /><br />
gribozavr: ad (rsync done, 28Gb)<br />kn100: ac (in progress)<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_ml_atheism_dedupe seed_videos_ml_atheism_dedupe] || 698 || norc, Mqrius || style="background: green" | 2 complete of 2<br/> norc: ab done (16G), Mqrius: aa done (41GB).<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_l_interview_dedupe seed_videos_l_interview_dedupe] || 986 || Pentium100, wgfreewill || style="background: green" | aa - Done (136GB) <br/><br />
Pentium100: ab - Done (66.7GB)<br />
|-<br />
| [http://gv.nja.im/index.php?dir=evolution seed_videos_evolution_dedupe] (Long&Medium) || 1742 || Jade Falcon || style="background: purple" | downloading...<br />
|-<br />
| [http://gv.nja.im/index.php?dir=talk seed_videos_talk_dedupe] (Long&Medium) || 1795 || Jade Falcon || style="background: purple" | downloading...<br />
|-<br />
| [http://gv.nja.im/index.php?dir=money seed_videos_money_dedupe] (Long&Medium) || 1824 || leftfield || style="background: yellow" |<br />
|-<br />
| [http://gv.nja.im/index.php?dir=civilization seed_videos_civilization_dedupe] (Long&Medium) || 471 || leftfield || style="background: orange" | done one broken docid -4727094082505590423<br />
|-<br />
| seed_videos_2_a || 25,761 || swebb || style="background: yellow" | 61G, 3718/25761 files done (4/19/2011)<br><br />
89G, 5579/25761 files done (4/20/2011)<br><br />
117GB, 7252/25761 files done (4/21/2011)<br />
|-<br />
| [http://notatypewriter.com/googlegargle seed_videos_2_k] || 19,266 (24,242) || Lightblb, ARc[Clone, crackbab1, Pentium100, Mqrius, arketype, Darkstar || style="background: orange" | 49 chunks completed of 49<br /><br />
Lightblb: aa ab ac ad ae (Done: 69GB -> Rsync: Done)<br /><br />
crackbab1: af,ak,al (Done: 16GB) <br /><br />
Mqrius: Done: ag - ak, am - ao, aq - as: 81 billion bytes.<br /><br />
(Errors: 8140990496183661566, 3602820803563530100, 305824290212962756, 4662407464242191178, 1966892422853997036, 2337004030985954962, 1338452982534754821, 10726218902867294)<br /><br />
arketype: ap (Done: 17GB)<br /><br />
(Errors: 2781869234442161475, 3684594607388096414)<br /><br />
Pentium100: at-az (complete, 42.8GB), ba-bb (complete, 14.1GB)<br /><br />
Darkstar: bc bd be (complete)<br /><br />
ARc[Clone: bf bg bh bi bj bk bl bm bn bo bp bq br bs bt bu bv bw (all done)<br /><br />
|-<br />
| seed_videos_2_l || 22,641 || ndurner, wgfreewill || style="background: grey" | [http://gv.nja.im/index.php?dir=seed_videos_2_l Split] 46 chunks of 500 videos each<br />ndurner: aa done; <br /> wgfreewill - More than a TB, rsync to archive.org.<br />
|-<br />
| seed_videos_2_m || 24,465 || Jade Falcon || style="background: orange" | Jade:Done. 506G, 305 [http://www.fenixnet.net/jadefalcon/seed_videos_2_m_errors.txt error'ed IDs]. Rsyncing.<br />
|-<br />
| seed_videos_2_o || 25,049 || travelinlibrarian || style="background: yellow" | [http://gv.nja.im/index.php?dir=seed_videos_2_o Split] 51 chunks of 500 videos each<br /><br />
travelinlibrarian 376/1-500<br /><br />
perfinion done seed_videos_2_ob[n-y]<br />
perfinion grabbing seed_videos_2_[a-m]<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_2_p seed_videos_2_p] || 23,713 || oli, Xentac, db48x, otro, Mqrius, Pentium100, Darkstar, ryan__, nstrom || style="background: yellow" | 46 complete of 48 chunks (all 48 claimed)<br/><br />
oli: aa to ah (complete, 90GB) - RSYNCING<br/><br />
Mqrius: Done: ak - am: 27 GB<br/><br />
Pentium100: an-av (done, 100GB with errors)<br/><br />
Xentac: bt bu bv bp bo bm bq br bg bn bi as at au (done), bg-br, as-av<br/><br />
db48x: bu (1.44GB, uploaded), bv (187MB, uploaded), ba-bf (78GB, uploaded)<br/><br />
otro: bs (2 GB complete with errors -4129568891134205061, -863669053556310192, 1529854584895362082, -1190862519877917483)<br/><br />
nstrom: aw (complete, 15GB, uploaded to a.o)<br/><br />
ryan__: ax(WIP)/ay(WIP)/az(done, 7 missing. verifying/retrying/confirming still) <br/><br />
Darkstar: ai, aj (Complete)<br/><br />
|-<br />
| seed_videos_2_q || 17,727 || DoubleJ || style="background: grey" | Done (165GB) and uploaded to IA<br />
2 bad IDs:<br />
-3522777020956111862<br />
1920882098876352864<br />
|-<br />
| seed_videos_2_t || 25,301 || businux || style="background: yellow" | [http://gv.nja.im/index.php?dir=seed_videos_2_t Split] 51 chunks of 500 videos each 961/25,301 3.79% 33GB<br />
LietKynes going backwards, 50 threads, 310GB already<br />
|-<br />
| [http://elmundo.barbich.net/gargle/ seed_videos_2_u] || 23,528 || barbich, negge || style="background: green" | 48 chunks complete of 48<br/><br />
barbich: finished 0 to 29 (100% done, 370G)<br/><br />
negge: finished 30 to 47 (100% done, ~200G)<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_2_w seed_videos_2_w] || 21,732 || nickmoorman || style="background: yellow" | Split] 0 chunks completed of 34 (34 claimed0<br /><br />
nickmoorman: aa ab ac ad ae af ag ah ai aj<br /><br />
zachtib: ak al am an ao<br /><br />
Dr.Sweety: ap aq ar as at au av aw ax ay az ba bb bc bd be bf bg bh (In progress, currently downloading av)<br />
|-<br />
| seed_videos_2_x || 19,733 || ksh || style="background: orange" |100% / 78GB<br />
Need to check for errors!<br />
<br>After this is checked, if there are no errors, change to green and remove this line.<br />
|-<br />
| seed_videos_2_y || 20,965 || negge || style="background: green" | Done (216GB)<br />
|-<br />
| seed_videos_2_z || 18,877 || flare || style="background: yellow" | Currently in progress (38% - 104GiB)<br />
|-<br />
| seed_videos_a || 1000 || Dr.Sweety || style="background: green" | Done (84G). 9 DocIDs with 404.<br />
|-<br />
| seed_videos_a_related || This list contain errors || Dr.Sweety || style="background: orange" | Done, 44G total. ~1097 out of 1284 seem to be DocIDs, rest is text. Half of the DocIDs are broken (see "Broken DocIDs" for some examples, a complete list is here http://piratepad.net/b8VbxXCVPG). What about the errors, will there be an updated list?<br />
|-<br />
| seed_videos_b || 999 || bjwebb || style="background: yellow" | 651/999<br />
|-<br />
| seed_videos_c || 981 || dnova || style="background: grey" | Uploaded to Archive.org (40.2GB)<br />
|-<br />
| seed_videos_d || 999 || NomDuClavier || style="background: orange" | complete<br />
|-<br />
| seed_videos_e || 999 || NomDuClavier || style="background: orange" | complete<br />
|-<br />
| seed_videos_f || 999 || DoubleJ || style="background: grey" | Done (25GB)<br />
Uploaded to IA w/subtitles<br />
|-<br />
| seed_videos_g || 999 || dnova || style="background: gray" | Uploaded to Archive.org (30.9GB) <br />one bad id=7751522177274361392<br />
|-<br />
| seed_videos_h || 999 || ARc[Clone || style="background: orange" | Done<br />
|-<br />
| seed_videos_i || 999 || DeCarabas || style="background: green" | Done (58 GB)<br />
|-<br />
| seed_videos_j || 999 || joethehuman || style="background: green" | Done (36.7 GB)<br />
|-<br />
| seed_videos_k || 999 || aggroskater || style="background: orange" | Done (28.7 GB) one bad ID: -4784504756717962046<br />
|-<br />
| seed_videos_l || 999 || yipdw || style="background: gray" | Uploaded<br />
|-<br />
| seed_videos_m || 999 || TJ__ || style="background: orange" | Done (34.7GB)<br />
|-<br />
| seed_videos_n || 999 || ndurner || style="background: green" | Done (38 GB)<br />
|-<br />
| seed_videos_o || 999 || com_lab, grelbar ([http://pastebin.com/AFc4SvPV list]) || style="background: green" | ~38GB (com_lab) already uploaded, <br> ~24GB(grelbar)<br />
|-<br />
| seed_videos_p || 999 || Pneu || style="background: yellow" | <br />
|-<br />
| seed_videos_q || 996 || NomDuClavier || style="background: green" | Done (~24Gb)<br />
|-<br />
| seed_videos_r || 996 || Pentium || style="background: orange" | Done (26.5GB), two bad IDs (-6997682955012239023, -5475489738249304784)<br />
|-<br />
| seed_videos_s || 999 || Pentium || style="background: orange" | Done (48.9GB), two bad IDs (2103424227166759427, -8954969329395485241)<br />
|-<br />
| seed_videos_t || 999 || joethehuman || style="background: orange" | Done with errors below (56.8 GB)<br />
|-<br />
| seed_videos_u || 999 || perfinion, 0xDEADBEEF, norc || style="background: green" | 0xDEADBEEF 516/1000 24GB. norc 500-1000 done, 24GB. Perfinion done, 44GB.<br />
|-<br />
| seed_videos_v || 999 || masterme1 || style="background: yellow" | 497/999 (~28GB)<br />
|-<br />
| seed_videos_w || 1000 || com_lab || style="background: green" | Done (~5.7GB)<br />
|-<br />
| seed_videos_x || 1000 || Dark-Star || style="background: green" | Done (~33GB)<br />
|-<br />
| seed_videos_y || 1000 || beremat || style="background: green" | Done (~61.01GB)<br />
|-<br />
| seed_videos_z || 1000 || ksh || style="background: green" | Done (27GB)<br />
|-<br />
| [http://pastebin.com/FNSatQam "microelectronics",<br />"circuit+design",<br />"microprocessor",<br />"chiptune",<br />"electrical+engineering",<br />"hardware+hacking",<br />"unboxing",<br />"demoscene",] || 1267 || dnova || style="background: grey" | Uploaded to Archive.org (33.9GB)<br />
|-<br />
| [http://pastebin.com/9cfVyFci "transistor",<br />"tonawanda",<br />"micron",<br />"gallium",<br />"nanometer",<br />"femtosecond",<br />"qubit",<br />"integrated+circuit"] || 343 || dnova || style="background: grey" | Uploaded to Archive.org (7.1GB)<br />
|-<br />
| [http://pastebin.com/ThCuzFwu "singularity"] || 174 || db48x || style="background: green" | completed, 12.57GB (list created at 8am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/jMtaRuA2 "Feynman"] || 28 || db48x || style="background: green" | completed, 2.20GB (list created at 9am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/3HDwcsk5 "police"] || 998 || lutostag || style="background: green" | done, ~33GB (list created at 8am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/Hy1nYdkC "eliezer"] || 150 (1000) || norc || style="background: grey" | uploaded, 6.8G (list created at 8am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/me1BGvg8 "obama"] ||1000 || ryan__ || style="background: yellow" | 302/1000 as of 04-19-2011 00:51 EDT (still WIP) (list created at 8am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/JQfdYaX9 "cia"] || 999 || ndurner || style="background: purple" | 800 (list created at 8am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/yRQiqG4Q "charlie"] || 1000 || ryan__ || style="background: yellow" | 120/1000 as of 04-19-2011 00:51 EDT (still WIP) (list created at 8am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/sHQkmBuH IDs from the metafilter thread] || 28 || db48x || style="background: gray" | completed, 6.17GB (list created at 9am UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/yFbtiW4b IDs from the reddit thread] || 106 || ndurner || style="background: green" | done (list created at 9am UTC April 18th 2011)<br />
|-<br />
| "rare"<br />
|rowspan=3|~3100<br />
|rowspan=3|Darkstar <br />
|rowspan=3 style="background: green" | done (~70gb)<br />
|-<br />
| "vintage"<br />
|-<br />
| "commercial"<br />
|-<br />
| [http://pastebin.com/ZkzNmwEW "douglas adams",<br />"richard dawkins",<br />"charles darwin"] || || NomDuClavier || style="background: green" | 513 videos, done ([http://pastebin.com/ZkzNmwEW one de-duped list] for the 3 terms)<br />
|-<br />
| [http://pastebin.com/4cZeF4Hc "australia history"]<br />[http://pastebin.com/L7NLX0pi "indigenous aboriginal australia"] || 1659 || oli || style="background: green" | complete - RSYNCING<br />
|-<br />
| [http://rapidpacket.com/~xtat/seed_videos_linux "linux"] || 1641 || xtat || style="background: green" | Done, 70GB, 8 failures<br />
|-<br />
| [http://pastebin.com/8Xn3ynUu "Bugs Bunny"] || 153 || stack,wgfreewill || style="background: green" | Done, 2.7GB<br />
|-<br />
| [http://pastebin.com/jMtaRuA2 "rodney mullen"] || 176 || com_lab || style="background: grey" | Done, 1.7GB<br />
|-<br />
| [http://pastebin.com/7KWMfkNn "tech talks"] || 562 || tahu || style="background: green" | completed, 562 videos, 47GB, 2011-04-20 22:07:31 UTC<br />
|-<br />
| [http://pastebin.com/iHvuYDLt "rick astley"] || 17 || db48x || style="background: gray" | completed, 272.8MB (grabbed 13:00 UTC April 18th 2011)<br />
|-<br />
| [http://pastebin.com/gf9evK3q "CERN"] || 912 || vled || style="background: green" | Done<br />
|-<br />
| [http://pastebin.com/TGgHTT05 multiple]: "michio kaku",<br />"brian cox",<br />"vernor vinge",<br />"carl sagan",<br />"simon singh" || 176 || NomDuClavier || style="background: green" | done<br />
|-<br />
| [http://pastebin.com/NWGExd7c "intel",<br />"amd"] || 1547 || leftfield || style="background: orange" | done 21.5GB one broken docid -712494279917239419 <br />
|- <br />
| [http://pastebin.com/SVGFBkSX "foia"] || 89 || com_lab || style="background: grey" | Done, 4.1GB<br />
|-<br />
| [http://pastebin.com/jagqsQru "creative commons"] || 1000 (968 d/d) || aikidork || style="background: gray" | Uploaded<br />
|-<br />
| [http://pastebin.com/tPqfuR5X "TED"] || 1000 || vled || style="background: orange" | w/ problems<br />
|-<br />
| [http://pastebin.com/wd011gVQ "programming"] || 1546 || Xentac || style="background: yellow" | In Progress<br />
|-<br />
| [http://pastebin.com/VLXeeeha "military", "army", "navy", "air force", "marine corps"] || 3108 || tj__ & ksh || style="background: orange" | Done (18GB + unknown)<br />
|-<br />
| [http://pastebin.com/7t5ibE75 "fiddle", "banjo", "old time music"] || 921 || RJL20 || style="background: orange" | Done<br />
|-<br />
| [http://pastebin.com/XgeUDbhs "silent+film"] || 1000 || dericed || style="background: yellow" | In Progress<br />
|-<br />
| [http://pastebin.com/FGJHAKN9 "industrial"] || 1584 || Archive242 || style="background: yellow" | In Progress<br />
|-<br />
| [http://pastebin.com/FbAPwZ2b (pretty much) every valid GV link on MetaFilter] || 1675 || RJL20 || style="background: orange" | Done<br />
|-<br />
| [http://gv.nja.im/index.php?dir=seed_videos_hubbestof http://hubpages.com/hub/The_Best_of_GoogleVideo] || 122 || Lightblb || style="background: grey" | Done: 7.1GB - 55 Failed - Rsync Done.<br />
|-<br />
| [http://pastebin.com/x0bv1da0 a few Olympics 1980 videos] || 4 || gribozavr || style="background: grey" | Rsync done<br />
|-<br />
| [http://pastebin.com/2Qiy40UH "kurzweil"] || 61 || NomDuClavier || style="background: green" | Completed<br />
|-<br />
| [http://pastebin.com/iyEif6U1 "human+rights"] || 2943 || witness.org,dericed || style="background: yellow" | In Progress<br />
|-<br />
| [http://pastebin.com/jY1yE6Hc "the+netherlands",<br />"nederland"] || 1650 || NomDuClavier || style="background: yellow" | In Progress<br />
|-<br />
| '''Total''' || '''>324,788''' || '''Archive Team''' || '''''>2.24 TB (Apr. 19, 11:37:13 UTC)'''''<br />
|}<br />
</center><br />
== DocID Errors ==<br />
The following table is a list of all the video document IDs that did not work.<br />
<br />
{| class="wikitable"<br />
! DocID !! Title !! list<br />
|-<br />
| -4313176927520589553 || [http://video.google.com/videoplay?docid=-4313176927520589553 Ferrari 320 km/h SelMcKenzie] || seed_videos_h<br />
|-<br />
| 710915802292429594 || [http://video.google.com/videoplay?docid=710915802292429594# Triple H-Best Pedigree Ever] || seed_videos_h<br />
|-<br />
| 919675995190477263 || 404s || seed_videos_h<br />
|-<br />
| -7433458566080701467 || 404s || seed_videos_2_k<br />
|-<br />
| 7476314005948269525 || [http://video.google.com/videoplay?docid=7476314005948269525# Tan Tay Du Ky 2 tap 1 phan 2] || seed_videos_2_k<br />
|-<br />
| 1310034078921227326 || [http://video.google.com/videoplay?docid=1310034078921227326 Presentatie H. van Garderen] || seed_videos_h<br />
|-<br />
| -8196546459051063200 || [http://video.google.com/videoplay?docid=-8196546459051063200 Ethiopia - Ethiopian Talk Show - Dr. Kinfe M Kassaye] || seed_videos_m<br />
|-<br />
| 6012309833489564165 || [http://video.google.com/videoplay?docid=6012309833489564165 I&#39;m gonna miss you forever] || seed_videos_m<br />
|-<br />
| 1006201176909432045 || [http://video.google.com/videoplay?docid=1006201176909432045 Nick "KNUCKLEHEAD" Thomas Learning to Ride A KX 65] || seed_videos_2_k_br<br />
|-<br />
| 9013618753646293166 || [http://video.google.com/videoplay?docid=9013618753646293166 TooSexii] || seed_videos_m<br />
|-<br />
| 4607644763702261746 || [http://video.google.com/videoplay?docid=4607644763702261746 Most Haunted] || seed_videos_m<br />
|-<br />
| 910327017359455024 || 404s || seed_videos_2_k_br<br />
|-<br />
| -3505183273546479430 || [http://video.google.com/videoplay?docid=-3505183273546479430# Top 10 Dunkers in Slam Dunk Contest History by www.todonba.mx.kz] || seed_videos_2_k_bu<br />
|-<br />
| 515155312540224448 || [http://video.google.com/videoplay?docid=515155312540224448 Prof. Stephen Berk - The Six Day War] -- (Only downloads 106MB & manual seek fails) || seed_videos_m<br />
|-<br />
| 8233620694803027158 || [http://video.google.com/videoplay?docid=8233620694803027158 Tien Kiem Ky Hiep 12a] || seed_videos_2_k_bs<br />
|-<br />
| -7026671761719496982 || [http://video.google.com/videoplay?docid=-7026671761719496982# KV Kortrijk - Virton: kans Vervaeke] || seed_videos_2_k_bo<br />
|-<br />
| 4744936758707683681 || 404s || seed_videos_2_k_bo<br />
|-<br />
| -4138015874145288917 || [http://video.google.com/videoplay?docid=-4138015874145288917# Irvine City Council Regular Meeting] -- content too short (expected 880173643 bytes and served 871) || seed_videos_2_k_bo<br />
|-<br />
| 1751753922865083288 || [http://video.google.com/videoplay?docid=1751753922865083288# Lou Dobbs - Bill Gates Testifies to Senate: Part 2] || seed_videos_h<br />
|-<br />
| -1847242336625060764 || 404s || seed_videos_h<br />
|-<br />
| -840074924615574683 || [http://video.google.com/videoplay?docid=-840074924615574683# H.O.T. TV EPISODE 7] || seed_videos_h<br />
|-<br />
| 5450039563312738134 || || seed_videos_2_o<br />
|-<br />
| 2740779495236816438 || || seed_videos_2_o<br />
|-<br />
| 8240553330007645065 || 404 || "rick astley"<br />
|-<br />
| 2776148046666235174 || 404 || seed_videos_d<br />
|-<br />
| 4641809537228296381 || 404 || seed_videos_<br />
|-<br />
| -4718427583805445551 || 404 || seed_videos_e<br />
|-<br />
| 5588388288256218328 || 404 || seed_videos_d<br />
|-<br />
| -1413491257698089214 || Redirects to http://www.khou.com/news/119535529.html || seed_videos_a_related<br />
|-<br />
| 1895753595163256038 || Redirects to http://tv.sky.com/martina-my-toughest-opponent || seed_videos_a_related<br />
|-<br />
| -4941694769105315227 || Redirects to http://saratoga-north.ynn.com/content/headlines/524274/governor-visit-s-nation-s-capitol/ || seed_videos_a_related<br />
|-<br />
| -7773409926173229653 || Redirects to http://www.zacks.com/commentary/15486/Value+Stock+Picks-August+24,+2010 || seed_videos_a_related<br />
|-<br />
| 7391058183663855490 || Redirects to http://www.ebaumsworld.com/video/watch/81158874/ || seed_videos_a_related<br />
|-<br />
| -4381742157481868130 || Redirects to http://arcade.modemhelp.net/play-3613-Stealing_A_Van.html || seed_videos_a_related<br />
|-<br />
| -1554641026467581780 || Redirects to http://s167.photobucket.com/albums/u158/browneydgurl1212/?action=view&current=meganstealinghashbrown.mp4 || seed_videos_a_related<br />
|-<br />
| 2353616771034791644 || Redirects to http://berkshires.ynn.com/content/headlines/523405/glens-falls-woman-accused-of-stealing-a-cat-from-pet-store/ || seed_videos_a_related<br />
|-<br />
| 9195455606734953941 || Redirects to http://abcnews.go.com/ThisWeek/video/roundtable-tragedy-tucson-12575675 || seed_videos_a_related<br />
|-<br />
| 9150764031039845836 || Redirects to http://www.ebaumsworld.com/video/watch/81298536/ || seed_videos_a_related<br />
|-<br />
| 9111781772616747857 || Redirects to http://abcnews.go.com/Politics/video/stephen-colbert-testifies-house-hearing-illegal-farm-workers-11718759 || seed_videos_a_related<br />
|-<br />
| 9106424136068226425 || Redirects to http://www.gameswelt.de/videos/videos/10349-Warhammer_Online_-_Home_Movie_Ever_Forward.html || seed_videos_a_related<br />
|-<br />
| 9106312808616607793 || Redirects to http://video.google.com/videoplay?docid=9106312808616607793 || seed_videos_a_related<br />
|-<br />
| -423230311474262633 || || seed_videos_2_k_at<br />
|-<br />
| -1989250447613793254 || || seed_videos_2_k_at<br />
|-<br />
| -1717591024529167847 || || seed_videos_2_k_au<br />
|-<br />
| -1893715945421217990 || || seed_videos_2_k_aw<br />
|-<br />
| 98954701061936704|| || seed_videos_2_k_az<br />
|-<br />
| -857514171338089705 || 871B instead of 9.9MB || seed_videos_2_k_az<br />
|-<br />
| 187959010149993716 || || seed_videos_2_k_az<br />
|-<br />
| -3761310108351243571 || || seed_videos_2_k_az<br />
|-<br />
| -5034671686367848138 || [http://video.google.com/videoplay?docid=-5034671686367848138# Umar Kalim breaks it all] content too short || seed_videos_2_k_bh<br />
|-<br />
| 3687153060611498767 || [http://video.google.com/videoplay?docid=3687153060611498767# Picnic Tables at CiCo] content too short || seed_videos_2_k_bj<br />
|-<br />
| 1010610140821179600 || || seed_videos_2_k_bf<br />
|-<br />
| 1272139449455901373 || || seed_videos_2_k_bi<br />
|-<br />
| 2154847967655726343 || || seed_videos_2_k_bj<br />
|-<br />
| 2453599535490760149 || || seed_videos_2_k_bl<br />
|-<br />
| 2525371248363122880 || || seed_videos_2_k_bf<br />
|-<br />
| -3761310108351243571 || || seed_videos_2_k_bh<br />
|-<br />
| 4549148983829940555 || 404s || seed_videos_2_k_bi<br />
|-<br />
| 7051814862620931463 || || seed_videos_2_k_bh<br />
|-<br />
| -7353344548521134361 || || seed_videos_2_k_bl<br />
|-<br />
| -817434969229495880 || || seed_videos_2_k_bh<br />
|-<br />
| 8335036545639007262 || || seed_videos_2_k_bh<br />
|-<br />
| -8653635503491974486 || || seed_videos_2_k_bh<br />
|-<br />
| -970580050717025709 || || seed_videos_2_k_bg<br />
|-<br />
| -3891054104657374974 || || seed_videos_2_k_bb<br />
|-<br />
| -5401734107040161313 || || seed_videos_2_k_bb<br />
|-<br />
| -6540216432023094075 || || seed_videos_2_k_bb<br />
|-<br />
| -1165561225258043258 || [http://video.google.com/videoplay?docid=-1165561225258043258 L'universo elegante parte 1] || seed_videos_l <br />
|-<br />
| 1922748009661857239 || [http://video.google.com/videoplay?docid=1922748009661857239 4/8 - L'histoire secrète du pétrole - Le temps des premiers craquements] || seed_videos_l <br />
|-<br />
| 300163955057959602 || [http://video.google.com/videoplay?docid=300163955057959602 6/8 - L'histoire secrète du pétrole - Le temps des magouilles] || seed_videos_l<br />
|-<br />
| -7110898118644169273 || [http://video.google.com/videoplay?docid=-7110898118644169273 Beppe Grillo e l'inceneritore] || seed_videos_l<br />
|-<br />
| -7942619273555709195 || [http://video.google.com/videoplay?docid=-7942619273555709195 Le monde selon Monsanto - Arte FR] || seed_videos_l<br />
|-<br />
| 8543705644990106023 || [http://video.google.com/videoplay?docid=8543705644990106023 José Bové à Aubagne le 7 Février.] || seed_videos_l<br />
|-<br />
| 2781869234442161475 || 404 || seed_videos_2_k_ap<br />
|-<br />
| 3684594607388096414 || 404 || seed_videos_2_k_ap<br />
|-<br />
| 4857427355245773332 || 404 || seed_videos_2_wap<br />
|-<br />
| 4818927167565306511 || 404 || seed_videos_2_wap<br />
|-<br />
| -7139586667055487256 || [http://video.google.com/videoplay?docid=-7139586667055487256# Cadru 4 : Une mission du roi Even lui même?] || meme<br />
|-<br />
| 744578668610845478 || [http://video.google.com/videoplay?docid=744578668610845478 Massieux délire (saut à poil)] || meme<br />
|- <br />
| 9027107881335248661 || 404 || meme<br />
|- <br />
| 712494279917239419 || Unavailable - Charlie Rose - Red Wine & Mice / Andy Grove & Richard Tedlow || intel amd<br />
|-<br />
| -4770095342392663956 || [http://video.google.com/videoplay?docid=-4770095342392663956# Trailer Park Boys - S03E08 - A Sh*t Leopard Can't Change Its Spots] || seed_videos_t<br />
|-<br />
| http://pastebin.com/LhR0vDFu || "Content Unavailable" or 404s || seed_videos_2_x<br />
|-<br />
| -2183089322473530253 || EOF || army seed list<br />
|-<br />
| 7899609783711363184 || EOF || army seed list<br />
|-<br />
| -8998613917213332529 || EOF || army seed list<br />
|-<br />
| -4784504756717962046 || EOF ; visiting [http://video.google.com/videoplay?docid=-4784504756717962046# 2007 K-FROG Cares Golf Classic - Part 4: Pat Green Concert] shows "video is not currently available" message || seed_videos_k<br />
|-<br />
| 7282734499247419085 || [http://video.google.com/videoplay?docid=7282734499247419085 Papell Studio Samba Serenade Printed Silk Georgette Pants - Item: 129-160] || from listerine<br />
|-<br />
| 1551984263748100534 || [http://video.google.com/videoplay?docid=1551984263748100534 ALLAMA TALIB JAUHARI - NASHTAR PARK KARACHI 2006 (PART-III)] || from listerine<br />
|-<br />
| 2769128814553569958 || [http://video.google.com/videoplay?docid=2769128814553569958 Laguna_Beach__-_Season_3_-_Episode_15_-_16.avi] || from listerine<br />
|-<br />
| 3368393825136501633 || [http://video.google.com/videoplay?docid=3368393825136501633 Magic Kingdom Hearts] || from listerine<br />
|-<br />
| -4534051497958455065 || [http://video.google.com/videoplay?docid=-4534051497958455065 Naruto Shippuuden 10 Fuuin Jutsu - Genryuu Kyuu Fuujin] || from listerine<br />
|-<br />
| -2661405767136566167 || [http://video.google.com/videoplay?docid=-2661405767136566167 marché aux animaux à Douz] || from listerine<br />
|-<br />
| -4129568891134205061 || [http://video.google.com/videoplay?docid=-4129568891134205061# 浙江化工廠釋放毒瓦斯 居民抗議遭鎮壓] || seed_videos_2_p<br />
|-<br />
| -863669053556310192 || [http://video.google.com/videoplay?docid=-863669053556310192# silencio] || seed_videos_2_p<br />
|-<br />
| 1529854584895362082 || [http://video.google.com/videoplay?docid=1529854584895362082# Dédicuce à ma Turtle Que Je Nadloveme !!] || seed_videos_2_p<br />
|-<br />
| -1190862519877917483 || [http://video.google.com/videoplay?docid=-1190862519877917483# Reportaje] || seed_videos_2_p<br />
|-<br />
| 777223614374448946 || || seed_videos_2_pan<br />
|-<br />
| -3753237639401264919 || || seed_videos_2_pan<br />
|-<br />
| 513998298993769213 || || seed_videos_2_pan<br />
|-<br />
| 4197907857130732658 || || seed_videos_2_pan<br />
|-<br />
| -7209518661908939846 || || seed_videos_2_pan<br />
|-<br />
| 1936036414289617481 || || seed_videos_2_pan<br />
|-<br />
| 1231628683306604703 || || seed_videos_2_pan<br />
|-<br />
| 8391426573583714670 || || seed_videos_2_pao<br />
|-<br />
| -5030624673313016595 || || seed_videos_2_pao<br />
|-<br />
| 2797125101537296652 || || seed_videos_2_pao<br />
|-<br />
| 1231628683306604703 || || seed_videos_2_pao<br />
|-<br />
| 765639190728070873 || || seed_videos_2_pap<br />
|-<br />
| 3106095225664799618 || || seed_videos_2_pap<br />
|-<br />
| 3824729866360231334 || || seed_videos_2_pap<br />
|-<br />
| -1011278591250373536 || || seed_videos_2_paq<br />
|-<br />
| 5017038353295770271 || || seed_videos_2_paq<br />
|-<br />
| -2103962498187129713 || || seed_videos_2_par<br />
|- <br />
| -1920063529943044649 || || seed_videos_2_par<br />
|- <br />
| -8842656122683618628 || || seed_videos_2_par<br />
|- <br />
| 3980781378957129624 || || seed_videos_2_par<br />
|- <br />
| 3168333365786153885 || || seed_videos_2_par<br />
|- <br />
| -850263308777060275 || || seed_videos_2_par<br />
|- <br />
| -2739776417348844007 || || seed_videos_2_par<br />
|- <br />
| -3693490165652585623 || || seed_videos_2_par<br />
|- <br />
| -4421953779802914087 || || seed_videos_2_par<br />
|- <br />
| -4985191518265705146 || || seed_videos_2_par<br />
|- <br />
| -5030272711619967323 || || seed_videos_2_par<br />
|- <br />
| -7480760343548282696 || || seed_videos_2_par<br />
|- <br />
| -8507025902579487785 || || seed_videos_2_par<br />
|- <br />
| -8565673568506246688 || || seed_videos_2_par<br />
|- <br />
| 7948280818830462878 || || seed_videos_2_par<br />
|- <br />
| 7111518386861929818 || || seed_videos_2_par<br />
|- <br />
| 5414116161601449115 || || seed_videos_2_par<br />
|- <br />
| 4453387956996456150 || || seed_videos_2_par<br />
|- <br />
| 3484019002795418536 || || seed_videos_2_par<br />
|- <br />
| 2599414351734791684 || || seed_videos_2_par<br />
|- <br />
| 981037964378644131 || || seed_videos_2_par<br />
|- <br />
| 503478249453792411 || || seed_videos_2_par<br />
|- <br />
| -626427952319840934 || || seed_videos_2_pas<br />
|- <br />
| 6692782035853741408 || || seed_videos_2_pas<br />
|- <br />
| -8104722695725517962 || || seed_videos_2_pas<br />
|- <br />
| 6603725717674618753 || || seed_videos_2_pas<br />
|- <br />
| -6885426254291916923 || || seed_videos_2_pas<br />
|- <br />
| 8878306115268123242 || || seed_videos_2_pas<br />
|- <br />
| 2664598798454107069 || || seed_videos_2_pas<br />
|- <br />
| -1130301863313429407 || || seed_videos_2_pas<br />
|- <br />
| 6383722209898652464 || || seed_videos_2_pas<br />
|- <br />
| 1410624060530577390 || || seed_videos_2_pat<br />
|- <br />
| 1100175904848145330 || || seed_videos_2_pat<br />
|- <br />
| 6421364272580349095 || || seed_videos_2_pat<br />
|- <br />
| 3243976296567942326 || || seed_videos_2_pat<br />
|- <br />
| 2856723628413664723 || || seed_videos_2_pau<br />
|- <br />
| -6684370625181545902 || || seed_videos_2_pau<br />
|- <br />
| -9112039128971736721 || || seed_videos_2_pau<br />
|- <br />
| -5134977928545797502 || || seed_videos_2_pau<br />
|-<br />
| 491463814477878191 || || listerine<br />
|-<br />
| 8027332670412780967 || || listerine<br />
|-<br />
| -8620028295602605989 || || listerine<br />
|-<br />
| 6793949560762919914 || unavailable || listerine<br />
|-<br />
| -4337343993095627162 || || listerine<br />
|-<br />
| -4246080235264001426 || youtube-dl errors with "unable to extract title" but video plays in browser || listerine<br />
|}<br />
<br />
=Deduplication For Those Not Using Listerine=<br />
<br />
To avoid downloading videos that have already been downloaded by others:<br />
* check if you have SQLite installed ("which sqlite3")<br />
* download the [http://bazaar.launchpad.net/~ndurner/+junk/gv-dedup/files gv-dedup] scripts<br />
* initialize a fresh database with "./gv-list-create.sh"<br />
* download all seed lists on this page (plus the [http://piratepad.net/TL7KDN8821 cherry picks]) and import them with "./gv-list-import.sh seed_file" (or "find seeds/* -exec ./gv-list-import.sh {} \;")<br />
* invoke "./gv-list-dedup.sh seed_videos_foo > list" to filter already downloaded videos from your custom seed list<br />
* also import your custom seed file with "./gv-list-import.sh list"<br />
A pre-filled database is [http://goo.gl/POIaR available].<br />
<br />
= Tools =<br />
<br />
== Youtube-DL ==<br />
* http://rg3.github.com/youtube-dl/download.html<br />
** python youtube-dl googlevideourl<br />
<br />
== DocID scripts ==<br />
* http://piratepad.net/googlevideoscript<br />
<br />
Scraping by dates uploaded:<br />
* http://www.cs.utexas.edu/~lutostag/goog-vid/datescrape.pl<br />
Check to see which dates have already been scraped at:<br />
* http://piratepad.net/lByckw7Wtn<br />
<br />
== GoogleGargle ==<br />
* http://www.textfiles.com/googlegargle<br />
<br />
== Aria2c (APT) ==<br />
* apt-add-repository ppa:t-tujikawa/ppa<br />
* apt-get update<br />
* apt-get install aria2<br />
** http://aria2.sourceforge.net/<br />
<br />
== Aria2c (RPM) ==<br />
Fedora and CentOS have RPMs available.<br />
* yum install aria2<br />
<br />
== Searcher ==<br />
Bash script to search for terms on Google Video, includes dedupe and ability to restrict search by video length.<br />
* https://github.com/norcnorc/googlegargle/blob/35a995e07508faccd1db79abd31bd702e995de88/searcher.sh<br />
<br />
== predict-download-size ==<br />
Bash script to read a docid list and find out the total size of the listed videos. Requires youtube-dl, curl.<br />
* https://github.com/norcnorc/googlegargle/blob/master/predict-download-size<br />
<br />
== Subtitles ==<br />
Some videos have subtitles which haven't been included in the download script (yet). I've created a fairly basic script which retrieves all available subtitles and stores them into the correct folder. You just need perl and a seed list (saved as "list"). You can also run it in an empty dir if you're afraid that it will mess with the videos you have downloaded so far (probably a good idea as I didn't do extensive tests yet). Once the subtitles have been downloaded, just run a "rsync -avP $subtitle_directory $video_directory" to transfer the subtitles to the corresponding video.<br />
<br />
You may grab the script at http://piratepad.net/K7wZRrxvoU. Feel free to modify it.<br />
<br />
--- For some reason it sometimes saves the file under a different name than what it outputs to the console, tested on Debian 6 -Pentium100 -> This has been corrected, the problem arose whenever there were spaces in the filename.<br />
<br />
--- Google will return a 503 if it feels like it's queried by a bot (http://www.google.com/support/websearch/bin/answer.py?hl=en&answer=86640). I have modified the script to pause for 60 seconds after 100 queries, hope that this will suffice. If not, you can either tweak the $PAUSE_AFTER or the actual pause duration in the script. Also, the script will now download multiple subtitles for one video (it didn't do that before, sorry!). -Dr.Sweety<br />
<br />
= Saving Individual Videos =<br />
The seed files do currently not include all videos, so you might want to save precious videos explicitely. To do that, add IDs (found in the docid URL parameter video) to the "list" file in the same directory as the script, for example:<br />
docid=1545969803753962248<br />
docid=1598207563000425446<br />
docid=-1679753730105404298<br />
and start ./googlegargle<br />
<br />
To request a video, add it to this list: http://piratepad.net/gvspecificrequests<br />
<br />
If you download something from that list, add its docid to http://piratepad.net/TL7KDN8821 so that others won't download those videos for the second time.<br />
<br />
=Custom Keyword Searches=<br />
<br />
==Linux==<br />
<br />
If you want to grab videos by your own custom keyword search term, you can use [https://github.com/norcnorc/googlegargle/blob/master/searcher.sh this script].<br />
<br />
Alternatively, you can use this command:<br />
<pre><nowiki><br />
SEARCH='my+search+term';for i in `seq 0 10 990 `;do curl -A "AT, Bitches" "http://www.google.com/search?q=$SEARCH+site:video.google.com&hl=en&safe=off&tbm=vid&start=$i&sa=N"|grep -o "docid=[0-9-]*"|sort -u|tee -a seed_videos_$SEARCH;done<br />
</nowiki></pre><br />
Change "my+search+term" to your search term, and remember to use a plus sign instead of spaces (and to url encode the text for other special characters).<br />
<br />
==Mac Bash Command==<br />
<br />
Uses jot instead of seq:<br />
<pre><nowiki><br />
SEARCH='my+search+term';for i in `jot - 0 990 10 `;do curl -A "AT, Bitches" "http://www.google.com/search?q=$SEARCH+site:video.google.com&hl=en&safe=off&tbm=vid&start=$i&sa=N"|grep -o "docid=[0-9-]*"|sort -u|tee -a seed_videos_$SEARCH;done<br />
</nowiki></pre><br />
Alternatively, you can get <tt>seq</tt> (and lots of other useful stuff) by installing the macports coreutils package: <tt>sudo port install coreutils</tt>. Commands are prefixed with a 'g', so <tt>seq</tt> is called <tt>gseq</tt>, but you may of course symlink it so you don't have to modify your scripts.<br />
<br />
==Searches Undertaken==<br />
<br />
Since we want to minimize overlap, here are some search terms that are already in progress of being downloaded along with the name of the downloader:<br />
<br />
*Darkstar: "rare", "vintage", "commercial"<br />
*NomDuClavier: "douglas adams", "richard dawkins", "charles darwin", "michio kaku", "brian cox", "vernor vinge", "carl sagan", "simon singh" <br />
*oli: "australia history"<br />
*dnova: "microelectronics"<br />
*Lightblb: "documentary" (medium and long videos), "lecture" (medium and long videos), "atheism" (medium & long), "interview" (long), talk (medium & long), brain (medium & long), civilization (medium & long), evolution (medium & long), future (medium & long), language (medium & long), literature (medium & long), mind (medium & long), money (medium & long), neurolinguistic (medium & long), singularity (medium & long)<br />
*ttuttle: "astronomy"<br />
*crackbab1: "ecology"<br />
*tj__: "army"<br />
*r00s: "dokumentation" (medium, long)<br />
<br />
Also check the specificrequest PiratePad under Cherry Picking on this page.<br />
<br />
= Troubleshooting =<br />
* /usr/bin/aria2c: unrecognized option '--max-connection-per-server=16'<br />
** The Aria version available in many linux distributions is not up to date and will throw errors.<br />
** To fix this remove the option from the goooglegargle script line starting with "ARIAOPTIONS="<br />
<br />
* User 'negge' on IRC reports the following ARIA command line works for Debian Squeeze with ext4 filesystem,<br />
**--max-overall-download-limit=1024M --file-allocation=falloc --max-connection-per-server=4 --min-split-size=1M --log-level=notice --remote-time=true<br />
* or for ext3 on Debian Squeeze,<br />
**--max-overall-download-limit=1024M --file-allocation=prealloc --max-connection-per-server=4 --min-split-size=1M --log-level=notice --remote-time=true<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Video hosting]]<br />
[[Category:Google]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Posterous&diff=27393Posterous2017-01-16T15:50:10Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Posterous<br />
| image = Posterous_home.png<br />
| description = <br />
| URL = http://posterous.com<br />
| project_status = {{closed}}<br />
| source = [https://github.com/ArchiveTeam/posterous-grab posterous-grab]<br />
| archiving_status = {{saved}}<br />
| irc = preposterus<br />
| tracker = [http://tracker.archiveteam.org/posterous/ here]<br />
}}<br />
<br />
'''Posterous''' was a blogging platform started in May 2008. It was acquired by [[Twitter]] on March 12, 2012 and shut down April 30, 2013.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
==Archives==<br />
We saved it! Discussion around and details of our efforts have been archived to the [[Posterous/War room|Posterous war room]]. The final moments has been [[Posterous/Story| retold as a story]].<br />
* [http://archive.org/details/archiveteam_posterous Preposterous! The Posterous Grab] on archive.org<br />
* [http://archive.org/details/2013-02-22-posterous-hostname-list List of hostnames]<br />
<br />
===I had a Posterous blog. How can I get my files back?===<br />
<br />
There are two ways available:<br />
<br />
* Check if your blog has been ingested in the [http://archive.org/web Wayback Machine].<br />
* Extract the files from the WARC files with some [[The WARC Ecosystem|WARC tools]].<br />
** This method requires power user skills. In essence, scan each CDX index file and then extract it from the appropriate WARC files. Ask us in [[IRC]] for help.<br />
<br />
==Press==<br />
* [http://www.dailydot.com/news/archive-team-preserving-posterous/ Archive Team races to preserve Posterous before it goes dark], ''The Daily Dot'', 2013-03-13<br />
<br />
{{navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=App.net&diff=27392App.net2017-01-16T15:50:00Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=4chan/4plebs&diff=273914chan/4plebs2017-01-16T15:49:52Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = archive.4plebs.org<br />
| image = archive-4plebs.png<br />
| URL = [http://archive.4plebs.org archive.4plebs.org]<br />
| project_status = {{online}}<br />
| archiving_status = {{saved}}<br />
}}<br />
<br />
Status: <span style="color: green;">'''Online'''</span><br />
Saves Images?: <span style="color: green;">'''Yes'''</span><br />
<br />
4plebs is shedding all full-sized images dating before April 2014, about 240GBs worth of data, due to storage limits. We need to retrieve this data and put it on the Internet Archive for safekeeping. <br />
<br />
The Bibliotheca Anonoma has recieved the pruned images from 4plebs via tar piping, and will be uploading to the Internet Archive shortly.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Method 1: Web Scraping ==<br />
<br />
Using wget, we just scrape the images off the server. It's not elegant, but it works, and thankfully [http://img.4plebs.org/boards/o/image/to_be_removed_in_order.txt the admin has provided some image lists.] (change board name in URL to view another list) This will take about a month at least, and that's assuming we're scraping in parallel. The following bash script is used:<br />
<br />
<pre><br />
!/bin/bash<br />
board="tg"<br />
wget http://img.4plebs.org/boards/$board/image/to_be_removed_in_order.txt<br />
sed -e 's|^./|http://img.4plebs.org/boards/$board/image/|g' -i to_be_removed_in_order.txt<br />
wget -b --tries=10 -nc -c -i to_be_removed_in_order.txt --user-agent="Bibliotheca Anonoma Website Archiver/1.1 (+http://github.com/bibanon/bibanon/wiki)" -w 1<br />
</pre><br />
<br />
== Web Scraping ETA ==<br />
<br />
Below are rough estimates for scraping time, procedurally calculated based on the amount of images listed. <br />
<br />
These were intentionally overestimated to ensure that my VPS actually had enough space and time, but actually the time estimates are off by a factor of 9, since it only took 15 hours to scrape 5GBs of data (from /s4s/), not 6 days. Maybe it should be 1.1 seconds, rather than 2? We used a delay just to be polite.<br />
<br />
Assumes:<br />
<br />
* 2 second Average Download Time (includes 1 second delay)<br />
* 600KB Average filesize for regular boards<br />
* 3MB Average filesize for high resolution boards<br />
* 8MB Average filesize for /f/lashes<br />
<br />
=== Total ===<br />
<br />
* Total Amount of Images: 372123<br />
* Total Estimated Size: 244789 MB (or) 240 GB<br />
* Total Estimated Timespan:<br />
* Parallel: 1 month (30 days)<br />
* Sequential: 2063 hours (or) 85 days<br />
<br />
=== /adv/ ===<br />
<br />
Status: <span style="color: blue;">'''Scraping - 2015-09-20'''</span><br />
<br />
* Amount of Images: 12973<br />
* Estimated Timespan: 72 hours (or) 3 days<br />
* Estimated Size: 7601 MB (or) 8 GB<br />
** Actual Timespan: 7h 18m 10s<br />
** Actual Size: 3.3G<br />
<br />
=== /hr/ ===<br />
<br />
* Amount of Images: 11082<br />
* Estimated Timespan: 61 hours (or) 2 days<br />
* Estimated Size: 33246 MB (or) 33 GB<br />
<br />
=== /f/ ===<br />
<br />
Nothing to be pruned?<br />
<br />
=== /o/ ===<br />
<br />
* Amount of Images: 37437<br />
* Estimated Timespan: 207 hours (or) 8 days<br />
* Estimated Size: 21935 MB (or) 22 GB<br />
<br />
=== /pol/ ===<br />
<br />
* Amount of Images: 107115<br />
* Estimated Timespan: 595 hours (or) 24 days<br />
* Estimated Size: 62762 MB (or) 62 GB<br />
<br />
=== /s4s/ ===<br />
<br />
Status: <span style="color: blue;">'''Scraping - 2015-09-20'''</span><br />
<br />
* Amount of Images: 29504<br />
* Estimated Timespan: 163 hours (or) 6 days<br />
* Estimated Size: 17287 MB (or) 17 GB<br />
** Actual Timespan: 15h 30m<br />
** Actual Size: 5.7G<br />
<br />
=== /sp/ ===<br />
<br />
Nothing to be pruned?<br />
<br />
=== /tg/ ===<br />
<br />
* Amount of Images: 60556<br />
* Estimated Timespan: 336 hours (or) 14 days<br />
* Estimated Size: 35482 MB (or) 34 GB<br />
<br />
=== /trv/ ===<br />
<br />
Status: <span style="color: blue;">'''Saved! - 2015-09-21'''</span><br />
<br />
* Amount of Images: 1713<br />
* Estimated Timespan: 9 hours<br />
* Estimated Size: 1003 MB (or) 1 GB<br />
** Actual Timespan: 1h 6m 32s<br />
** Actual Size: 1.1G<br />
<br />
=== /tv/ ===<br />
<br />
* Amount of Images: 99399<br />
* Estimated Timespan: 552 hours (or) 23 days<br />
* Estimated Size: 58241 MB (or) 57 GB<br />
<br />
=== /x/ ===<br />
<br />
Status: <span style="color: green;">'''Saved! - 2015-09-20'''</span><br />
<br />
* Amount of Images: 12344<br />
* Estimated Timespan: 68 hours (or) 2 days<br />
* Estimated Size: 7232 MB (or) 7 GB<br />
** Actual Timespan: 6h 55m 29s<br />
** Actual Size: 3.4G<br />
<br />
== Method 2: tar Piping ==<br />
<br />
Web scraping does eat up bandwidth and take quite a long time. A better method is to pipe a tar archive from their host server to our (dedicated) server. Yes you heard that right, the tar backup is ''stored directly on the remote server'', not on the host server. <br />
<br />
That way, the host server doesn't have to store a redundant backup that could be massive. Instead, just spit it at our server directly.<br />
<br />
<pre><br />
tar -c /path/to/dir | ssh remote_server 'tar -xvf - -C /absolute/path/to/remotedir'<br />
</pre><br />
<br />
This would only take about a week or so to transfer 240GBs of data, and reduces the amount of overhead on the web server from requesting 330,000 files: we only send one continuous stream of data.<br />
<br />
* [http://serverfault.com/questions/18125/how-to-copy-a-large-number-of-files-quickly-between-two-servers/18142#18142 Source: StackOverflow - Tar piping]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Nifty&diff=27390Nifty2017-01-16T15:49:44Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Nifty<br />
| image = Homepage.nifty.com-20160901.png<br />
| URL = [http://homepage.nifty.com/ homepage.nifty.com]<br />
| description = Japanese ISP with web hosting<br />
| project_status = {{Closed}}<br />
| archiving_status = {{saved}}<br />
| source = https://github.com/ArchiveTeam/nifty-discovery<br />
| irc = niftyjanai<br />
}}<br />
<br />
Japanese ISP providing web hosting. Will be closing about 140,000 unclaimed homepages by 2016-11-10 15:00. {{url|http://homepage.nifty.com/information/2016/01/|Termination notice}} (Japanese)<br />
<br />
<pre><br />
http://homepage1.nifty.com/USERNAME/<br />
http://homepage2.nifty.com/USERNAME/<br />
http://homepage3.nifty.com/USERNAME/<br />
</pre><br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Usenet&diff=27389Usenet2017-01-16T15:49:35Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Zoocasa&diff=27388Zoocasa2017-01-16T15:49:26Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Zoocasa<br />
| logo = ZoocasaLogo.png<br />
| image = <br />
| URL = http://www.zoocasa.com<br />
| project_status = {{offline}}<br />
| archiving_status = {{saved}}<br />
| tracker = [http://tracker.archiveteam.org/zoocasa/ zoocasa]<br />
| source = [https://github.com/ArchiveTeam/zoocasa-items zoocasa-items], [https://github.com/ArchiveTeam/zoocasa-grab zoocasa-grab]<br />
| irc = zoohouse<br />
}}<br />
<br />
'''Zoocasa''' was a real estate brokerage owned by Rogers. It was supposed to shut down on June 22, 2015, but shut down on June 25, 2015.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Archives ==<br />
https://archive.org/details/archiveteam_zoocasa<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Ello&diff=27387Ello2017-01-16T15:49:17Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Ello<br />
| URL = http://ello.co<br />
| image = Ello_chromium_1412290394266.png<br />
| logo = Ello-logo.png<br />
| project_status = {{online}}<br />
| archiving_status = {{notsavedyet}}<br />
| irc = oodbye<br />
}}<br />
<br />
'''Ello''' is a social-networking website which markets itself as an ad-free [[Facebook]] alternative. Ello launched as an invite-only public beta in April 2014 and became popular in late September 2014.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Downloading your data ==<br />
As of October 2014, there is no export function, although it ''might'' be available in the future.<ref>https://ello.co/scottbeale/post/Z8InvSKY34keARET3oT2VQ: "@scottbeale @jayzes This isn't really something that we want to promote as any sort of API, but we'll talk about possibilities here. We'll get around to export and API functionality soon!"</ref> <br />
<br />
On October 23, 2014, one of Ello's developers tweeted that a backup feature would likely be paid feature,<ref>https://twitter.com/cacheflowe/status/525328915653722113</ref> saying "Charging a buck or two for an export feature seems far from hostile."<ref>https://twitter.com/cacheflowe/status/525340660476297216</ref><br />
<br />
<s>A raw copy of a user's posts can be retrieved by appending ".json" to their profile URL (e.g. https://ello.co/textfiles.json). User-uploaded images can be manually searched for in the json file.</s> As of February 2015, this seems to be disabled.<br />
<br />
== References ==<br />
<references /><br />
<br />
== External Links ==<br />
* [https://ello.co/waxpancake/post/oy73kFfDdhOPh8Jv9z9pFA Andy Baio's post on Ello] and [https://ello.co/waxpancake/post/Jp4o1TBtLrYytHpcEni0Kg follow-up post]<br />
* [https://twitter.com/archiveteam/status/515115988778770432 @archiveteam: (Whistling, circling January 2016 on calendar and writing "ello project")]<br />
* [https://twitter.com/textfiles/status/516659144741634049 @textfiles: Where's your export function, Ello?]<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Wget&diff=27386Wget2017-01-16T15:49:07Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>[http://www.gnu.org/software/wget/ GNU Wget] is a free utility for non-interactive download of files from the Web. Using Wget, it is possible to grab a large chunk of data, or mirror an entire website, including its (public) folder structure, using a single command. In the tool belt of the renegade archivist, Wget tends to get an awful lot of use. (Note: Some people prefer to use [http://curl.haxx.se/ cURL]. If it can back up data, it's useful.)<br />
<br />
This guide will not attempt to explain all possible uses of Wget; rather, this is intended to be a concise introduction to Wget, specifically geared towards using it to archive data such as podcasts, PDF documents, or entire websites. Dealing with issues such as user agent checks and robots.txt restrictions will be covered as well.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Mirroring a website ==<br />
<br />
When you run something like this:<br />
<pre><br />
wget http://icanhascheezburger.com/<br />
</pre><br />
...Wget will just grab the first page it hits, usually something like index.html. If you give it the -m flag:<br />
<pre><br />
wget -m http://icanhascheezburger.com/<br />
</pre><br />
...then Wget will happily slurp down anything within reach of its greedy claws, putting files in a complete directory structure. Go make a sandwich or something.<br />
<br />
You'll probably want to pair -m with -c (which tells Wget to continue partially-complete downloads) and -b (which tells wget to fork to the background, logging to wget-log).<br />
<br />
If you want to grab everything in a specific directory - say, the SICP directory on the mitpress web site - use the -np flag:<br />
<pre><br />
wget -mbc -np http://mitpress.mit.edu/sicp<br />
</pre><br />
<br />
This will tell Wget to not go up the directory tree, only downwards.<br />
<br />
== User-agents and robots.txt ==<br />
<br />
By default, Wget strictly follows a website's robots.txt directives. In certain situations this will lead to Wget not grabbing anything at all, if for example the robots.txt doesn't allow Wget to access the site.<br />
<br />
To avoid this: first, you should try using the <code>--user-agent</code> option:<br />
<pre><br />
wget -mbc --user-agent="" http://website.com/<br />
</pre><br />
This instructs Wget to not send any user agent string at all. Another option for this is:<br />
<pre><br />
wget -mbc -e robots=off http://website.com/<br />
</pre><br />
...which tells Wget to ignore robots.txt directives altogether.<br />
<br />
You can append <code>--wait 1</code> to add a delay of one second between requests, to lighten the server load and avoid being blocked, which might happen in certain cases if you make too many requests within too short a time.<br />
<br />
== Compression ==<br />
<br />
Wget doesn't use compression by default! This can make a big difference when you're downloading easily compressible data, like human-language HTML text, but doesn't help at all when downloading material that is already compressed, like JPEG or PNG files. To enable compression, use:<br />
<pre><br />
wget --header="accept-encoding: gzip"<br />
</pre><br />
This will produce a file (if the remote server supports gzip compression) that uses the .html extension, but is actually gzip-encoded, which can be confusing.<br />
<br />
Any vaguely modern server can sustain thousands of simultaneous text downloads, with video or large images being the big ticket items. But sites using outdated hardware, or run by habitual whiners, will complain when a site scraping uses 200 megabytes of transfer when it could have used 100.<br />
<br />
== Creating WARC with wget ==<br />
<br />
If you wish to create a WARC file (which includes an entire mirror of a site), you will want something like this:<br />
<br />
export USER_AGENT="Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/533.20.25 (KHTML, like Gecko) Version/5.0.4 Safari/533.20.27"<br />
export SAVE_HOST="example.com"<br />
export WARC_NAME="example.com-panicgrab-20130611"<br />
<br />
wget \<br />
-e robots=off --mirror --page-requisites \<br />
--waitretry 5 --timeout 60 --tries 5 --wait 1 \<br />
--warc-header "operator: Archive Team" --warc-cdx --warc-file="$WARC_NAME" \<br />
-U "$USER_AGENT" "$SAVE_HOST"<br />
<br />
You can find out more about [[Wget_with_WARC_output|Wget with WARC output]].<br />
<br />
=== You can even create a function ===<br />
<br />
<pre><br />
function quick-warc {<br />
if [ -f $1.warc.gz ]<br />
then<br />
echo "$1.warc.gz already exists"<br />
else<br />
wget --warc-file=$1 --warc-cdx --mirror --page-requisites --no-check-certificate --restrict-file-names=windows \<br />
-e robots=off --waitretry 5 --timeout 60 --tries 5 --wait 1 \<br />
-U "Mozilla/5.0 (Windows; U; Windows NT 6.1; en-US) AppleWebKit/533.20.25 (KHTML, like Gecko) Version/5.0.4 Safari/533.20.27" \<br />
"http://$1/"<br />
fi<br />
}<br />
</pre><br />
<br />
<br />
=== Forum Grab ===<br />
<br />
<pre>src/wget --save-cookies team17-cookies.txt --post-data 'vb_login_username=USERNAMEGOESHERE&vb_login_password=PASSWORDGOESHERE&securitytoken=guest&cookieuser=1&do=login' http://forum.team17.com/login.php?do=login<br />
src/wget --load-cookies team17-cookies.txt -e robots=off --wait 0.25 "http://forum.team17.com/" --mirror --warc-file="at-team17-forum"<br />
</pre><br />
<br />
=== Wordpress Grab ===<br />
<br />
<pre>wget --no-parent --no-clobber --html-extension --recursive --convert-links --page-requisites --user=<username> --password=<password> <path></pre><br />
<br />
=== Lua Scripting ===<br />
<br />
If you need fine grain behavior Wget while it downloads, use a version of [[Wget_with_Lua_hooks|Wget with Lua hooks]].<br />
<br />
== Tricks and Traps ==<br />
<br />
* A standard methodology to prevent scraping of websites is to block access via user agent string. Wget is a good web citizen and identifies itself. Renegade archivists are not good web citizens in this sense. The '''--user-agent''' option will allow you to act like something else.<br />
* Some websites are actually aggregates of multiple machines and subdomains, working together. (For example, a site called ''dyingwebsite.com'' will have additional machines like ''download.dyingwebsite.com'' or ''mp3.dyingwebsite.com'') To account for this, add the following options: '''-H -Ddomain.com'''<br />
* If you do not want Wget to download the original files while making a WARC, use [[Wget with Lua hooks]] and <code>--output-document</code> and <code>--truncate-out</code>. Use of these options treats the output document as a temporary file. For the purposes of making a WARC file, these options should be used together to prevent growing files and poor performance.<br />
* [http://www.win.tue.nl/~aeb/linux/misc/wget.html Wget mistakes certain UTF-8 characters] in the original filenames with control characters and happily escapes them, turning the filenames into garbage. If your system supports UTF-8 filenames (probably), you can turn the escaping off by using the <code>--restrict-file-names=nocontrol</code> option. Fortunately, the contents of the .warc files should be unaffected by the escaping.<br />
** Accidentally bitten by this "feature" already? Try [http://www.win.tue.nl/~aeb/linux/misc/wgetfix.c this C program] that recursively unescapes the filenames.<br />
<br />
== Parallel downloading ==<br />
http://keramida.wordpress.com/2010/01/19/parallel-downloads-with-python-and-gnu-wget/<br />
<br />
== Essays and Reading on the Use of WGET ==<br />
<br />
* [http://lifehacker.com/software/top/geek-to-live--mastering-wget-161202.php Mastering WGET] by Gina Trapani<br />
* [http://psung.blogspot.com/2008/06/using-wget-or-curl-to-download-web.html Using Wget or curl to download web sites for archival] by Phil Sung<br />
* [http://linux.about.com/od/commands/l/blcmdl1_wget.htm about.com Wget] list of commands<br />
* [http://www.delorie.com/gnu/docs/wget/wget.html#SEC_Top GNU Wget manual]<br />
<br />
[[Category:Tools]]<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Jux&diff=27385Jux2017-01-16T15:48:59Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Jux<br />
| logo = Jux logo.png<br />
| image = Jux homepage.png<br />
| description = <br />
| URL = {{url|1=http://www.jux.com/}}<br />
| project_status = {{offline}}<br />
| archiving_status = {{partiallysaved}} (see [[Jux#Archives|Archives]])<br />
| source = [https://github.com/ArchiveTeam/jux-grab jux-grab], [https://github.com/ArchiveTeam/jux-items jux-items]<br />
| tracker = [http://tracker.archiveteam.org/jux/ jux]<br />
| irc = juxsux<br />
}}<br />
<br />
'''Jux''' is a creative blogging platform. It was announced that Jux would shut down on August 31, 2013, but this was later changed apparently due to financial support from one of their members. '''UPDATE:''' On November 5, 2014, Jux announced that they would be closing permanently at the end of the month on November 30, 2014.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Site structure ==<br />
<br />
A complete list of blogs hasn't been made yet, searching "site:jux.com" and "site:*.jux.com" on [[Google]] and Bing might help. More blogs could also be found by scraping https://jux.com/gallery.<br />
<br />
* All the images seem to be stored on Amazon CloudFront, so they might still be up after the site's shutdown.<br />
<br />
TODO: A script for easily backing up all the blogs would help with archiving Jux before it shuts down.<br />
*[http://jux.com/docs/ This page] and [http://helpers.jux.com/customer/portal/articles/246318-do-you-support-json-or-have-an-api- this page] both have details on Jux's API<br />
<br />
As of Nov. 20, the main API and site are 503ing. There is an [http://betaapi.jux.com alternative API], but this appears to be easily overwhelmed. However, the [http://user-zip-files.s3.amazonaws.com/ S3 bucket] for the zips of user files is listable, so we can grab complete users from there.<br />
<br />
* <s>Someone should do a quick grab of their [https://www.facebook.com/juxcom Facebook] and [https://twitter.com/JuxDotCom Twitter] accounts.</s> This has already been done with [[ArchiveBot]].<br />
<br />
== Download Your Data ==<br />
<br />
:"We realize that some of you have media stored on the Jux servers that you would like to preserve, so we have a link for you that will enable you to retrieve and download all your stored media files with one simple step. Sign in with your Jux credentials to export your media files."<ref>http://farewell.jux.com</ref><br />
<br />
According to Jux,<ref>http://helpers.jux.com/customer/portal/articles/1159505-how-can-i-backup-my-work-on-jux-</ref> going to ''"https://USERNAME.jux.com/!/mystuff"'' (when logged in) will allow you to backup your data. You can also backup some data in the JSON format without logging in as described [http://helpers.jux.com/customer/portal/articles/246318-do-you-support-json-or-have-an-api- here].<br />
<br />
== Archives ==<br />
<br />
Partial archive grabbed by ''computerfreak'' - approximately 865,000 posts fetched. Available at [https://archive.org/details/jux_posts_to_nov_24 https://archive.org/details/jux_posts_to_nov_24]<br />
<br />
Jux closed down their public CDN and their main website became unresponsive well before the shut down date, so we were not able to get a full archive.<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=500px&diff=27384500px2017-01-16T15:48:49Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Storage_Media&diff=27383Storage Media2017-01-16T15:48:41Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>The term '''storage media''' refers to any number of objects - '''CD''', '''DVD''', '''USB flash drive''' - basically any device that can have any form of data stored on it.<br />
<br />
When deciding where to keep your data, remember that ''everything rots, and everything breaks down''. It's the way the universe works, and there's not much we can do to stop it. Therefore, you have to think about your data's lifetime in terms of months, years, or decades.<br />
<br />
Very little data most people have truly needs decades of preservation, but often, it doesn't hurt to have it around as long as possible. A minor amount of effort will help mitigate that. Specifically, expect to renew/refresh your data storage every 3 years or so - anything you don't do this with will progressively be subject to bit rot, moisture and heat damage, and being shoved into progressively unpleasant locations at home and office before someone decides it's trash.<br />
<br />
More information about bit rot can be found on the [http://en.wikipedia.org/wiki/Bit_rot Wikipedia article].<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Hard Disk Drives (HDD) ==<br />
<br />
There are many forms of hard drives, including internal hard drives for laptops (2.5") and desktops (3.5"), and external hard drives that can run free of a computer. While external hard drives fall under the replace-every-few-years rule, data can be bought cheaply in comparison to other formats ($80 could probably buy two terabytes of storage), and so are the recommended format used for backups.<br />
<br />
=== External Hard Drives ===<br />
<br />
These are regular internal drives fitted into an enclosure, usually with a USB cable for connectivity. Functionally, they are more or less like using a thumb drive. Externals are quite cheap these days, and can be bought for as low as $80 and $130 for a 2TB and 5TB respectively. For a bit more, you can buy drives with smaller form factors and USB power. They're a very affordable way to keep your data backed up.<br />
<br />
=== Network Attached Storage (NAS) ===<br />
<br />
Unlike external hard drives which are dependent on a host PC, a NAS is more or less its own computer. They contain slots for one or more HDD (or come with them built-in), an ethernet port and such basic features as print server, file server, BitTorrent client, etc. This allows them to function independently of any PC, allowing any computer on the network access. A good NAS will set you back $150 or more depending on available drive bays, bundled drives and feature set.<br />
<br />
=== Hard Drive Docks ===<br />
<br />
HDD Docks are essentially enclosures for multiple HDDs. Like an external hard drive, they usually connect to a single computer via USB. Lacking the fancy features of a NAS, they can be significantly cheaper, and have the advantage of requiring only one power socket and USB cable for multiple drives, as opposed to the requirements of a legion of individual externals.<br />
<br />
{{Navigation pager<br />
| previous = Formats<br />
| next = Recommended Reading<br />
}}<br />
{{Navigation box}}<br />
<br />
[[Category:tools]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Formats&diff=27382Formats2017-01-16T15:48:32Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{notice|1=See also [http://fileformats.archiveteam.org/ Let's Solve the File Format Problem] wiki that provides an extensive catalogue of file formats.}}<br />
<br />
A very good rule of thumb with data formats is to pick those that are ''no more complex than the data being represented'', that are ''recoverable with simple tools'' and ''widely implemented''.<br />
<br />
In general, if you have written a text document and it's not viewable or editable in a low-level text editor (Notepad, Emacs, and so on), you should probably take the time to convert it into a plain-text format - keep the rich format also.<br />
<br />
If you are backing up data in a format that's not widely understood, be sure to also keep backups of the software you use to open it and any registration keys. A file made with version 2.x of a piece of software may not open with the all new, singing and dancing version 5.x!<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Images ==<br />
* PNG<br />
* Lossless TIFF<br />
* SVG<br />
<br />
== Audio ==<br />
Lossless:<br />
* FLAC<br />
<br />
Lossy:<br />
* OGG<br />
<br />
== Video ==<br />
* Matroska<br />
* OGV<br />
* AVI<br />
<br />
== Compression ==<br />
* [[7z]]<br />
* [[TAR]]<br />
* ZIP<br />
<br />
== Website crawls ==<br />
[[WARC]] is required for Wayback Machine integration and is highly recommended. It retains important metadata (such as request/response headers) that would otherwise be lost.<br />
<br />
== External links ==<br />
* http://en.wikipedia.org/wiki/Category:Open_formats<br />
<br />
{{Navigation pager<br />
| previous = Software<br />
| next = Storage Media<br />
}}<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Knol&diff=27381Knol2017-01-16T15:48:22Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Knol<br />
| logo = Knol-128.png<br />
| image = Knol es 1304521664151.png<br />
| description = <br />
| URL = {{url|1=http://knol.google.com/k}}<br />
| project_status = {{offline}} since 2012-05-01<br />
| archiving_status = {{saved}}<br />
| irc= klol<br />
}}<br />
<br />
'''Knol''' was a portal for user-written article in many topics. It was described as a rival to [[Wikipedia]]. ({{url|http://news.cnet.com/8301-1023_3-9997426-93.html|Google's Wikipedia rival, Knol, goes public}})<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Announcement ==<br />
Closure announced on the {{url|http://googleblog.blogspot.com/2011/11/more-spring-cleaning-out-of-season.html|official Google Blog}}:<br />
<br />
<blockquote><br />
Knol will be moving to Annotum on May 1, 2012<br/><br />
Knol will be discontinued as a service, but we've worked with Solvitor and Crowd Favorite to create Annotum, an open-source platform based upon WordPress that allows you to continue authoring and publishing scholarly articles. You can migrate your knols to WordPress and continue your work with Annotum. After May 1, you will no longer be able to create, view, enter or edit knols, but you will be able to export your knols to WordPress.com and download them to file through October 1st, 2012.</blockquote><br />
<br />
== Some random Knols ==<br />
* http://knol.google.com/k/best-knol-of-the-month#Hall_of_the_Best_Knols_2009<br />
<br />
== Lists of knols ==<br />
Using categories and language parameters:<br />
* http://knol.google.com/k/knol/Search?q=incategory%3Asociety&hl=en<br />
* http://knol.google.com/k/knol/Search?q=incategory%3Asociedad&hl=es<br />
* http://knol.google.com/k/knol/Search?&start=0&num=50&q=incategory%3Asociedad&locale=es (max results per page is 50)<br />
<br />
== Stuff to know ==<br />
WebCite doesn't archive images in knol articles (if uploaded to knol site). Images hosted in external sites and hotlinked in knol articles are archived correctly.<br />
<br />
The Wayback Machine has a good portion of Knol [http://web.archive.org/web/*/http://knol.google.com/ archived].<br />
<br />
== Metadata ==<br />
* 700,000+ knols metadata http://db.tt/GNrEh61y<br />
<br />
== See also ==<br />
* [[Knol/Twitter account]] grab<br />
<br />
== External links ==<br />
* [http://knol.google.com/k Google Knol]<br />
<br />
{{Navigation box}}<br />
[[Category:Google]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Zapd&diff=27380Zapd2017-01-16T15:48:13Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Zapd<br />
| URL = http://zapd.com/<br />
| logo = Zapd_logo.png<br />
| image = Zapd_homepage_screenshot.png<br />
| project_status = {{offline}}<br />
| archiving_status = {{saved}}<br />
| source = https://github.com/ArchiveTeam/zapd-grab<br />
| tracker = http://tracker.archiveteam.org/zapd/<br />
| irc = crapd<br />
}}<br />
<br />
“'''Zapd''' is like Tumblr, in that it makes making pretty websites super easy, but Zapd does all its web building magic from your iPhone.”<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Shutdown ==<br />
<br />
[[File:Zapd_sunset_homepage_screenshot.png|right|frameless]]<br />
<br />
=== The News ===<br />
<br />
<blockquote><br />
<p>Fast-growing RealSelf gobbles up Zapd, names Kelly Smith chief experience officer</p><br />
<br />
<p>''September 11, 2013 at 8:26 am by John Cook''</p><br />
<br />
<p>You could call this an “acquihire.” But, in this case, it’s really just about grabbing the talents of one person.</p><br />
<br />
<p>RealSelf is buying Pressplane, the parent company of Zapd, picking up the skills of experienced entrepreneur and designer Kelly Smith in the process. Zapd will be shut down at the end of the month, though some of the technology will be carried over to RealSelf, which is building what it dubs the world’s largest community of cosmetic surgery, dermatology and dentistry.<ref>http://www.geekwire.com/2013/fastgrowing-profitabe-realself-gobbles-zapd-names-kelly-smith-chief-experience-officer/</ref></p><br />
</blockquote><br />
<br />
=== The Email ===<br />
''September 28, 2013''<br />
<blockquote><br />
<p>Zapd has been acquired!</p><br />
<p>The Zapd service will be discontinued on October 7, 2013</p><br />
<br />
<p>Today I wanted to share that Zapd has been acquired by RealSelf. RealSelf is the leading online resource for elective cosmetic medical procedures. As the new Chief Experience Designer, I'll be leveraging everything we learned at Zapd to help build a better mobile engagement experience. The Zapd website and mobile apps will stay up until October 7, 2013 and then will be shutting down.<ref>http://pastie.org/8362982</ref></p><br />
</blockquote><br />
<br />
<br />
== Site structure ==<br />
<br />
* All content is served via javascript, with javasscript disabled you just get an empty template.<br />
* They have a url shortener zapd.co<br />
* zapd.co url scheme is #.zapd.co, #[a-z].zapd.co, #[a-z][a-z].zapd.co, or [a-z]#.zapd.co . # represents a number 0-9.<br />
* Had 350k app downloads in 2011. At worst they have 500,000 users.<br />
* No API<br />
* All images are hosted on Cloudfront<br />
* Comments have no separate page or url for items. They are served dynamically<br />
* You cannot view the like information without logging into Facebook<br />
* It only shows the newest 5 comments for any item. There appears to be no way to see older comments or show all comments.<br />
* Working example url: http://anna-heimbichner.zapd.com/cake-pops<br />
* Each "story" page has all the necessary data as a json blob instead of a script section.<br />
* The json part has all the urls to images under "full_image_url" and users can be found via "Contributor" -> "url"<br />
* All! http://zapd.com/all<br />
<br />
== Crawling Process ==<br />
<br />
* We are still trying to discover urls.<br />
** We got a bunch off of /all, but more sources (e.g. Google, Bing) are welcome.<br />
* We are working on code to scrape the content.<br />
* Commoncrawl had no urls for zapd.<br />
* zapd.co shortcodes: [https://dl.dropboxusercontent.com/u/672132/archiveteam/zapd_shortcode.txt.xz zapd_shortcode.txt.xz]<br />
<br />
== How can I help? ==<br />
<br />
At time of writing, the project is not yet active in the [[Warrior]] yet (but it should be soon).<br />
<br />
If you are comfortable running the scripts manually, check out the project source GitHub repository.<br />
<br />
Join us in [[IRC]] to be kept informed of any news.<br />
<br />
== Archives ==<br />
<br />
Archives are located on the [https://archive.org/details/archiveteam_zapd archiveteam_zapd] collection.<br />
<br />
Note: The Wayback Machine will not be able to playback the artifacts properly due to JavaScript. You will need to pick apart the web pages for cloudfront.net URLs and manually enter them. Again, join us in [[IRC]] in #archiveteam for assistance.<br />
<br />
== References ==<br />
<br />
<references/><br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Odysee&diff=27379Odysee2017-01-16T15:48:00Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Odysee<br />
| logo = Odysee-logo.png<br />
| image = Odysee-homepage.png<br />
| URL = https://www.odysee.com<br />
| project_status = {{offline}}<br />
| archiving_status = {{lost}}<br />
| irc = spaceodysee<br />
}}<br />
<br />
Odysee is a service for syncing and sharing photos. Acquired by Google, shutting down February 23, 2015.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Orkut&diff=27378Orkut2017-01-16T15:47:50Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Orkut<br />
| logo = Orkut_logo.png<br />
| image = Screen_shot_2013-11-07_at_3.28.58_pm.png<br />
| description = Login page<br />
| URL = https://orkut.google.com/en.html<br />
| project_status = {{closing}}<br />
| archiving_status = {{inprogress}}<br />
| tracker = [http://tracker.archiveteam.org/orkut orkut]<br />
| source = [https://github.com/ArchiveTeam/orkut-grab orkut-grab]<br />
| irc = throatkut<br />
}}<br />
<br />
'''Orkut''' was a social networking website owned and operated by Google.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Site structure ==<br />
=== Public ===<br />
* Some communities are public, and can be viewed without logging in using Googlebot as user-agent.<br />
<br />
Community main page:<br />
http://www.orkut.com.br/Community?cmm={cmmid}<br />
<br />
Community forum topics:<br />
http://www.orkut.com.br/CommTopics?cmm={cmmid}<br />
<br />
Community topic posts:<br />
http://www.orkut.com.br/CommMsgs?cmm={cmmid}&tid={topicid}<br />
<br />
=== Logged-in only ===<br />
* You can view the other pages only when logged in.<br />
User profile:<br />
http://www.orkut.com.br/Main#Profile?uid={userid}<br />
<br />
User full profile:<br />
http://www.orkut.com.br/Main#FullProfile?uid={usrid}<br />
<br />
== How can I help? ==<br />
<br />
<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''Orkut''. (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)<br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/orkut-grab github.com/ArchiveTeam/orkut-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd orkut-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>orkut-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
== External Links ==<br />
<br />
* {{w|Orkut}}<br />
* [https://support.google.com/orkut/answer/6033100?p=orkut&hl=en&rd=1 "Time to say goodbye to Orkut"]<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}<br />
[[Category:Social networks]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Wikkii&diff=27377Wikkii2017-01-16T15:47:42Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Wikkii<br />
| logo = Logo-wikkii.jpg<br />
| image = <br />
| description = <br />
| URL = http://wikkii.com<br />
| project_status = {{offline}}<br />
| archiving_status = {{saved}} at https://archive.org/search.php?query=wikkii<br />
| irc = wikiteam<br />
}}<br />
<br />
'''Wikkii''' is a [[wikifarm]]. It hosts about 3000 wikis. It also operates a few other wikifarms located in different domains to allow users to have a different domain name for their wikis.<ref>http://wikkii.com/wiki/Start_a_Wiki</ref><br />
<br />
Wikkii also operates Wikkii.net, which is a service for advanced users to install their wikis by themselves so that they can maximize <br />
the amount of customization they could do on their wikis. Wikkii.org is also operated by Wikkii, to allow users to create a few pages instead of creating a completely new wiki.<br />
<br />
For a list of wikis hosted in this wikifarm see: https://code.google.com/p/wikiteam/source/browse/trunk/listsofwikis<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== See also ==<br />
* [[List of wikifarms]]<br />
<br />
== References ==<br />
<references /><br />
<br />
== External links ==<br />
* http://wikkii.com<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Wikis]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Neoseeker.com/Twitter_account&diff=27376Neoseeker.com/Twitter account2017-01-16T15:47:33Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Artchive&diff=27375Artchive2017-01-16T15:47:24Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Historical}}<br />
Proposal: someone (ArchiveTeam!) should archive and preserve web-based art installations.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== What? ==<br />
<br />
Giving a definition of 'Art' is hard, but we could keep it as wide as possible. Anything that even remotely resembles art is interesting, especially if it generated any kind of hype on the web.<br />
<br />
Here's a list of examples. (A bit too restricted examples: both use crowdsourcing, which should certainly not be a requirement &mdash; and how much art is there in giraffes anyway? &mdash; but I hope you get the idea.)<br />
<br />
=== The Sheep Market (Archived) ===<br />
<br />
"The Sheep Market, is a web-based artwork that appropriated the MTurk system to implicate thousands of workers in the creation of a massive database of drawings."<br />
<br />
As one of the first web-based artworks that used crowdsourcing, I think this is worth keeping. Currently hosted by the artist.<br />
<br />
http://www.thesheepmarket.com/<br />
<br />
=== One Million Giraffes (Archiving) ===<br />
<br />
"A Norwegian man who made a bet with his friend that he could get a million giraffe pictures", well, you get the idea. I'm not sure if it's really art, but as it generated quite a hype it's worth archiving.<br />
<br />
Hosted by the 'Norwegian man', who writes about problems with hosting costs on the blog.<br />
<br />
http://www.onemilliongiraffes.com/<br />
<br />
=== The Million Dollar Homepage ===<br />
<br />
Art? Well.... Anyway, it is an bit of web history. Saved in the Wayback Machine.<br />
<br />
http://www.milliondollarhomepage.com/<br />
<br />
<br />
== Do you know more? ==<br />
<br />
There must be!<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=NUjij&diff=27373NUjij2017-01-16T15:47:12Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = NUjij<br />
| logo = nujij-logo.png<br />
| image = nujij_screenshot.png<br />
| URL = http://nujij.nl<br />
| project_status = {{closed}}<br />
| archiving_status = {{rescued}} [https://archive.org/details/archiveteam_nujij archiveteam_nujij]<br />
| tracker = [http://tracker.archiveteam.org/nujij nujij]<br />
| source = [https://github.com/ArchiveTeam/nujij-grab nujij-grab]<br />
}}<br />
<br />
'''NUjij''' is a discussion platform for the Dutch '''NU.nl''' news website.<br />
<br />
It is being shut down on September 12, 2016.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== How can I help? ==<br />
<br />
<br />
=== Running a Warrior ===<br />
<br />
You can start up a [[Warrior]] and there select ''NUjij''. (If you don't really care what you are archiving, select ''ArchiveTeam's Choice'' instead, as at some points ArchiveTeam may priorize another project.)<br />
<br />
=== Running the script manually ===<br />
<br />
If you use Linux and you're a bit familiar with it, you can try running the script directly.<br />
<br />
The instructions can be found at [https://github.com/ArchiveTeam/nujij-grab github.com/ArchiveTeam/nujij-grab].<br />
<br />
{| class="mw-collapsible mw-collapsed" style="text-align:left;"<br />
! Some additional information<br />
|-<br />
| Don't forget to replace YOURNICKHERE with your nickname.<br />
<br />
The number after <code>--concurrent</code> determines how many threads run at the same time. You can increase this number if your resources (RAM, CPU, bandwidth) are sufficient. However, if you constantly see messages about rate limiting, there is no need to increase the concurrency.<br />
<br />
If you want to stop the script, please do it gracefully if possible. To do so, create an empty file named '''STOP''' in the folder of the script (terminal command: <code>touch STOP</code>). The script finishes the current item(s) and stops only after that. (If you kill the script immediately, the items get broken, and they will need to be reassigned to another user.) – Before starting the script again, don't forget to remove the STOP file.<br />
<br />
If you see "Project code is out of date", kill the script, go to its folder (<code>cd nujij-grab</code>) and issue <code><nowiki>git pull https://github.com/ArchiveTeam/</nowiki>nujij-grab</code>. After the updating has finished, re-launch the script.<br />
|}<br />
<br />
=== Donating to the Internet Archive ===<br />
<br />
Content downloaded by the ArchiveTeam will be uploaded to the [[Internet Archive]], where it will be stored and be available – hopefully – forever. However, storing it costs thousands of dollars in the long run. So, if you can afford, please consider donating to the Internet Archive, so that this piece of history can be kept for us all. http://archive.org/donate<br />
<br />
=== Do you like our cause? ===<br />
<br />
If you want to help in other projects, want to learn more about ArchiveTeam, or even help in development in general, navigate to the [[Main Page]] of this wiki, from there you can reach a lot of information. The Team consists of volunteers working on the projects in their free time, so helping hands (and resources) are always welcome.<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=IFTTT&diff=27371IFTTT2017-01-16T15:47:00Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = IFTTT<br />
| logo = IFTTT_logo.png<br />
| image = IFTTT-homepage.png<br />
| URL = {{url|1=http://www.ifttt.com}}<br />
| project_status = {{online}}<br />
| archiving_status = {{nosavedyet}}<br />
}}<br />
IFTTT (IF This Then That) is a site that allows you to create "recipes" that link actions between "channels".<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
==Using IFTTT for backup==<br />
It is very easy to use IFTTT for backup with various recipes that automatically backup your data (email, Facebook photos, etc.) to other sites (Google Drive, Dropbox, etc.).<br />
<br />
==Channels==<br />
*[[500px]]<br />
*Android Device<br />
*Android Location<br />
*Android Notifications<br />
*Android Phone Call<br />
*Android Photos<br />
*Android SMS<br />
*Android Wear<br />
*[[App.net]]<br />
*[[AppZapp]]<br />
*Automatic<br />
*Best Buy<br />
*[[URLTeam|bit.ly]]<br />
*blink(1)<br />
*[[Blogger]]<br />
*[[Box]]<br />
*[[Boxcar]] (removed)<br />
*[[Boxcar 2]]<br />
*Boxoh Package Tracking<br />
*Bttn<br />
*[[Buffer]]<br />
*[[BuzzFeed]]<br />
*Campfire<br />
*Chain<br />
*[[Craigslist]]<br />
*[[Dailymotion]]<br />
*Dash<br />
*Date & Time<br />
*[[Delicious]]<br />
*[[Digg]]<br />
*[[Diigo]]<br />
*[[Dropbox]]<br />
*[[eBay]]<br />
*Email<br />
*Email Digest<br />
*[[Entertainment Weekly]]<br />
*[[ESPN]]<br />
*[[ESPN|ESPN Olympics]] (only when the Olympics are taking place)<br />
*[[Etsy]]<br />
*[[Evernote]]<br />
*[[EyeFi Cloud]]<br />
*[[Facebook]]<br />
*[[Facebook|Facebook Groups]]<br />
*[[Facebook|Facebook Pages]]<br />
*[[Feedly]]<br />
*[[ffffound!]]<br />
*Fitbit<br />
*[[Fiverr]]<br />
*[[Flickr]]<br />
*FollowUp.cc<br />
*[[Foursquare]]<br />
*Garageio<br />
*[[Giphy]]<br />
*[[Github]]<br />
*[[Gmail]]<br />
*[[Google Calendar]]<br />
*[[Google Drive]]<br />
*Google Glass<br />
*[[Google Reader]] (removed)<br />
*[[Google Talk]] (removed)<br />
*[[GroupMe]]<br />
*[[Gumroad]]<br />
*Harmony<br />
*Homeboy<br />
*Honeywell evohome<br />
*Honeywell Single-zone Thermostat<br />
*HootSuite (removed)<br />
*IFTTT<br />
*[[Instagr.am]]<br />
*[[Instapaper]] (old channel removed)<br />
*Instapush<br />
*[[InStyle]]<br />
*iOS Contacts<br />
*iOS Location<br />
*iOS Notifications<br />
*[[iCloud|iOS Photos]]<br />
*[[iCloud|iOS Reminders]]<br />
*Is It Christmas?<br />
*JetSetMe<br />
*Kato<br />
*[[Last.fm]]<br />
*Launch Center Pro<br />
*[[Life360]]<br />
*LIFX<br />
*[[LinkedIn]]<br />
*[[littleBits]]<br />
*Lutron Caséta Wireless and Serena Shades<br />
*Manything<br />
*Misfit<br />
*[[MixRadio]]<br />
*[[Moped]] (removed)<br />
*Myfox<br />
*Nest Protect<br />
*Nest Thermostat<br />
*Netatmo Weather Station<br />
*[[NewsBlur]]<br />
*Nike+<br />
*[[NowVia]]<br />
*Numerous<br />
*[[OneDrive]] (formerly SkyDrive)<br />
*[[OneNote]]<br />
*ORBneXt<br />
*Parrot Flower Power<br />
*[[People]]<br />
*Philips Hue<br />
*Phone Call<br />
*[[Pinboard]]<br />
*[[Pocket]]<br />
*Printhug<br />
*[[Pryv]]<br />
*[[Push.co]] (removed)<br />
*[[Pushalot]]<br />
*[[Pushbullet]]<br />
*[[Pushover]]<br />
*QualityTime<br />
*Quip<br />
*Rachio Iro<br />
*[[Readability]]<br />
*[[ReadingPack]]<br />
*[[reddit]]<br />
*Revolv<br />
*RSS<br />
*[[Saga]]<br />
*Salesforce Chatter<br />
*ShopYourWay<br />
*Sighthound Video<br />
*[[Sina Weibo]]<br />
*[[Slack]]<br />
*Slice<br />
*Smappee<br />
*SmartThings<br />
*SMS<br />
*[[SoundCloud]]<br />
*Space<br />
*Spark<br />
*[[Sports Illustrated]]<br />
*Square<br />
*Stockimo<br />
*Stocks<br />
*[[Storify]]<br />
*Stripe<br />
*Sunlight Foundation<br />
*Surfline<br />
*Svpply (removed)<br />
*[[The New York Times]]<br />
*[[TIME]]<br />
*[[Todoist]]<br />
*[[Toodledoo]]<br />
*TrackIf<br />
*[[Tumblr]]<br />
*[[Twitter]]<br />
*Ubi<br />
*UP by Jawbone<br />
*[[Vimeo]]<br />
*Weather<br />
*WeMo Insight Switch<br />
*WeMo Light Switch<br />
*WeMo Maker<br />
*WeMo Motion<br />
*WeMo Switch<br />
*Whistle<br />
*Wink: Aros<br />
*Wink: Egg Minder<br />
*Wink: Nimbus<br />
*Wink: Pivot Power Genius<br />
*Wink: Porkfolio<br />
*Wink: Spotter<br />
*Withings<br />
*[[WordPress]]<br />
*[[Yahoo! Fantasy Sports]]<br />
*[[Yammer]]<br />
*[[Yo]]<br />
*[[YouTube]]<br />
*[[Zootool]] (removed)<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Infinite_Crisis&diff=27370Infinite Crisis2017-01-16T15:46:48Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Infinite Crisis<br />
| image = infinitecrisis.png<br />
| URL = https://www.infinitecrisis.com<br />
| project_status = {{offline}}<br />
| archiving_status = Some parts {{partiallysaved}} through [[ArchiveBot]]<ref>http://archive.fart.website/archivebot/viewer/job/101d1</ref><br />
| irc = finite-crisis<br />
}}<br />
<br />
Infinite Crisis is a multiplayer online battle arena (or MOBA) game somewhat based on the DC Comics series of the same name.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
=References=<br />
<references/><br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Template:Internet_history&diff=27369Template:Internet history2017-01-16T15:46:37Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Wikipedia&diff=27368Wikipedia2017-01-16T15:46:27Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Wikipedia<br />
| url = http://www.wikipedia.org/<br />
| project_status = {{online}}<br />
| archiving_status = {{saved}}<br />
| irc = wikiteam<br />
}}<br />
<br />
'''Wikipedia''' is the largest [[wiki]] on the planet, with several million articles available in English and several million more in dozens of available languages.<br />
<br />
[[File:Wikipedia nostalgia.png|thumb|right|[http://nostalgia.wikipedia.org Wikipedia nostalgia], a frozen version of Wikipedia from 2001]] <br />
[[File:Wikipedia, the free encyclopedia april fools day 2010.png|thumb|right|April Fools Day 2010]]<br />
<center>'''No more [[Library of Alexandria|Libraries of Alexandria]] destroyed.'''</center><br />
<br />
[[File:Size of English Wikipedia in August 2010 (L).png|thumb|right|700px|English Wikipedia in August 2010, if printed.]]<br />
<br />
For once, a site that recognizes the importance of third-party backups! They have a [http://dumps.wikimedia.org/ main downloads page] from which you can get XML dumps from individual wikis (Wikimedia Foundation hosts more than 800 wikis: Wikipedias, Wiktionaries, Wikinews, Wikisources, Wikibooks, Wikiquotes, Wikiversities, Wikispecies, Wikimedia Commons, Wikivoyage, Wikidata).<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Backups ==<br />
As of 19:07, 10 July 2016 (EDT), dumps.wikimedia.org only has about 10 earlier versions of dumps for each wiki, generally going back to around October 2015. They don't seem to be linked, but they are accessible via http://dumps.wikimedia.org/''wikiname''/ (where ''wikiname'' is listed on the index page).<br />
<br />
There's an old article dump (2008/03/12) [http://thepiratebay.org/torrent/4794236/enwiki-20080312-pages-articles.xml.bz2 up on The Pirate Bay] [magnet:?xt=urn:btih:5dc4df42109c8d1dbc759276d62225223ca69c53&dn=enwiki-20080312-pages-articles.xml.bz2&tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80&tr=udp%3A%2F%2Fopen.demonii.com%3A1337&tr=udp%3A%2F%2Ftracker.coppersurfer.tk%3A6969&tr=udp%3A%2F%2Fexodus.desync.com%3A6969 magnet], from the [http://thepiratebay.org/user/archiveteam/ ArchiveTeam TPB account], although it has no seeders as of 19:07, 10 July 2016 (EDT).<br />
<br />
There is no current public backup for images uploaded to [[Wikimedia Commons]], which has about 32 million images and other media files uploaded on its services as of 19:07, 10 July 2016 (EDT).<br />
<br />
Links:<br />
* [http://download.wikipedia.org/ official backups site]<br />
* http://download.wikimedia.org/archive/ - about a dozen older dumps, including [http://dumps.wikimedia.org/archive/enwiki/20060816/ one from 2006], as well as 2 from [https://dumps.wikimedia.org/archive/2001/ 2001].<br />
* {{url|http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z|old wikipedia backups discovered}}<br />
** [https://web.archive.org/web/20130522000621/http://noc.wikimedia.org/~tstarling/wikipedia-logs-2001-08-17.7z Direct Wayback link]<br />
** {{url|http://lists.wikimedia.org/pipermail/foundation-l/2010-December/063088.html|announcement on foundation-l}}<br />
** {{url|https://web.archive.org/web/20120306052415/http://grey.colorado.edu/wikipedia_2001/|script for parsing them}}<br />
<br />
* Internet Archive results: http://www.archive.org/search.php?query=wikipedia%20dumps (223,142 results as of 20:25, 10 July 2016 (EDT))<br />
** {{IA id|wikimediadownloads}} - Primary collection, manage by Hydriz<br />
*** 915,108 items, with archivedates from Nov 10, 2005 through Jul 10, 2016 as of 20:34, 10 July 2016 (EDT)<br />
** {{IA id|wikipediadumps}} - Older, somewhat forgotten collection<br />
*** 810 items, with archivedates from April 9, 2010 through Aug 13, 2014 as of 20:25, 10 July 2016 (EDT)<br />
*** Three sets of all or most of the different language editions of Wikipedia, from 2010-04-08, 2010-06-10 and 2011-08-08.<br />
**** 2010-04 has an underscore between the wiki name and the date, and is missing ltwiki (Lithuanian) presumably because it was created between then and June 2010.<br />
**** 2010-06 has the same identifier format, and contains one edition that is missing from the other two: emwiki (which appears to be the [[wikipedia:Emilian-Romagnol]] edition).<br />
**** 2011-08 has a dash (rather than an underscore) both before and after "wiki", and is missing 7 editions that are present in the other two (ace, ckb, hu, krc, mwl, pcd, pnb) and contains 7 missing from them (ak, be_x_old, eml, fj, hz, ng, tokipona).<br />
*** There are also 12 other misc dumps:<br />
**** {{IA id|arwiki20110112}}<br />
**** {{IA id|de_labswikimedia-20110904}}<br />
**** {{IA id|de_labswikimedia-20111013}}<br />
**** {{IA id|en_labswikimedia-20110906}}<br />
**** {{IA id|en_labswikimedia-20111015}}<br />
**** {{IA id|enwiki-20110620-item-1-of-2}}<br />
**** {{IA id|enwiki-20110620-item-2-of-2}}<br />
**** {{IA id|flaggedrevs_labswikimedia-20110907}}<br />
**** {{IA id|flaggedrevs_labswikimedia-20111016}}<br />
**** {{IA id|idwiki20101106}}<br />
**** {{IA id|readerfeedback_labswikimedia-20110907}}<br />
**** {{IA id|readerfeedback_labswikimedia-20111016}}<br />
<br />
* [http://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive Compilation of links to Wikipedia archives]<br />
* [http://nostalgia.wikipedia.org/wiki/HomePage A backup of Wikipedia as of Thursday, December 20, 2001]<br />
<br />
=== Transferring to IA ===<br />
[[User:Hydriz|Hydriz]] is currently transferring the dumps of all Wikimedia projects into the Internet Archive. Wikimedia itself has provided resources to me for transferring these dumps to the Internet Archive. The results are in the {{IA id|wikimediadownloads}} collection, which is still being kept up to date as of 20:38, 10 July 2016 (EDT).<br />
<br />
== Vital signs ==<br />
<br />
Stable, but they seriously use a lot of tactics to get donations.<br />
<br />
== Offline readers ==<br />
* [http://www.okawix.com/ Okawix] ([http://www.okawix.com/zenos/ files])<br />
* [http://www.kiwix.org Kiwix] ([http://download.kiwix.org/zim/ files])<br />
<br />
== See also ==<br />
* [[Wikimedia Commons]]<br />
* [[Wikia]]<br />
* [[Wikis]]<br />
* [[Nupedia]]<br />
* [[GNUPedia]]<br />
* [[Citizendium]]<br />
* [[WikiTravel]] - Not a Wikimedia project, but its content was forked to create WMF-hosted rival Wikivoyage.<br />
* [[WikiTeam]]<br />
<br />
== External links ==<br />
* http://www.wikipedia.org<br />
* http://www.wikimedia.org<br />
* https://en.wikipedia.org/wiki/User:Emijrp/Wikipedia_Archive<br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Wikis]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Projects&diff=27367Projects2017-01-16T15:46:13Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Projects status}}<br />
<br />
This page should contain, or directly link to, almost all ArchiveTeam archiving endavours, categorized.<br />
* '''[[#Current projects|Current projects]]''': currently active, upcoming and recently finished grandiose ArchiveTeam projects. (Extract of the next two categories.)<br />
* '''[[#Warrior projects|Warrior projects]]''': projects that utilize(d) ArchiveTeam's distributed archiving system.<br />
* '''[[#Manual projects 2|Manual projects]]''' that need(ed) much more effort than just pushing a button.<br />
* '''[[#Small projects|Small projects]]''': small-scale website archiving projects usually done by a single individual.<br />
* '''[[#Early projects|Early projects]]''': first archiving endavours on the dawn of ArchiveTeam, in a format nobody is apparently able/dare to touch.<br />
<br />
(The box on the top counts projects having dedicated wiki pages, those numbers aren't complete and far don't contain all projects mentioned in the sections below.)<br />
<br />
If you know of a website in danger, let us know that on [[IRC]]. If it's a larger site, please also mention it on the '''[[Deathwatch]]''' page. And, after a decision is made on IRC, or if it doesn't need a decision, then, to help things kept documented and up to date, you are encouraged to add projects, or modify their status<br />
* in the appropriate section(s),<br />
* on the project's dedicated wiki page (if any),<br />
* on [[Deathwatch]] and/or on [[Alive... OR ARE THEY]].<br />
<br />
The box on the top is generated automatically from projects' dedicated wiki pages, so shouldn't be touched.<br />
<br />
'''Important:''' Contents of sections below are '''embedded''' from other pages, that is, don't edit the section, nor this page, but use the "'''Edit this list'''" link! (That opens the corresponding page for editing, and after editing, you'll be forwarded to the page containing only that list: don't worry, you didn't delete the others.)<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
= Warrior projects =<br />
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: #99FF99; border: 1px solid; padding: 5px"><br />
ArchiveTeam's past, current and future Warrior projects with details, in a table form.<br />
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". --><br />
<div class="mw-collapsible-content" style="width:100%"><br />
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Warrior_projects&action=edit Edit this list]</span>'''<br />
{{:Warrior projects}}<br />
</div><br />
</div><br />
<br />
= Manual projects =<br />
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: #CCFF99; border: 1px solid; padding: 5px"><br />
Difficult, discussion-intensive, human-resource-intensive and audit projects.<br />
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". --><br />
<div class="mw-collapsible-content" style="width:100%"><br />
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Manual_projects&action=edit Edit this list]</span>'''<br />
{{:Manual projects}}<br />
</div><br />
</div><br />
<br />
= Small projects =<br />
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: #FFCCFF; border: 1px solid; padding: 5px"><br />
List of smaller website rescuing projects, usually done by single individuals.<br />
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". --><br />
<div class="mw-collapsible-content" style="width:100%"><br />
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Small_projects&action=edit Edit this list]</span>'''<br />
{{:Small projects}}<br />
</div><br />
</div><br />
<br />
= Early projects =<br />
<div class="mw-collapsible mw-collapsed" style="width:100%; background-color: lightgray; border: 1px solid; padding: 5px"><br />
List of ArchiveTeam's early endavours, for historical interest, not edited.<br />
<!-- TO EDIT THE LIST, GO BACK AND CLICK "Edit this list". --><br />
<div class="mw-collapsible-content" style="width:100%"><br />
'''<span class="plainlinks">[http://archiveteam.org/index.php?title=Early_projects&action=edit Edit this list]</span>'''<br />
{{:Early projects}}<br />
</div><br />
</div><br />
<br />
<br />
{{Navigation pager<br />
| previous = Fire Drill<br />
| next = Philosophy<br />
}}<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=ArchiveTeam_Warrior&diff=27366ArchiveTeam Warrior2017-01-16T15:46:02Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Basic usage ==<br />
<br />
The warrior runs on Windows, OS X and Linux using a virtual machine. You'll need one of:<br />
<br />
* [https://www.virtualbox.org/ VirtualBox] (recommended)<br />
* [https://www.vmware.com/products/player/ VMware workstation/player] (free-gratis for personal use)<br />
* [[#Alternative virtual machines|See below for alternative virtual machines]]<br />
<br />
<br />
=== Quick start instructions for VirtualBox ===<br />
<br />
# Download the [http://archive.org/download/archiveteam-warrior/archiveteam-warrior-v2-20121008.ova appliance] (174MB).<br />
# Launch VirtualBox<br />
# In VirtualBox, click File > Import Appliance and open the file.<br />
# Start the virtual machine.<br />
#* It will fetch the latest updates and will eventually tell you to start your web browser.<br />
# Using your regular web browser, visit http://localhost:8001/<br />
# On the left, click "Your settings".<br />
# Choose a username - we'll show your progress on the [[tracker|leaderboard]].<br />
# On the left, click "Available projects" tab and pick a project to work on.<br />
#* Even better: select "ArchiveTeam's Choice" to let your warrior work on the most urgent project.<br />
<br />
<br />
=== Start instructions for VMWare Player ===<br />
<br />
# Download the [http://archive.org/download/archiveteam-warrior/archiveteam-warrior-v2-20121008.ova appliance] (174MB).<br />
# Launch VMWare Player<br />
# In Player on the right, click "Open Virtual Machine", open the file and import the virtual machine.<br />
# Select the virtual machine and click "Edit virtual machine settings".<br />
#* Select "Hard Disk 2 (IDE)" > "Advanced..." and change it to "IDE 1:0"<br />
#* Select Network Adapter and set it to "Bridged: Connected directly to the physical network"<br />
# Start the virtual machine.<br />
#* It will fetch the latest updates and will eventually tell you to start your web browser.<br />
# Using your regular web browser, visit the address that is shown on the bottom (e.g. http://192.168.0.100:8001/)<br />
# On the left, click "Your settings".<br />
# Choose a username - we'll show your progress on the [[tracker|leaderboard]].<br />
# On the left, click "Available projects" tab and pick a project to work on.<br />
#* Even better: select "ArchiveTeam's Choice" to let your warrior work on the most urgent project.<br />
<br />
<br />
__TOC__<br />
<br />
== Alternative virtual machines ==<br />
<br />
Thanks to user-effort, there are alternatives:<br />
<br />
* [https://www.docker.io/ Docker] (Linux)<br />
** ([https://github.com/ArchiveTeam/warrior-dockerfile modified dockerfile])<br />
** ([https://hub.docker.com/r/infrequent/at-as-dockerfile modified dockerfile - for manual script execution])<br />
<br />
* [https://www.microsoft.com/en-us/server-cloud/solutions/virtualization.aspx Hyper-V] (Windows 8 Professional)<br />
** ([http://jonimoose.net/2013/archiveteam-warrior-on-hyper-v/ Hyper-V virtual machine])<br />
<br />
Please note that these alternatives are not in widespread use by our warriors, so we may not be able to help with either issues or advanced usage.<br />
<br />
==Warrior FAQ==<br />
<br />
=== Can I use whatever internet access for the warrior? ===<br />
<br />
No. We need "clean" connections. Please ensure the following:<br />
<br />
* No OpenDNS. No ISP DNS that redirects to a search page. Use non-captive DNS servers.<br />
* No ISP connections that inject advertisements into web pages.<br />
* No proxies. Proxies can return bad data. The original HTTP headers and IP address is needed for the WARC file.<br />
* No content-filtering firewalls.<br />
* No censorship. If you believe your country implements censorship, do not run a warrior. <br />
* No Tor. The server may return an error page instead of content if they ban exit nodes.<br />
* No free wifi cafe. Archiving your cafe's wifi service agreement repeatedly is not helpful.<br />
* We prefer connections from many public IP addresses if possible. (For example, if your apartment building uses a single IP address, we don't want your apartment banned.)<br />
<br />
=== Why am I seeing a message that no item was received? ===<br />
<br />
It means that there is no work available. This happens for several reasons:<br />
<br />
* There project has just finished and someone is inspecting the work done. If a problem is discovered, items may be re-queued and more work is available.<br />
* You have checked out / claimed too many items. Reduce your concurrency and let others do some of the work too.<br />
* In a rare case, you have been banned by a tracker administrator because you were requesting too much work, you were tampering with the scripts, a malfunction has occurred, or your internet connection is "unclean".<br />
<br />
=== Why am I seeing a message about rate limiting? ===<br />
<br />
Keep in mind that although downloading the internet for digital preservation and fun are the primary goals of all Archive Team activities, serious stress on the target's server may occur. The rate limit is imposed by a [[Tracker#People|tracker administrator]] and should not be subverted.<br />
<br />
(In other words, we don't want to DDoS the servers.)<br />
<br />
=== Why am I seeing a message about code being out of date? ===<br />
<br />
The warrior will update its code every hour. If you are impatient, please restart the warrior and it will download the latest code and resume work.<br />
<br />
===Help! The warrior is eating all my bandwidth!===<br />
<br />
You can limit the warrior's bandwidth quite easily for VirtualBox as long as you are running a relatively recent version. The option is not offered with a GUI however.<br />
<br />
The command <pre>VBoxManage bandwidthctl archiveteam-warrior-2 add limit --type network --limit 3m</pre> will limit the warrior instance called archiveteam-warrior-2 (the default name of the warrior vm currently) to 3Mb/s. Adjust as needed.<br />
(limit units: k=kilobit, m=megabit, g=gigabit, K=kilobyte, M=megabyte, G=gigabyte)<br />
<br />
<br />
In the latest version of VirtualBox on Windows, the syntax appears to have changed. The correct command now seems to be:<br />
<br />
<pre>VBoxManage bandwidthctl archiveteam-warrior-2 add netlimit --type network --limit 3</pre><br />
<br />
For more info, consult the [http://www.virtualbox.org/manual/ch06.html#network_bandwidth_limit VirtualBox manual (Chapter 6, Section 9)].<br />
<br />
===NAT sucks! I want directly-bridged networking!===<br />
<br />
Simples! (If you're running linux, that is.)<br />
<br />
<pre>VBoxManage modifyvm "archiveteam-warrior-2" --nic1 bridged</pre><br />
<br />
<pre>VBoxManage modifyvm "archiveteam-warrior-2" --bridgeadapter1 eth0</pre><br />
<br />
(We presume you want to bind to <code>eth0</code>. Adjust as required. :))<br />
<br />
=== I turned my warrior VM appliance off. Will those tasks be lost? ===<br />
<br />
If you've killed your warrior VM instances, then the work your warrior did has been lost, however the tasks will be returned to the pool after a period of time. If you want, you can alert the admins via IRC of what's happened, and they can clear the claims your username may have made. However, this isn't very important on most projects.<br />
<br />
=== I closed my browser or tab with the warrior's web interface. Will those tasks be lost? ===<br />
<br />
No, the web browser interface just provides, well, a user interface to the warrior. As long as the VM is not stopped, it will continue normally.<br />
<br />
=== I need to disconnect my internet / reboot my PC, but I don't want to lose work. ===<br />
<br />
If you pause/suspend the warrior instance, most projects will allow resuming of work in progress when you unsuspend the warrior instance.<br />
<br />
If you decided to use the suspend feature in VirtualBox, please note that if you keep it suspended for too long (more than a few hours), the admins will assume that the item is lost and be re-queued. Using the suspend feature so that you can reboot your computer is perfectly fine.<br />
<br />
=== I told the warrior to shutdown from the interface but nothing has changed! What gives? ===<br />
<br />
The warrior will attempt to finish the current running tasks before shutting down. If you need to shut down right away, go ahead. Your progress will be lost, however the jobs will eventually cycle out to another user.<br />
<br />
=== How much disk space will the warrior use? ===<br />
<br />
Short answer: it depends on the project.<br />
<br />
Long answer: because the way each project defines an item differently, the warrior may be downloading a small file or downloading a whole subsection of a website. The virtual machine is configured by default to use 60GB as an absolute maximum. Any unused virtual machine disk space is not used on the host computer. You may, however, run the virtual machine on less than 60GB if you like to live dangerously. We're downloading the internet after all!<br />
<br />
=== The secondary disk is using up space even though it's not running a project. ===<br />
<br />
Virtual machine disk images do not behave like a regular file. There are several ways to reclaim space:<br />
<br />
* Delete the second disk and put back an empty disk. The warrior should reformat the second disk.<br />
* Delete the entire warrior application and re-import it.<br />
* Use the [http://intgat.tigress.co.uk/rmy/uml/index.html zerofree] program and then clone the disk image. Reattach the cloned disk image.<br />
<br />
=== I can't connect to localhost. ===<br />
<br />
The application includes a configuration to set up port forwarding to the guest machine on port 8001 so you can access the interface through your web browser. If this does not happen, you may need to double check your machine's network settings.<br />
<br />
=== The warrior can't connect to the internet. ===<br />
<br />
It may be possible that the virtual machine has picked up the address of the local DNS cache on your computer which the virtual machine does not have access to. <br />
<br />
If you experience this on VirtualBox, see [http://askubuntu.com/questions/204953/virtualbox-dns-stopped-working-on-upgrade-to-12-10 this question and answer].<br />
<br />
=== I'm looking at the text scrolling by and I notice some errors. rsync is not working. ===<br />
<br />
Uh-oh! Something is not right. Notify us immediately in the appropriate [[IRC]] channel.<br />
<br />
=== The item I'm working on is downloading thousands of URLs and it's taking hours. ===<br />
<br />
See the above question and reboot the warrior as appropriate.<br />
<br />
=== I'm looking at the leaderboard. What's that icon beside the username? ===<br />
<br />
That's just the warrior logo: [[File:Archive_team.png|42px]] (click on the image for a larger version). It means that that person is using the warrior. Those without the icon are running the scripts manually.<br />
<br />
=== What's that guy doing in the logo? ===<br />
<br />
The place is on fire! But don't worry, he safely escaped with the rescued data in his arms.<br />
<br />
<br />
[[Image:Archiveteam-warrior-sticker.png|256px|right]]<br />
<br />
=== That’s awesome – can I slap this logo on my laptop to show my Internet-preservation pride? ===<br />
<br />
[http://www.redbubble.com/people/ajhajh/works/12857655-archive-team-warrior-stickers?p=sticker You sure can! The ArchiveTeam Warrior laptop sticker can start conversations about archiving, if you’re into that.]<br />
<br />
=== I want to log in to the virtual machine. How do I do this? ===<br />
<br />
Unless you know what you are doing, you should not need to do this. But if you want to, the username is <code>root</code> and the password is <code>archiveteam</code>. Then, you can execute <code>sudo -u warrior -i</code> to log in as the warrior user. <br />
<br />
Press ALT+F3 to switch to virtual console number 3. Use ALT+Left or ALT+Right to switch between virtual consoles. There are 6 virtual consoles in total. Consoles 1 and 2 are reserved for the warrior.<br />
<br />
=== Can I run multiple virtual machines at the same time? ===<br />
<br />
Yes, but you'll need to adjust the networking settings.<br />
<br />
On the machine, open up Settings → Network → Adapter 1 → Port Fowarding. You need to adjust the Host Port. For example, ensure your table looks like TCP | 127.0.0.1 | 8123 | | 8001. In this example, you can then visit http://localhost:8123/ as it maps port 8123 in your browser to port 8001 which the warrior uses.<br />
<br />
=== The warrior seems to have too much overhead. I can't run a VM in a VPS! ===<br />
<br />
You don't need to run a virtual machine.<br />
<br />
An option is running Docker containers, based on LXC the overhead is far less than running a full VM on a VPS, it should be noted if you plan on running the ([https://github.com/ArchiveTeam/warrior-dockerfile warrior-dockerfile]) to publish the port to allow access to the web interface.<br />
<pre> docker run -d -p 8001:8001 archiveteam/warrior-dockerfile </pre><br />
<br />
(Above is assumed direct mapping VPS port to container port so if you wanted say <code>port 38001</code> it would be <code>docker run -d -p 38001:8001 archiveteam/warrior-dockerfile </code> Adjust as required. :P)<br />
<br />
<br />
If you are managing a VPS, it's likely you are comfortable with some Linux stuff. '''Projects can be run manually.''' Consult the project wiki page or the source code repository readme file.<br />
<br />
(Note that multiple projects can be also run in isolated environments(containers) for rapid deployment using: ([https://hub.docker.com/r/infrequent/at-as-dockerfile at-as-dockerfile]))<br />
<br />
=== Why a virtual machine in the first place? ===<br />
<br />
The virtual machine is a quick, safe, and easy way for newcomers to help us out. It offers many features:<br />
<br />
* Graphical interface<br />
* Automatically selects which project is important to run<br />
* Self-updating software infrastructure<br />
* Allows for unattended use<br />
* In case of software faults, your machine is not ruined<br />
* Restarts itself in case of runaway programs<br />
* Runs on Windows, Mac, and Linux painlessly<br />
* Ensures consistency in the archived data regardless of your machine's quirks<br />
<br />
If you have suggestions for improving this system, please talk to us as described below.<br />
<br />
=== I'm running the scripts manually in a VPS but it says the code is out of date a while later ===<br />
<br />
It happens when a bug in the scripts is discovered. Bugs are unavoidable especially when the server is out of our control.<br />
<br />
Try the <code>--auto-update</code> option available in Seesaw version 0.8. However, please be aware that you are now executing code automatically. Be sure to run the scripts in a separate user account for safety.<br />
<br />
=== I just imported the ova image and the warrior is stuck on "Preparing the data partition" ===<br />
<br />
This issue has cropped up before and we do not know what causes it. It is recommended to just delete the warrior image and import the ova again. Testing shows that such a reimport works in the majority of cases.<br />
<br />
=== Why is the default project not working? / Why is a manual project not in the Warrior yet? ===<br />
<br />
Sorry. Sometimes the administrators are too busy...<br />
<br />
=== Why are there no projects? ===<br />
<br />
If there are no projects showing, you can help us write one. No projects does ''not'' mean there is nothing left to archive!<br />
<br />
=== The instructions to run the software/scripts are awful and they are difficult to set up. ===<br />
<br />
Well, excuuuuse me, princess!<br />
<br />
We're not a professional support team so help us help you help us all. See below for bug reports, suggestions, or contribute writing code.<br />
<br />
=== Help I'm getting errors when I try to launch the VM ===<br />
If you are receiving ''"Breakpoint has been reached (0x80000003)"'', ''"A critical error has occurred while running the virtual machine and the machine execution has been stopped."'' or VT-X errors you probably have virtualization disabled in you computer's BIOS or your CPU may not support virtualization. You can check this using [http://openlibsys.org/index-ja.html VirtualChecker]<br />
<br />
To enable virtualization reboot the computer and enter the BIOS, the virtualization setting is usually under CPU configuration or Advanced settings.<br />
<br />
=== Where can I file a bug, suggestion, or a feature request? ===<br />
<br />
If the issue is related to the warrior's web interface or the library that grab scripts are using, see [https://github.com/ArchiveTeam/seesaw-kit/issues seesaw-kit issues]. Other issues should be filed into their own [[Dev/Source_Code|repositories]].<br />
<br />
=== I'd like to help write code. Where can I find more info? ===<br />
<br />
Check out the [[Dev]] documentation for details on the infrastructure and details of the source code layout.<br />
<br />
=== I still have a question! ===<br />
<br />
Check out the [[Frequently Asked Questions|general FAQ page]]. Talk to us on [[IRC]]. Use [irc://irc.efnet.org/warrior #warrior] for specific warrior questions or [irc://irc.efnet.org/archiveteam #archiveteam] for general questions.<br />
<br />
== Projects ==<br />
<br />
See: [[Warrior projects]].<br />
<br />
== Are you a coder? ==<br />
<br />
Like the warrior? Interested in how it works under the hood? Got software skills? '''[[Dev|Help us improve it!]]'''<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Minecraft.net&diff=27365Minecraft.net2017-01-16T15:45:50Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Minecraft.net<br />
| image = Minecraft.net.png<br />
| URL = http://www.minecraft.net<br />
| project_status = {{online}}<br />
| archiving_status = {{partiallysaved}}<br />
}}<br />
<br />
Minecraft.net is the website for the popular Mojang game, Minecraft.<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
==Site structure==<br />
*Minecraft jars (old launcher):<br />
**<nowiki>http://assets.minecraft.net/<version>/minecraft.jar</nowiki><br />
**<nowiki>http://assets.minecraft.net/<version>/minecraft_server.jar</nowiki><br />
<br />
*Minecraft jars (new launcher):<br />
**<nowiki>http://s3.amazonaws.com/Minecraft.Download/versions/<version>/<version>.jar</nowiki><br />
**<nowiki>http://s3.amazonaws.com/Minecraft.Download/versions/<version>/minecraft_server.<version>.jar (not present for older versions)</nowiki><br />
**<nowiki>http://s3.amazonaws.com/Minecraft.Download/versions/<version>/minecraft_server.<version>.exe (not present for older versions)</nowiki><br />
**<nowiki>http://s3.amazonaws.com/Minecraft.Download/versions/<version>/<version>.json</nowiki><br />
<br />
*<nowiki>https://minecraft.net/haspaid.jsp?user=<user></nowiki><br />
**Returns true if the user is premium (has paid for Minecraft). Case-sensitive.<br />
<br />
*<nowiki>http://s3.amazonaws.com/MinecraftSkins/<user>.png</nowiki><br />
*<nowiki>http://s3.amazonaws.com/MinecraftCloaks/<user>.png</nowiki><br />
**Returns an access denied page if the username is invalid or doesn't have a skin/cloak. Case-sensitive.<br />
<br />
*<nowiki>http://assets.minecraft.net</nowiki><br />
**Returns a list of files hosted on assets.minecraft.net.<br />
<br />
*<nowiki>http://s3.amazonaws.com/MinecraftDownload</nowiki><br />
**Returns a list of files hosted on s3.amazonaws.com/MinecraftDownload/.<br />
<br />
*<nowiki>http://s3.amazonaws.com/Minecraft.Download/versions/versions.json</nowiki><br />
**Returns an incomplete list of Minecraft versions hosted on s3.amazonaws.com/Minecraft.Download/. Snapshots 13w16a and newer are (mostly) left of the list, but still can be downloaded.<br />
<br />
*<nowiki>https://s3.amazonaws.com/Minecraft.Download/indexes/<version>.json</nowiki><br />
*<nowiki>https://s3.amazonaws.com/Minecraft.Download/indexes/legacy.json</nowiki><br />
**Returns a list of resources to download from resources.download.minecraft.net for the respective Minecraft version.<br />
<br />
==External links==<br />
* [http://www.minecraftforum.net/forums/mapping-and-modding/minecraft-mods/1285484-world-downloader-mod World Downloader Mod], useful for saving dying multiplayer servers.<br />
<br />
===Archives===<br />
* [https://archive.org/details/assets.minecraft.net-panicgrab-20140807 assets.minecraft.net]<br />
* [https://archive.org/details/mojang.com-notch-panicgrab-20140912 mojang.com/notch] (not directly related to minecraft.net, contains Minecraft 4K)<br />
* [[ArchiveBot]] crawls containing minecraft.net, its subdomains, and asset/resource servers:<br />
** [https://archive.org/details/archiveteam_archivebot_go_151 Archivebot GO Pack 151] (urls-dequis.org-all_minecraft_resources.txt, urls-dequis.org-assets.minecraft.net.txt)<br />
** [https://archive.org/details/archiveteam_archivebot_go_20140921200925 Archivebot GO Pack 20140921200925] (minecraft.net)<br />
** [https://archive.org/details/archiveteam_archivebot_go_20141011220004 Archivebot GO Pack 20141011220004] (pi.minecraft.net, stats.minecraft.net)<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=The_Mail_Archive&diff=27364The Mail Archive2017-01-16T15:45:41Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = The Mail Archive<br />
| description = <br />
| image = Mail-archive_com_Oct13-2015.jpeg<br />
| URL = {{url|1=http://www.mail-archive.com|2=mail-archive.com}}<br />
| project_status = {{online}}<br />
| archiving_status = {{notsavedyet}}<br />
}}<br />
<br />
'''The Mail Archive''' is what it sounds like; it's an ad-supported mailing list archive that users can add arbitrary mailing lists to. Started in 1998, it currently holds 121,034,946 archived postings, on 4,314 mailing lists as of October 2015.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Kephost.com&diff=27363Kephost.com2017-01-16T15:45:32Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Kephost.com<br />
| URL = {{url|http://kephost.com}}<br />
| description = Kephost.com, a Hungarian image sharing site<br />
| image = Kephost_com_screenshot.png<br />
| project_status = {{Online}}<br />
| archiving_status = {{partiallysaved}} {{blue|continuously}}<br />
}}<br />
<br />
:''Not to be confused with [[Kephost.hu]].''<br />
<br />
'''Kephost.com''' is a Hungarian one-click image sharing service, one of the oldest and most popular ones besides [[kepfeltoltes.hu]]. People can easily upload files from their computers with a few clicks, and then paste the URLs of the images (or their pages) wherever they want. Here users can also choose a category to upload to, and even create an account for themselves. Pictures can be commented on through the embedded Facebook comment module.<br />
<br />
The service started in 2008. Since then, they have changed design in almost every year, and also have had several different site structures (including accessing images) – however, they seem to have not deleted older pictures since the first, 2010 redesign.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Site structure ==<br />
<br />
As already mentioned, they have had several different site structures. The following applies to their last, which has been in use apparently since 2014. <br />
<br />
Uploaded images can be public, and also probably private (this function needs registration). Only public pictures appear in the image browser, private ones cannot be discovered. Also, recently some older pictures don't appear in the end of the image browser. <br />
<br />
Images are given a page whose URL is <nowiki>http://www.kephost.com/image/&lt;ID></nowiki> where &lt;ID> seems to be an <s>incrementing</s><ref>Partly incrementing, partly indefinite. No general rule of handing out tokens has been found yet.</ref> base-62 number (alphabet A..Z, 0..9, a..z (in this order (?)), starting with one character). Letter 'A' acts as a leading zero when being the first character, that is, Aw = AAw = AAAw etc. The location of the actual image is <nowiki>http://kephost.com/images/&lt;YYYY>/&lt;MM>/&lt;DD>/&lt;FILENAME>.&lt;EXT></nowiki> where YYYY-MM-DD is the date the file was uploaded, FILENAME is the same as the user's file had on their computer, &lt;EXT> is its extension. This original version is always linked in with an <code>&lt;a></code> tag, as the download link. A thumbnail version is also generated for every picture, alhough not shown (just provided among the links); its filename is postfixed with .th, like &lt;FILENAME>.th.&lt;EXT>. Finally, if the image is large enough, an .md (medium) version is also generated (filename extended with .md), and '''in this case this version, otherwise the original one is shown''' on the page (with an <code>&lt;img></code> tag).<br />
<br />
=== Older images ===<br />
Archiving older images would need web scrape. Some of the older pattern prefixes:<br />
* <code>kephost.com/view3.php?filename=</code> – followed by a random alphanumeric/underscore token plus extension<br />
* <code>kephost.com/images[23]/</code> – followed by a random alphanumeric/underscore token plus extension<br />
* <code>kephost.com/images4/</code> – followed by YYYY/M/D (one or two digit month and day!) then a random alphanumeric/underscore token plus extension, the token often also contains the date in YYYY_M_D format<br />
* <s><code>kephost.com/view-NUM_TOKEN.EXT</code> – NUM might be a day counter, TOKEN might be the original filename</s> – used until 2010, images seem to have been deleted<br />
<br />
== Archiving ==<br />
<br />
{| class="wikitable" style="margin-left:auto; margin-right:auto"<br />
|+ Status table<br />
! Upload date !! Status !! Note<br />
|-<br />
| 2008 – 2010 || {{Red|Lost}} || First structure<br />
|-<br />
| 2010 – 2014-05-16 || {{Purple|Not saved yet}} || Second structure<br />
|-<br />
| 2014-05-16 – 2016-03-21 || {{green|Saved}} ([http://archive.fart.website/archivebot/viewer/job/9ajfi archive]) || 291,039 images<br />
|-<br />
| 2016-03-21 – 2016-09-04 || {{green|Saved}} ([http://archive.fart.website/archivebot/viewer/job/dgekc archive]) || 135,604 images<br />
|}<br />
<div style="text-align:center">Very first and very last date non-inclusive</div><br />
<br />
== Notes ==<br />
<references/><br />
<br />
<br />
{{Hungarian websites}}<br />
{{Navigation box}}<br />
[[Category:Image hosting]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Frequently_Asked_Questions&diff=27362Frequently Asked Questions2017-01-16T15:45:22Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>'''How can I help?'''<br />
<br />
See [[Who We Are]], [[Deathwatch]], and [[:Category:Projects_status]]. These pages describe our projects and the things you can do to help.<br />
<br />
'''Is the Archive Team affiliated with the [[Internet Archive]] (archive.org)?'''<br />
<br />
No. A few members are affiliated, but majority of Archive Team members are volunteers who help while not busy at work or school.<br />
<br />
'''Why is ArchiveTeam crawling my site / disrespecting robots.txt? '''<br />
<br />
A detailed manifesto is located at [[Robots.txt]]. Please read it first and contact us through [[IRC]] (described in a answer below) before making harsh actions. [[Posterous/Story|We cooperate!]]<br />
<br />
If you notice the crawler's user-agent is "ArchiveBot", please see [[ArchiveBot]].<br />
<br />
'''How should I go about backing things up?'''<br />
<br />
What would you like to back up? If you want to mirror/backup a website, the de facto tool is [[Wget]] (but there's lots more, see [[Software]]!). WARC files are highly recommended as they can be ingested by the Wayback Machine.<br />
<br />
If you want to back up your personal files, {{w|List of backup software}} is an extensive list of backup software. See [[Backup Tips]] as well!<br />
<br />
'''Where do all the saved files go?'''<br />
<br />
Files are ultimately uploaded to Internet Archive on the [https://archive.org/details/archiveteam Archive Team collection]. Archive Team relies on Internet Archive for storing the files.<br />
<br />
'''What are these WARC files in the Internet Archive? How do I extract files from a WARC file?'''<br />
<br />
[http://fileformats.archiveteam.org/wiki/WARC WARC files] are de facto medium of digital preservation of the web. These WARC files are ingested by the Wayback Machine. WARC files are not simple zip files; they're designed to record metadata.<br />
<br />
There is a growing number of tools that can manipulate WARC files in [[The WARC Ecosystem]].<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
= We Are Not The Internet Archive =<br />
<br />
'''How do I upload something to the Internet Archive (archive.org)?'''<br />
<br />
You can either use the HTML5/Flash uploader or use an [[Internet_Archive#Uploading_to_archive.org|alternative method]].<br />
<br />
'''Can someone remove or fix something on the Internet Archive?'''<br />
<br />
Possibly. Keep in mind that majority of Archive Team are volunteers who are not affiliated with the Internet Archive and requests should go to staff instead. If strenuous circumstances arise, please see the question about contacting Archive Team below.<br />
<br />
'''How do I get the original file on the Wayback Machine?'''<br />
<br />
Add <code>id_</code> after the date in the URL. For example: <code>web.archive.org/web/20090119040418'''id_'''/<nowiki>http://www.archiveteam.org/index.php?title=Main_Page</nowiki></code><br />
<br />
'''How do I search the contents of the Wayback Machine?'''<br />
<br />
You can't unfortunately. However, the Internet Archive provides API access (designed for programmers and power users) to the Wayback Machine and to the CDX database.<br />
<br />
'''Why does the Wayback Machine follow robots.txt in a way that I don't like?'''<br />
<br />
Because it makes the lawyers go away. We're not the Internet Archive. Don't ask us, ask them.<br />
<br />
'''Can I upload $COPYRIGHTED_THING to the Internet Archive?'''<br />
<br />
Although the Internet Archive ''prefers'' freely-redistributable content, they also accept still-in-copyright things. If there's a valid complaint/DMCA takedown request, they'll simply make the item private, but they will '''not''' delete the data. Having said that, the Internet Archive is not [[The Pirate Bay]], so please don't treat it as such.<br />
<br />
'''Can I save big files with the Save Now feature?'''<br />
<br />
No, files larger than 200 MB will not be saved correctly.<br />
<br />
= How redundant is Archiveteam? =<br />
<br />
'''Is there a backup of the data on the archiveteam.org website? If so where can I download it?'''<br />
<br />
Two sets of backups of this wiki are available. There are backups done by the hosting provider (several, going back days and weeks as well as hours), which use the storage capability of the shared hosting to keep them automatically (no tape or disk backups being done as most people would think of them). There are similarly copies of the database kept going back months.<br />
<br />
<s>Additionally, an XML dump of the Mediawiki database (which can be imported into any MediaWiki software) is accessible at [http://www.archiveteam.org/dumps http://www.archiveteam.org/dumps]. New backups are currently pushed out once a week (and will be increased if changes on the site require it). All images are also wrapped into a images.tar.gz file.</s><ref>Doesn't seem to be working any more.</ref> Our entire images directory is available at [http://www.archiveteam.org/images http://www.archiveteam.org/images].<br />
<br />
Dumps of the ArchiveTeam Wiki are generated with [[WikiTeam]] tools and uploaded to the [[Internet Archive]] quite regularly. You can find them [https://archive.org/details/wiki-archiveteam.org here] and newer ones [https://archive.org/details/wiki-archiveteamorg here].<br />
<br />
'''Is there a mirror of the archiveteam.org website?'''<br />
<br />
There are no mirrors we know of, although we encourage our more paranoid or protective readers to maintain one based on the above dumps.<br />
<br />
= Who are y'all? =<br />
<br />
'''Does Archive Team have any social media accounts?'''<br />
<br />
Follow us on Twitter: [https://twitter.com/archiveteam @archiveteam], [https://twitter.com/at_warrior @at_warrior], [http://www.reddit.com/r/archiveteam /r/archiveteam] and [https://www.facebook.com/ArchiveTeam like us on Facebook]. These accounts are run by selected volunteers and may not be monitored for questions.<br />
<br />
(There is a [https://groups.google.com/forum/?fromgroups=#!forum/archive-team Google ArchiveTeam group] but it is not used.)<br />
<br />
'''Who's the administrator?'''<br />
<br />
For a list of administrators, see [[Tracker#People]] which has a table at the bottom of the page.<br />
<br />
'''I went through the wiki and I still have a question! How do I contact the Archive Team?'''<br />
<br />
Join us on [[IRC|IRC! (All channels and info listed here.)]] For general inquiries, visit [irc://irc.efnet.org/archiveteam #archiveteam] on EFNet. Email can be sent to [mailto:archiveteam@archiveteam.org archiveteam@archiveteam.org].<br />
<br />
This wiki is ''not'' monitored for questions.<br />
<br />
If a FAQ should appear here, please add it.<br />
<br />
= Notes =<br />
<references/><br />
<br />
<br />
{{Navigation pager<br />
| previous = Recommended Reading<br />
}}<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Wget_with_Lua_hooks&diff=27361Wget with Lua hooks2017-01-16T15:45:09Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>* New idea: add Lua scripting to wget.<br />
<br />
* Work in progress: https://github.com/alard/wget-lua/tree/lua <br />
** The Lua scripting is patched on the "lua" branch. You can use the [https://github.com/alard/wget-lua/compare/lua#files_bucket compare branch feature] on GitHub to see the differences.<br />
** Alternative location: https://github.com/ArchiveTeam/wget-lua/tree/lua.<br />
<!-- If you get errors about 'lua_open' while compiling, try applying [http://paste.archivingyoursh.it/raw/manavagose this] patch. --><br />
** If you get errors about 'wget.pod' while compiling, try applying [http://paste.archivingyoursh.it/raw/dekasuroda this] patch.<br />
<br />
* Documentation: https://github.com/alard/wget-lua/wiki/Wget-with-Lua-hooks<br />
<br />
Example usage:<br />
<pre><br />
wget http://www.archiveteam.org/ -r --lua-script=lua-example/print_parameters.lua<br />
</pre><br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Template:Internet_history&diff=27360Template:Internet history2017-01-16T15:44:54Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Nolblog.hu&diff=27359Nolblog.hu2017-01-16T15:44:46Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = nolblog.hu<br />
| image = nolblog_screenshot.png<br />
| project_status = {{closed}}<br />
| archiving_status = {{saved}}<br />
| URL = {{URL|http://nolblog.hu}}<br />
}}<br />
<br />
'''Nolblog.hu''' was a blogging platform, primarily for readers of the leftist Hungarian newspaper [https://en.wikipedia.org/wiki/N%C3%A9pszabads%C3%A1g Népszabadság]. As of January 2016, it hosted more than 3000 blogs.<br />
<br />
On December 31, 2015 Mworks Print Zrt. announced that nolblog.hu would be discontinued on January 31, 2016.<br />
<br />
The site got cut off at 00:28 CET on February 1, 2016, but came back online about 8 hours later, and was finally shut down on February 2 in the afternoon.<br />
<br />
The above applies for the *.nolblog.hu subdomains, hosting the individual blogs. The [http://nol.hu/nol_blog nol.hu/nol_blog] main page (that is and has been the destination of the redirect from nolblog.hu) has been restructured, and now shows entries of the most popular users of the old Nolblog, so they could keep on blogging. For the other users, however, the line has been cut.<br />
<br />
Mediaworks did not tell the reason for the shutdown. The site was not used by many people, was not really updated in the last years, but there were often harsh conflicts between the users, causing need for the administrators to intervene. Also, there were no ads on the site, so running it was probably lossy.<br />
<br />
Users consisted mainly of elderly people, forming a basically good community.<br />
<br />
The site provided exporting. The exported blogs may be imported to other blogging platforms.<br />
<br />
'''MOTHERFUCKER ! ! !'''<br />
<br />
'''MOTHERFUCKER ! ! !'''<br />
<br />
'''MOTHERFUCKER ! ! !'''<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Source ==<br />
http://hirkut.hu/index.php/vege-februarban-megszunik-a-nolblog/<br />
<br />
<br />
{{Hungarian websites}}<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=IRC/Old&diff=27358IRC/Old2017-01-16T15:44:37Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Answers.com&diff=27357Answers.com2017-01-16T15:44:26Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Stack_Exchange&diff=27356Stack Exchange2017-01-16T15:44:17Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=MyVIP&diff=27355MyVIP2017-01-16T15:44:08Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{DISPLAYTITLE:myVIP}}<br />
{{Infobox project<br />
| title = myVIP<br />
| URL = {{url|http://myvip.com}}<br />
| description = Second most popular Hungarian social network<br />
| logo = myvip_logo.png<br />
| image = myvip_screenshot.png<br />
| project_status = {{Endangered}}<br />
| archiving_status = {{saved}}<br />
| irc = byevip<br />
| tracker = [http://tracker.archiveteam.org/myvip myvip]<br />
| source = [https://github.com/ArchiveTeam/myvip-grab myvip-grab]<br />
}}<br />
'''myVIP''' is an earlier popular Hungarian social network, it started in 2006 as the second of its kind. Although [[iWiW]] was very popular then, myVIP also could collect a lot of users in no time. However, [[Facebook]] took over the social network market in Hungary too, and myVIP also got deserted. And, considering that iWiW shut down with a lot more visitors than myVIP in 2014, it's a wonder that myVIP is still up.<br />
<br />
'''On 2016-07-29, someone claiming to be a myVIP employee informed us that the site may close in a month.'''<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Status ==<br />
=== The site ===<br />
The site seems to be generally stable. Some bugs appear, but staff is – surprisingly – responsive, and usually fix the bugs.<br />
<br />
Since 2015-12-16, according to a support mail, "There is currently ongoing maintenance on our site, therefore unfortunately not all functions are available, also the login may show problems sometimes." The – fortunately – only visible problem is that some comments don't appear properly under the images. As of 2016-01-24, this hasn't been fixed yet.<br />
<br />
There is a recurring problem that sometimes users' friend listing doesn't work, shows emply pages of list instead.<br />
<br />
'''UPDATE''': Since February 2016, more often and more kinds of problems occur, e.g. certain kinds of pages don't load or return 500 errors, the whole site goes down and returns 502 for days etc. Fortunately, these either don't affect our archiving efforts, or go away after some time – but all these are, at least, worrysome.<br />
<br />
'''UPDATE''': It turned out that profiles starting from ID 4,500,000 are currently only partially archivable, the profile pages of existing users return error 500. It means that profile pages of users registered after 2013-03-30 12:21 CET are – currently – impossible to archive (images and friends lists are usually saveable, though). (That is approx. 90,000 users, but usually with little content.)<br />
<br />
==== Shutdown? ====<br />
<div style="width:100%"><pre><br />
júl 29 10:25:55 <lazlo> hi guys<br />
júl 29 10:26:07 <lazlo> i see you save myvip<br />
júl 29 10:26:22 <lazlo> i apreciate it<br />
júl 29 10:26:38 <lazlo> if you not finished i wanna warn you<br />
júl 29 10:26:57 <lazlo> i heared something bacause i work there<br />
júl 29 10:27:25 <lazlo> not 100 % but managament want to kill myvip in 1 month<br />
júl 29 10:27:51 <lazlo> boss like "nobody uses myvip, why run it?<br />
júl 29 10:32:55 <lazlo> good luck an thank you<br />
júl 29 10:32:58 <lazlo> !<br />
</pre></div><br />
<br />
Unfortunately the user left before we could get further information from him.<br />
<br />
=== The operators ===<br />
<br />
According to [http://myvip.com/impresszum.html myvip.com/impresszum.html], the site is operated by Epicenter Market Hungary Kft. (on behalf of UK-based but [http://web.archive.org/web/20150926105504/http://epicentermarket.co.uk/ apparently] Hungarian company Epicenter Market Limited) and World Web Data Kft. However, it seems that myVIP was passed from Epicenter Market Hungary Kft. to Marco Polo Magyarország Kft. This is suspected for three reasons:<br />
* the impresszum has been updated so on other websites formerly run by Epicenter,<br />
* [http://marcopolo.hu Marco Polo] claims it has those websites, including myVIP, in its portfolio since 2015-05-12,<br />
* [http://epicenter.hu epicenter.hu] is down since like that time ([http://web.archive.org/web/20150318020735/http://epicenter.hu/ last] Wayback capture).<br />
<br />
[http://epicenter.hu Epicenter] made good profit, but probably beacuse its other, successful websites. Marco Polo hasn't sent in its first balance sheet yet (as it's a newly founded company).<br />
<br />
[http://wwdh.hu World Web Data], which is assigned to replying user requests (besides the built-in support system on [http://ugyfelkapu.myvip.com ugyfelkapu.myvip.com] whose recipients are unknown), is a company offering hosting, development and marketing services. It seems to be responsible for technical aspects of running myVIP and other <s>Epicenter</s> Marco Polo websites. In 2015, it changed its name and [http://sas.hu website] to SAS. It's been in a bad financial situation and it seems to become even worse.<br />
<br />
== Archiving ==<br />
*'''User range 1 – 108,000:''' {{Green|done}} by [[user:bzc6p]] (archives: [http://archive.org/details/myvip_com_profiles_0000001_0100000 here] and [http://archive.org/details/myvip_com_profiles_0100001_0108000 here])<br />
*'''User range 108,001 – 4,627,000:''' {{green|done}} (ArchiveTeam [[DPoS]])<br />
*'''Clubs:''' {{green|done}} by [[user:bzc6p]] ([https://archive.org/details/myvip_com_clubs_0000001_1117844 archive])<br />
*'''User range 4,627,001 – :''' {{colour|#D420D0|Upcoming}} by [[user:bzc6p]]<br />
<br />
Also, an index of users {{green|has been}} built by [[user:bzc6p]], although in case of users above ID 4,500,000 (~90,000 users) not all data fields could be included due to technical problems on myVIP's side.<br />
<br />
The index will be made available in some form when necessary, until that please use myVIP's search.<br />
<br />
'''The archive of myVIP is accessible through the Wayback Machine'''; if you know the user ID you can use a URL like http://web.archive.org/web/myvip.com/profile.php?uid=1146123.<br />
<br />
The bulk [[WARC]] files are also available in the [http://archive.org/details/archiveteam_myvip archiveteam_myvip] collection.<br />
<br />
== Sources ==<br />
*http://hu.wikipedia.org/wiki/MyVIP<br />
*http://hvg.hu/tudomany/20130906_general_media<br />
<br />
<br />
{{Hungarian websites}}<br />
{{Navigation box}}<br />
[[Category:Social networks]]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=TechNet&diff=27354TechNet2017-01-16T15:43:57Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Slidecast&diff=27353Slidecast2017-01-16T15:43:49Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Ispygames&diff=27352Ispygames2017-01-16T15:43:40Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{| width=300px style="border: 1px solid #aaa; background-color: #f9f9f9; color: black; margin: 0.5em 0 0.5em 1em; padding: 0.2em; font-size: 90%;clear: right; float: right;"<br />
|-<br />
| colspan=2 align=center | <big>'''{{{title|Gamespy, IGN, 1up, ugo}}}'''</big><br />
|-<br />
| colspan=2 align=center | [[File:{{{logo|Dummy.png}}}|100px|{{PAGENAME}} logo]]<br />
|-<br />
| colspan=2 align=center | [[File:{{{image|gamespy.jpg}}}|280px|{{{description|}}}]]<br/>{{{description|}}}<br />
|-<br />
| width=125px | '''URL''' || {{{URL|{{{url|http://www.gamespy.com & many others}}}}}}<br />
|-<br />
| width=125px | '''Project status''' || {{{project_status|{{Closing}}}}}<br />
|-<br />
| width=125px | '''Archiving status''' || {{partiallysaved}}<br />
|-<br />
| width=125px | '''Project source''' || {{{source|{{Unknown}}}}}<br />
|-<br />
| width=125px | '''Project tracker''' || {{{tracker|{{Unknown}}}}}<br />
|-<br />
| width=125px | '''IRC channel''' || <span class="plainlinks">[http://chat.efnet.org:9090/?nick=&channels=%23{{{irc|ispygames}}}&Login=Login #{{{irc|ispygames}}}]</span><br />
|}<noinclude><br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== The Problems ==<br />
<br />
* Once you start digging around these sites you find it to be a mess of inconsistent url schemes and content everywhere. <br />
* Some files are being hosted on MediaFire.<br />
* Based on tests the larger and older a site is the more that is missed by a wget crawl due to the url scheme. <br />
<br />
== What we know ==<br />
<br />
* We already have a list of almost all the domains involved<br />
* A clean list with dups and bad domains is already being process and will be posted here when complete.<br />
* Most of the sites are not that big, but a few are huge.<br />
<br />
== The plan ==<br />
<br />
* Save the sites and related content<br />
* Backup the twitter feeds for any associated accounts. [http://www.allmytweets.net/ All my tweets] just takes a username and returns the max tweets possible.<br />
<br />
<br />
== wget test command ==<br />
This if for the gamespy sites.<br />
<pre><br />
USER_AGENT="Mozilla/5.0 (Windows; U; MSIE 9.0; Windows NT 9.0; en-US)"<br />
SAVE_HOST="http://planetdoom.gamespy.com"<br />
WARC_NAME="warc_name"<br />
<br />
wget -e robots=off --mirror --page-requisites \ <br />
--waitretry 5 --timeout 60 --tries 5 --wait 2 \<br />
--warc-header "operator: Archive Team" --warc-cdx --warc-file="$WARC_NAME" \<br />
-U "$USER_AGENT" "$SAVE_HOST" \<br />
--span-hosts --domains=$SAVE_HOST,pcmedia.gamespy.com,pnmedia.gamespy.com,pspmedia.gamespy.com,oystatic.ignimgs.com<br />
</pre><br />
<br />
Try this for the ign, ugo sites.<br />
<br />
<pre><br />
USER_AGENT="Mozilla/5.0 (Windows; U; MSIE 9.0; Windows NT 9.0; en-US)"<br />
SAVE_HOST="http://ve3d.ign.com"<br />
WARC_NAME="warc_name"<br />
<br />
wget -e robots=off --mirror --page-requisites \ <br />
--waitretry 5 --timeout 60 --tries 5 --wait 2 \<br />
--warc-header "operator: Archive Team" --warc-cdx --warc-file="$WARC_NAME" \<br />
-U "$USER_AGENT" "$SAVE_HOST"<br />
</pre><br />
<br />
== IGN domains ==<br />
<br />
=== In progress ===<br />
* http://aavault.ign.com - grabbed, checking for completeness<br />
* http://autoassaultvault.ign.com/ - grabbed, checking for completeness<br />
* http://beacon.snowball.com - already closed<br />
* http://bombergirl.ign.com - closed contest page in flash<br />
* http://o.cubemedia.ign.com - dead end<br />
* http://actionunleashed.ign.com - grabbed, checking for completeness<br />
* http://dndvault.ign.com - grabbed, checking for completeness<br />
* http://vault.ign.com - grabbing<br />
* http://ac2vault.ign.com - Smiley - done<br />
* http://acvault.ign.com - Smiley - done<br />
* http://aion.ign.com - Smiley Grabbing<br />
* http://wowvault.ign.com - Smiley grabbing<br />
* http://atvault.ign.com - Smiley grabbing<br />
* http://alli.ign.com - Smiley done<br />
* http://antis.ign.com - Smiley grabbing<br />
* http://doa.ign.com - Smiley grabbing<br />
* http://aovault.ign.com - siliconvalleypark - grabbed<br />
* http://aevans.dev.m.au.ign.com - dead<br />
* http://aevans.dev.m.ca.ign.com - dead<br />
* http://aevans.dev.m.ie.ign.com - dead<br />
* http://aevans.dev.m.ign.com - dead<br />
* http://aevans.dev.m.uk.ign.com - dead<br />
* http://cdyi.dev.m.ign.com - dead<br />
* http://ve3d.ign.com - grabbed, checking for completeness<br />
* http://flashpoint.ign.com - siliconvalleypark - grabbed<br />
* http://beaterator.ign.com - siliconvalleypark - grabbed<br />
* http://aivlev.dev.m.au.ign.com - password protected dev site<br />
* http://aivlev.dev.m.ie.ign.com - password protected dev site<br />
* http://aivlev.dev.m.ign.com - password protected dev site<br />
* http://aivlev.dev.m.uk.ign.com - password protected dev site<br />
* http://apassey.dev.m.ign.com - password protected dev site<br />
* http://bestof.ign.com - Tracer grabbing<br />
* http://www.blockbuster.ign.com - already dead<br />
* http://kaneandlynch.ign.com/ - flash based<br />
* http://wishvault.ign.com - siliconvalleypark grabbing, Smiley done<br />
* http://witchervault.ign.com - siliconvalleypark, Smiley - grabbed, done<br />
* http://www.supersmashbros.ign.com - Smiley grabbing<br />
* http://www.championshipgamingseries.com - Smiley grabbing<br />
* http://code.ign.com - Smiley grabbing<br />
* http://mevault.ign.com - siliconvalleypark grabbing<br />
* http://beacon.ign.com - Smiley, done<br />
* http://blockbuster.ign.com - 404 not found<br />
* http://broadband.ign.com - Smiley<br />
* http://browserthemes.ign.com - Smiley<br />
* http://ces2009.ign.com - Smiley, all broken links<br />
* http://championshipgamingseries.ign.com - Smiley<br />
* http://championsonline.ign.com - Smiley<br />
* http://cohvault.ign.com - Smiley<br />
* http://comiccon.ign.com - dead<br />
* http://corp.ign.com - Smiley<br />
* http://covvault.ign.com - Smiley, redirects to cohvault<br />
* http://adtools.ign.com - blank<br />
* http://crossassault.ign.com - Smiley, done<br />
* http://design.ign.com - Smiley, done<br />
* http://dragonica.ign.com - Smiley, done<br />
* http://eq2vault.ign.com - Smiley, done with some timeouts<br />
* http://eqvault.ign.com - Smiley, finished with some timeouts<br />
* http://esports.ign.com - Smiley, done<br />
* http://evevault.ign.com - Smiley<br />
* http://feeds.ign.com - Smiley, done<br />
* http://ffvault.ign.com - Smiley, finished with some timeouts<br />
* http://findit.ign.com - Smiley, done<br />
* http://gamechanger.ign.com - Smiley, done<br />
* http://gamesites.ign.com - Smiley, done<br />
* http://gamestore.ign.com - Smiley, done<br />
* http://gamingworldrecord.ign.com - Smiley, done<br />
* http://gbartone.dev.m.au.ign.com - not found<br />
* http://gbartone.dev.m.ca.ign.com - not found<br />
* http://gbartone.dev.m.ie.ign.com - not found<br />
* http://gbartone.dev.m.ign.com - not found<br />
* http://gbartone.dev.m.uk.ign.com - not found<br />
* http://grandtheftautohood.ign.com -> grandtheftauto.ign.com<br />
* http://grandtheftauto.ign.com - Smiley, done<br />
* http://gtahood.ign.com -> grandtheftauto.ign.com<br />
* http://gta.ign.com - Smiley, done<br />
* http://www.blockbuster.ign.com - already dead<br />
* http://guitarhero3.ign.com - Smiley, done<br />
* http://gwvault.ign.com - Smiley, finished with some timeouts<br />
* http://halo.ign.com - Smiley, done<br />
* http://hls.gbartone.dev.m.ign.com -> hls.gbartone.dev.m.uk.ign.com, done, smiley<br />
* http://horizonsvault.ign.com - Smiley, done<br />
* http://ie.bestof.ign.com -> uk.bestof.ign.com (localized version of sites?<br />
* http://ie.top100.ign.com - Smiley, (localized version redirected to .uk.)<br />
* http://uk.bestof.ign.com - Smiley grabbing<br />
* http://uk.corp.ign.com - Smiley grabbing<br />
* http://uk.retro.ign.com - Smiley grabbing -> broken by + in url<br />
* http://uk.sports.ign.com - Smiley grabbing -> uk.ign.com<br />
* http://uk.top100.ign.com - Done<br />
* http://iplstore.ign.com - dead, uploaded, Smiley<br />
* http://jloijens.dev.m.ign.com - Smiley, redirected to .uk. version, done<br />
* http://jtai.dev.m.ign.com - Smiley, dead<br />
* http://kaneandlynch.ign.com - Smiley, done<br />
* http://kjaniak.dev.m.ign.com - blocked by auth, Smiley<br />
* http://l2vault.ign.com - Smiley<br />
* http://labs.ign.com - Smiley, done <br />
* http://lanoire.ign.com - Smiley, done<br />
* http://links.em.ign.com - Smiley, done<br />
* http://littlebigplanet.ign.com - Smiley, done<br />
* http://live.ign.com - Smiley, done<br />
* http://lotrovault.ign.com - Smiley<br />
* http://mag.ign.com - Smiley, done<br />
* http://mail2.ign.com - Smiley, done<br />
* http://mcraft.ign.com - Smiley, done<br />
* http://memoviedia.ign.com - Smiley, dead<br />
* http://mevault.ign.com - Smiley, done<br />
* http://m.ign.com -> m.uk.ign.com, Smiley, done<br />
* http://minecraft.ign.com - Smiley, done<br />
* http://uk.video.ign.com - Smiley grabbing -> broken by redirect<br />
* http://niboppub.ign.com - Smiley, done<br />
* http://nwvault.ign.com - Smiley, done<br />
* http://m.uk.ign.com - Smiley<br />
* http://musichub.ign.com - Smiley, done<br />
* http://mxovault.ign.com - Smiley, done with some timeouts<br />
* http://nchandra.dev.m.uk.ign.com - Smiley, done<br />
* http://o.rpgvaultarchive.ign.com - Smiley, done<br />
* http://potbsvault.ign.com - Smiley, done with some timeouts<br />
* http://rotavault.ign.com - Smiley<br />
* http://rpgvaultarchive.ign.com - Smiley<br />
* http://ryzomvault.ign.com - Smiley, done, some timeouts<br />
* http://sbvault.ign.com - Smiley, dead, done<br />
* http://starcraft2.ign.com - Smiley done<br />
* http://opt-out.emailpreferences.ign.com -> mail.ign.com, Smiley, done<br />
* http://overlord.ign.com - Smiley, done<br />
* http://pawong.dev.www.ign.com - Smiley, auth failed<br />
* http://planetelderscrolls.ign.com - Smiley, done<br />
* http://play.ign.com -> gamespyarcade.com Smiley, done<br />
* http://gamespyarcade.com - Smiley, done<br />
* http://podcast.ign.com -> ign.com/index/podcasts.html, Smiley, done<br />
* http://podcasts.ign.com -> ign.com/index/podcasts.html, Smiley, done<br />
* http://primeblog.ign.com - Smiley, done<br />
* http://promotions.ign.com - Smiley, done<br />
* http://promotools.ign.com - Smiley<br />
* http://publish.ign.com - Smiley, done<br />
* http://rift.ign.com - Smiley, done<br />
* http://rmcadams.dev.m.ca.ign.com - Smiley, auth failed<br />
* http://rmcadams.dev.m.ign.com - Smiley, auth failed<br />
* http://rmcadams.dev.www.ign.com - Smiley, auth failed<br />
* http://rsullivan.dev.m.ca.ign.com - Smiley, auth failed<br />
* http://rsullivan.dev.m.ign.com - Smiley, auth failed<br />
* http://rsullivan.dev.www.ign.com - Smiley, auth failed<br />
* http://share.affiliation.com - Smiley, dead<br />
* http://share.ign.com - Smiley, done<br />
* http://shootmania.ign.com -> ign.com/ipl/shootmania, Smiley<br />
* http://s.insiderdownloads.ign.com -> http://login.ign.com/prime/landing - Need IGN login subscriber, Smiley<br />
* http://skate2.ign.com - Smiley, done<br />
* http://smashbros.ign.com -> supersmashbros.ign.com, Smiley, done<br />
* http://supersmashbros.ign.com - Breaks wget<br />
* http://smcnabb.dev.www.ign.com - Smiley, auth failed<br />
* http://sovault.ign.com -> vault.ign.com, Smiley, done<br />
* http://strangleholdcentral.ign.com - Smiley, done<br />
* http://strategyvault.ign.com -> vault.ign.com, Smiley, done<br />
* http://sts.ign.com - Smiley, done<br />
* http://studio.ign.com - Smiley, done<br />
* http://swgvault.ign.com - Smiley<br />
* http://swvault.ign.com - Smiley, done with errors<br />
* http://tabularasavault.ign.com - Smiley, done<br />
* http://tdu.ign.com - Smiley, done, broken<br />
* http://tford.dev.m.ign.com - not found, Smiley<br />
* http://tford.dev.www.ign.com - not found, Smiley<br />
* http://thhe.ign.com - Smiley, done<br />
* http://ticket.ign.com - Smiley, done<br />
* http://tickets.ign.com - Smiley, done<br />
* http://titanquestvault.ign.com - Smiley, done<br />
* http://tjohnson.dev.uk.ign.com - 404, Smiley<br />
* http://touch.ign.com - Smiley, done<br />
* http://tqvault.ign.com -> titanquestvault.ign.com, Smiley, done<br />
* http://trvault.ign.com -> tabularasavault.ign.com, Smiley, done<br />
* http://twoworldsvault.ign.com - Smiley<br />
* http://uovault.ign.com -> http://vault.ign.com/uovault.html, Smiley, done<br />
* http://vanguardvault.ign.com - Smiley, done<br />
* http://warhammervault.ign.com - Smiley, done<br />
* http://wikihub.stg.www.ign.com - 404<br />
* http://wiki.stg.www.ign.com - 404<br />
* http://wishvault.ign.com - Smiley<br />
* http://witchervault.ign.com - Smiley<br />
* http://www.antis.ign.com -> http://entertainment.ign.com/antis.html <br />
* http://www.championshipgamingseries.com - Smiley<br />
* http://www.ipl.ign.com -> http://www.ign.com/ipl/<br />
* http://www.kaneandlynch.ign.com -> kaneandlynch.ign.com, Smiley, done<br />
* http://www.mevault.ign.com - Smiley, done<br />
* http://xboxlive.ign.com -> uk.ign.com/xbox-live<br />
<br />
=== Ready to grab ===<br />
* http://au.bestof.ign.com<br />
* http://au.retro.ign.com<br />
* http://au.sports.ign.com<br />
* http://wiki.stg.www.ign.com<br />
<br />
<br />
=== untested ===<br />
* http://au.microsites.ign.com<br />
* http://au.top100.ign.com<br />
* http://au.video.ign.com<br />
* http://crose.dev.m.ca.ign.com<br />
* http://crose.dev.m.ign.com<br />
* http://m.au.ign.com<br />
* http://m.ca.ign.com<br />
* http://m.ie.ign.com<br />
* http://mobile.ign.com<br />
* http://nchandra.dev.m.au.ign.com<br />
* http://nchandra.dev.m.ie.ign.com<br />
* http://nchandra.dev.m.ign.com<br />
* http://nchandra.dev.www.ign.com<br />
* http://o.mobile.ign.com<br />
* http://open.em.ign.com<br />
* http://speedtv.ign.com<br />
* http://sslvpn.ign.com<br />
* http://staging-api.ign.com<br />
* http://store.ign.com<br />
* http://tjohnson.dev.au.ign.com<br />
* http://tjohnson.dev.ca.ign.com<br />
* http://tjohnson.dev.ie.ign.com<br />
* http://tjohnson.dev.www.ign.com<br />
* http://top100.ign.com<br />
* http://v3-api.stg.ie.ign.com<br />
* http://v3-api.stg.m.au.ign.com<br />
* http://v3-api.stg.m.ie.ign.com<br />
* http://v3-api.stg.www.ign.com<br />
* http://vgu.stg.www.ign.com<br />
* http://viashoka.dev.m.ign.com<br />
* http://viashoka.dev.www.ign.com<br />
* http://videopatch1.atio.dev.uk.ign.com<br />
* http://videoplayer.phantom.stg.www.ign.com<br />
* http://video.stg.www.ign.com<br />
* http://vnboards.ign.com<br />
<br />
<br />
=== These might be asset only hosting sites ===<br />
* http://au.media.ign.com<br />
* http://carsmedia.ign.com<br />
* http://carsmovies.ign.com<br />
* http://codesmedia.ign.com<br />
* http://cubemedia.ign.com<br />
* http://dcmedia.ign.com<br />
* http://entertainmentmedia.ign.com<br />
* http://faqsmedia.ign.com<br />
* http://faqsmovies.ign.com<br />
* http://ffmedia.ign.com <br />
* http://formenmedia.ign.com<br />
* http://guidesmedia.ign.com<br />
* http://ie.media.ign.com<br />
* http://insdermedia.ign.com <br />
* http://insiderdownloads.ign.com<br />
* http://insidermedia.ign.com<br />
* http://macmedia.ign.com<br />
* http://macmovies.ign.com<br />
* http://media.ign.com<br />
* http://moviemedia.ign.com<br />
* http://moviesmovies.ign.com<br />
* http://o.faqsmedia.ign.com<br />
* http://o.faqsmovies.ign.com<br />
* http://o.ffmovies.ign.com<br />
* http://o.guidesmedia.ign.com<br />
* http://o.insidermedia.ign.com<br />
* http://o.media.ign.com<br />
* http://o.moviesmovies.ign.com<br />
* http://o.pcmedia.ign.com<br />
* http://o.pocketmedia.ign.com<br />
* http://o.ps2media.ign.com<br />
* http://o.xboxmedia.ign.com<br />
* http://xboxlivemedia.ign.com<br />
* http://xboxlivemovies.ign.com<br />
* http://xboxmedia.ign.com<br />
* http://xboxmovies.ign.com<br />
* http://pcmedia.ign.com<br />
* http://pocketmedia.ign.com<br />
* http://ps2media.ign.com<br />
* http://psxmedia.ign.com <br />
* http://psxmovies.ign.com <br />
* http://scifimedia.ign.com<br />
* http://uk.media.ign.com<br />
* http://vaultmedia.ign.com<br />
* http://wiremedia.ign.com<br />
* http://wiremovies.ign.com<br />
* http://c.ffmovies.ign.com<br />
* http://s.faqsmovies.ign.com<br />
* http://s.ffmovies.ign.com<br />
* http://s.moviesmovies.ign.com<br />
* http://s.psxmovies.ign.com<br />
* http://s.xboxmovies.ign.com<br />
<br />
=== Redirects ===<br />
<br />
* http://cars.ign.com -> http://www.ign.com/<br />
* http://guides.ign.com -> http://www.ign.com/wikis<br />
* http://answers.ign.com -> http://www.ign.com/boards/<br />
* http://ve3dboards.ign.com -> http://www.ign.com/boards/<br />
* http://ffmovies.ign.com -> http://ffmedia.ign.com<br />
* http://ddovault.ign.com -> http://dndvault.ign.com/<br />
* http://bigworldvault.ign.com -> http://vault.ign.com<br />
* http://911.ign.com -> http://tickets.ign-inc.com/<br />
* http://bestofe3.ign.com -> http://games.ign.com/bestofe3.html<br />
* http://dsi.ign.com -> http://ds.ign.com/dsi/<br />
* http://www.ipl.ign.com -> http://www.ign.com/ipl/<br />
* http://xboxlive.ign.com -> http://www.ign.com/xbox-live<br />
* http://download.ign.com - redirect to fileplanet<br />
* http://downloads.ign.com - redirect to fileplanet<br />
* http://dsvault.ign.com - redirect to planetdungeonsiege.com<br />
* http://emailpreferences.ign.com - redirect to mail.ign.com<br />
* http://guidesarchive.ign.com -> http://uk.ign.com/wikis<br />
* http://hjvault.ign.com - redirects to vault.ign.com<br />
* http://ipl.ign.com -> ign.com/ipl<br />
* http://mac.ign.com -> http://uk.ign.com/games/reviews?platformSlug=mac<br />
* http://o.guidesarchive.ign.com -> uk.ign.com/wikis<br />
* http://starcraft.ign.com - Smiley done - redirect to starcraft2.ign.com<br />
<br />
== Gamespy Domains ==<br />
<br />
=== Ready to grab ===<br />
* http://sslvpn.gamespy.com<br />
<br />
=== In Progress ===<br />
<br />
* http://lanoirepc.d2gstore.gamespy.com - Smiley, done<br />
* http://gamespyarcade.com - Smiley, done<br />
* http://planetthemovies.gamespy.com - Smiley, done<br />
* http://planetelderscrolls.gamespy.com - Smiley<br />
* http://planetcnc.gamespy.com - grabbed, checking for completeness<br />
* http://planetthesims.gamespy.com - grabbed, checking for completeness<br />
* http://planetfrontlines.gamespy.com - grabbed, checking for completeness<br />
* http://planetcivilization.gamespy.com - grabbed, checking for completeness<br />
* http://planethalflife.gamespy.com - grabbed, checking for completeness<br />
* http://planettransformers.gamespy.com - grabbed, checking for completeness<br />
* http://planetcoh.gamespy.com - grabbed, checking for completeness<br />
* http://planetbattlefield.gamespy.com - grabbed, checking for completeness<br />
* http://planetresidentevil.gamespy.com - grabbed, checking for completeness<br />
* http://planetxmen.gamespy.com - grabbed, checking for completeness<br />
* http://planetquake.gamespy.com - grabbed, checking for completeness, grabbing again.<br />
* http://planetgrandtheftauto.gamespy.com - grabbed, checking for completeness<br />
* http://planettonyhawk.gamespy.com - grabbed, checking for completeness<br />
* http://planetunreal.gamespy.com - grabbed, checking for completeness<br />
* http://planetfallout.gamespy.com - grabbed, checking for completeness<br />
* http://planetageofempires.gamespy.com - grabbed, checking for completeness<br />
* http://planetgearsofwar.gamespy.com - grabbed, checking for completeness<br />
* http://planetcallofduty.gamespy.com - grabbed, checking for completeness<br />
* http://classicgaming.gamespy.com - grabbed, checking for completeness<br />
* http://planetdoom.gamespy.com - grabbed, checking for completeness<br />
* http://planetwwe.gamespy.com - grabbed, checking for completeness<br />
* http://pc.gamespy.com - grabbed, checking for completeness<br />
* http://psp.gamespy.com - grabbed, checking for completeness<br />
* http://ds.gamespy.com - grabbed, checking for completeness<br />
* http://xbox360.gamespy.com - grabbed, checking for completeness<br />
* http://planetfarcry.gamespy.com - grabbed, checking for completeness<br />
* http://planetcrysis.gamespy.com - grabbing omf_<br />
* http://planetmedalofhonor.gamespy.com - grabbing omf_<br />
* http://xbox.gamespy.com - grabbing Smiley<br />
* http://norad.gamespy.com - grabbing Smiley<br />
* http://bf2142portalservices.gamespy.com - grabbing Smiley<br />
* http://arena.gamespy.com - grabbing Smiley<br />
* http://www.gamespy.com - grabbing Smiley<br />
* http://bugsubmit.gamespy.com - grabbing Smiley<br />
* http://ps3.gamespy.com - Smiley, done with 503's at the end.<br />
<br />
=== Redirects ===<br />
<br />
* http://forumplanet.gamespy.com -> ign.com/boards<br />
* http://forums.gamespy.com -> ign.com/boards/categories/gamespy<br />
* http://planetdeusex.gamespy.com -> gamespy.com (actual site's at planetdeusex.com)<br />
* http://planetelderscrolls.gamespy.com -> planetelderscrolls.ign.com<br />
<br />
== 1up.com ==<br />
On 2016-05-24, http://www.1up.com has been thrown into [[ArchiveBot]] with job ident <tt>35fcc4zofjl5kg52fkbcskgus</tt>.<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=BetaArchive&diff=27351BetaArchive2017-01-16T15:43:30Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = BetaArchive<br />
| logo = BetaArchiveLogo.png<br />
| image = BetaArchiveHomepage.png<br />
| URL = http://betaarchive.com<br />
| project_status = {{online}}<br />
| archiving_status = {{notsaved}}<br />
}}<br />
'''BetaArchive''' is a site that hosts over 24TB of betas and abandonware. It was formed after [[OSBetaArchive]] shut down.<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
==Forums==<br />
[[phpBB]] based forums for the site. In order to access the Servers Discussion forum, you need to be in the FTP Access Group, and to access the Donators Discussion Forum, you (obviously) need to be in the Donators group.<br />
<br />
==Wiki==<br />
Uses [[WikiTeam|MediaWiki]]. Contains information on betas and abandonware. <br />
<br />
==Image Uploader==<br />
Meant for uploading images to use on the forum. BetaArchive provides an archive of all the images uploaded to it [http://www.betaarchive.com/imageupload/allimages.htm here].<br />
<br />
==Screenshot Gallery==<br />
Contains screenshots of betas and abandonware.<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Vol%C3%A1n&diff=27350Volán2017-01-16T15:43:22Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=ArchiveTeam_Warrior&diff=27349ArchiveTeam Warrior2017-01-16T15:43:10Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Basic usage ==<br />
<br />
The warrior runs on Windows, OS X and Linux using a virtual machine. You'll need one of:<br />
<br />
* [https://www.virtualbox.org/ VirtualBox] (recommended)<br />
* [https://www.vmware.com/products/player/ VMware workstation/player] (free-gratis for personal use)<br />
* [[#Alternative virtual machines|See below for alternative virtual machines]]<br />
<br />
<br />
=== Quick start instructions for VirtualBox ===<br />
<br />
# Download the [http://archive.org/download/archiveteam-warrior/archiveteam-warrior-v2-20121008.ova appliance] (174MB).<br />
# Launch VirtualBox<br />
# In VirtualBox, click File > Import Appliance and open the file.<br />
# Start the virtual machine.<br />
#* It will fetch the latest updates and will eventually tell you to start your web browser.<br />
# Using your regular web browser, visit http://localhost:8001/<br />
# On the left, click "Your settings".<br />
# Choose a username - we'll show your progress on the [[tracker|leaderboard]].<br />
# On the left, click "Available projects" tab and pick a project to work on.<br />
#* Even better: select "ArchiveTeam's Choice" to let your warrior work on the most urgent project.<br />
<br />
<br />
=== Start instructions for VMWare Player ===<br />
<br />
# Download the [http://archive.org/download/archiveteam-warrior/archiveteam-warrior-v2-20121008.ova appliance] (174MB).<br />
# Launch VMWare Player<br />
# In Player on the right, click "Open Virtual Machine", open the file and import the virtual machine.<br />
# Select the virtual machine and click "Edit virtual machine settings".<br />
#* Select "Hard Disk 2 (IDE)" > "Advanced..." and change it to "IDE 1:0"<br />
#* Select Network Adapter and set it to "Bridged: Connected directly to the physical network"<br />
# Start the virtual machine.<br />
#* It will fetch the latest updates and will eventually tell you to start your web browser.<br />
# Using your regular web browser, visit the address that is shown on the bottom (e.g. http://192.168.0.100:8001/)<br />
# On the left, click "Your settings".<br />
# Choose a username - we'll show your progress on the [[tracker|leaderboard]].<br />
# On the left, click "Available projects" tab and pick a project to work on.<br />
#* Even better: select "ArchiveTeam's Choice" to let your warrior work on the most urgent project.<br />
<br />
<br />
__TOC__<br />
<br />
== Alternative virtual machines ==<br />
<br />
Thanks to user-effort, there are alternatives:<br />
<br />
* [https://www.docker.io/ Docker] (Linux)<br />
** ([https://github.com/ArchiveTeam/warrior-dockerfile modified dockerfile])<br />
** ([https://hub.docker.com/r/infrequent/at-as-dockerfile modified dockerfile - for manual script execution])<br />
<br />
* [https://www.microsoft.com/en-us/server-cloud/solutions/virtualization.aspx Hyper-V] (Windows 8 Professional)<br />
** ([http://jonimoose.net/2013/archiveteam-warrior-on-hyper-v/ Hyper-V virtual machine])<br />
<br />
Please note that these alternatives are not in widespread use by our warriors, so we may not be able to help with either issues or advanced usage.<br />
<br />
==Warrior FAQ==<br />
<br />
=== Can I use whatever internet access for the warrior? ===<br />
<br />
No. We need "clean" connections. Please ensure the following:<br />
<br />
* No OpenDNS. No ISP DNS that redirects to a search page. Use non-captive DNS servers.<br />
* No ISP connections that inject advertisements into web pages.<br />
* No proxies. Proxies can return bad data. The original HTTP headers and IP address is needed for the WARC file.<br />
* No content-filtering firewalls.<br />
* No censorship. If you believe your country implements censorship, do not run a warrior. <br />
* No Tor. The server may return an error page instead of content if they ban exit nodes.<br />
* No free wifi cafe. Archiving your cafe's wifi service agreement repeatedly is not helpful.<br />
* We prefer connections from many public IP addresses if possible. (For example, if your apartment building uses a single IP address, we don't want your apartment banned.)<br />
<br />
=== Why am I seeing a message that no item was received? ===<br />
<br />
It means that there is no work available. This happens for several reasons:<br />
<br />
* There project has just finished and someone is inspecting the work done. If a problem is discovered, items may be re-queued and more work is available.<br />
* You have checked out / claimed too many items. Reduce your concurrency and let others do some of the work too.<br />
* In a rare case, you have been banned by a tracker administrator because you were requesting too much work, you were tampering with the scripts, a malfunction has occurred, or your internet connection is "unclean".<br />
<br />
=== Why am I seeing a message about rate limiting? ===<br />
<br />
Keep in mind that although downloading the internet for digital preservation and fun are the primary goals of all Archive Team activities, serious stress on the target's server may occur. The rate limit is imposed by a [[Tracker#People|tracker administrator]] and should not be subverted.<br />
<br />
(In other words, we don't want to DDoS the servers.)<br />
<br />
=== Why am I seeing a message about code being out of date? ===<br />
<br />
The warrior will update its code every hour. If you are impatient, please restart the warrior and it will download the latest code and resume work.<br />
<br />
===Help! The warrior is eating all my bandwidth!===<br />
<br />
You can limit the warrior's bandwidth quite easily for VirtualBox as long as you are running a relatively recent version. The option is not offered with a GUI however.<br />
<br />
The command <pre>VBoxManage bandwidthctl archiveteam-warrior-2 add limit --type network --limit 3m</pre> will limit the warrior instance called archiveteam-warrior-2 (the default name of the warrior vm currently) to 3Mb/s. Adjust as needed.<br />
(limit units: k=kilobit, m=megabit, g=gigabit, K=kilobyte, M=megabyte, G=gigabyte)<br />
<br />
<br />
In the latest version of VirtualBox on Windows, the syntax appears to have changed. The correct command now seems to be:<br />
<br />
<pre>VBoxManage bandwidthctl archiveteam-warrior-2 add netlimit --type network --limit 3</pre><br />
<br />
For more info, consult the [http://www.virtualbox.org/manual/ch06.html#network_bandwidth_limit VirtualBox manual (Chapter 6, Section 9)].<br />
<br />
===NAT sucks! I want directly-bridged networking!===<br />
<br />
Simples! (If you're running linux, that is.)<br />
<br />
<pre>VBoxManage modifyvm "archiveteam-warrior-2" --nic1 bridged</pre><br />
<br />
<pre>VBoxManage modifyvm "archiveteam-warrior-2" --bridgeadapter1 eth0</pre><br />
<br />
(We presume you want to bind to <code>eth0</code>. Adjust as required. :))<br />
<br />
=== I turned my warrior VM appliance off. Will those tasks be lost? ===<br />
<br />
If you've killed your warrior VM instances, then the work your warrior did has been lost, however the tasks will be returned to the pool after a period of time. If you want, you can alert the admins via IRC of what's happened, and they can clear the claims your username may have made. However, this isn't very important on most projects.<br />
<br />
=== I closed my browser or tab with the warrior's web interface. Will those tasks be lost? ===<br />
<br />
No, the web browser interface just provides, well, a user interface to the warrior. As long as the VM is not stopped, it will continue normally.<br />
<br />
=== I need to disconnect my internet / reboot my PC, but I don't want to lose work. ===<br />
<br />
If you pause/suspend the warrior instance, most projects will allow resuming of work in progress when you unsuspend the warrior instance.<br />
<br />
If you decided to use the suspend feature in VirtualBox, please note that if you keep it suspended for too long (more than a few hours), the admins will assume that the item is lost and be re-queued. Using the suspend feature so that you can reboot your computer is perfectly fine.<br />
<br />
=== I told the warrior to shutdown from the interface but nothing has changed! What gives? ===<br />
<br />
The warrior will attempt to finish the current running tasks before shutting down. If you need to shut down right away, go ahead. Your progress will be lost, however the jobs will eventually cycle out to another user.<br />
<br />
=== How much disk space will the warrior use? ===<br />
<br />
Short answer: it depends on the project.<br />
<br />
Long answer: because the way each project defines an item differently, the warrior may be downloading a small file or downloading a whole subsection of a website. The virtual machine is configured by default to use 60GB as an absolute maximum. Any unused virtual machine disk space is not used on the host computer. You may, however, run the virtual machine on less than 60GB if you like to live dangerously. We're downloading the internet after all!<br />
<br />
=== The secondary disk is using up space even though it's not running a project. ===<br />
<br />
Virtual machine disk images do not behave like a regular file. There are several ways to reclaim space:<br />
<br />
* Delete the second disk and put back an empty disk. The warrior should reformat the second disk.<br />
* Delete the entire warrior application and re-import it.<br />
* Use the [http://intgat.tigress.co.uk/rmy/uml/index.html zerofree] program and then clone the disk image. Reattach the cloned disk image.<br />
<br />
=== I can't connect to localhost. ===<br />
<br />
The application includes a configuration to set up port forwarding to the guest machine on port 8001 so you can access the interface through your web browser. If this does not happen, you may need to double check your machine's network settings.<br />
<br />
=== The warrior can't connect to the internet. ===<br />
<br />
It may be possible that the virtual machine has picked up the address of the local DNS cache on your computer which the virtual machine does not have access to. <br />
<br />
If you experience this on VirtualBox, see [http://askubuntu.com/questions/204953/virtualbox-dns-stopped-working-on-upgrade-to-12-10 this question and answer].<br />
<br />
=== I'm looking at the text scrolling by and I notice some errors. rsync is not working. ===<br />
<br />
Uh-oh! Something is not right. Notify us immediately in the appropriate [[IRC]] channel.<br />
<br />
=== The item I'm working on is downloading thousands of URLs and it's taking hours. ===<br />
<br />
See the above question and reboot the warrior as appropriate.<br />
<br />
=== I'm looking at the leaderboard. What's that icon beside the username? ===<br />
<br />
That's just the warrior logo: [[File:Archive_team.png|42px]] (click on the image for a larger version). It means that that person is using the warrior. Those without the icon are running the scripts manually.<br />
<br />
=== What's that guy doing in the logo? ===<br />
<br />
The place is on fire! But don't worry, he safely escaped with the rescued data in his arms.<br />
<br />
<br />
[[Image:Archiveteam-warrior-sticker.png|256px|right]]<br />
<br />
=== That’s awesome – can I slap this logo on my laptop to show my Internet-preservation pride? ===<br />
<br />
[http://www.redbubble.com/people/ajhajh/works/12857655-archive-team-warrior-stickers?p=sticker You sure can! The ArchiveTeam Warrior laptop sticker can start conversations about archiving, if you’re into that.]<br />
<br />
=== I want to log in to the virtual machine. How do I do this? ===<br />
<br />
Unless you know what you are doing, you should not need to do this. But if you want to, the username is <code>root</code> and the password is <code>archiveteam</code>. Then, you can execute <code>sudo -u warrior -i</code> to log in as the warrior user. <br />
<br />
Press ALT+F3 to switch to virtual console number 3. Use ALT+Left or ALT+Right to switch between virtual consoles. There are 6 virtual consoles in total. Consoles 1 and 2 are reserved for the warrior.<br />
<br />
=== Can I run multiple virtual machines at the same time? ===<br />
<br />
Yes, but you'll need to adjust the networking settings.<br />
<br />
On the machine, open up Settings → Network → Adapter 1 → Port Fowarding. You need to adjust the Host Port. For example, ensure your table looks like TCP | 127.0.0.1 | 8123 | | 8001. In this example, you can then visit http://localhost:8123/ as it maps port 8123 in your browser to port 8001 which the warrior uses.<br />
<br />
=== The warrior seems to have too much overhead. I can't run a VM in a VPS! ===<br />
<br />
You don't need to run a virtual machine.<br />
<br />
An option is running Docker containers, based on LXC the overhead is far less than running a full VM on a VPS, it should be noted if you plan on running the ([https://github.com/ArchiveTeam/warrior-dockerfile warrior-dockerfile]) to publish the port to allow access to the web interface.<br />
<pre> docker run -d -p 8001:8001 archiveteam/warrior-dockerfile </pre><br />
<br />
(Above is assumed direct mapping VPS port to container port so if you wanted say <code>port 38001</code> it would be <code>docker run -d -p 38001:8001 archiveteam/warrior-dockerfile </code> Adjust as required. :P)<br />
<br />
<br />
If you are managing a VPS, it's likely you are comfortable with some Linux stuff. '''Projects can be run manually.''' Consult the project wiki page or the source code repository readme file.<br />
<br />
(Note that multiple projects can be also run in isolated environments(containers) for rapid deployment using: ([https://hub.docker.com/r/infrequent/at-as-dockerfile at-as-dockerfile]))<br />
<br />
=== Why a virtual machine in the first place? ===<br />
<br />
The virtual machine is a quick, safe, and easy way for newcomers to help us out. It offers many features:<br />
<br />
* Graphical interface<br />
* Automatically selects which project is important to run<br />
* Self-updating software infrastructure<br />
* Allows for unattended use<br />
* In case of software faults, your machine is not ruined<br />
* Restarts itself in case of runaway programs<br />
* Runs on Windows, Mac, and Linux painlessly<br />
* Ensures consistency in the archived data regardless of your machine's quirks<br />
<br />
If you have suggestions for improving this system, please talk to us as described below.<br />
<br />
=== I'm running the scripts manually in a VPS but it says the code is out of date a while later ===<br />
<br />
It happens when a bug in the scripts is discovered. Bugs are unavoidable especially when the server is out of our control.<br />
<br />
Try the <code>--auto-update</code> option available in Seesaw version 0.8. However, please be aware that you are now executing code automatically. Be sure to run the scripts in a separate user account for safety.<br />
<br />
=== I just imported the ova image and the warrior is stuck on "Preparing the data partition" ===<br />
<br />
This issue has cropped up before and we do not know what causes it. It is recommended to just delete the warrior image and import the ova again. Testing shows that such a reimport works in the majority of cases.<br />
<br />
=== Why is the default project not working? / Why is a manual project not in the Warrior yet? ===<br />
<br />
Sorry. Sometimes the administrators are too busy...<br />
<br />
=== Why are there no projects? ===<br />
<br />
If there are no projects showing, you can help us write one. No projects does ''not'' mean there is nothing left to archive!<br />
<br />
=== The instructions to run the software/scripts are awful and they are difficult to set up. ===<br />
<br />
Well, excuuuuse me, princess!<br />
<br />
We're not a professional support team so help us help you help us all. See below for bug reports, suggestions, or contribute writing code.<br />
<br />
=== Help I'm getting errors when I try to launch the VM ===<br />
If you are receiving ''"Breakpoint has been reached (0x80000003)"'', ''"A critical error has occurred while running the virtual machine and the machine execution has been stopped."'' or VT-X errors you probably have virtualization disabled in you computer's BIOS or your CPU may not support virtualization. You can check this using [http://openlibsys.org/index-ja.html VirtualChecker]<br />
<br />
To enable virtualization reboot the computer and enter the BIOS, the virtualization setting is usually under CPU configuration or Advanced settings.<br />
<br />
=== Where can I file a bug, suggestion, or a feature request? ===<br />
<br />
If the issue is related to the warrior's web interface or the library that grab scripts are using, see [https://github.com/ArchiveTeam/seesaw-kit/issues seesaw-kit issues]. Other issues should be filed into their own [[Dev/Source_Code|repositories]].<br />
<br />
=== I'd like to help write code. Where can I find more info? ===<br />
<br />
Check out the [[Dev]] documentation for details on the infrastructure and details of the source code layout.<br />
<br />
=== I still have a question! ===<br />
<br />
Check out the [[Frequently Asked Questions|general FAQ page]]. Talk to us on [[IRC]]. Use [irc://irc.efnet.org/warrior #warrior] for specific warrior questions or [irc://irc.efnet.org/archiveteam #archiveteam] for general questions.<br />
<br />
== Projects ==<br />
<br />
See: [[Warrior projects]].<br />
<br />
== Are you a coder? ==<br />
<br />
Like the warrior? Interested in how it works under the hood? Got software skills? '''[[Dev|Help us improve it!]]'''<br />
<br />
{{Navigation box}}</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Wunderlist&diff=27348Wunderlist2017-01-16T15:42:59Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Twitch.tv/Vinesauce/December_2013&diff=27347Twitch.tv/Vinesauce/December 20132017-01-16T15:42:51Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>(153 Streams)<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
=== Chunk 2013-12_C26 (12/30 - 12/31) ===<br />
<br />
* 2013-12-31T01:16:04Z - [http://www.twitch.tv/vinesauce/b/491443598 Now Streaming - Rev || Daggerfall and maybe hearthstone]<br />
<br />
* 2013-12-30T23:58:29Z - [http://www.twitch.tv/vinesauce/b/491421315 Now Streaming - Joel || Beefsmith 2014]<br />
<br />
* 2013-12-30T20:50:09Z - [http://www.twitch.tv/vinesauce/b/491385662 Now Streaming - Joel || Beefsmith 2014]<br />
<br />
* 2013-12-30T14:08:45Z - [http://www.twitch.tv/vinesauce/b/491312918 Mari &amp; Jen || Super Mario 64 Co-op || chat @ vinesauce.com!]<br />
<br />
* 2013-12-30T11:11:01Z - [http://www.twitch.tv/vinesauce/b/491291495 Mari &amp; Jen || Super Mario 64 Co-op || chat @ vinesauce.com!]<br />
<br />
* 2013-12-30T05:17:48Z - [http://www.twitch.tv/vinesauce/b/491241711 Hootey || Chess with Chat || chat @ vinesauce.com!]<br />
<br />
=== Chunk 2013-12_C25 (12/29 - 12/30) ===<br />
<br />
* 2013-12-30T04:28:55Z - [http://www.twitch.tv/vinesauce/b/491226038 fred]<br />
<br />
* 2013-12-29T23:30:01Z - [http://www.twitch.tv/vinesauce/b/491166704 Vinny || Tower of Guns Developer Interview]<br />
<br />
* 2013-12-29T22:15:59Z - [http://www.twitch.tv/vinesauce/b/491149548 Jen &amp; Friends || LFD2 Custom Map]<br />
<br />
* 2013-12-29T18:43:22Z - [http://www.twitch.tv/vinesauce/b/491099826 Vinny || Animal Crossing: New Leaf + NES Remix]<br />
<br />
* 2013-12-29T06:57:53Z - [http://www.twitch.tv/vinesauce/b/490993914 Vinny || Airport Simulator 2013 + More]<br />
<br />
=== Chunk 2013-12_C24 (12/28 - 12/29) ===<br />
<br />
* 2013-12-29T06:50:44Z - [http://www.twitch.tv/vinesauce/b/490987320 DireBoar dying a lot]<br />
<br />
* 2013-12-29T05:50:20Z - [http://www.twitch.tv/vinesauce/b/490981543 DireBoar dying a lot]<br />
<br />
* 2013-12-29T00:29:15Z - [http://www.twitch.tv/vinesauce/b/490914452 Now Streaming - Rev || Daggerfall || chat @ vinesauce.com!]<br />
<br />
* 2013-12-28T19:45:35Z - [http://www.twitch.tv/vinesauce/b/490849616 Jen || Harvest Moon 64 || chat @ vinesauce.com!]<br />
<br />
* 2013-12-28T08:11:06Z - [http://www.twitch.tv/vinesauce/b/490742458 Hootey || Sid Meier's Pirates! (finale) || chat @ vinesauce.com!]<br />
<br />
* 2013-12-28T08:07:06Z - [http://www.twitch.tv/vinesauce/b/490737842 Hootey || Sid Meier's Pirates! (finale) || chat @ vinesauce.com!]<br />
<br />
=== Chunk 2013-12_C23 (12/27 - 12/28) ===<br />
<br />
* 2013-12-28T05:19:38Z - [http://www.twitch.tv/vinesauce/b/490712422 Fred, KY, and Notsoserious || Ouya/Wii U games // Chat @ Vinesauce.com]<br />
<br />
* 2013-12-28T03:58:19Z - [http://www.twitch.tv/vinesauce/b/490695774 Fred, KY, and Notsoserious || Ouya/Wii U games // Chat @ Vinesauce.com]<br />
<br />
* 2013-12-28T01:13:36Z - [http://www.twitch.tv/vinesauce/b/490660800 Hootey || Story Time (chat participation encouraged!), possibly SMRPG // chat @ vinesauce.com!]<br />
<br />
* 2013-12-27T19:20:06Z - [http://www.twitch.tv/vinesauce/b/490583873 Jen || Star Ocean // Chat @ Vinesauce.com]<br />
<br />
* 2013-12-27T04:57:16Z - [http://www.twitch.tv/vinesauce/b/490455914 Vinny || Super Mario Crossover + More]<br />
<br />
=== Chunk 2013-12_C22 (12/26 - 12/27) ===<br />
<br />
* 2013-12-27T02:41:59Z - [http://www.twitch.tv/vinesauce/b/490429252 Fred Plays Ys: Memories of Celceta!]<br />
<br />
* 2013-12-27T02:39:14Z - [http://www.twitch.tv/vinesauce/b/490423063 Fred Plays Ys: Memories of Celceta!]<br />
<br />
* 2013-12-27T02:04:29Z - [http://www.twitch.tv/vinesauce/b/490421697 Hootey || Chess, possibly some Super Mario RPG to follow || Chat @ vinesauce.com!]<br />
<br />
* 2013-12-26T23:35:08Z - [http://www.twitch.tv/vinesauce/b/490391679 Now Streaming - Rev || Daggerfall]<br />
<br />
* 2013-12-26T22:02:43Z - [http://www.twitch.tv/vinesauce/b/490372676 DireBoar playing The Girl and the Robot (Alpha)]<br />
<br />
=== Chunk 2013-12_C21 (12/26) ===<br />
<br />
* 2013-12-26T16:48:50Z - [http://www.twitch.tv/vinesauce/b/490313512 Jen || Harvest Moon 64]<br />
<br />
* 2013-12-26T16:46:14Z - [http://www.twitch.tv/vinesauce/b/490308785 Jen || Harvest Moon 64]<br />
<br />
* 2013-12-26T08:17:08Z - [http://www.twitch.tv/vinesauce/b/490254302 Vinny || GTA:Online]<br />
<br />
* 2013-12-26T07:03:44Z - [http://www.twitch.tv/vinesauce/b/490245366 Vinny || GTA:Online]<br />
<br />
* 2013-12-26T03:08:42Z - [http://www.twitch.tv/vinesauce/b/490208927 Fred Plays Killer is Dead!]<br />
<br />
=== Chunk 2013-12_C20 - Christmas Special Part 2 ===<br />
<br />
* 2013-12-26T02:01:20Z - [http://www.twitch.tv/vinesauce/b/490197853 Fred Give-Away and then Plays Ys: Memories of Celceta!]<br />
<br />
* 2013-12-26T00:06:45Z - [http://www.twitch.tv/vinesauce/b/490178981 Imakuni &amp; Conome's Christmas Clusterpoop || Multiple games]<br />
<br />
* 2013-12-25T20:33:09Z - [http://www.twitch.tv/vinesauce/b/490142342 Imakuni &amp; Conome's Christmas Clusterpoop || Multiple games]<br />
<br />
* 2013-12-25T19:25:48Z - [http://www.twitch.tv/vinesauce/b/490131054 Vinny || Super Mario Crossover]<br />
<br />
=== Chunk 2013-12_C19 - Christmas Special Part 1 ===<br />
<br />
* 2013-12-25T16:02:45Z - [http://www.twitch.tv/vinesauce/b/490100516 Hootey || Die Hard (getting into the Christmas spirit!) || chat @ vinesauce.com!]<br />
<br />
* 2013-12-25T08:17:02Z - [http://www.twitch.tv/vinesauce/b/490051447 Viscera Cleanup Detail: Santa's Rampage + More with Vinny (Giving Away 20+ Games Also)]<br />
<br />
* 2013-12-25T06:02:29Z - [http://www.twitch.tv/vinesauce/b/490034428 Pikmin 3 Holiday Maps + More with Vinny (Giving Away 20+ Games Also)]<br />
<br />
* 2013-12-25T02:08:15Z - [http://www.twitch.tv/vinesauce/b/490000110 KY gives away games with the Spelunky Death Roulette! Chat @ vinesauce.com]<br />
<br />
=== Chunk 2013-12_C18 (12/24 - 12/25) ===<br />
<br />
* 2013-12-25T00:57:26Z - [http://www.twitch.tv/vinesauce/b/489989556 Fred Plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-25T00:35:28Z - [http://www.twitch.tv/vinesauce/b/489983572 Fred Plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-25T00:24:15Z - [http://www.twitch.tv/vinesauce/b/489981593 Fred Plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-25T00:06:58Z - [http://www.twitch.tv/vinesauce/b/489979767 Fred Plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-24T20:15:01Z - [http://www.twitch.tv/vinesauce/b/489943916 Fred Plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-24T05:34:15Z - [http://www.twitch.tv/vinesauce/b/489827246 Fred Plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-24T05:31:55Z - [http://www.twitch.tv/vinesauce/b/489821628 Fred Plays Ys: Memories of Celceta]<br />
<br />
=== Chunk 2013-12_C17 (12/22 - 12/24) ===<br />
<br />
* 2013-12-24T00:20:56Z - [http://www.twitch.tv/vinesauce/b/489766371 Rev Plays Daggerfall]<br />
<br />
* 2013-12-23T20:57:03Z - [http://www.twitch.tv/vinesauce/b/489723991 Hootey plays Super Mario RPG: Legend of the Seven Stars || chat at vinesauce.com! || Happy Holidays!]<br />
<br />
* 2013-12-23T19:16:28Z - [http://www.twitch.tv/vinesauce/b/489703345 Limes || Lego Marvel]<br />
<br />
* 2013-12-22T23:16:44Z - [http://www.twitch.tv/vinesauce/b/489512848 Vinny plays Wierd Games then NES Remix and Super Mario 3D World]<br />
<br />
* 2013-12-22T19:15:01Z - [http://www.twitch.tv/vinesauce/b/489459217 Joel plays F-Zero X]<br />
<br />
=== Chunk 2013-12_C16 (12/22) ===<br />
<br />
* 2013-12-22T17:55:07Z - [http://www.twitch.tv/vinesauce/b/489442375 Limes || Lego Marvel]<br />
<br />
* 2013-12-22T17:54:31Z - [http://www.twitch.tv/vinesauce/b/489436381 Limes || Lego Marvel]<br />
<br />
* 2013-12-22T14:44:45Z - [http://www.twitch.tv/vinesauce/b/489407838 Fred plays Sega Genesis Games]<br />
<br />
* 2013-12-22T14:20:19Z - [http://www.twitch.tv/vinesauce/b/489403018 Fred plays Sega Genesis Games]<br />
<br />
* 2013-12-22T14:09:08Z - [http://www.twitch.tv/vinesauce/b/489398925 Fred plays Sega Genesis Games]<br />
<br />
* 2013-12-22T14:07:11Z - [http://www.twitch.tv/vinesauce/b/489397304 Fred plays Sega Genesis Games]<br />
<br />
* 2013-12-22T13:48:34Z - [http://www.twitch.tv/vinesauce/b/489397238 Fred plays Sega Genesis Games]<br />
<br />
* 2013-12-22T05:19:29Z - [http://www.twitch.tv/vinesauce/b/489324636 Fred plays Shaq Fu, Battles Toad, and other crappy games]<br />
<br />
* 2013-12-22T05:17:11Z - [http://www.twitch.tv/vinesauce/b/489318713 Fred plays Shaq Fu, Battles Toad, and other crappy games]<br />
<br />
* 2013-12-22T05:14:15Z - [http://www.twitch.tv/vinesauce/b/489318439 Fred plays Shaq Fu, Battles Toad, and other crappy games]<br />
<br />
=== Chunk 2013-12_C15 (12/20 - 12/21) ===<br />
<br />
* 2013-12-21T17:05:19Z - [http://www.twitch.tv/vinesauce/b/489168583 Joel || Hardcore Saturday? ( Half Life 2 in one sitting )]<br />
<br />
* 2013-12-21T02:46:11Z - [http://www.twitch.tv/vinesauce/b/489027887 KY plays Spelunky Death Roulette! Chat @ vinesauce.com]<br />
<br />
* 2013-12-21T02:43:28Z - [http://www.twitch.tv/vinesauce/b/489021292 KY plays Spelunky Death Roulette! Chat @ vinesauce.com]<br />
<br />
* 2013-12-21T00:37:20Z - [http://www.twitch.tv/vinesauce/b/489001153 KY plays Chrono Cross, blind run! Chat @ vinesauce.com]<br />
<br />
* 2013-12-20T22:40:47Z - [http://www.twitch.tv/vinesauce/b/488976552 Hootey plays Super Mario RPG || Chat at vinesauce.com!]<br />
<br />
=== Chunk 2013-12_C14 (12/19 - 12/20) ===<br />
<br />
* 2013-12-20T22:13:42Z - [http://www.twitch.tv/vinesauce/b/488965695 freddy]<br />
<br />
* 2013-12-20T08:23:12Z - [http://www.twitch.tv/vinesauce/b/488839012 KY plays Contraption Maker (successor to The Incredible Machine!) Chat @ vinesauce.com]<br />
<br />
* 2013-12-20T04:59:46Z - [http://www.twitch.tv/vinesauce/b/488808607 Vinny plays Just Cause 2 Multiplayer + More?]<br />
<br />
* 2013-12-20T01:52:47Z - [http://www.twitch.tv/vinesauce/b/488769916 Fred plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-19T22:13:17Z - [http://www.twitch.tv/vinesauce/b/488724573 Joel || GTA late night fuckery + more]<br />
<br />
=== Chunk 2013-12_C13 (12/18 - 12/19) ===<br />
<br />
* 2013-12-19T14:41:29Z - [http://www.twitch.tv/vinesauce/b/488637672 Fred plays Video Games]<br />
<br />
* 2013-12-19T12:07:16Z - [http://www.twitch.tv/vinesauce/b/488617963 Fred plays Video Games]<br />
<br />
* 2013-12-19T06:52:28Z - [http://www.twitch.tv/vinesauce/b/488582081 Vinny plays Walking Dead Season 2]<br />
<br />
* 2013-12-19T03:29:15Z - [http://www.twitch.tv/vinesauce/b/488545084 Vinny plays NES Remix + Walking Dead Season 2]<br />
<br />
* 2013-12-18T21:27:24Z - [http://www.twitch.tv/vinesauce/b/488469614 Now Streaming - Rev and Tilde || Don't Starve Multiplayer Mod]<br />
<br />
=== Chunk 2013-12_C12 (12/18) ===<br />
<br />
* 2013-12-18T16:02:09Z - [http://www.twitch.tv/vinesauce/b/488403083 Limes || Bayonetta 'n Stuff]<br />
<br />
* 2013-12-18T15:58:28Z - [http://www.twitch.tv/vinesauce/b/488398019 Limes || Bayonetta 'n Stuff]<br />
<br />
* 2013-12-18T05:00:20Z - [http://www.twitch.tv/vinesauce/b/488313599 Vinny plays Zelda: A Link Between Worlds]<br />
<br />
* 2013-12-18T03:20:57Z - [http://www.twitch.tv/vinesauce/b/488292608 Hootey plays Super Mario RPG: Legend of the Seven Star || Chat at vinesauce.com!]<br />
<br />
* 2013-12-18T02:06:30Z - [http://www.twitch.tv/vinesauce/b/488274996 Conome streaming Noctorne]<br />
<br />
* 2013-12-18T00:50:49Z - [http://www.twitch.tv/vinesauce/b/488259034 Conome streaming Noctorne]<br />
<br />
=== Chunk 2013-12_C11 (12/17) ===<br />
<br />
* 2013-12-17T20:58:39Z - [http://www.twitch.tv/vinesauce/b/488207506 Limes || Bayonetta and Marvel]<br />
<br />
* 2013-12-17T20:53:42Z - [http://www.twitch.tv/vinesauce/b/488200101 Limes || Bayonetta and Marvel]<br />
<br />
* 2013-12-17T08:04:10Z - [http://www.twitch.tv/vinesauce/b/488086102 Vinny || Super Mario Crossover (Lost Levels)]<br />
<br />
* 2013-12-17T05:12:38Z - [http://www.twitch.tv/vinesauce/b/488059231 KY plays Legend of Dungeon! Chat (with developers?) @ vinesauce.com]<br />
<br />
* 2013-12-17T02:19:02Z - [http://www.twitch.tv/vinesauce/b/488025134 Fred plays Ys: Memories of Celceta!]<br />
<br />
=== Chunk 2013-12_C10 (12/16) ===<br />
<br />
* 2013-12-16T23:21:45Z - [http://www.twitch.tv/vinesauce/b/487989626 Limes || Bayonetta]<br />
<br />
* 2013-12-16T15:51:46Z - [http://www.twitch.tv/vinesauce/b/487902090 Limes || Bayonetta]<br />
<br />
* 2013-12-16T15:13:19Z - [http://www.twitch.tv/vinesauce/b/487896506 Limes || Bayonetta]<br />
<br />
* 2013-12-16T08:12:27Z - [http://www.twitch.tv/vinesauce/b/487850520 Vinny and Friends play GTA Online]<br />
<br />
* 2013-12-16T08:03:18Z - [http://www.twitch.tv/vinesauce/b/487846998 Vinny and Friends play GTA Online]<br />
<br />
* 2013-12-16T05:50:51Z - [http://www.twitch.tv/vinesauce/b/487832155 Vinny and Friends play GTA Online]<br />
<br />
* 2013-12-16T05:20:06Z - [http://www.twitch.tv/vinesauce/b/487823320 Vinny and Friends play GTA Online]<br />
<br />
=== Chunk 2013-12_C09 (12/15 - 12/16) ===<br />
<br />
* 2013-12-16T00:35:54Z - [http://www.twitch.tv/vinesauce/b/487774808 Fred plays Spelunky]<br />
<br />
* 2013-12-15T22:50:53Z - [http://www.twitch.tv/vinesauce/b/487754099 Vinny plays Super Mario 3D World]<br />
<br />
* 2013-12-15T14:37:50Z - [http://www.twitch.tv/vinesauce/b/487652180 Marisa and Jen play Starbound! Chat @ Vinesauce.com]<br />
<br />
* 2013-12-15T10:46:13Z - [http://www.twitch.tv/vinesauce/b/487618159 Fred plays Killer is Dead]<br />
<br />
* 2013-12-15T10:44:29Z - [http://www.twitch.tv/vinesauce/b/487614193 Fred plays Killer is Dead]<br />
<br />
* 2013-12-15T08:26:40Z - [http://www.twitch.tv/vinesauce/b/487598917 Fred plays Tearaway!(Finale?)]<br />
<br />
=== Chunk 2013-12_C08 (12/14 - 12/15) ===<br />
<br />
* 2013-12-15T08:17:20Z - [http://www.twitch.tv/vinesauce/b/487593610 Corruptions with Vinny]<br />
<br />
* 2013-12-15T04:00:19Z - [http://www.twitch.tv/vinesauce/b/487547310 Vinny Plays Horrible Games &amp; Good Games]<br />
<br />
* 2013-12-15T00:01:07Z - [http://www.twitch.tv/vinesauce/b/487492357 Fred plays Tearaway!]<br />
<br />
* 2013-12-14T23:51:09Z - [http://www.twitch.tv/vinesauce/b/487483830 Fred plays Tearaway!]<br />
<br />
* 2013-12-14T19:38:22Z - [http://www.twitch.tv/vinesauce/b/487427889 Joel || Harcore Fridays ( Saturday(???) ) Doom 20th Anniversary]<br />
<br />
=== Chunk 2013-12_C07 (12/14) ===<br />
<br />
* 2013-12-14T18:48:06Z - [http://www.twitch.tv/vinesauce/b/487416073 Now Streaming - Jen || Harvest Moon 64]<br />
<br />
* 2013-12-14T18:32:42Z - [http://www.twitch.tv/vinesauce/b/487408416 Now Streaming - Jen || Harvest Moon 64]<br />
<br />
* 2013-12-14T18:21:37Z - [http://www.twitch.tv/vinesauce/b/487404800 Now Streaming - Jen || Harvest Moon 64]<br />
<br />
* 2013-12-14T17:52:33Z - [http://www.twitch.tv/vinesauce/b/487402494 Now Streaming - Jen || Harvest Moon 64]<br />
<br />
* 2013-12-14T15:31:06Z - [http://www.twitch.tv/vinesauce/b/487370514 Now Streaming - Rev || Daggerfall]<br />
<br />
=== Chunk 2013-12_C06 (12/13 - 12/14) ===<br />
<br />
* 2013-12-14T13:09:49Z - [http://www.twitch.tv/vinesauce/b/487351180 Fred plays Killer is Dead!]<br />
<br />
* 2013-12-14T11:50:57Z - [http://www.twitch.tv/vinesauce/b/487339545 Fred plays Killer is Dead!]<br />
<br />
* 2013-12-14T09:42:53Z - [http://www.twitch.tv/vinesauce/b/487322658 Fred plays Starcraft 2 HoTS!]<br />
<br />
* 2013-12-14T03:58:34Z - [http://www.twitch.tv/vinesauce/b/487260998 Fred plays Spelunky Speed Runs and may More!]<br />
<br />
* 2013-12-14T02:17:19Z - [http://www.twitch.tv/vinesauce/b/487232927 Fred plays Ys: Memories of Celceta]<br />
<br />
* 2013-12-13T20:15:43Z - [http://www.twitch.tv/vinesauce/b/487152796 Now Streaming - Rev || Daggerfall Chat @ Vinesauce.com]<br />
<br />
=== Chunk 2013-12_C05 (12/13) ===<br />
<br />
* 2013-12-13T18:19:32Z - [http://www.twitch.tv/vinesauce/b/487132791 Jen plays Risk of Rain with Vinebros! Chat @ Vinesauce.com]<br />
<br />
* 2013-12-13T08:01:47Z - [http://www.twitch.tv/vinesauce/b/487048322 Fred plays Killer is Dead]<br />
<br />
* 2013-12-13T06:40:57Z - [http://www.twitch.tv/vinesauce/b/487037437 Fred plays Killer is Dead]<br />
<br />
* 2013-12-13T02:40:26Z - [http://www.twitch.tv/vinesauce/b/486991625 Vinny plays Starbound, Zelda LBW and Mario Party 3DS (All at the same time! Not really...)]<br />
<br />
* 2013-12-13T01:10:43Z - [http://www.twitch.tv/vinesauce/b/486971975 Now Streaming - Rev || More Daggerfall]<br />
<br />
* 2013-12-13T01:01:42Z - [http://www.twitch.tv/vinesauce/b/486965547 Now Streaming - Rev || More Daggerfall]<br />
<br />
=== Chunk 2013-12_C04 (12/10 - 12/12) ===<br />
<br />
* 2013-12-12T19:58:28Z - [http://www.twitch.tv/vinesauce/b/486906858 Joel || Short physics simulator]<br />
<br />
* 2013-12-12T16:23:06Z - [http://www.twitch.tv/vinesauce/b/486866156 Now Streaming - Rev || Finishing an Arena run and Daggerfall]<br />
<br />
* 2013-12-12T06:30:06Z - [http://www.twitch.tv/vinesauce/b/486793991 Vinny - Mario 3D World and More]<br />
<br />
* 2013-12-12T05:40:54Z - [http://www.twitch.tv/vinesauce/b/486786032 Vinny - Mario 3D World and More]<br />
<br />
* 2013-12-11T21:44:09Z - [http://www.twitch.tv/vinesauce/b/486691677 Joel fucks around in a physics simulator, Rev gets hit by a storm of swarm of dick]<br />
<br />
* 2013-12-10T19:07:39Z - [http://www.twitch.tv/vinesauce/b/486418008 Now Streaming - Joel || Tekken TAG 2 with annoying delay + Arcade Madness]<br />
<br />
=== Chunk 2013-12_C03 (12/08 - 12/10) ===<br />
<br />
* 2013-12-10T08:17:02Z - [http://www.twitch.tv/vinesauce/b/486328997 Vinny plays Starbound (Newest Update)]<br />
<br />
* 2013-12-10T07:33:44Z - [http://www.twitch.tv/vinesauce/b/486323790 Vinny plays Starbound (Newest Update)]<br />
<br />
* 2013-12-09T03:15:06Z - [http://www.twitch.tv/vinesauce/b/486052702 Vinny || GTA Online]<br />
<br />
* 2013-12-09T01:38:19Z - [http://www.twitch.tv/vinesauce/b/486033675 Vinny || Zelda: Link Between Worlds and GTA Online]<br />
<br />
* 2013-12-08T21:16:03Z - [http://www.twitch.tv/vinesauce/b/485979495 Joel || Tekken TAG 2 + Arcade Madness]<br />
<br />
=== Chunk 2013-12_C02 (12/05 - 12/07) ===<br />
<br />
* 2013-12-07T22:33:29Z - [http://www.twitch.tv/vinesauce/b/485720532 Vinny || Super Mario 3D World]<br />
<br />
* 2013-12-07T20:13:33Z - [http://www.twitch.tv/vinesauce/b/485687306 Vinny || Super Mario 3D World]<br />
<br />
* 2013-12-07T04:27:57Z - [http://www.twitch.tv/vinesauce/b/485522699 Vinny plays Starbound]<br />
<br />
* 2013-12-06T04:36:09Z - [http://www.twitch.tv/vinesauce/b/485275709 Hootey plays Long Live The Queen || Chat @ vinesauce.com]<br />
<br />
* 2013-12-05T18:56:15Z - [http://www.twitch.tv/vinesauce/b/485156656 Darren plays Starbound || Chat @ vinesauce.com]<br />
<br />
<br />
=== Chunk 2013-12_C01 (12/01 - 12/04) ===<br />
<br />
* 2013-12-04T18:54:34Z - [http://www.twitch.tv/vinesauce/b/484921881 Joel plays Brutal Nature and plays GTA online for the first time]<br />
<br />
* 2013-12-04T04:19:57Z - [http://www.twitch.tv/vinesauce/b/484806918 Vinny || AVGN Game + Zelda: Link Between Worlds]<br />
<br />
* 2013-12-03T03:42:14Z - [http://www.twitch.tv/vinesauce/b/484579391 Vinny || Godus, Zelda + More]<br />
<br />
* 2013-12-02T03:14:51Z - [http://www.twitch.tv/vinesauce/b/484360603 GTA Online with Vinny and friends]<br />
<br />
* 2013-12-01T21:48:08Z - [http://www.twitch.tv/vinesauce/b/484298888 Glitch Cricket with Vinny]<br />
<br />
* 2013-12-01T03:31:46Z - [http://www.twitch.tv/vinesauce/b/484111533 Rev || Daggerfall]<br />
<br />
* 2013-12-01T00:37:26Z - [http://www.twitch.tv/vinesauce/b/484076742 Mario Party 3DS with Vinny (+More)]</div>Megalanya0https://wiki.archiveteam.org/index.php?title=Blogger&diff=27346Blogger2017-01-16T15:42:41Z<p>Megalanya0: MOTHERFUCKER ! ! !</p>
<hr />
<div>{{Infobox project<br />
| title = Blogger<br />
| logo = Blogger-logo.png<br />
| image = Blogger- Crea tu blog gratuito 1303511108785.png<br />
| description = <br />
| URL = http://www.blogger.com/<br />
| project_status = {{online}}<br />
| archiving_status = {{notsavedyet}}<br />
| source = [https://github.com/ArchiveTeam/blogger-discovery blogger-discovery]<br />
| tracker = [http://tracker.archiveteam.org/bloggerdisco/ bloggerdisco]<br />
| irc = frogger<br />
}}<br />
<br />
'''Blogger''' is a blog hosting service. On February 23, 2015, they announced that "sexually explicit" blogs would be restricted from public access in a month. But soon they withdrew their plan, and said they wouldn't change their existing policies.<ref>https://support.google.com/blogger/answer/6170671?p=policy_update&hl=en&rd=1</ref><br />
<br />
'''ArchiveTeam did a discovery between February and May 2015, but actual content has not been downloaded yet.'''<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== '''MOTHERFUCKER ! ! !''' ==<br />
<br />
== Country Redirect ==<br />
<br />
Accessing http://whatever.blogspot.com will usually redirect to a country-specific subdomain depending on your IP address (e.g. whatever.blogspot.co.uk, whatever.blogspot.in, etc) which in some cases may be censored or edited to meet local laws and standards - this can be bypassed by requesting http://whatever.blogspot.com/ncr as the root URL.<ref>https://support.google.com/blogger/answer/2402711?hl=en</ref> <ref>http://www.bbc.co.uk/news/technology-16852920</ref><br />
<br />
== Downloading a single blog with Wget ==<br />
These Wget parameters can download a BlogSpot blog, including comments and any on-site dependencies. It should also reject redundant pages such as the /search/ directory and any multiple occurrences of the same page but with different query strings. It has only be tested on blogs using a Blogger subdomain (e.g. http://foobar.blogspot.com), not custom domains (e.g. http://foobar.com). Both instances of [URL] should be replaced with the same URL. A simple Perl wrapper is available [http://pastebin.com/2QUuH26L here].<br />
<br />
<tt>wget --recursive --level=2 --no-clobber --no-parent --page-requisites --continue --convert-links --user-agent="" -e robots=off --reject "*\\?*,*@*" --exclude-directories="/search,/feeds" --referer="[URL]" --wait 1 [URL]</tt><br />
<br />
'''UPDATE''':<br />
<br />
Use this improved bash script instead, in order to bypass the adult content confirmation. BLOGURL should be in <code><nowiki>http://someblog.blogspot.com</nowiki></code> format.<br />
<br />
<pre style="white-space: pre-wrap"><br />
#!/bin/bash<br />
blogspoturl="BLOGURL"<br />
wget -O - "blogger.com/blogin.g?blogspotURL=$blogspoturl" | grep guestAuth | cut -d'"' -f 4 | wget -i - --save-cookies cookies.txt --keep-session-cookies<br />
wget --load-cookies cookies.txt --recursive --level=2 --no-clobber --no-parent --page-requisites --continue --convert-links --user-agent="" -e robots=off --reject "*\\?*,*@*" --exclude-directories="/search,/feeds" --referer="$blogspoturl" --wait 1 $blogspoturl<br />
</pre><br />
<br />
== Export XML trick ==<br />
Add this to a blog url and it will download the most recent 499 posts (that is the limit): /atom.xml?redirect=false&max-results=<br />
<br />
== Your own blogs ==<br />
<br />
Download them at https://takeout.google.com/settings/takeout<br />
<br />
We've not tested whether the output is suitable for importing in any other software such as Wordpress.<br />
<br />
== External links ==<br />
* {{url|1=http://www.blogger.com/|2=Blogger}}<br />
<br />
== References ==<br />
<references/><br />
<br />
{{Navigation box}}<br />
<br />
[[Category:Google]]</div>Megalanya0