Difference between revisions of "User:Sanqui"

From Archiveteam
Jump to navigation Jump to search
m (typo)
(add my IM template)
(20 intermediate revisions by the same user not shown)
Line 1: Line 1:
Email: gsanky@gmail.com
Hi, I am Sanqui!  I consider myself an amateur digital archivist and have been helping Archive Team on and off for several years.  My other topics of interest are birds and nature conservation at wide, old technology and video games, reverse engineering and ROM hacking, internet and furry culture, and linguistics.  Anything that intersects these topics is likely to capture my attention as far as archival goes.  I'm fluent in Czech and English and I have basic skills that will assist in archiving sites in German, Estonian, and Japanese.  If you believe I could help '''you''' with preserving content at risk, don't hesitate to contact me:
* IRC - efnet, Sanqui!~SanquiG@ghorland.net
* Telegram - @Sanqui
* Discord - Sanqui#3248
* Email - me@sanqui.net


On #archiveteam as Sanqui
Get in touch!


Further contact info on my homepage: <nowiki>http://sanqui.&#xfeff;rustedlogic.net</nowiki> <!-- spam filter workaround, doesn't need to be clickable anyway -->
Homepage: [https://sanqui.net sanqui.net]


== Some works ==


----
{{Czech websites}}
{{Instant messengers}}


* Twitch Plays Pokémon logs: https://archive.org/details/tpp_logs
* [[Internet Centrum]] - Czech freehost, gone 2015-03-03
* [[Webzdarma]] - Another Czech freehost, keep an eye on it
* [[Retrospring]] - Q&A, gone 2016-06-12
* [[Nifty]] - Japanese freehost, gone 2016-11-10
* [[Romhacking.net]]
* [[Spolužáci.cz]]
* [[Floraverse]]
* [[Discord]]
* [[Duelyst]]


I archived Twitch Plays Pokémon logs: <nowiki>https://archive.&#xfeff;org/details/tpp_logs</nowiki>
Perpetually feeding interesting small to medium sized websites to [[ArchiveBot]].  Mainly related to old technology, video games, ROM hacking, internet and furry culture.


Currently trying to organize saving the sites hosted for free on [[Internet Centrum]].
[https://archive.fart.website/bin/irclogger_log_search_a/archivebot?search=%28Sanqui%7CSanky%29.*%21a%5B%28rchive%29o+%5D&action=search&error=0 My ArchiveBot commands]


* [https://www.linfoxdomain.com/ linfoxdomain.com] (archived 2015-10-14) - DS flashcart firmware files and more
* Lots of Japanese Pokémon fansites
* [https://www.ocf.berkeley.edu/~jdonald/pokemon/ hanzou's pokemon stuff] (archived 2015-04-17)
* [http://free-smiley-faces.de/ free-smiley-faces.de] (archived 2015-04-13) - a legend
* [http://www.avians.net/ avians.net] (archived 2015-12-19)
* [http://forum.gbadev.org/ forum.gbadev.org] (archived 2016-03-20)
* [http://www.projectspark.com projectspark.com], [http://forums.projectspark.com forums.projectspark.com] (archived 2016-05-14) - gone
* [http://pokefactory.pokemology.com/ pokefactory.pokemology.com] (archived 2016-08-10)
* [http://otp22.referata.com otp22.referata.com] (archived 2016-08-11)
* [http://forum.machinaesupremacy.com/ forums.machinaesupremacy.com] (archived 2016-10-22)
* [https://devkitpro.org/ devkitpro.org] (archived 2016-11-28)
* [https://zdoom.org/ zdoom.org], [https://forums.zdoom.org/ forums.zdoom.org] (archived 2017-01-08)
* [http://lilymud.net lilymud.net] (archived 2017-03-04)
* [http://vgmrips.net vgmrips.net] (archived 2017-03-23)


=== Parallel wget ===
== Todo ==
<pre>
#!/bin/sh
cat $LIST | xargs -n 1 -P $PARALLEL -I % \
wget \
-mc --waitretry 5 --timeout 60 --tries 5 \
--user-agent="Mozilla/5.0 (X11; Ubuntu; Linux i686; rv:28.0) Gecko/20100101 Firefox/28.0"\
-e robots=off \
--warc-header "operator: Archive Team (Sanqui)" --warc-cdx --warc-file="sanqui_00_%" \
-o "sanqui_00_%.log" \
%
</pre>


=== Common/useful czech words ===
* recordings of Game Page, a czech video game TV program.  Needs something custom to scrape videos. http://www.ceskatelevize.cz/porady/1095870977-game-page/205562242600016/
* [https://handheld.allepaginas.nl/ https://handheld.allepaginas.nl/]
 
== Mini list of freehosts and blog hosts ==
 
* [[Webzdarma]]
* sweb.cz
* https://x10hosting.com/
* dreamwidth.org
* http://www.geocities.ws/
* weebly.com
 
== Possible future projects ==
 
* ArchiveBot helper, something to watch the logs and suggest ignores
* Semantic Mediawiki for archiveteam.org
* Cooperation with TCRF and Hidden Palace
* tilde servers (tad bit too late)
* Invisionfree, Zetaboards, and other free forum hosts, possibly including registrations to scrape more data
* MU*: archive what's left of MUD/MUSH/MUCK/MOO servers using a semi-automated process; record people's memories
* Minecraft: automatically visit and archive Minecraft servers (classic and modern)
* A search service indexing only freehosts
 
== Alt search engines ==
* https://millionshort.com/
* https://wiby.me/
* https://pinboard.in/search/
* https://private.sh/
 
== Common/useful czech words ==
<pre>
<pre>
cislo (číslo) = number
cislo (číslo) = number

Revision as of 18:59, 17 May 2020

Hi, I am Sanqui! I consider myself an amateur digital archivist and have been helping Archive Team on and off for several years. My other topics of interest are birds and nature conservation at wide, old technology and video games, reverse engineering and ROM hacking, internet and furry culture, and linguistics. Anything that intersects these topics is likely to capture my attention as far as archival goes. I'm fluent in Czech and English and I have basic skills that will assist in archiving sites in German, Estonian, and Japanese. If you believe I could help you with preserving content at risk, don't hesitate to contact me:

  • IRC - efnet, Sanqui!~SanquiG@ghorland.net
  • Telegram - @Sanqui
  • Discord - Sanqui#3248
  • Email - me@sanqui.net

Get in touch!

Homepage: sanqui.net

Some works

     Czech and Slovak websites     
Webhosting WebzdarmaInternet CentrumEStránky.czwebnode.cz

Sweb.czblog.czhostuju.czszm.comwebgarden.cz (jex.cz)

home.tiscali.cz

Social media Spolužáci.czLidé
Photo and file hosting uloz.torajce.net
Message boards okoun.cznyx.cznyx.czHOFYLAND.CZLopuch.cz
Fandoms Pikachu.czPJZ.czHOCZBronies.czfurry.czFurrici.info
Instant messengers
'80s

talkIRC

'90s

ICQAIMYahoo! MessengerMSN MessengerJabber/XMPPQQ

'00s

SkypeGoogle TalkFacebook MessengerWhatsApp

'10s

KikViberSnapchatLINETelegramSlackGitter
KeybaseSignalMusical.ly/TikTokMatrixDiscordInstagram


Perpetually feeding interesting small to medium sized websites to ArchiveBot. Mainly related to old technology, video games, ROM hacking, internet and furry culture.

My ArchiveBot commands

Todo

Mini list of freehosts and blog hosts

Possible future projects

  • ArchiveBot helper, something to watch the logs and suggest ignores
  • Semantic Mediawiki for archiveteam.org
  • Cooperation with TCRF and Hidden Palace
  • tilde servers (tad bit too late)
  • Invisionfree, Zetaboards, and other free forum hosts, possibly including registrations to scrape more data
  • MU*: archive what's left of MUD/MUSH/MUCK/MOO servers using a semi-automated process; record people's memories
  • Minecraft: automatically visit and archive Minecraft servers (classic and modern)
  • A search service indexing only freehosts

Alt search engines

Common/useful czech words

cislo (číslo) = number
hlas, hlasovat, hlasování = vote
jmeno (jméno) = name
nahled (náhled) = preview, thumbnail
obr, obrazek (obrázek) = picture
posli (pošli), poslat = send
sprava (správa) = administration
stahuj, stáhnout, stahovat = download
tisk = print
ukaz (ukaž) = show
zprava (zpráva) = message