URLTeam

From Archiveteam
Revision as of 21:16, 5 November 2015 by HCross (talk | contribs)
Jump to: navigation, search
Urlteam
URLTeam logo
url shortening was a fucking awful idea
url shortening was a fucking awful idea
URL http://urlte.am
Project status Online!
Archiving status In progress...
Project source Old: urlteam-stuff tinyback tinyarchive

New: terroroftinytown-client-grab terroroftinytown

Project tracker http://tracker.archiveteam.org:1337/ (HTTPS)
IRC channel #urlteam (on EFnet)
Project lead Unknown

TinyURL, bit.ly and other similar services allow long URLs to be converted to smaller ones on their specific service; the small URL is visited by a consumer and their web browser is redirected to the long URL.

Such services are a ticking timebomb. If they go away, get hacked or sell out millions of links will be lost (see Wikipedia: Link Rot). Archive.org/301Works is acting as an escrow for URL shortener databases, but they rely on URL shorteners to actually give them their databases. Even 301Works founding member bit.ly does not actually share their databases and most other big shorteners don't share theirs either.

301Work cooperation

301works logo.jpg

The fine folks at archive.org have provided us with upload permissions to the 301Works archive: http://www.archive.org/details/301utm. They unfortunately do not want to make them downloadable, but the same data is in our torrents too, just in a different format (we use pipe-delimited, xz-compressed files while 301works uses comma-delimited uncompressed files).

Tools

Terror of Tiny Town

The easiest way to help with scraping is to run the Warrior and select the URLTeam 2 project. You can also run ToTT outside the warrior; to do so, follow the instructions at https://github.com/ArchiveTeam/terroroftinytown-client-grab.

URL shorteners

New table

The new table includes shorteners we have already started to scrape.

Name Est. number of shorturls Scraping done by Status Comments
http://tinyurl.com tinyurl_7 +/- 10,000,000,000 Warrior In progress (more than 2,490,442,400 checked: earliest incremental dump 2014-11-22)

done: sequential to zzzzzz

current: non-sequential, 7 characters
http://tinyurl.hu tinyurl-hu_4 +/- ? Warrior In progress (more than 20,188,300 checked: earliest incremental dump 2015-08-13)
http://bit.ly bitly_6 +/- 50,000,000,000 Warrior In progress (more than 3,816,453,100 checked; earliest incremental dump 2014-11-22)

done: non-sequential 6 characters

current: non-sequential, 6 characters
http://goo.gl ? User:Scumola started (2011-03-04) goo.gl throttles pulls
http://is.gd isgd_6 +/- 934,134,706 (2013-05-20) Warrior In progress (more than 2,053,365,800 checked; earliest incremental dump 2014-11-22)

done: sequential up to ZZZZZ

new shorturls: non-sequential, 6 characters
http://ff.im ? User:Chronomex only used by FriendFeed, no interface to shorten new URLs
http://4url.cc 1279 (2009-08-14)[1] User:Chronomex dead (2011-02-15)
http://litturl.com 17096 (2010-04-15)[2] User:Chronomex dead (2010-11-18)
http://xs.md 3084 (2009-08-15)[3] User:Chronomex done dead (2010-11-18)
http://url.0daymeme.com 14867 (2009-08-14)[4] User:Chronomex done dead (2010-11-18)
http://tr.im (old) 1990425 - got what we could dead (2011-12-31)
http://tr.im/ (new) tr-im_gravity4 +/- ? Warrior checked 42,813,150 between 2014-12-01 and 2014-12-12; done: sequential to 42pzz dead (2014-JUL-17) ; Appears incremental - Ex: http://tr.im/44tn2 http://tr.im/44tn4
visibli (hex) 16777216 User:Chfoo Warrior In progress
Done. 15104865 301MB
Using links.sharedby.co/links/ as URL prefix. These results are already in the torrent. Latest version uploaded to IA
http://sharedby.co sharedby-co_6 +/- Warrior In progress (more than 40,759,850 checked; earliest incremental dump 2015-01-29) (Also see http://vsb.li. Double redirects via USERNAME.sharedby.co/share/XXXXXX ) (and http://shrd.by )
http://ur1.ca ur1-ca +/- ? Warrior 10,874,150 checked between 2014-12-17 and 2014-12-20 new shorturls: sequential ; FOSS, run by StatusNet; claims to offer a download of their database, but it just contains garbage
http://snipurl.com snipurl +/- snipurl_range2 +/- ? Warrior First range: 181,015,750 checked between 2015-01-24 and 2015-03-06; range2: 293,372,600 checked between 2015-03-06 and 2015-04-20 new shorturls: sequential ; snipr.com / snipurl.com / snurl.com - Appears incremental - Ex: http://snipr.com/27nvst http://snipr.com/27nvtt. snipr.com and snipurl.com work but appear infected with malware.
http://post.ly (Posterous) ? Warrior/EC2 done dead
http://vbly.us (formerly vb.ly) vbly-us +/- ? Warrior 624,000 checked between 2015-01-11 and 2015-01-12 new shorturls: sequential
http://arseh.at arseh-at +/- ? Warrior Appears down (1,842,350 checked between 2015-01-11 and 2015-01-15) new shorturls: sequential
http://zapd.co Zapd 326592 User:Chfoo Done. 144093 1.7M xxxx.zapd.co. Uploaded to IA
http://bre.ad Bre.ad 120932351 User:Chfoo Incomplete (59771889 examined). 54506 1.2MB de.ad (2013-11-18). Uploaded to IA

Got what I can without overloading their EC2 instance.

http://1r.hu 1r-hu +/- Warrior Done (1,346,050 checked on 2014-12-02)
http://2jump.info 2jump-info +/- Warrior Banned (298,949 checked on 2014-12-02)
http://adjix.com adjix +/- Warrior Appears down (​8,615,100 checked between 2014-12-13 and 2015-1-24)
http://alturl.com alturl-com +/- Warrior 39,367,900 checked between 2015-03-06 and 2015-05-27 Appears to redirect to http://shorturl.com ; Probably sequential/loweralpha - Ex: http://alturl.com/wqok
http://ar.gy ar-gy +/- Warrior Appears down (3,303,100 checked between 2014-12-17 and 2014-12-18) Argyle Social, main page 404s, existing urls still work
http://awe.sm awe-sm +/- Warrior 967,591,000 checked between 2014-12-24 and 2015-04-04 main page redirects, doesn't allow for new urls to be publicly shortened, existing urls still work
http://burl.se burl-se +/- Warrior 3,050 checked on 2014-11-06 Incremental. Ex: http://burl.se/428
http://feedly.com/e/ feedly_8 +/- Warrior Banned (129,762,450 checked between 2015-01-03 and 2015-01-03)
http://kcy.me kcy-me +/- Warrior 2,952,100 checked on 2014-11-06
http://korta.nu korta-nu +/- Warrior 12,767,850 checked between 2015-04-11 and 2015-05-27
http://mysp.ac mysp-ac +/- Warrior 16,245,049 checked between 2015-02-09 and 2015-04-04
http://nig.gr nig-gr +/- Warrior 279,800 checked on 2014-11-06
http://ow.ly ow-ly +/- Warrior 367,074,900 checked between 2015-01-20 and 2015-03-10 new shorturls: sequential ; (aliases: http://htl.li & http://ht.ly )
http://ph.ly ph-ly +/- Warrior 989,290,500 checked between 2014-12-13 and 2015-04-04 Related to the pond called Philadelphia, where links are born and raised, doesn't allow for new urls to be publicly shortened, existing urls still work
http://piciurl.hu piciurl-hu +/- Warrior 255,200 checked on 2015-04-11 Incremental
http://pub.vitrue.com pub-vitrue-com +/- Warrior 15,067,550 checked between 2014-11-16 and 2014-11-22 Now part of Oracle
http://shar.es shar-es +/- Warrior Down (1,031,784,400 checked between 2014-12-13 and 2015-04-08) Still resolves URLs, but the homepage is 404; related to http://sharethis.com
http://shrt.st shrt-st +/- Warrior Appears down (738,600 checked on 2014-11-06) doesn't allow new urls to be shortened, existing urls still work. Appears incremental - Ex: http://shrt.st/vpz
http://srtn.us srtn-us +/- Warrior 54,100 checked on 2014-11-06 still resolves URLs, but site just shows blank page
http://t7.hu t7-hu +/- Warrior 585,800 checked on 2014-12-13 Doesn't make any more shorturls
http://tighturl.com tighturl-com +/- Warrior 3,123,300 checked between 2014-11-16 and 2014-11-22 Appears incremental: http://tighturl.com/30xu http://tighturl.com/30xv
http://trap.it trap-it +/- Warrior 3,130,300 checked between 2015-01-01 and 2015-02-20
http://u.to u-to +/- Warrior In progress (more than 48,782,049 checked: earliest incremental dump 2015-05-02)
http://u4.hu u4-hu +/- Warrior Appears down (178,250 checked on 2014-11-16)
http://v.gd vgd_6 +/- ? Warrior In progress (more than 1,645,279,750 checked; earliest incremental dump 2015-02-14)
http://viddy.it viddy viddy-it +/- Warrior partial; 832,351,700 checked between 2014-11-16 and 2014-12-18 dead
http://waa.ai waa-ai +/- Warrior 2,151,400 checked on 2014-11-16
http://x.co xco +/- Warrior 80,319,800 checked between 2014-11-06 and 2014-11-22 Appears incremental - Ex: http://x.co/1IxUV http://x.co/1IxUW; but custom ones also exist (up to 10 characters)
http://xrl.us xrl-us +/- xrl-us_lowercase +/- Warrior self-saved
xrl-us: 161,601,700 checked between 2014-12-12 and 2015-01-10
xrl-us_lowercase: 28,675,900 checked between 2015-01-10 and 2015-01-14
Thank you Metamark for the database dump!
http://y.ahoo.it y-ahoo-it_5 +/- y-ahoo-it_6 +/- y-ahoo-it_8 +/- Warrior Partial
y-ahoo-it_5: 982,090,300 checked between 2014-11-06 and 2015-02-25
y-ahoo-it_6: 1,670,279,150 checked between 2014-11-06 and 2015-04-03
y-ahoo-it_8: 1,952,022,300 checked between 2014-11-06 and 2015-04-04
Dead
http://yatuc.com yatuc +/- Warrior 597,150 checked on 2014-12-13 Not accepting new urls.
http://yoolink.to yoolink-to +/- Warrior 275,300 checked between 2014-11-16 and 2014-11-22
Name Number of shorturls Scraping done by Status Comments

For the latest TinyTown updates, please see chfoo's spreadsheet.

Alive

Last verified 2014-12-07. Original list last updated 2009-08-14.[5]

  • adf.ly - Ex: http://adf.ly/bnpYL
  • adfoc.us
  • ask.fm - Ex: ask.fm/a/40k05kgp
  • bc.vc
  • budurl.com - Appears non-incremental
  • buff.ly - Buffer App
  • ccl.hu
  • cf.ly (CashFly.com)
  • cli.gs - Appears non-incremental
  • cl.ly - CloudApp
  • cmt.com - Country Music Television
  • cur.lv (CoinURL.com)
  • decenturl.com - Not at all easy to scrape.
  • del.ly - sprinklr
  • df4.us - daringfireball.net
  • dld.bz - "private URL shortening service"
  • dlvr.it - Requires free login; then requires connecting to another service; URLs are shortened when sent through. ( as of 01:36, 2 November 2015 (EST))
  • doiop.com - Appears non-incremental
  • dwurl.hu - Allows public shortening; appears to give 6 character, mixed case alphabetic (no digits), non-incremental URLs, e.g. http://dwurl.hu/gMEtiA ( as of 01:36, 2 November 2015 (EST))
  • easyurl.net - Appears non-incremental. Ex: http://easyurl.net/afd2f
  • fav.me - Used by DeviantArt. Ex: http://fav.me/d31sfml
  • flip.it - Flipboard
  • flpbd.it - Flipboard
  • fnd.us (See offical shorteners)
  • fos.hu – incremental alphanumeric, but shares pattern with an image sharing service
  • fwdurl.net
  • gyar.eu
  • dft.ba
  • jdem.cz - Incremental with random (?) last digit - Ex: http://jdem.cz/bw388
  • kics.it – Restricted access to shourturl creation
  • linkbucks.com
  • ln.is - linkis.com
  • lnq.me
  • m112.hu
  • me2.hu
  • mgnet.me - for torrent magnet URIs.
  • migre.me - 5 character, mixed case alphanumeric, incremental, currently around rZIfF (as of 02:00, 2 November 2015 (EST))
  • miniurl.hu
  • moourl.com – Random
  • my.dot.tk/tweak - Appears non-incremental
  • nblo.gs
  • news.me
  • nohref.hu – Allows custom shorturl
  • notlong.com - Appears to be alpha-only - Ex: http://yeitoo.notlong.com/
  • nutshellurl.com - Appears incremental. 301s to a redirector script, which then 301s you to the destination.
  • owl.li
  • p.pw
  • pear.ly - Used by pearltrees.com. Ex: http://pear.ly/6J1H
  • pnut.co - see nutshellurl.com Ex: http://pnut.co/3a
  • po.st
  • prsm.tc - getprismatic.com
  • r.ebay.com
  • rod.gs - up to 3 characters, alphanumeric, creating new ones appears to hang (as of 02:14, 2 November 2015 (EST))
  • sdai.ly – Allows custom shorturl
  • shorl.com - Doesn't appear guessable - Ex: http://shorl.com/tisikestibahu
  • shorte.st
  • shrinkurl.us - Still resolves, but does not allow creating new URLs ("The URL you entered was not valid or did not exist.")
  • smarturl.eu / joturl.com - Doesn't appear guessable, HTML redirect.
  • smarturl.it - smartURL
  • soa.li - Gigya inc.
  • soc.li - Gigya inc.
  • spne.ws - Silicon Prairie News
  • spnsr.tw - sponsoredtweets.com
  • surl.co.uk - Many shortening options.
  • techme.me - Techmeme
  • tinyarrows.com / ta.gd / ri.ms / ➡.ws / ➨.ws / ➯.ws / ➔.ws / ➞.ws / ➽.ws / ➹.ws / ✩.ws / ✿.ws / ❥.ws / ›.ws / ⌘.ws / ‽.ws / ☁.ws - Appears non-incremental: uses user-defined words for URLs (e.g. http://➡.ws/URLTEAM)
  • tiny.cc - Appears non-incremental
  • totesz.hu/x – Allows custom shorturl
  • trib.al -- Does not appear to allow public creation of new short-URLs; owned by SocialFlow
  • twitthis.com
  • urlcut.com - "We are not currently accepting new redirects at this time." ; existing ones seem to still work, e.g. http://urlcut.com/1xvha (as of 02:09, 2 November 2015 (EST))
  • usite.hu/link.php – Numeric incremental, public database
  • vk.cc
  • y2u.be - meant for YouTube videos
  • yep.it

"Official" shorteners

  • abcn.ws - ABC News
  • bln.gs - Blingee (format: bln.gs/b/28fss0 and bln.gs/b/1)
  • bull.hn - Bullhorn Reach (format: bull.hn/l/19JQE/)
  • CokeURL.com - Coca-Cola
  • db.tt - Dropbox
  • di.sn - Disney
  • fb.me - Facebook
  • flic.kr - Flickr
  • fnd.us - Fundrazr.com
  • fxn.ws - Fox News
  • g.co - Google (used for Google products and services)
  • getpocket.com/s/ - Pocket
  • goo.gl - Google
  • go.usa.gov - USA Government (and since they control the Internets, it doesn't get much more official than this)
  • git.io - GitHub only URLs
  • gty.im - Getty Images (format: gty.im/488068439; links by editorial number)
  • gu.com - The Guardian (weird format - https://gu.com/p/3f7ca )
  • hub.me - HubPages
  • ift.tt - IFTTT
  • igg.me - Indiegogo
  • lnkd.in - LinkedIn
  • mfi.re - MediaFire
  • msft.it - Microsoft (or maybe something called "Sprinklr"?)
  • mysp.ac - Myspace
  • nydn.us - New York Daily News
  • off365.ms - Office 365
  • pocket.co - Pocket
  • post.ly - Posterous
  • redd.it - Reddit
  • reut.rs - Reuters
  • rsg.ms - Rockstar Games
  • skfb.ly - Sketchfab
  • spoti.fi - Spotify
  • stanford.io - Stanford University
  • su.pr - StumbleUpon
  • sx3.se - swedishstartupspace.se
  • t.co - Twitter
  • ti.me - Time Magazine
  • tmblr.co - Tumblr
  • uoft.me - University of Toronto
  • upl.nu - Ung Pirat (Youth Pirate Party, Sweden)
  • vstphl.ly - Visit Philly
  • wapo.st - Washington Post
  • wh.gov - White House (format: wh.gov/i3lXR)
  • wp.me - Wordpress.com
  • youtu.be - YouTube
  • hrts.me - University of Hertfordshire. Seems to be 5 characters long. a-z with usage of capitals and non capitals. Includes numbers. Mainly used on https://twitter.com/UniofHerts
bit.ly aliases

A bit.ly alias works just like a bit.ly URL. The shortcode is the same, it sets the same bit.ly cookie, and DNS resolving the address shows the IP addresses are the same as bit.ly. The homepage may be different however.

  • 1.usa.gov - USA Government
  • 4sq.com - Foursquare
  • 757live.ga
  • ada.ms
  • aje.me - Aljazeera
  • amzn.to - Amazon
  • arfo.sk
  • atfp.co - Foreign Policy
  • bbc.in - BBC
  • bbnew.be
  • bbybgrl.com
  • binged.it - Bing (bonus points for being longer than bing.com)
  • bnkrpt.am - Bankrupting America
  • bzfd.it - Buzzfeed
  • calltrack.es
  • canva.link
  • carrot.cr - Carrot Creative
  • cb.com - Career Builder
  • chzb.gr - Cheezeburger
  • cmplx.it - Complex Magazine
  • cnet.co - CNET
  • cnnmon.ie - CNN Money
  • conta.cc - Constant Contact Inc.
  • corb.is - Corbis Images
  • cot.ag
  • cpurl.net - Current Photographer.com
  • curbed.cc - Curbed.com
  • dag.gy
  • dennysd.in - Denny's Restaurants
  • dtoid.it - Destructoid
  • dwqn.me
  • econ.st - The Economist
  • emarketee.rs - Emarketeers
  • engri.sh - Engrish.com
  • eonli.ne - E! Online
  • es.pn - ESPN
  • fakes.pn - The Fake ESPN (at lockerdome.com)
  • fanpa.ge - Fanpage.it
  • feedly.com/k/ - redirect, see below for their own
  • fencetimesweep.com
  • flts.tk
  • fltsim.me
  • gaw.kr - Gawker
  • geekiss.im - Geekismo
  • grd.to - The Grid TO
  • grn.bz - GreenBiz
  • gtg.lu - GetGlue
  • hoblu.es - House of Blues
  • hub.am - HubSpot
  • huff.to - Huffington Post
  • ift.tt - IFTTT
  • j.mp - bit.ly[6]
  • joga.bo
  • jrnl.to - thejournal.ie
  • kck.st - Kickstarter
  • m.ly
  • marsdd.it - MaRS Discovery District
  • mbist.ro - MediaBistro
  • mojo.ly - Mother Jones
  • muo.fm - MakeUseOf
  • mwne.ws - MarketWired News
  • ncl.uz
  • nie.mn - Neiman Journalism Lab
  • nilegui.de
  • njlle.me
  • nokia.ly - Nokia
  • nyti.ms - New York Times
  • onforb.es - Forbes
  • onion.com - The Onion
  • pops.ci - Popular Science
  • popu.pe - Pop-Up Pantry
  • proof.ly
  • propub.ca - ProPublica
  • psxs.us
  • read.bi - Business Insider
  • rseo.co - realseo
  • s831.us - Studio831 - whatever that is
  • saggia.me
  • sbn.to - sbnation
  • sens.sc
  • shr.li
  • skygrid.me - SkyGrid
  • slackers.co - slackers.com
  • smle.us
  • squid.us - Laughing Squid
  • s.shr.lc - shareaholic - Naive, redirects any shortcode to bit.ly
  • stay.am
  • stjo.es - St. Joseph Media
  • tag.my
  • tcrn.ch - Techcrunch
  • theatln.tc - The Atlantic
  • tnw.co - The Next Web
  • tom.hn - Tom Hillenbrand
  • toms.sh - TOMS Shoes
  • tvt.ag - tvtag.com
  • txpr.de - TexasStore
  • unr.ly - Unruly media
  • usat.ly - USA Today Newspaper
  • vrge.co - The Verge
  • wkdb.it
  • yhoo.it - Yahoo! (not to be confused with y.ahoo.it, their non-bitly public url shortener)
  • zite.to - Zite

Dead or Broken

  • 1link.in - Website dead
  • 6url.com - HTML redirect, Error 500
  • ad.vu - mirror of adjix.com, application not found
  • bacn.me
  • be.vc
  • biglnk.com - dead, replaced with unrelated blog
  • bwtm.co - DNS fails to resolve.
  • calyp.co - Server error. 403 - Forbidden: Access is denied.
  • canurl.com - Website dead
  • chod.sk - Appears non-incremental, not resolving
  • come.to - Related to various .to shorteners. Started in 1997, killed in 2013 after parent company died.
  • catchyurl.co
  • da.co - Parked.
  • digg.com - discontinued - [1]
  • dwarfurl.com - Website dead/Numeric, appears incremental: http://dwarfurl.com/08041
  • easy.tc - DNS not resolving.
  • easyuri.com - Website dead/Appears hex incremental with last digit random/checksum: http://easyuri.com/1339f , http://easyuri.com/133a3
  • eqent.me - Improper redirect to bitly.
  • feedzil.la - Domain parked.
  • go2cut.com - Website dead
  • gob.li - Golbin Ridge Limited. Timed out
  • gonext.org - not resolving
  • go.to - sold its domains on Sedo apparently.
  • go2.me - everything 404s
  • gx.si
  • hashonomy.com - Timed out
  • href.hu
  • htcdev.net - DNS not resolving.
  • iawtp.me - DNS not resolving
  • icymi.me - DNS not resolving
  • ilix.in - domain parked
  • imfy.us - requires a recaptcha to get to the linked site, and avast goes nuts. DNS fails to resolve.
  • inspr.in - Inspired Beta. Can't find server
  • ix.it - Not resolving
  • jijr.com - Doesn't appear to be a shortener, now parked
  • joomlagyar.hu/usb - DNS not resolving
  • jump.to - dead as of February 1, 2013
  • kissa.be - "Kissa.be url shortener service is shutdown"
  • kl.am - "kl.am Closes its Shell"
  • kuijt.nu - replaced with unrelated site
  • kurl.us - Parked.
  • lk.to
  • lnkurl.com - Website dead
  • marv.ly - DNS fails to resolve.
  • mash.to - Cannot connect.
  • memurl.com - Pronounceable. Broken.
  • me.lt - Connection refused.
  • mens.hm - Not responding (timeout)
  • miklos.dk - Doesn't appear guessable: http://miklos.dk/!z7bA6a - "Vi arbejder på sagen..."
  • mindless.co
  • minilien.com - Doesn't appear guessable: http://minilien.com/?9nyvwnA0gh - Website dead
  • minim.in - Times out
  • minurl.org - Presently in ERROR 404
  • ms.me - Parked.
  • msplinks.com - Used by Myspace[2]
  • mtw.tl - everything 403s
  • muhlink.com - Not resolving
  • myloc.me
  • mytinyurl.com - redirects to an unrelated image
  • myurl.us - cpanel frontend
  • myurl.in
  • myv.bz - Not resolving
  • nyturl.com - NY Times (bonus points for being longer than nyt.com, which they own). Taken by squatters
  • onvzi.com - DNS fails to resolve.
  • otf.me - Empty WordPress site
  • ping.fm - Fails to resolve.
  • pln.so - Not working.
  • plzretwt.me - Fails to resolve.
  • pnt.me - Doesn't appear guessable, too big a space to bruteforce: http://pnt.me/FzAblc
  • pulsene.ws - Expired. Parked by GoDaddy.
  • qurlyq.com - Javascript redirect. Appears sequential: http://qurlyq.com/5nf. Domain parked.
  • re.ad - Fails to resolve.
  • redirx.com - Lowercase alpha only, appears sequential or guessable - Ex: http://redirx.com/?wyok. Website still online but does not resolve existing URLs nor does it allow creating new ones (responds with the message: blame the spammers)
  • see.sc - Fails to resolve.
  • s.me - Domain parked.
  • say.ly - redirects to unrelated site
  • s3nt.com - Probably sequential. http://s3nt.com/aa goes somewhere different from /ab . Domain parked.
  • shortlinks.co.uk - Working again. Maybe not.
  • short.to - Domain is parked - Probably sequential/loweralpha: http://short.to/msmp
  • shrinklink.co.uk - Doesn't appear sequential: http://www.shrinklink.co.uk/45bmx , www.shrinklink.co.uk/npk6xp . Domain parked.
  • shrtn.us - myshorturls.appspot.com. 404, does not resolve
  • simurl.com - Doesn't appear guessable - Ex: http://simurl.com/panpes. Website is blank; does not resolve URLs ("This SimURL is now inactive")
  • smf.is - DNS not resolving.
  • sns.mx - SNS Analytics, domain parked
  • sq.com - Now redirects to Singapore Airlines.
  • surl.hu
  • tiny.ly - DNS not resolving.
  • tm.to - Twtmore has "flown away"
  • to.gg - Global Giving, everything 503s
  • traceurl.com - DNS fails to resolve.
  • tr.im (1st generation) - "Be back soon!"
  • tweetburner.com / twurl.nl - Appears incremental, everything 404s
  • twixar.com - "Estamos fora do ar por algum tempo, mas estamos trabalhando para voltar a oferecer o serviço para encurtar URLs longa em breve!"
  • twthpr.co - DNS not resolving.
  • twitpwr.com - Domain parked.
  • twitt.hu
  • u.mavrev.com - Stopped accepting new urls. Now times out
  • u.nu - "The shortest URLs. period." Website dead since at least 1st of october 2010 (http://web.archive.org/web/20100104023208/http://u.nu/)
  • url9.com - Sequential, alphanumeric. Leading 0s are significant. "The site is working correctly."
  • urlborg.com - 404 Not Found.
  • urlcover.com - Domain parked.
  • urlhawk.com - Domain parked.
  • url-press.com - Suspended by web host.
  • urlsinn.com - DNS not resolving.
  • urlsmash.com - DNS not resolving.
  • urltea.com - Dreamhost's coming soon page.
  • urlvi.be - Domain parked.
  • urlx.org - Owner has agreed to share his database
  • uxp.in - still resolves URLs, but site just shows blank page. Domain parked.
  • vibemag.co - Vibe Magazine. Times out
  • vsb.li / links.visibli.com/links/ - The latter uses truncated md5 hex string. See sharedby.co.
  • w3t.org - 403 Forbidden.
  • wlink.us - Domain parked.
  • wl.tl - DNS not resolving.
  • wwy.me
  • xaddr.com - Domain parked.
  • xil.in - Under construction.
  • x.se - Cannot resolve, but www.x.se works.
  • xym.kr - Gibberish (?) Korean text blog.
  • y.ahoo.it - Yahoo
  • yweb.com - Suspicious iframe with long url and fake loading gif image.
  • zi.ma - DNS not resolving.
  • zip.sm - was a redirect to joturl.com. Now times out

Discontinued

  • adjix.com - Still resolves URLs, but site does not work: "The requested application was not found on this server." - Is static host on AWS service.[7]
  • feedly.com/e/ - realized that URL shorteners were bad [8]. Non-cooperative.
  • metamark.net / xrl.us - no longer allowing new urls to be shortened, existing urls still work (Ex. http://xrl.us/bfabog). Uploaded a database dump to Internet archive.
  • urlbrief.com - co-operates with 301Works.org

Hueg list

[3]

Archives

Check out Audit2014 and help audit the archives. In particular, the stuff not on Internet Archive needs to be uploaded.

References

Weblinks

Common URL shortening software

Ha-ha! Please don't run a URL shortening service.


v · t · e         Archive Team
Current events

Alive... OR ARE THEY · Deathwatch · Projects

Archiveteam.jpg
Archiving projects

APKMirror · Archive.is · BetaArchive · Government Backup (#datarefuge · ftp-gov· Gmane · Internet Archive · It Died · Megalodon.jp · OldApps.com · OldVersion.com · OSBetaArchive · TEXTFILES.COM · The Dead, the Dying & The Damned · The Mail Archive · UK Web Archive · WebCite · Vaporwave.me

Blogging

Blog.pl · Blogger · Blogster · Blogter.hu · Freeblog.hu · Fuelmyblog · Jux · LiveJournal · My Opera · Nolblog.hu · Open Diary · ownlog.com · Posterous · Powerblogs · Proust · Roon · Splinder · Tumblr · Vox · Weblog.nl · Windows Live Spaces · Wordpress.com · Xanga · Yahoo! Blog · Zapd

Cloud hosting/file sharing

aDrive · AnyHub · Box · Dropbox · Docstoc · Google Drive · Google Groups Files · iCloud · Fileplanet · LayerVault · MediaCrush · MediaFire · Mega · MegaUpload · MobileMe · OneDrive · Pomf.se · RapidShare · Ubuntu One · Yahoo! Briefcase

Corporations

Apple · IBM · Google · Loblaw · Lycos Europe · Microsoft · Yahoo!

Events

Arab Spring · Great Ape-Snake War · Spanish Revolution

Font Repos

DaFont · Google Web Fonts · GNU FreeFont · Fontspace

Forums/Message boards

4chan · Captain Luffy Forums · College Confidential · DSLReports · ESPN Forums · forums.starwars.com · HeavenGames · Invisionfree · NeoGAF · The Classic Horror Film Board · Yahoo! Messages · Yahoo! Neighbors · Yuku.com

Gaming

Atomicgamer · Bazaar.tf · City of Heroes · Club Nintendo · Counter-Strike: Global Offensive · CS:GO Lounge · Desura · Dota 2 · Dota 2 Lounge · Emulation Zone · ESEA · GameBanana · GameMaker Sandbox · GameTrailers · Halo · HLTV.org · HQ Trivia · Infinite Crisis · joinDOTA · League of Legends · Liquipedia · Minecraft.net · Player.me · Playfire · Raptr · Steam · SteamDB · Team Fortress 2 · TF2 Outpost · Warhammer · Xfire

Image hosting

500px · AOL Pictures · Blipfoto · Blingee · Canv.as · Camera+ · Cameroid · DailyBooth · Degree Confluence Project · deviantART · Demotivalo.net · Flickr · Fotoalbum.hu · Fotolog.com · Fotopedia · Frontback · Geograph Britain and Ireland · GTF Képhost · ImageShack · Imgh.us · Imgur · Inkblazers · Instagram · Kepfeltoltes.hu · Kephost.com · Kephost.hu · Kepkezelo.com · Keptarad.hu · Madden GIFERATOR · MLKSHK · Microsoft Clip Art · Microsoft Photosynth · Nokia Memories · noob.hu · Odysee · Panoramio · Photobucket · Picasa · Picplz · Pixiv · Portalgraphics.net · PSharing · Ptch · puu.sh · Rawporter · Relay.im · ScreenshotsDatabase.com · Snapjoy · Streetfiles · Tabblo · Tinypic · Trovebox · TwitPic · Wallbase · Wallhaven · Webshots · Wikimedia Commons

Knowledge/Wikis

arXiv · Citizendium · Clipboard.com · Deletionpedia · EditThis · Encyclopedia Dramatica · Etherpad · Everything2 · infoAnarchy · GeoNames · GNUPedia · Google Books (Google Books Ngram· Horror Movie Database · Insurgency Wiki · Knol · Lost Media Wiki · Neoseeker.com · Notepad.cc · Nupedia · OpenCourseWare · OpenStreetMap · Orain · Pastebin · Patch.com · Project Gutenberg · Puella Magi · Referata · Resedagboken · SongMeanings · ShoutWiki · The Internet Movie Database · TropicalWikis · Uncyclopedia · Urban Dictionary · Urban Exploration Resource · Webmonkey · Wikia · Wikidot · WikiHow · Wikkii · WikiLeaks · Wikipedia (Simple English Wikipedia· Wikispaces · Wikispot · Wik.is · Wiki-Site · WikiTravel · Word Count Journal

Magazines/Blogs/News

Cyberpunkreview.com · Game Developer Magazine · Gigaom · Hardware Canucks · Helium · JPG Magazine · Make Magazine · Polygamia.pl · San Fransisco Bay Guardian · Scoop · Regretsy · Yahoo! Voices

Microblogging

Heello · Identi.ca · Jaiku · Mommo.hu · Plurk · Sina Weibo · Twitter · TwitLonger

Music/Audio

AOL Music · Audimated.com · Cinch · digCCmixter · Dogmazic.net · Earbits · exfm · Free Music Archive · Gogoyoko · Indaba Music · Instacast · Jamendo · Last.fm · Music Unlimited · MOG · PureVolume · Reverbnation · ShareTheMusic · SoundCloud · Soundpedia · This Is My Jam · TuneWiki · Twaud.io · WinAmp

People

Aaron Swartz · Michael S. Hart · Steve Jobs · Mark Pilgrim · Dennis Ritchie · Len Sassaman Project

Protocols/Infrastructure

FTP · Gopher · IRC · Usenet · World Wide Web
BitTorrent DHT

Q&A

Askville · Answerbag · Answers.com · Ask.com · Askalo · Baidu Knows · Blurtit · ChaCha · Experts Exchange · Formspring · GirlsAskGuys · Google Answers · Google Baraza · JustAnswer · MetaFilter · Quora · Retrospring · StackExchange · The AnswerBank · The Internet Oracle · Uclue · WikiAnswers · Yahoo! Answers

Recipes/Food

Allrecipes · Epicurious · Food.com · Foodily · Food Network · Punchfork · ZipList

Social bookmarking

Addinto · Backflip · Balatarin · BibSonomy · Bkmrx · Blinklist · BlogMarks · BookmarkSync · CiteULike · Connotea · Delicious · Designer News · Digg · Diigo · Dir.eccion.es · Evernote · Excite Bookmark · Faves · Favilous · folkd · Freelish · Getboo · GiveALink.org · Gnolia · Google Bookmarks · Hacker News · HeyStaks · IndianPad · Kippt · Knowledge Plaza · Licorize · Linkwad · Menéame · Microsoft Developer Network · myVIP · Mister Wong · My Web · Mylink Vault · Newsvine · Oneview · Pearltrees · Pinboard · Pocket · Propeller.com · Reddit · sabros.us · Scloog · Scuttle · Simpy · SiteBar · Slashdot · Squidoo · StumbleUpon · Twine · Vizited · Yummymarks · Xmarks · Yahoo! Buzz · Zootool · Zotero

Social networks

Bebo · BlackPlanet · Classmates.com · Cyworld · Dogster · Dopplr · douban · Ello · Facebook · Flixster · FriendFeed · Friendster · Friends Reunited · Gaia Online · Google+ · Habbo · hi5 · Hyves · iWiW · LinkedIn · Miiverse · mixi · MyHeritage · MyLife · Myspace · myVIP · Netlog · Odnoklassniki · Orkut · Plaxo · Qzone · Renren · Skyrock · Sonico.com · Storylane · Tagged · tvtag · Upcoming · Viadeo · Vine · Vkontakte · WeeWorld · Weibo · Wretch · Yahoo! Groups · Yahoo! Stars India · Yahoo! Upcoming · more sites...

Shopping/Retail

Alibaba · AliExpress · Amazon · Apple Store · Barnes & Noble · DirectCanada · eBay · Kmart · NCIX · Printfection · RadioShack · Sears · Sears Canada · Target · The Book Depository · ThinkGeek · Toys "R" Us · Walmart

Software/code hosting

Android Development · Alioth · Assembla · BerliOS · Betavine · Bitbucket · BountySource · Codecademy · CodePlex · Freepository · Free Software Foundation · GNU Savannah · GitHost  · GitHub · GitHub Downloads · Gitorious · Gna! · Google Code · ibiblio · java.net · JavaForge · KnowledgeForge · Launchpad · LuaForge · Maemo · mozdev · OSOR.eu · OW2 Consortium · Openmoko · OpenSolaris · Ourproject.org · Ovi Store · Project Kenai · RubyForge · SEUL.org · SourceForge · Stypi · TestFlight · tigris.org · Transifex · TuxFamily · Yahoo! Downloads

Television/Radio

ABC · Austin City Limits · BBC · CBC · CBS · Computer Chronicles · CTV · Fox · G4 · Global TV · Jeopardy! · NBC · NHK · PBS · Penn & Teller: Bullshit! · The Howard Stern Show · TV News Archive (Understanding 9/11)

Torrenting/Piracy

ExtraTorrent · EZTV · isoHunt · KickassTorrents · The Pirate Bay · Torrentz · Library Genesis

Video hosting

Academic Earth · Bambuser · Blip.tv · Epic · Google Video · Justin.tv · Niconico · Nokia Trailers · Oddshot.tv · Plays.tv · Qwiki · Skillfeed · Stickam · TED Talks · Ticker.tv · Twitch.tv · Ustream · Videoplayer.hu · Viddler · Viddy · Vidme · Vimeo · Vine · Vstreamers · Yahoo! Video · YouTube · Famous Internet videos (Me at the zoo)

Web hosting

Angelfire · Brace.io · BT Internet · CableAmerica Personal Web Space · Claranet Netherlands Personal Web Pages · Comcast Personal Web Pages · Extra.hu · FortuneCity · Free ProHosting · GeoCities (patch· Google Business Sitebuilder · Google Sites · Internet Centrum · MBinternet · MSN TV · Nifty · Nwnyet · Parodius Networking · Prodigy.net · Saunalahti Iso G · Swipnet · Telenor · Tripod · University of Michigan personal webpages · Verizon Mysite · Verizon Personal Web Space · Webzdarma · Virgin Media

Web applications

Mailman · MediaWiki · phpBB · Simple Machines Forum · vBulletin

Information

A Million Ways to Die on the Web · Backup Tips · Cheap storage · Collecting items randomly · Data compression algorithms and tools · Dev · Discovery Data · DOS Floppies · Fortress of Solitude · Keywords · Naughty List · Nightmare Projects · Rescuing floppy disks · Rescuing optical media · Site exploration · The WARC Ecosystem · Working with ARCHIVE.ORG

Projects

ArchiveCorps · Audit2014 · Emularity · Faceoff · FlickrFckr · Froogle · INTERNETARCHIVE.BAK (Internet Archive Census· IRC Quotes · JSMESS · JSVLC · Just Solve the Problem · NewsGrabber · Project Newsletter · Valhalla · Web Roasting (ISP Hosting · University Web Hosting· Woohoo

Tools

ArchiveBot · ArchiveTeam Warrior (Tracker· Google Takeout · HTTrack · Video downloaders · Wget (Lua · WARC)

Teams

Bibliotheca Anonoma · LibreTeam · URLTeam · Yahoo Video Warroom · WikiTeam

Other

800notes · AOL · Akoha · Ancestry.com · April Fools' Day · Amplicate · AutoAdmit · Bre.ad · Circavie · Cobook · Co.mments · Countdown · Distill · Dmoz · Easel · Eircode · Electronic Frontier Foundation · FanFiction.Net · Feedly · Ficlets · Forrst · FunnyExam.com · FurAffinity · Google Helpouts · Google Moderator · Google Reader · ICQmail · IFTTT · Jajah · JuniorNet · Lulu Poetry · Mobile Phone Applications · Mochi Media · Mozilla Firefox · MyBlogLog · NBII · Neopets · Quantcast · Quizilla · Salon Table Talk · Shutdownify · Slidecast · SOPA blackout pages · starwars.yahoo.com · TechNet · Toshiba Support · USA-Gov · Volán · Widgetbox · Windows Technical Preview · Wunderlist · YTMND · Zoocasa

About Archive Team

Introduction · Philosophy · Who We Are · Our stance on robots.txt · Why Back Up? · Software · Formats · Storage Media · Recommended Reading · Films and documentaries about archiving · Talks · In The Media · FAQ