Talk:SourceForge

From Archiveteam
Revision as of 11:50, 14 November 2015 by Its notjack (talk | contribs)
Jump to navigation Jump to search

BerliOS seems to still be online (?) its_notjack 06:50, 14 November 2015 (EST)


Script to download files: https://github.com/SpiritQuaddicted/sourceforge-file-download


And some discovery of the various parts and software packages projects can have, random project names are used here where the url patterns were found, imagine $projectname instead.

Issue trackers can have multiple, arbitrary names:

http://sourceforge.net/p/scummvm/bugs/

http://sourceforge.net/p/scummvm/feature-requests/

http://sourceforge.net/p/dungeonsofdecay/tickets/

  • The admin interface lets you pick make arbitrary numbers of issue trackers (called "tickets" in the interface) with arbitrary names. We'll need to find them by parsing the summary page ( http://sourceforge.net/p/scummvm )

http://sourceforge.net/p/scummvm/mailman/

http://sourceforge.net/p/dvdstyler/discussion/

http://sourceforge.net/p/doublecmd/forum/

  • This appears to be a standard instance of phpBB (which we hopefully know how to archive?)

http://sourceforge.net/p/doublecmd/news/

http://sourceforge.net/p/doublecmd/code

  • this looks like browsable repos; we probably don't want to scrape these

http://sourceforge.net/p/scummvm/patches/

  • we don't want feeds, so reject: patches/[0-9]+/feed\.(atom|rss)
  • attachments are in scummvm/patches/_discuss/thread/

http://sourceforge.net/projects/dvdstyler/reviews

  • simply some pages with ? , not a directory

http://sourceforge.net/p/dvdstyler/wiki/

Donation links (which appear to just be redirects to a PayPal URL, seem to be of the form): http://sourceforge.net/p/scummvm/donate/

wiki might be hosted elsewhere

homepage might be hosted elsewhere


domains from which files are served


It's somewhat lower priority, but a download stats API seems to be documented here: http://sourceforge.net/p/forge/documentation/Download%20Stats%20API/