I actually joined this forum after being a lurker at the old boards for something like 2 years or so. Frankly, I find them a fascinating wealth of information. When I read that they were being deleted i was like
and then
. As a result, I went on a search to hunt down a way to save the interesting and useful information, especially since it seem that there is an interest in getting rid of the old forums. So here are my personal ideas:
1)
Anyway, contact them at info@archive.org
They're datahoarders - I'm fairly sure they'll take it.
The archive is a great resource, but I think it can take a few months (like 6 or so, maybe up to 2 years) before everything gets ported over.
2)
So write a script to read old post text, and post it to the new system as data. Hell, you could even call some Bayesian library functions to filter the spam inline.
Should be too hard if you have an XML dump (or similar) of the old stuff.
This sounds nontrivial. Is this nontrivial? I have no idea. If it would work, that'd be great, but, again, to my under-educated ear, it sounds nontrivial.
I don't think this should be too terribly hard. I did some digging and I *think* Forumotion
http://www.phpbb.com/community/viewtopic.php?f=65&t=861505 would be able to get this done. However, according to the documentation, it works best when you have:
A Web Server with [preferably] MySQL.
Forum Software installed on this web server, configured and ready to go with phpBB or SMF
Database Access - need to be able to insert, delete and truncate
Python (versions 2.5 to 2.6 will work (probably 2.7 as well), but Python 3 will NOT work!) -- download from
http://www.python.orgI haven't had time to look into this, but people at phpbb.com state that it can work without needing access to the database, you just don't get all the user information/profiles/etc., but the directories, threads, and posts should remain intact (or so I understand).
3) Dump the threads/directories that don't matter so much and don't really contribute to creativity bulk information. In my opinion, this would include (feel free to disagree, just my opinion, no offense to others intended) almost everything under: "Board Business", "Show Stuff", "General Gab", "Playtime!", and "Trash bin". This would eliminate ~176000 of the ~429000 posts or about 41% of them. Some of the most heavily externally linked information that I have found is located under "Min/Max It!" and especially it's children boards "Handbooks" and "You Break It, You Buy It". If it wouldn't be too terribly expensive (I have no understanding of the costs involved), but many of these boards could be copied over, locked, and placed under an "archived" section of the new forums or just dropped wholly into an appropriate forum over here. If you locked all of them, it would preserve their locations and prevent zombifying any of the threads from the old site as well.
At the very least it would be nice to keep the optimization information, handbooks, and "special" overly broken builds to prevent people form having to reinvent the wheel, as these section took ALOT of work to pull together by a dedicated community over a fairly long period of time.
Just some thoughts, I might have time this weekend to play around a bit.