I am curious to hear from anyone, particularly smaller institutions, that have implemented DIY solutions with Archivematica, RODA, etc. We have a non-trivial amount of data (300TB - 400TB) that would be cost prohibitive in vendor-based pure cloud scenario using S3 and Glacier.
It seems to me the most cost effective method is to have data on-site in duplicate and replicated to Amazon Glacier, perhaps using a commercial system for the added support. However, its seems (in my ambitious mind) possible to manage a system like RODA in-house and handle the replication ourselves, needing only to pull from Glacier if the fixity checking reports an error. Can someone disabuse me of my fanciful notions? Any horror stories or success stories? Am I right to assume that as long as the bags are stored in Glacier we could migrate those to another system later on if we found maintaining our own system too unwieldy? How horrifying is it to all of you to rely on Glacier as a fail safe option?