What naming convention do you use for S3 keys?

uruzseven · on April 3, 2008

I think this is dependant on what you're doing with S3. I'm only using it for backup and use the tarball name and date/timestamp as the key name. This ensures that it's unique and easy to understand when you need to restore.

If I were going to use them for hosting my videos or something, I'd probably just use the GUID without the need for the MD5. Just make sure there's not chance of duplicates in the database schema and it should work.

I use Bacula for network backups and it does the same thing. All paths and filenames are stored as unique ID numbers.

sam · on April 3, 2008

Yeah, using the GUID works, but then it opens us up to someone coming in and just scraping every file we have on S3 which would be bad for us. I'm leaning towards encrypting/obfuscating the ids... any suggestions for python libraries suited to this task?

sam · on April 3, 2008

Hmm. I just realized that if they key is the MD5 digest, then I can't independently store/delete identical files. Maybe the best bet is to encrypt the GUID.