Firesteel is a great metaphor for for system clustering, HA and DR. Firesteel or “ferrocerium rod” is a metal rod with properties similar to flint and steel. The purpose is to make fire. These rods are used in two parts. The rod and the striker. The striker is very simple and can be similar to a tab of a poptop can or the spine of any knife (blade too but that’s not a great idea even though the rod is a softer metal)The metaphor…Firesteel comes in a number of lengths and sizes. So the question is which size is the right size?First of all the firesteel should be just one of the three combustion options you have in a true DR situation. A normal lighter, firesteel and magnifying lens are ideal. You should also know how to build a bow drill and a compression stick.But choosing the right length, width, use-count, and packaging are all very important.the longer and wider the rod the more it costs. You can pay $30 for a bare 6in long 1/2in wide rod.some smaller rods are connected on some cordage to keep the striker close to the rod.some rods have plastic holders for ease of use and will also have a whistlesome are just meant to be worn around your neck or attached to you sheath.not all rods are the same. They vary in the number of strikes (regardless if there are sparks) 12,000, 10,000, 3000, 1500. I have noticed that the 1500 generates bigger sparks and the rod is softer so I must be shaving off more material per strike,And while all of this is reasonable there is one big weakness of the rods. Saltwater. I’m told that saltwater is a firesteel’s mortal enemy. If the rod is submerged in saltwater it will not strike or I assume degraded.So as a metaphor for system DR and HA this is it.Have a few extra rods with you. Put some in ziplocks in your pack, hang on your knife or belt and accessible on your pack. Assume you’re going to lose them and have a backup like a lighter or bowdrill.Have a few extra machines in your network. plan to share the workload, keep some hardware in reserve, do not put all your money into just a few machines.STORY: I have a cluster of CoreOS machines in my PAN (private area network). I use them to develop my CoreOS and container ideas as well as staging work product for production. Something went very wrong last night when my wireless and wired networks became partitioned. I’m still not sure what happened but it’s easy to recognize that my [a] and [b] plans failed. If the system had not corrected itself this morning I would have been moving to [c] which would allow me to get back to work but I would have lost a few days of R&D. I once had an idea that I should “commit everything”. I think that plan is coming back so that plan [c] becomes plan [d] and the new [c] move the work to a different site.This system failure and my DR/HA plan is a perfect metaphor.