Skip to main content

What uptime really means

Previously I have written that it is not possible to improve uptime but only minimize the impact of major incidents (read about the major incident process in IT here). This philosophy results in an improvement in uptime and is not a new idea, the avionics industry has been using it for decades to improve flight safety. It is thus clear that the investigation of accidents in IT will result in IT safety. Every time a plane falls out the sky, no stone is left unturned until the precise reason is known. IT is not as diligent and obviously not as safety conscious. However, safety in IT is more than measuring the power availability to a server in an arbitrary data centre using the "how many 9s" technique!

Read the full article on LinkedIn's Pulse here.




Comments

Popular posts from this blog

LDWin: Link Discovery for Windows

LDWin supports the following methods of link discovery: CDP - Cisco Discovery Protocol LLDP - Link Layer Discovery Protocol Download LDWin from here.

Battery Room Explosion

A hydrogen explosion occurred in an Uninterruptible Power Source (UPS) battery room. The explosion blew a 400 ft2 hole in the roof, collapsed numerous walls and ceilings throughout the building, and significantly damaged a large portion of the 50,000 ft2 building. Fortunately, the computer/data center was vacant at the time and there were no injuries. Read more about the explosion over at hydrogen tools here .

STG (SNMP Traffic Grapher)

This freeware utility allows monitoring of supporting SNMPv1 and SNMPv2c devices including Cisco. Intended as fast aid for network administrators who need prompt access to current information about state of network equipment. Access STG here (original site) or alternatively here .