IT infrastructure is a crucial aspect of most operations, and the data center is the heart of IT resources. Your servers and routing equipment are essential to the standard activities that many employees and clients may take for granted: accessing their data, communicating with each other, accessing the internet, or, in some cases, even dialing an outside line. So, if the data center suffers a failure of some sort, a server dies equipment overheats, or power is lost, users down the line, your clients, colleagues, and managers, will also have a problem. And if you're even remotely involved with the data center, you'll hear about it.
To prevent outages and long conversations with disgruntled parties affected by outages, you'll need to monitor your data center. Data center monitoring systems help keep you informed of the goings-on in your data center, and can alert you to problems before they become service-affecting, so you can keep everything running smoothly. Or, even if a problem does result in an outage, monitoring systems can help you pinpoint the problem, so you can minimize the length or seriousness of an outage.
To keep fully aware of what happens in your data center, you'll want to monitor the equipment itself, and also the environment. Equipment monitoring consists mostly of tracking equipment to make sure that any single piece doesn't malfunction. Environmental monitoring involves using analog sensors to check on the heat, humidity, and air flow in your data center to ensure that your HVAC and other environmental systems are working properly to prevent equipment malfunction.
Of course, you'll have to make sure that all of your data center monitoring systems report to the appropriate personnel. According to the Uptime Institute, even 1st tier data centers are expected to maintain 99.671% availability, so you'll need a fail-proof way to make sure that you always know what's going on in your data center, without being glued to a terminal. And when something does go wrong, you can't afford to waste time searching for the location or the exact nature of the problem. Typically this means you'll need to employ a master station in to bring in all the equipment and environmental monitoring data and report it in quickly and meaningful ways.
Monitoring Starts at the Equipment Level
The equipment in the data center is essentially your employer. If it goes out of business, so do you. So monitoring your equipment is a good place to start. You'll use remote telemetry devices (RTU) in your racks to report back on the status of your equipment. Most RTU's are easily rack mountable and fit in line with your equipment. The RTU's discrete points, connected to equipment in your racks, will report back critical, major, and minor alarms.
Your RTU should also have analog inputs that you can use to measure equipment temperatures, air flow, and other environmental data. If an exhaust fan in a cabinet goes out, a simple air flow sensor can clue you in to the problem, so you can respond early to prevent a high temperature alarm or equipment damage resulting in an outage. If racks are liquid cooled, analog sensors can help you monitor for humidity and condensation.
Any RTU you install should also be able to report ping alarms. You can configure ping alarms to ping various network equipment and ensure that communications are still active (or working). If not, the RTU will set an alarm and inform you of the issue before somebody else can.

Remotes like the NetGuardian series can monitor analog sensors, discrete contacts, and ping targets, operate control relays, and report events via SNMP or email, keeping you aware of the condition of your data center while you attend to more important things.
DPS Telecom recommends their NetGuardian series remotes for monitoring. In addition to the above features, NetGuardians have simple web interfaces, so you can simply punch in the IP address, login, and configure or monitor your equipment. From the web interface, you can set notifications, so when an alarm occurs, the RTU can send you an email or send an SNMP trap. You can also monitor the RTU's discrete points and analog inputs, and even operate control relays. You don't necessarily want to use the web interface as a full-time monitoring solution for your RTUs (because keeping 20 browsing windows open to monitor equipment can be a hassle), but you can use it to check up on your alarms when you're on the go or if you receive an alarm notification. Even from your BlackBerry™ or other smartphone, you can keep aware of what's going on in your data center.
Environmental Monitoring Helps Protect Equipment
Beyond monitoring equipment for problems, monitoring your data center's controlled environment can help clue you into problems before they result in equipment alarms or service-affecting damage. Your data center likely has computer room air conditioners (CRAC's) and other sorts of HVAC equipment to keep the server room from getting to hot or humid. Data center cooling typically relies on air flow as well, cool air in, hot air out. (Or something to this affect. You'll usually want to recycle your hot air, because hot return air helps maintain a high differential for the heat exchanger, which results in cooler air coming out of the CRAC units and into the data center.)
To monitor your data center's environment, you'll need to monitor your data center's crucial HVAC equipment with discrete contacts to make sure that your environmental control systems are working. It's also helpful to place analog sensors throughout the data centers in areas that will give you an indication of whether or not your environmental systems are functioning properly. Measure airflow in hot and cold aisles to make sure that the right kind of air is moving in the right places throughout the data center. Place temperature sensors in hot spots around the room to give you an indication of when equipment is in danger.
A device like DPS Telecom's TempDefender can help with analog data center monitoring. It takes up very little rack space and can monitor up to 16 analog sensors. Sensors for the TempDefender can be daisy-chained together, and the chain can run up to 600 feet from the small RTU. The TempDefender's range helps ensure that you can monitor every corner of your data center without much regard for RTU placement.

TempDefender monitors your data center's cooling and environmental controls to ensure the safety of your equipment and optimal operating conditions. Preventing hardware failures and shutdowns due to poor data center conditions keeps costs down.
The Master Station Ties it all Together Because there's so much to monitor in your data center, across multiple RTUs, you'll need a way to organize and report back on the status on your data center. A master station polls the network monitoring systems in your data center and provides a standard, unified interface from which you can view your monitoring data or alarms. When there is a problem, the master station will bring it to the forefront, so you can view the alarm without directly accessing the interface for the RTU.
DPS Telecom recommends the T/Mon alarm master series to collect your alarms. T/Mon can poll and interpret alarms from equipment with integrated proprietary or legacy monitoring systems in addition to standard RTU's, so you're not limited to deploying a whole new fleet of RTU's. You can use existing monitoring paths and deploy new RTUs to cover blind spots, and all of your monitoring equipment can report to a single, top-level master. This simplifies things for you while maximizing visibility. In addition to consolidating alarms, advanced master stations like T/Mon can help you sort your alarms, so you can quickly narrow down a problem to its root cause when it does occur.

T/Mon mediates protocols from a multitude of devices, and can report out SNMP or T1 to a higher-level master
The T/Mon alarm master can also report alarms the same way RTUs do, via pager, email or SNMP trap. With an accessory, you can even report alarms via voice call and acknowledge alarms via DTMF. If you always need to be aware of your network, than you should have multiple ways for the data center NOC to get in touch with you and let you know what sort of problems you've got.
With a capable alarm master station, you can monitor your alarms locally or remotely, and with multiple programs. T/Mon offers virtual access, a web interface, and a map-based interface that can be access remotely. The T/GFX map based interface for T/Mon allows you to view your data center on a floor-plan level, with your equipment mapped out, so you can see exactly where the problem is when something goes wrong. For added convenience, you can even access the interface for devices connected to your RTU's terminal server through the graphical interface.
The Bottom Line
A robust monitoring system with the right monitoring equipment and strategy can essentially automate data center monitoring and will make maintaining uptime easier, so you can spend less time in the data center and more time focusing on increasing performance. The key to automating data center management is making sure that you're monitoring your equipment, environment, and consolidating reporting with an advanced master station.
Click Here to Learn About Data Center Cooling and Management
|
Request More Info
(simple online form) |
|
Next Page >> most related to this one |
Give Us a Call!
To find out more about this and other DPS applications, give us a call at our toll-free number and talk to one of our network specialists. They'll help you put together a perfect fit solution for your network!





RSS