Case Studies

CSI的圣骑士监控保存了一个客户的主要电子邮件中断,并允许我们主动处理另一个的电子邮件中断问题

CSI's Paladin Remote Monitoring 解决方案在周三的一个小时内发现了两个主要的Exchange 2010电子邮件危机. One affected a little under 700 users and the other about 1350 users.

The first event was that Paladin gave us an Exchange alert called "back pressure". 这就是Exchange认为它将无法根据所消耗的服务器资源(RAM和磁盘空间)的速率来完成其工作的地方. Exchange then attempts to protect the core. 它通常首先关闭进出Exchange服务器的电子邮件流. This begs the questions, "how do you know if emails you sent are not being delivered? How do you know if emails sent do you are not being received? Paladin knows. 因为CSI主动观察警报控制台,而不是仅仅依赖于自动报警给我们的客户, 我们采取了“老办法”,拿起电话和合适的人交谈,但他们并不知道他们没有得到
emails. We worked with them to resolve the issue. 只需对其虚拟环境进行简单的资源分配更改,并快速重启,这700名用户就可以继续做他们所做的事情,而无需担心, "why is email down?”

When we combine Paladin Monitoring with Paladin Email Defense, we can do one better. Paladin Email Defense 为我们提供24x7x365短信文本警报时,邮件流进出电子邮件服务器停止和启动. If the outage is due to true disaster situation, Paladin Email Defense 立即切换到灾难恢复模式,在这种模式下,无法通过web发送到其邮件服务器的客户端电子邮件立即可用. The mail server may be dead or the building destroyed, but if you can find internet access somewhere, 你仍然能够发送和接收重要的电子邮件,直到任何糟糕的事情被解决. If the situation is temporary, Paladin Email Defense 一旦连接重新建立,就会自动重启入站和出站邮件流,然后通过SMS通知每个人正常的邮件流再次工作.

The second Exchange event happened an hour after the first event. Unfortunately an Exchange server which provides access to approximately 1,350 users had a high CPU condition. This was causing degraded user performance. There was no warning. One minute it was normal. The next minute it was in a bad place. Paladin alerted us. 当客户打来电话报告Exchange中出现奇怪的性能问题时,我们已经在调查中断了. In this instance we couldn't prevent a degradation in performance. No one can do that all the time. However, we knew before our client knew that there was an urgent issue. We were proactively working to resolve the issue as fast as possible to minimize downtime. About 20 minutes after the event started we had it resolved and everyone went back to work. Our response time from alert to action on this critical alert was about three minutes.

你不可能知道你的社交网络中正在发生或即将发生的一切. By overlaying 24x7x365 Paladin remote monitoring 我们可以为您提供了解您的网络的能力,这是您自己无法了解的. By overlaying Paladin Email Defense 我们可以为您的关键电子邮件通信提供一个额外的灾难恢复保护层. How do you know what you don't know about your network?

CSI’s Paladin Monitoring Saves Another Client From Excessive Downtime

CSI's Paladin Remote Monitoring solution had an impressive save in the last couple of days.

Last week we had an ISP go on-site, after hours to do a routine hardware upgrade/swap. The outage was planned and expected. It was to be a quick in and out and back on-line. Paladin saw the client site go off-line (as planned). However, the site never came back. Time went by and it still never came back. Hours went by. It was obvious that something went horribly wrong. If this continued until morning, bad things were going to happen for our client. There were 2,100名用户坐在这个连接后面——如果这个问题得不到解决,他们中的许多人会非常生气. 我们在几个小时后给合适的人打电话,晚上10:45左右,ISP重新访问了客户端,迅速解决了升级带来的连接问题. The end users never even knew the outage had occurred. The folks in charge of that site knew because Paladin was monitoring that site 24x7x365 whether they were standing there or not. 我们知道,我们不只是依赖自动的“您挂了”警报,因为我们非常努力地与客户进行互动讨论,并努力让他们保持健康. In this case it was some after hours, live "person" monitoring - just to make sure that everything came out okay. 你不可能知道你的社交网络中正在发生或即将发生的一切.

By overlaying 24x7x365 Paladin remote monitoring 我们可以为您提供了解您的网络的能力,这是您自己无法了解的. There is simply too much data to sift through. 在这两种情况下,我们都能够发现实质性的问题,并在它们演变成大量不高兴的用户的重大危机之前进行处理.

How do you know what you don't know about your network?

CSI Monitors Our Client’s Networks Through Hurricane Irene

As Hurricane Irene approached New York, CSI使用我们的24x7x365的圣骑士监控服务来帮助我们的客户准备他们的电脑和网络来应对即将到来的飓风. 我们能够快速识别管理下的所有不间断电源(即电池),其中有坏电池或其他硬件问题. 插入这些电池单元的设备受到的能量流比正常情况下要大.

One client site was intending to shut down their entire operations during the storm. 在他们关闭设备之前,我们发现了一个服务器,它被RAID阵列中的坏驱动器和其他硬件问题破坏了. 我们担心的是,由于这个关键服务器已经有一个失败的冗余组件以及其他问题, it might be shut off and never come back online.

Realizing that time was of the essence in repairing the server, 我们能够使用Paladin的远程管理工具,在周六上午12点,当风暴逼近时,重新构建冗余驱动器,并在服务器实际关闭之前重新建立完全冗余. The client never had to get out of bed. No one had to show up to let us into the building, turn off the alarm and unlock the multiple doors required to get to the server closet. After our work was completed, the server went down as the client had planned and came up fine after the storm.

在风暴期间,我们主动监控客户的网络,当我们看到整个地区的建筑和服务器因停电而宕机时,我们提供了动态状态更新. 通过查看以前的警报并查询电源供应,我们能够识别“无电源”和实际设备故障之间的区别.

在周日风暴平息后,我们能够准确地确定该地区有哪些建筑物处于离线状态. 然后,当这些建筑重新上线时,我们能够准确地确定每个建筑内的哪些设备没有重新启动. 从那里我们得到了客户的亚游只为非同凡响人员或CSI的人员可以调查的设备列表.

周日晚上,我亲自通过圣骑士监控控制台监视我们客户的网络. In the middle of that I lost power at home. I walked outside and started up my generator. I then turned on my Verizon wireless card on my laptop and didn't miss a beat.

CSI的办公室有一个足够的备用发电机和良好的互联网连接,所以我们的24x7x365监测持续不管风暴条件.

Once Monday morning came despite the flooding, 道路关闭和一些地区的大规模停电,我们的大多数客户回去工作时,他们的电脑网络的运行就像他们周五去度周末时那样.

That is what CSI's Paladin Monitoring does 24x7x365.