05-25-07 06:18 AM
On May 24, 8:55 am, efff...@f-m.fm wrote:
> Company I work for has a SAN of about 50 TB.
> It is configured as 4 logical disks. So when there is a
> failure, that logical disk is out of action while
> the RAID rebuilds itself. "Hot swapping" doesn't
> help much when 1/4 of the system is paralysed for
> hours afterwards. It seems about once a month that
> one hard drive shits itself, and has to be replaced,
> triggering the fiasco again.
> Then system engineer says it would be good idea to
> run complete diagnostics. That means taking all offline
> for 172800 seconds = gazillions of dollars lost.
Either the storage system is configured very badly or it's a very poor
design. A single disk failure should not have such a significant
impact on performance. You should be able to replace the drive and let
the system rebuild it in the background and still allow user/
application access to the logical disks. For that quantity of storage
and for the cost of down-time (given your mention of lost revenue)
this storage should be a highly available enterprise level solution.
If that's what you've paid for, it certainly sounds like that's not
what you've got. Care to elaborate on what systems you're actually
running?
Graeme
[ Post a follow-up to this message ]
|