× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



Got busy this morning and forgot to follow up.

We pulled the "time out error" drive around 5pm last night. The SAN
immediately started the RAID 5 rebuild with the hot spare. Then we
inserted the cold spare and configured as the new hot spare.

The RAID 5 rebuild finished around 10:20pm at which time the SAN then
verified the hot spare. That concluded about midnight.

Today all is well except for needing to replace the cold spare.



On Mon, Mar 10, 2014 at 3:39 PM, Jeff Crosby <jlcrosby@xxxxxxxxxxxxxxxx>wrote:

During save portion of backup. Prior to verify stage.


On Mon, Mar 10, 2014 at 3:36 PM, Chris Bipes <chris.bipes@xxxxxxxxxxxxxxx>wrote:

Heal itself, NO. But a timeout is not necessarily a failure. Did this
happen during some sort of heavy IO like a backup? After the timeout, the
RAID controller may have initiated a rebuild and now the data is rebuilt
and the drive is OK. I would replace it just to be safe. If it erred
once, it will do so again, and at the worst time, after another drive fails.

--
Chris Bipes
Director of Information Services
CrossCheck, Inc.


-----Original Message-----
From: pctech-bounces@xxxxxxxxxxxx [mailto:pctech-bounces@xxxxxxxxxxxx]
On Behalf Of Jeff Crosby
Sent: Monday, March 10, 2014 12:32 PM
To: PC Technical Discussion for iSeries Users
Subject: [PCTECH] Can a hard drive fix itself

Cybernetics SAN, 8 500gb drives, RAID 5 w/hot spare. Drives are
S.M.A.R.T.
Hosting 5 virtual servers.

At 3:20 this morning I got a text of a drive issue: "ERROR - Drive
timeout
detected: port=5"

Talked to the hardware guy at that time. He checked in and said the RAID
5
was still functioning and the hot spare was still available. Suggested we
wait till business hours since there was a hot spare.

Around 9am we decided we would do the replacement around 4:30pm since
things would be a lot less busy for the RAID 5 rebuild. The RAID 5 was
still functioning and if one didn't drill down in all the way, you
wouldn't
even see the status DEVICE ERROR on that drive.

I checked it out about an hour ago and the DEVICE ERROR is gone, replaced
by OK which is what it should say. Hardware guy says he has seen this
before simply by reseating the "problem" drive.

But we did no such thing. Haven't touched it at all. Hardware guy
suggests waiting a day and see what happens.

Anybody seen this before? A drive heals itself?

--
This is the PC Technical Discussion for IBM i (AS/400 and iSeries) Users
(PcTech) mailing list
To post a message email: PcTech@xxxxxxxxxxxx
To subscribe, unsubscribe, or change list options,
visit: http://lists.midrange.com/mailman/listinfo/pctech
or email: PcTech-request@xxxxxxxxxxxx
Before posting, please take a moment to review the archives
at http://archive.midrange.com/pctech.




--
Jeff Crosby
VP Information Systems
UniPro FoodService/Dilgard
P.O. Box 13369
Ft. Wayne, IN 46868-3369
260-422-7531
www.dilgardfoods.com

The opinions expressed are my own and not necessarily the opinion of my
company. Unless I say so.





As an Amazon Associate we earn from qualifying purchases.

This thread ...

Replies:

Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2024 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.