I still think Larry's conjecture about a device checking out for a short
while and coming back on line is very valid.
Have them look to see if the HMC is reporting any errors (brown icon on very
bottom left of the panel) or any yellow bang (!) indicators. I'm guessing
you'll be able to look in there and see if the idea has any merit.
Based on what you have described I suspect some sort of an I/O failure with
the DASD controller or several disk units that were recoverable. Also check
to be sure the cache batteries are not in a failed state. No cache, and a
moderate I/O load = really slow (appears to be hung) system.
--
Jim Oberholtzer
Chief Technical Architect
Agile Technology Architects
-----Original Message-----
From: MIDRANGE-L [mailto:midrange-l-bounces@xxxxxxxxxxxx] On Behalf Of James
H. H. Lampert
Sent: Thursday, October 02, 2014 1:19 PM
To: Midrange Systems Technical Discussion
Subject: Re: Customer box periodically becoming completely unresponsive --
we can't figure out why
New information:
According to the customer, the HMC was showing "all zeros and the partition
was running."
(Since we don't have any HMC-equipped boxes [our E4A uses a LAN Console, and
our older boxes are all Twinax] I'm not entirely sure what that means.)
I've passed on my observations about the three QPMxxxxxxx jobs that ran
without incident last night, and asked the customer whether the HMC showed
anything out of the ordinary.
On the other hand, while the three QPMxxxxxxx jobs ran, I think there might
have been a nightly scheduled backup that didn't run.
--
JHHL
--
This is the Midrange Systems Technical Discussion (MIDRANGE-L) mailing list
To post a message email: MIDRANGE-L@xxxxxxxxxxxx To subscribe, unsubscribe,
or change list options,
visit:
http://lists.midrange.com/mailman/listinfo/midrange-l
or email: MIDRANGE-L-request@xxxxxxxxxxxx Before posting, please take a
moment to review the archives at
http://archive.midrange.com/midrange-l.
As an Amazon Associate we earn from qualifying purchases.