From: Stephan Windmüller on
Hello!

About two weeks ago I installed the current Solaris Os on two v240 to
form a cluster. For the first install I used ZFS, but that was
incompatible with the cluster software. So I reinstalled the OS, chose
"UFS" as the filesystem and mirrored all partitions onto the second
drive in each machine with the Solaris Volume Manager.

Today I wanted to patch these machines and rebooted one of them into
single user mode. Instead of booting correctly, I got this message:

| WARNING: The following files in / differ from the boot archive:
|
| changed /kernel/drv/did.conf
|
| The recommended action is to reboot to the failsafe archive to correct
| the above inconsistency. To accomplish this, on a GRUB-based platform,
| reboot and select the "Solaris failsafe" option from the boot menu.
| On an OBP-based platform, reboot then type "boot -F failsafe". Then
| follow the prompts to update the boot archive. Alternately, to continue
| booting at your own risk, you may clear the service by running:
| "svcadm clear system/boot-archive"

So I booted the machine into failsafe mode and used the instructions
from http://docs.sun.com/app/docs/doc/817-1985/gglbw?a=view to update
the boot-archive. During this procedure I was confused that my first
disk was listed as c0t0d0s0 and not c1t0d0s0. Also the output of
metastat did not work and the system asked me if I wanted to mount a
zpool which it found during startup. But there should not be any more
zpool now, all partitions are UFS.

The reboot did not work with c0t0d0s0 in the vfstab, so I rebooted once
again into failsafe. From this point, the whole system is corrupted. I
am unable to boot anymore. The OpenBoot prompt tells me this:

| Rebooting with command: boot
| Boot device: disk File and args:
| ERROR: Last Trap: Fast Data Access MMU Miss
|
| {1} ok boot
| ERROR: /pci(a)1c,600000: Last Trap: Fast Data Access MMU Miss
|
| {1} ok

I am unsure how to proceed from this point. The second machine tells me
exactly the same error about "/kernel/drv/did.conf", so I do not think
that in both servers a hard drive failed simultaniously.

Regards
Stephan
From: Richard B. Gilbert on
Stephan Windm�ller wrote:
> Hello!
>
> About two weeks ago I installed the current Solaris Os on two v240 to
> form a cluster. For the first install I used ZFS, but that was
> incompatible with the cluster software. So I reinstalled the OS, chose
> "UFS" as the filesystem and mirrored all partitions onto the second
> drive in each machine with the Solaris Volume Manager.
>
> Today I wanted to patch these machines and rebooted one of them into
> single user mode. Instead of booting correctly, I got this message:
>
> | WARNING: The following files in / differ from the boot archive:
> |
> | changed /kernel/drv/did.conf
> |
> | The recommended action is to reboot to the failsafe archive to correct
> | the above inconsistency. To accomplish this, on a GRUB-based platform,
> | reboot and select the "Solaris failsafe" option from the boot menu.
> | On an OBP-based platform, reboot then type "boot -F failsafe". Then
> | follow the prompts to update the boot archive. Alternately, to continue
> | booting at your own risk, you may clear the service by running:
> | "svcadm clear system/boot-archive"
>
> So I booted the machine into failsafe mode and used the instructions
> from http://docs.sun.com/app/docs/doc/817-1985/gglbw?a=view to update
> the boot-archive. During this procedure I was confused that my first
> disk was listed as c0t0d0s0 and not c1t0d0s0. Also the output of
> metastat did not work and the system asked me if I wanted to mount a
> zpool which it found during startup. But there should not be any more
> zpool now, all partitions are UFS.
>
> The reboot did not work with c0t0d0s0 in the vfstab, so I rebooted once
> again into failsafe. From this point, the whole system is corrupted. I
> am unable to boot anymore. The OpenBoot prompt tells me this:
>
> | Rebooting with command: boot
> | Boot device: disk File and args:
> | ERROR: Last Trap: Fast Data Access MMU Miss
> |
> | {1} ok boot
> | ERROR: /pci(a)1c,600000: Last Trap: Fast Data Access MMU Miss
> |
> | {1} ok
>
> I am unsure how to proceed from this point. The second machine tells me
> exactly the same error about "/kernel/drv/did.conf", so I do not think
> that in both servers a hard drive failed simultaniously.
>
> Regards
> Stephan


The error message you get on attempting to boot strongly suggests a
hardware problem. Call Sun and get it fixed!

I hope you have a backup of your disk from before you started tinkering!
You may need that after you get your MMU problem straightened out.
From: ndjilinski on
On Dec 23, 2:44 am, Stephan Windmüller <usenet-reply-2...(a)white-
hawk.de> wrote:
> Hello!
>
> About two weeks ago I installed the current Solaris Os on two v240 to
> form a cluster. For the first install I used ZFS, but that was
> incompatible with the cluster software. So I reinstalled the OS, chose
> "UFS" as the filesystem and mirrored all partitions onto the second
> drive in each machine with the Solaris Volume Manager.
>
> Today I wanted to patch these machines and rebooted one of them into
> single user mode. Instead of booting correctly, I got this message:
>
> | WARNING: The following files in / differ from the boot archive:
> |  
> |      changed /kernel/drv/did.conf
> |      
> | The recommended action is to reboot to the failsafe archive to correct
> | the above inconsistency. To accomplish this, on a GRUB-based platform,
> | reboot and select the "Solaris failsafe" option from the boot menu.
> | On an OBP-based platform, reboot then type "boot -F failsafe". Then
> | follow the prompts to update the boot archive. Alternately, to continue
> | booting at your own risk, you may clear the service by running:
> | "svcadm clear system/boot-archive"
>
> So I booted the machine into failsafe mode and used the instructions
> fromhttp://docs.sun.com/app/docs/doc/817-1985/gglbw?a=viewto update
> the boot-archive. During this procedure I was confused that my first
> disk was listed as c0t0d0s0 and not c1t0d0s0. Also the output of
> metastat did not work and the system asked me if I wanted to mount a
> zpool which it found during startup. But there should not be any more
> zpool now, all partitions are UFS.
>
> The reboot did not work with c0t0d0s0 in the vfstab, so I rebooted once
> again into failsafe. From this point, the whole system is corrupted. I
> am unable to boot anymore. The OpenBoot prompt tells me this:
>
> | Rebooting with command: boot
> | Boot device: disk  File and args:
> | ERROR: Last Trap: Fast Data Access MMU Miss
> |
> | {1} ok boot
> | ERROR: /pci(a)1c,600000: Last Trap: Fast Data Access MMU Miss
> |
> | {1} ok
>
> I am unsure how to proceed from this point. The second machine tells me
> exactly the same error about "/kernel/drv/did.conf", so I do not think
> that in both servers a hard drive failed simultaniously.
>
> Regards
>  Stephan

Hi,

not sure what you have done on those servers exactly - but the error
you got (boot_archive and GRUB) is related to x86 architecture. but
you have got v240 (SPARC).

now, if I were in your shoes I would boot off OS CD or net into single-
user mode and:
* restore bootblk on the internal disks
* mount your root FS from the internal disks and look into the
following:
/etc/system
/etc/vfstab

if you get the same error (MMU Miss) when booting off CD or net - open
the call in Sun support.

Cheers,
Nikolai
From: ndjilinski on
On Dec 23, 6:25 pm, ndjilinski <ndjilin...(a)gmail.com> wrote:
> On Dec 23, 2:44 am, Stephan Windmüller <usenet-reply-2...(a)white-
>
>
>
> hawk.de> wrote:
> > Hello!
>
> > About two weeks ago I installed the current Solaris Os on two v240 to
> > form a cluster. For the first install I used ZFS, but that was
> > incompatible with the cluster software. So I reinstalled the OS, chose
> > "UFS" as the filesystem and mirrored all partitions onto the second
> > drive in each machine with the Solaris Volume Manager.
>
> > Today I wanted to patch these machines and rebooted one of them into
> > single user mode. Instead of booting correctly, I got this message:
>
> > | WARNING: The following files in / differ from the boot archive:
> > |  
> > |      changed /kernel/drv/did.conf
> > |      
> > | The recommended action is to reboot to the failsafe archive to correct
> > | the above inconsistency. To accomplish this, on a GRUB-based platform,
> > | reboot and select the "Solaris failsafe" option from the boot menu.
> > | On an OBP-based platform, reboot then type "boot -F failsafe". Then
> > | follow the prompts to update the boot archive. Alternately, to continue
> > | booting at your own risk, you may clear the service by running:
> > | "svcadm clear system/boot-archive"
>
> > So I booted the machine into failsafe mode and used the instructions
> > fromhttp://docs.sun.com/app/docs/doc/817-1985/gglbw?a=viewtoupdate
> > the boot-archive. During this procedure I was confused that my first
> > disk was listed as c0t0d0s0 and not c1t0d0s0. Also the output of
> > metastat did not work and the system asked me if I wanted to mount a
> > zpool which it found during startup. But there should not be any more
> > zpool now, all partitions are UFS.
>
> > The reboot did not work with c0t0d0s0 in the vfstab, so I rebooted once
> > again into failsafe. From this point, the whole system is corrupted. I
> > am unable to boot anymore. The OpenBoot prompt tells me this:
>
> > | Rebooting with command: boot
> > | Boot device: disk  File and args:
> > | ERROR: Last Trap: Fast Data Access MMU Miss
> > |
> > | {1} ok boot
> > | ERROR: /pci(a)1c,600000: Last Trap: Fast Data Access MMU Miss
> > |
> > | {1} ok
>
> > I am unsure how to proceed from this point. The second machine tells me
> > exactly the same error about "/kernel/drv/did.conf", so I do not think
> > that in both servers a hard drive failed simultaniously.
>
> > Regards
> >  Stephan
>
> Hi,
>
> not sure what you have done on those servers exactly - but the error
> you got (boot_archive and GRUB) is related to x86 architecture. but
> you have got v240 (SPARC).
>
> now, if I were in your shoes I would boot off OS CD or net into single-
> user mode and:
> * restore bootblk on the internal disks
> * mount your root FS from the internal disks and look into the
> following:
> /etc/system
> /etc/vfstab
>
> if you get the same error (MMU Miss) when booting off CD or net - open
> the call in Sun support.
>
> Cheers,
> Nikolai

ops, sorry - I was wrong - as I see it must be Solaris10 U6 - so, the
warnings to boot_archive can be met on this particular configuration
after patching - due to the boot_archive file is not updated after
patch has been applied.

anyway, before booting off CD or net - try to reset the server

from OBP
setenv auto-boot? false
reset-all

and then
try to boot off the local disk with -F failsafe option (I would try
from both disks), if it doesn't work - boot off cdrom or net (if you
have JS server)

if you still have MMU Miss - log the call to Sun

Cheers,
Nikolai