Appendix A. Troubleshooting

This appendix explains

Checking OPS Host and CHALLENGE RAID SCSI IDs

This section explains how to make sure that the two OPS hosts (or other SCSI devices on the same bus) do not have the same SCSI ID number. Follow these steps:

  1. On the first OPS host, enter the command nvram. This command's output should contain the line

    scsihostid=
    

    This output means that the SCSI host ID for this OPS host is 0.

  2. Repeat step 1 in the second window, for the second OPS host. The output should contain the line

    scsihostid=2
    

    This output means that the SCSI host ID for the second OPS host is 2.

  3. If the host SCSI IDs are not 0 and 2, respectively, reboot and start the System Maintenance menu; choose item 5, the Command Monitor.

  4. In the Command Monitor, set the SCSI ID for this system:

    setenv scsihostid 2
    

  5. To verify that the SCSI ID was set, enter

    printenv
    

    The output should include the line scsihostid=2.

  6. Exit the System Maintenance menu and restart the system.

  7. Enter the command hinv on each server and compare the output.


    Caution: No SCSI device on any bus on the second server (OPS2) should have SCSI ID 2.


Table A-1 charts SCSI ID switch settings for the CHALLENGE RAID storage system. Note that the settings do not conform to frequently used numbering schemes.

Table A-1. CHALLENGE RAID SCSI ID Switch Settings

SCSI ID Number

Switch Number

 

 

 

Comment

 

ID 0

ID 1

ID 2

ID 3

 

1

Off

On

On

On

 

2

On

Off

On

On

Do not use

3

Off

Off

On

On

 

4

On

On

Off

On

 

5

Off

On

Off

On

 

6

On

Off

Off

On

Shown in Figure A-1 as A

7

Off

Off

Off

On

Shown in Figure A-1 as B

8

On

On

On

Off

 

9

Off

On

On

Off

 

10

On

Off

On

Off

 

11

Off

Off

On

Off

 

12

On

On

Off

Off

 

13

Off

On

Off

Off

 

14

On

Off

Off

Off

 

15

Off

Off

Off

Off

 

Figure A-1 shows a typical OPS cabling scheme.

Figure A-1. Example OPS Cabling Scheme


Checking Disk Partitions

Never create disk partitions 8, 9, and 10 manually. If partition 9 on the drive has been put on the drive at Silicon Graphics or during the installation process, do not reuse it or the sectors that it spans. Thus, 13 usable partitions per LUN remain: 0, 1, 2, 3, 4, 5, 6, 7, 11, 12, 13, 14, and 15.

Replacing a CHALLENGE RAID Disk Module

To replace a failed disk module in the CHALLENGE RAID, follow instructions in the CHALLENGE RAID-5 Owner's Guide.

Use only CHALLENGE RAID disk modules as replacements; only they contain the correct device firmware. The replacement 2 GB drive part number is 9410113; the replacement 4.3 GB drive part number is 9410114. Other disk modules, even those from other Silicon Graphics equipment, will not work. Do not mix disk modules of different capacities within one array.

After replacing a failed disk module, update the firmware on the CHALLENGE RAID storage-control processor. Type as root:

raid5 -d device firmware /usr/raid5/flare7.67.0.prom.7.99.bin 


Caution: You must use this command every time you replace a disk module.

The image in the file given in the command contains microcode that runs on the storage-control processor and possibly also a microcode image destined for the storage-control processor PROM, which runs the power-on diagnostics.

This command has no output.