小型机预防性维护_第1页
小型机预防性维护_第2页
小型机预防性维护_第3页
小型机预防性维护_第4页
小型机预防性维护_第5页
已阅读5页,还剩11页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、HP服务器预防性维护1.指示灯状态32.分区状态(仅限于cell-based系统)33.内存(dmesg/cstm)44.磁盘(ioscan fnkCdisk)75.I/O (ioscan fnk)86.网络(#ioscan nfkClan,#lanscan,#neistat -in)107.dmesg 输出118.系统启动日志129.系统运行日志1310.系统shutdown 日志1311.系统事件日志1312.文件系统1413.系统近期有无HPMC1514.系统近期有无Core Dump1515.根盘镜像状态1516.集群的运行状态1717.CPU 平均使用率( #sar )1918.空闲

2、的物理内存( #top )2019.交换区使用情况( #swapinfo atm)2120.系统核心资源使用情况(#sar v)211. 指示灯状态服务器各个部件都会有指示灯,一般绿灯属于部件正常,黄灯和红灯就有故常。2. 分区状态(仅限于cell-based系统)在操作系统上还可以用parstatus命令来检查Cell状态,RIO connection等。以下是一个正常输出的例子:sisdba/modelia64 hp server rx7620sisdba/parstatusNote: No action specified. Default behavior is display all

3、.Complex Complex Name : Complex 01 Complex Capacity Compute Cabinet (2 cell capable) : 1 Active MP Location : cabinet 0 Original Product Name : server rx7620 Original Serial Number : SGH434934T Current Product Order Number : A7027A OEM Manufacturer : Complex Profile Revision : 1.0 The total number o

4、f partitions present : 1Cabinet Cabinet I/O Bulk Power Backplane Blowers Fans Supplies Power Boards OK/ OK/ OK/ OK/Cab Failed/ Failed/ Failed/ Failed/Num Cabinet Type N Status N Status N Status N Status MP= = = = = = =0 2 cell slot 4/0/N+ 6/0/N+ 2/0/N+ - Active Notes: N+ = There are one or more spar

5、e items (fans/power supplies). N = The number of items meets but does not exceed the need. N- = There are insufficient items to meet the need. ? = The adequacy of the cooling system/power supplies is unknown. HO = Housekeeping only; The power is in a standby state. NA = Not Applicable.Cell CPU Memor

6、y Use OK/ (GB) Core OnHardware Actual Deconf/ OK/ Cell Next ParLocation Usage Max Deconf Connected To Capable Boot Num= = = = = = = =cab0,cell0 Absent - - - - - - cab0,cell1 Active Core 2/0/4 16.0/0.0 cab0,bay0,chassis1 yes yes 0 Notes: * = Cell has no interleaved memory.Chassis Core Connected ParHa

7、rdware Location Usage IO To Num= = = = =cab0,bay0,chassis0 Powered off ? ? ? cab0,bay0,chassis1 Active yes cab0,cell1 0 PartitionPar # of # of I/ONum Status Cells Chassis Core cell Partition Name (first 30 chars)= = = = = =0 Active 1 1 cab0,cell1 Partition 0 3. 内存(dmesg/cstm)#dmesgMemory Information

8、: physical page size = 4096 bytes, logical page size = 4096 bytes Physical: 16755960 Kbytes, lockable: 12532456 Kbytes, available: 14552680 Kbytes物理内存大小#cstm用map 命令列出硬件列表,用sel dev 2选中内存项,然后用il显示内存信息。webb/cstmRunning Command File (/usr/sbin/stm/ui/config/.stmrc).- Information -Support Tools ManagerPr

9、oduct Number B4708AA(C) Copyright Hewlett Packard Co. 1995-2005All Rights ReservedUse of this program is subject to the licensing restrictions describedin "Help->On Version". HP shall not be liable for any damages resultingfrom misuse or unauthorized use of this program.cstm>cstm>

10、map webb Dev Last Last Op Num Path Product Active Tool Status = = = = = 1 system system (1003) Information Successful 2 memory IPF_MEMORY (1003) Information Successful 3 0 Bus Adapter (103c1229) Information Successful 4 0/0 PCI Bus Adapter (103c122e Information Successful 5 0/0/1/0 RS-232 Interface

11、(103c129 Information Successful 6 0/0/1/1 RS-232 Interface (103c104 Information Successful 7 0/0/2/0 PCI SCSI Interface (10000 Information Successful 8 0/0/2/0.0.0 SCSI Disk (HP73.4GMAS3735 Information Successful 9 0/0/2/0.2.0 SCSI Disk (HP73.4GMAS3735 Information Successful 10 0/0/2/1 PCI SCSI Inte

12、rface (10000 Information Successful 11 0/0/2/1.3.0 Optical Storage Device (H Information Warning 12 0/0/4/0 PCI Bus Adapter (8086b154 Information Successful 13 0/0/4/0/4/0 USB Open Host Controller 14 0/0/4/0/4/0.0 Generic USB Interface (60 15 0/0/4/0/4/1 USB Open Host Controller 16 0/0/4/0/4/2 USB E

13、nhanced Host Control 17 0/0/4/0/5/0 Graphics Interface (GRAPH Information Warning 18 0/1 PCI Bus Adapter (103c122e Information Successful 19 0/1/1/0 PCI Bus Adapter (101401a7 Information Successful 20 0/1/1/0/1/0 PCI SCSI Interface (10000 Information Successful 21 0/1/1/0/1/0.5.0 SCSI Tape (HPC5683A

14、) 22 0/1/1/0/1/1 PCI SCSI Interface (10000 Information Successful 23 0/1/1/0/1/1.0.0 SCSI Disk (HP73.4GMAS3735 Information Successful 24 0/1/1/0/1/1.2.0 SCSI Disk (HP73.4GMAS3735 Information Successful 25 0/1/1/0/4/0 Core PCI 1000Base-T Link Information Successful 26 0/2 PCI Bus Adapter (103c122e In

15、formation Successful 27 0/2/1/0 PCI Gigabit Ethernet Link Information Successful 28 0/3 PCI Bus Adapter (103c122e Information Successful 29 0/3/1/0 PCI Gigabit Ethernet Link Information Successful 30 0/4 PCI Bus Adapter (103c122e Information Successful 31 0/5 PCI Bus Adapter (103c122e Information Su

16、ccessful 32 0/5/1/0 Fibre Channel Interface ( Information Successful 33 0/5/1/0.1 Fibre Channel Driver (Mas 34 0/5/1/55.2.7 Virtual Array 7110 (HPA61 35 0/5/1/55.2.10 Virtual Array 7110 (HPA61 36 0/5/1/0.2 Fibre Channel Driver (Mas 37 0/5/1/55.0.0. SCSI Tape (HPUltrium) 38 0/5/1

17、/55.0.0. SCSI Device (HPNS) 39 0/5/1/55.2.7 Virtual Array 7110 (HPA61 40 0/5/1/0.3 Fibre Channel Driver (Mas 41 0/5/1/55.2.1 Virtual Array 7110 (HPA61 42 0/6 PCI Bus Adapter (103c122e Information Successful 43 0/7 PCI Bus Adapter (103c122e Information Successful 44 120 CPU (10

18、03) 45 121 CPU (1003) 46 250 Core I/O Adapter (fffffff 47 250/0 IPMI Controller (49504930 Information Successful 48 250/1 ACPI Device (41435049) Information Successful cstm>sel dev 2cstm>il- Converting a (3300) byte raw log file to text. -Preparing the Information Tool Log for IPF_MEMORY on pa

19、th memory File .46;1H0K. webb : 2 . - Information Tool Log for IPF_MEMORY on path memory -Log creation time: Sun Jul 17 23:03:21 2011Hardware path: memoryBasic Memory Description Module Type: MEMORY Page Size: 4096 Bytes Total Physical Memory: N/A Total Configured Memory: 10240 MB Total De

20、configured Memory: N/A Memory Board Inventory DIMM Location Size(MB) DIMM Location Size(MB) - - - - Ext 0 DIMM 0A 1024 Ext 0 DIMM 0B 1024 Ext 0 DIMM 0C 1024 Ext 0 DIMM 0D 1024 Ext 0 DIMM 1A 512 Ext 0 DIMM 1B 512 Ext 0 DIMM 1C 512 Ext 0 DIMM 1D 512 Ext 0 DIMM 2A 256 Ext 0 DIMM 2B 256 Ext 0 DIMM 2C 25

21、6 Ext 0 DIMM 2D 256 Ext 0 DIMM 3A 256 Ext 0 DIMM 3B 256 Ext 0 DIMM 3C 256 Ext 0 DIMM 3D 256 Ext 0 DIMM 4A 256 Ext 0 DIMM 4B 256 Ext 0 DIMM 4C 256 Ext 0 DIMM 4D 256 Ext 0 DIMM 5A 256 Ext 0 DIMM 5B 256 Ext 0 DIMM 5C 256 Ext 0 DIMM 5D 256 Ext 0 Total: 10240 (MB) =Memory Error Log Summary The memory err

22、or log is empty.Page Deallocation Table (PDT) The Page Deallocation Table is empty.46;1H0K7mStandard inputPress space to continue, q to quit, h for helpm46;1H46;1H0K PDT Entries Used: 0 PDT Entries Free: 100 PDT Total Size: 100有报错:Memory Error Log Summary DIMM Location Error Address Error Type Page

23、Count - - - - - Cab 0 Cell 1 DIMM 3A 0x334271180 Single-Bit 0x334271 1 System start: Tue Mar 22 23:18:43 2011. 假如这个数值比较大就建议跟换内存 Last error detected: Tue Mar 22 23:18:43 2011. Logging interval: 900 seconds. 1 address(es) with errors logged in memory error log. The Logtool Utility provides full detail

24、s about the memory error log.4. 磁盘(ioscan fnkCdisk)#ioscan -fnCdisk | moreClass I H/W Path Driver S/W State H/W Type Description=disk 0 0/0/1/1.2.0 sdisk CLAIMED DEVICE SEAGATE ST39204LC/dev/dsk/c1t2d0 /dev/rdsk/c1t2d0disk 1 0/0/2/1.2.0 sdisk CLAIMED DEVICE HP DVD-ROM 305 /dev/dsk/c3t2d0 /dev/rdsk/c

25、3t2d0disk 2 0/6/0/ sdisk CLAIMED DEVICE HP A6189B/dev/dsk/c8t0d0 /dev/rdsk/c8t0d0disk 3 0/6/0/ sdisk CLAIMED DEVICE HP A6189B/dev/dsk/c8t0d1 /dev/rdsk/c8t0d1disk 4 0/6/0/ sdisk CLAIMED DEVICE HP A6189B/dev/dsk/c8t0d2 /dev/rdsk/c8t0d2disk 5 0/6/0/10

26、.1.0.0 sdisk CLAIMED DEVICE HP A6189B/dev/dsk/c10t0d0 /dev/rdsk/c10t0d0disk 6 0/6/0/ sdisk NO_HW DEVICE HP A6189B/dev/dsk/c10t0d1 /dev/rdsk/c10t0d1disk 7 0/6/0/ sdisk NO_HW DEVICE HP A6189B/dev/dsk/c10t0d2 /dev/rdsk/c10t0d2Ø 在上面的例子中,磁盘状态是“NO_HW“代表此盘在主机最初启动时是正常的,可被系

27、统正常访问;但现在系统核心已找不到这个物理盘体。造成此状态的具体原因有可能是:1. 物理磁盘损坏。2. 到这个磁盘的硬件连接通道有问题(SCSI卡,SCSI 线,光纤卡,光纤线,光纤交换机)。3. 这个磁盘被在线移掉。5. I/O (ioscan fnk)#ioscan -fn | moreClass I H/W Path Driver S/W State H/W Type Description=root 0 root CLAIMED BUS_NEXUS ioa 0 0 sba CLAIMED BUS_NEXUS System Bus Adapter (582)ba 0 0/0 lba CL

28、AIMED BUS_NEXUS Local PCI Bus Adapter (782)lan 0 0/0/0/0 btlan CLAIMED INTERFACE HP PCI 10/100Base-TX Core /dev/diag/lan0 /dev/ether0 /dev/lan0 ext_bus 0 0/0/1/0 c720 CLAIMED INTERFACE SCSI C896 Ultra Wide Single-Endedtarget 0 0/0/1/0.3 tgt CLAIMED DEVICE tape 0 0/0/1/0.3.0 stape NO_HW DEVICE HP C15

29、37Atarget 1 0/0/1/0.7 tgt CLAIMED DEVICE ctl 0 0/0/1/0.7.0 sctl CLAIMED DEVICE Initiator /dev/rscsi/c0t7d0ext_bus 1 0/0/1/1 c720 CLAIMED INTERFACE SCSI C896 Ultra Wide Single-Endedtarget 2 0/0/1/1.7 tgt CLAIMED DEVICE ctl 1 0/0/1/1.7.0 sctl CLAIMED DEVICE Initiator /dev/rscsi/c1t7d0target 3 0/0/1/1.

30、15 tgt CLAIMED DEVICE disk 0 0/0/1/1.15.0 sdisk CLAIMED DEVICE SEAGATE ST318404LC /dev/dsk/c1t15d0 /dev/rdsk/c1t15d0ext_bus 2 0/0/2/0 c720 CLAIMED INTERFACE SCSI C87x Fast Wide Single-Endedtarget 4 0/0/2/0.3 tgt CLAIMED DEVICE disk 1 0/0/2/0.3.0 sdisk CLAIMED DEVICE HP DVD-ROM 305 /dev/dsk/c2t3d0 /d

31、ev/rdsk/c2t3d0target 5 0/0/2/0.6 tgt CLAIMED DEVICE ctl 2 0/0/2/0.6.0 sctl CLAIMED DEVICE Initiator /dev/rscsi/c2t6d0ext_bus 3 0/0/2/1 c720 CLAIMED INTERFACE SCSI C87x Ultra Wide Single-Endedtarget 6 0/0/2/1.7 tgt CLAIMED DEVICE ctl 3 0/0/2/1.7.0 sctl CLAIMED DEVICE Initiator /dev/rscsi/c3t7d0target

32、 7 0/0/2/1.15 tgt CLAIMED DEVICE disk 2 0/0/2/1.15.0 sdisk CLAIMED DEVICE HP 36.4GATLAS10K3_36_SCA /dev/dsk/c3t15d0 /dev/rdsk/c3t15d0unknown -1 0/0/3/0 UNCLAIMED UNKNOWN PCItty 0 0/0/4/0 asio0 CLAIMED INTERFACE PCI Serial (103c1048) /dev/GSPdiag1 /dev/mux0 /dev/tty0p1 /dev/diag/mux0 /dev/tty0p0 /dev

33、/tty0p2 tty 1 0/0/5/0 asio0 CLAIMED INTERFACE PCI Serial (103c1048) /dev/GSPdiag2 /dev/mux1 /dev/diag/mux1 /dev/tty1p1 memory 0 8 memory CLAIMED MEMORY Memoryprocessor 0 160 processor CLAIMED PROCESSOR ProcessorØ 在 “ioscan fn“ 的输出结果中,造成I/O 设备的状态是Unclaimed或 Unknown 的原因是:1. 此设备的Driver 没有加载到核心里;所以

34、操作系统无法识别和驱动这个I/O 设备。2. 也可能是由于没有安装相应I/O 设备的Patch.Ø 在“ioscan fn“的输出结果中,I/O设备的状态是“NO_HW“代表此I/O外设在主机最初启动时是正常的,可被系统正常访问;但现在系统核心已找不到这个I/O设备。1. I/O 设备物理损坏。2. 到这个I/O外设的硬件连接通道有问题。3. 这个I/O外设被在线移掉(例如:磁带机,DVD-ROM 被在线拔走)。6. 网络(#ioscan nfkClan,#lanscan,#neistat -in)Ø 服务器网络部分的检查,可分三步进行:1. 用命令 #ioscan nfC

35、lan,确认所有网络设备(网卡)的状态都是“Claimed“。Class I H/W Path Driver S/W State H/W Type Description=lan 0 0/0/0/0 btlan CLAIMED INTERFACE HP PCI 10/100Base-TX Core/dev/diag/lan0 /dev/ether0 /dev/lan02. 用命令#lanscan 查看网卡是否启动。Hardware Station Crd Hdw Net-InterfaceNM MAC HP-DLPI DLPIPath Address In# State NamePPA ID

36、Type Support Mjr#0/0/0/0 0x00306E0C194A 0 UP lan0 snap0 1 ETHER Yes 119硬件路径 网卡MAC地址 网卡的状态:UP - 启动DOWN 未启动3. 用命令netstat -in确认网络地址及状态。 Name Mtu Network Address Ipkts Opkts lo0 4136 838 838 lan0 1500 160952 111715网卡号(有*表明IP层不通)子网地址IP地址 接收包数量 发送包数量7. dmesg 输出Ø

37、; 运行命令dmesg 是一个即简单又快捷的方法来查看系统硬件及文件系统有无报错。dmesg 的工作原理是直接从系统的缓冲器(buffer)中读取系统最近一段时期内的硬件状态。Ø 命令dmesg 的缺点是输出结果中没有时间标志, 同时因为缓冲器的容量有限,近期的内容会覆盖缓冲器里以前的内容。Ø 服务器没有硬件报错时,dmesg的标准输出是:May 14 10:38gate64: sysvec_vaddr = 0xc0002000 for 2 pagesNOTICE: autofs_link(): File system was registered at index 3.N

38、OTICE: cachefs_link(): File system was registered at index 5.NOTICE: nfs3_link(): File system was registered at index 6.0 sba0/0 lba0/0/0/0 btlan0/0/1/0 c7200/0/1/0.7 tgt0/0/1/0.7.0 sctl0/0/1/1 c7200/0/1/1.2 tgt0/0/1/1.2.0 sdisk0/0/1/1.7 tgt0/0/1/1.7.0 sctl0/0/2/0 c7200/0/2/0.7 tgt0/0/2/0.7.0 sctl0/

39、0/2/1 c7200/0/2/1.2 tgt0/0/2/1.2.0 sdisk0/0/2/1.7 tgt0/0/2/1.7.0 sctl0/0/4/0 asio00/0/5/0 asio00/1 lba0/2 lba0/2/0/0 c7200/2/0/0.0 tgt0/2/0/0.0.0 schgr0/2/0/0.1 tgt0/2/0/0.1.0 stape0/2/0/0.7 tgt0/2/0/0.7.0 sctl0/3 lba0/4 lbac8xx BUS: 5 SCSI C1010 Ultra Wide LVD assigned CPU: 00/4/0/0 c8xx0/4/0/0.6 t

40、gt0/4/0/0.6.0 sctl0/5 lba0/5/0/0 c7200/5/0/0.2 tgt0/5/0/0.2.0 stape0/5/0/0.7 tgt0/5/0/0.7.0 sctl0/6 lba0/6/0/0 tdtd: claimed Tachyon XL2 Fibre Channel Mass Storage card at 0/6/0/00/6/0/0.8 fcp0/6/0/10.0 fcparray0/6/0/10.0.0 tgt0/6/0/ sdisk0/6/0/ sdisk0/6/0

41、/ sdisk0/6/0/10.1 fcparray0/6/0/10.1.0 tgt0/6/0/ sdisk0/6/0/ sdisk0/6/0/ sdisk0/6/0/55.6 fcpdev0/6/0/55.6.14 tgt0/6/0/ sctl0/7 lbac8xx BUS: 7 SCSI C1010 Ultra Wide LVD assigned CPU: 10/7/0/0 c8xx0

42、/7/0/0.6 tgt0/7/0/0.6.0 sctl8 memory160 processor166 processorbtlan: Initializing 10/100BASE-TX card at 0/0/0/0. System Console is on the Built-In Serial InterfaceLogical volume 64, 0x3 configured as ROOTLogical volume 64, 0x2 configured as SWAPLogical volume 64, 0x2 configured as DUMP Swap device table: (start & size given in 512-byte blocks) entry 0 - major is 64, minor is 0x2; start = 0, size = 5242880 Dump device table: (start & size given in 1-Kbyte blocks) entry 0000000000000000

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论