• 华为麒麟服务器--硬盘问题


    记录以下今天处理的服务器:

    情况说明:linux 系统,不知道什么原因系统就突然不能用了(据说是前段时间断电来着,但是机房有应急电源)。

    系统环境:

    服务器:华为RH2288H V3 服务器

    服务器系统:linux 龙蜥操作系统 Anolis OS 8.4,

    硬盘:两块300G硬盘,做的raid1   

    两块硬盘故障灯都亮。

    这个里面的都不能选:

     以下是日志:

    大神们给分析一下:到底什么原因导致的

    1. 这是app_debug_log日志
    2. 2023-11-17 21:10:37 Payload ERROR: payload_hs.c(1005): hse_activate_completed:hse_fru_activate_policy
    3. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(1019): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_DEACTIVATED)
    4. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    5. 2023-11-17 21:10:38 Payload ERROR: payload_pwr.c(767): Detect fru270868 payload power dropped.hotswap:M767 m_pwr_state:0 Hardware:1
    6. 2023-11-17 21:10:38 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=00,tmp=01
    7. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(177): move M1 to M2
    8. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(636): send activate event at M1
    9. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(948): hse_fru_activate:sending active event
    10. 2023-11-17 21:10:38 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    11. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(213): move M2 to M3
    12. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(676): call pp_fru_pwr_ctrl(fru_id:0, POWER_ON)
    13. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(273): move M3 to M4
    14. 2023-11-17 21:10:38 Payload ERROR: payload_hop.c(1364): hop_on:already power on.
    15. 2023-11-17 21:10:38 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    16. 2023-11-17 21:10:38 CpuMem ERROR: cpu.c(2880): Get cpu architecture failed!
    17. 2023-11-17 21:12:15 CpuMem ERROR: cpu.c(868): Cpu1:get processor_sn failed !
    18. 2023-11-17 21:12:15 CpuMem ERROR: cpu.c(990): Cpu2:get manufacturer failed !
    19. 2023-11-17 21:12:15 CpuMem ERROR: cpu.c(957): Cpu2:get processor_family failed !
    20. 2023-11-17 21:12:15 CpuMem ERROR: cpu.c(897): Cpu2:get processor_version failed !FRU
    21. 2023-11-17 21:12:15 CpuMem ERROR: cpu.c(868): Cpu2:get processor_sn failed !
    22. 2023-11-17 21:12:15 CpuMem ERROR: cpu.c(837): Cpu2:get processor_assettag failed !
    23. 2023-11-17 21:12:18 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    24. 2023-11-17 21:13:01 sensor_alarm ERROR: sel.c(626): NO matching Sel Filter, sensor_type is 0x1f, reading_type is 0x6f, event_data_1 is 0x7
    25. 2023-11-17 21:14:12 sensor_alarm ERROR: sel.c(626): NO matching Sel Filter, sensor_type is 0x1f, reading_type is 0x6f, event_data_1 is 0xa
    26. 2023-11-17 21:14:38 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    27. 2023-11-17 21:14:38 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    28. 2023-11-17 21:15:59 CpuMem ERROR: cpu.c(868): Cpu1:get processor_sn failed !
    29. 2023-11-17 21:15:59 CpuMem ERROR: cpu.c(990): Cpu2:get manufacturer failed !
    30. 2023-11-17 21:15:59 CpuMem ERROR: cpu.c(957): Cpu2:get processor_family failed !
    31. 2023-11-17 21:15:59 CpuMem ERROR: cpu.c(897): Cpu2:get processor_version failed !
    32. 2023-11-17 21:15:59 CpuMem ERROR: cpu.c(868): Cpu2:get processor_sn failed !
    33. 2023-11-17 21:15:59 CpuMem ERROR: cpu.c(837): Cpu2:get processor_assettag failed !
    34. 2023-11-17 21:16:17 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    35. 2023-11-17 21:16:43 sensor_alarm ERROR: sel.c(626): NO matching Sel Filter, sensor_type is 0x1f, reading_type is 0x6f, event_data_1 is 0x7
    36. 2023-11-17 21:28:10 sensor_alarm ERROR: sel.c(626): NO matching Sel Filter, sensor_type is 0x1f, reading_type is 0x6f, event_data_1 is 0x8
    37. 2023-11-17 21:28:42 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    38. 2023-11-17 21:28:42 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    39. 2023-11-17 21:29:57 CpuMem ERROR: cpu.c(868): Cpu1:get processor_sn failed !
    40. 2023-11-17 21:29:57 CpuMem ERROR: cpu.c(990): Cpu2:get manufacturer failed !
    41. 2023-11-17 21:29:57 CpuMem ERROR: cpu.c(957): Cpu2:get processor_family failed !
    42. 2023-11-17 21:29:57 CpuMem ERROR: cpu.c(897): Cpu2:get processor_version failed !
    43. 2023-11-17 21:29:57 CpuMem ERROR: cpu.c(868): Cpu2:get processor_sn failed !
    44. 2023-11-17 21:29:57 CpuMem ERROR: cpu.c(837): Cpu2:get processor_assettag failed !
    45. 2023-11-17 21:30:22 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    46. 2023-11-17 21:31:08 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=01,tmp=00
    47. 2023-11-17 21:31:08 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M4 m_pwr_state:1 Hardware:0
    48. 2023-11-17 21:31:08 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=01,tmp=00
    49. 2023-11-17 21:31:08 Payload ERROR: payload_hs.c(394): move M4 to M6
    50. 2023-11-17 21:31:09 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M6 m_pwr_state:0 Hardware:0
    51. 2023-11-17 21:31:09 Payload ERROR: payload_hs.c(548): move M6 to M1
    52. 2023-11-17 21:31:09 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M1 m_pwr_state:0 Hardware:0
    53. 2023-11-17 21:37:34 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=00,tmp=01
    54. 2023-11-17 21:37:34 Payload ERROR: payload_hs.c(1005): hse_activate_completed:hse_fru_activate_policy
    55. 2023-11-17 21:37:34 Payload ERROR: payload_hs.c(1019): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_DEACTIVATED)
    56. 2023-11-17 21:37:34 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    57. 2023-11-17 21:37:34 Payload ERROR: payload_pwr.c(767): Detect fru270868 payload power dropped.hotswap:M767 m_pwr_state:0 Hardware:1
    58. 2023-11-17 21:37:34 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=00,tmp=01
    59. 2023-11-17 21:37:34 Payload ERROR: payload_hs.c(177): move M1 to M2
    60. 2023-11-17 21:37:34 Payload ERROR: payload_hs.c(636): send activate event at M1
    61. 2023-11-17 21:37:34 Payload ERROR: payload_hs.c(948): hse_fru_activate:sending active event
    62. 2023-11-17 21:37:34 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    63. 2023-11-17 21:37:35 Payload ERROR: payload_hs.c(213): move M2 to M3
    64. 2023-11-17 21:37:35 Payload ERROR: payload_hs.c(676): call pp_fru_pwr_ctrl(fru_id:0, POWER_ON)
    65. 2023-11-17 21:37:35 Payload ERROR: payload_hs.c(273): move M3 to M4
    66. 2023-11-17 21:37:35 Payload ERROR: payload_hop.c(1364): hop_on:already power on.
    67. 2023-11-17 21:37:35 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    68. 2023-11-17 21:39:13 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    69. 2023-11-17 21:46:16 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=01,tmp=00
    70. 2023-11-17 21:46:16 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M4 m_pwr_state:1 Hardware:0
    71. 2023-11-17 21:46:16 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=01,tmp=00
    72. 2023-11-17 21:46:16 Payload ERROR: payload_hs.c(394): move M4 to M6
    73. 2023-11-17 21:46:16 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M6 m_pwr_state:0 Hardware:0
    74. 2023-11-17 21:46:16 Payload ERROR: payload_hs.c(548): move M6 to M1
    75. 2023-11-17 21:46:17 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M1 m_pwr_state:0 Hardware:0
    76. 2023-11-17 21:46:25 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=00,tmp=01
    77. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(1005): hse_activate_completed:hse_fru_activate_policy
    78. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(1019): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_DEACTIVATED)
    79. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    80. 2023-11-17 21:46:25 Payload ERROR: payload_pwr.c(767): Detect fru270868 payload power dropped.hotswap:M767 m_pwr_state:0 Hardware:1
    81. 2023-11-17 21:46:25 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=00,tmp=01
    82. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(177): move M1 to M2
    83. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(636): send activate event at M1
    84. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(948): hse_fru_activate:sending active event
    85. 2023-11-17 21:46:25 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    86. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(213): move M2 to M3
    87. 2023-11-17 21:46:25 Payload ERROR: payload_hs.c(676): call pp_fru_pwr_ctrl(fru_id:0, POWER_ON)
    88. 2023-11-17 21:46:25 Payload ERROR: payload_hop.c(1364): hop_on:already power on.
    89. 2023-11-17 21:46:26 Payload ERROR: payload_hs.c(273): move M3 to M4
    90. 2023-11-17 21:46:26 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    91. 2023-11-17 21:48:06 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    92. 2023-11-17 21:49:15 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    93. 2023-11-17 21:49:15 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    94. 2023-11-17 21:50:55 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    95. 2023-11-17 21:52:28 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    96. 2023-11-17 21:52:28 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    97. 2023-11-17 21:54:07 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    98. 2023-11-17 21:55:19 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    99. 2023-11-17 21:55:19 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    100. 2023-11-17 21:57:00 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    101. 2023-11-17 21:58:23 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    102. 2023-11-17 21:58:23 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    103. 2023-11-17 22:00:02 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    104. 2023-11-17 22:08:07 Payload ERROR: payload_pwr.c(5909): .... restart cause=0
    105. 2023-11-17 22:08:07 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    106. 2023-11-17 22:09:48 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    107. 2023-11-19 09:14:17 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=01,tmp=00
    108. 2023-11-19 09:14:17 Payload ERROR: payload_hs.c(394): move M4 to M6
    109. 2023-11-19 09:14:17 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M6 m_pwr_state:1 Hardware:0
    110. 2023-11-19 09:14:17 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=01,tmp=00
    111. 2023-11-19 09:14:18 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M6 m_pwr_state:0 Hardware:0
    112. 2023-11-19 09:14:18 Payload ERROR: payload_hs.c(548): move M6 to M1
    113. 2023-11-19 09:14:18 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M1 m_pwr_state:0 Hardware:0
    114. 2023-11-19 09:15:41 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=00,tmp=01
    115. 2023-11-19 09:15:41 Payload ERROR: payload_hs.c(1005): hse_activate_completed:hse_fru_activate_policy
    116. 2023-11-19 09:15:41 Payload ERROR: payload_hs.c(1019): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_DEACTIVATED)
    117. 2023-11-19 09:15:41 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    118. 2023-11-19 09:15:41 Payload ERROR: payload_pwr.c(767): Detect fru270868 payload power dropped.hotswap:M767 m_pwr_state:0 Hardware:1
    119. 2023-11-19 09:15:41 Payload ERROR: payload_hs.c(177): move M1 to M2
    120. 2023-11-19 09:15:41 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=00,tmp=01
    121. 2023-11-19 09:15:41 Payload ERROR: payload_hs.c(636): send activate event at M1
    122. 2023-11-19 09:15:41 Payload ERROR: payload_hs.c(948): hse_fru_activate:sending active event
    123. 2023-11-19 09:15:41 Payload ERROR: payload_pwr.c(1110): detect a host reset occured, start the host checker...
    124. 2023-11-19 09:15:42 Payload ERROR: payload_hs.c(213): move M2 to M3
    125. 2023-11-19 09:15:42 Payload ERROR: payload_hs.c(676): call pp_fru_pwr_ctrl(fru_id:0, POWER_ON)
    126. 2023-11-19 09:15:42 Payload ERROR: payload_hop.c(1364): hop_on:already power on.
    127. 2023-11-19 09:15:42 Payload ERROR: payload_hs.c(273): move M3 to M4
    128. 2023-11-19 09:15:42 Payload ERROR: payload_hs.c(1030): hse_activate_completed:hs_send_evt(FRU_ACTIVATED_COMPLETED)
    129. 2023-11-19 09:17:21 Payload ERROR: payload_pwr.c(1191): host start successfully, host checker exit.
    130. 2023-11-19 09:19:46 Payload ERROR: payload_hop.c(208): fru0 acpi_status:old_tmp=01,tmp=00
    131. 2023-11-19 09:19:46 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M4 m_pwr_state:1 Hardware:0
    132. 2023-11-19 09:19:46 Payload ERROR: payload_hop.c(261): pwrpg_status:old_tmp=01,tmp=00
    133. 2023-11-19 09:19:47 Payload ERROR: payload_hs.c(394): move M4 to M6
    134. 2023-11-19 09:19:47 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M6 m_pwr_state:0 Hardware:0
    135. 2023-11-19 09:19:47 Payload ERROR: payload_hs.c(548): move M6 to M1
    136. 2023-11-19 09:19:47 Payload : ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M6 m_pwr_state:0 Hardware:0 (repeated 2 times)
    137. 2023-11-19 09:19:47 Payload ERROR: payload_pwr.c(706): pp_check_pwr_mutation 706:Detect fru0 payload power dropped.hotswap:M1 m_pwr_state:0 Hardware:0
    138. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru1 object fail!(result=-2009)
    139. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru2 object fail!(result=-2009)
    140. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru3 object fail!(result=-2009)
    141. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru4 object fail!(result=-2009)
    142. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru5 object fail!(result=-2009)
    143. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru6 object fail!(result=-2009)
    144. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru7 object fail!(result=-2009)
    145. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru8 object fail!(result=-2009)
    146. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru9 object fail!(result=-2009)
    147. 1970-01-01 00:00:35 Payload ERROR: payload_hop.c(191): get fru10 object fail!(result=-2009)

    这是fdm_log日志 

    1. 这是fdm_log日志
    2. [Hardware Error Log]:NO.1 SMI Serial NO.0
    3. collect:bios(smi) time: 2023-11-17 21:49:35 GMT flag:0x00
    4. CPU:0 (socket:CPU1) LogType: IIO AER module:PCIe ADDITIONAL
    5. DEV:(0x00:0x01.0x00)
    6. First Error type: Non-Fatal ERROR Error Code: Received PCIe completion with UR (80)
    7. Error type: corrected errors Error Code: PCIe link bandwidth changed (76)
    8. ------iio pcie additional reg dump:------
    9. errpin_ctl: 0x00000000
    10. errpin_stat: 0x00000000
    11. g_sys_ctl: 0x00000000
    12. g_sys_stat: 0x00000000
    13. sys_map: 0x00000120
    14. g_err_ctl: 0x00000000
    15. g_ferr_stat: 0x00000000
    16. g_nerr_stat: 0x00901140
    17. g_cerr_stat: 0x00101140
    18. g_f_ferr_stat: 0x00000000
    19. g_n_ferr_stat: 0x00000000
    20. g_f_nferr_stat: 0x00100000
    21. g_n_nferr_stat: 0x00801140
    22. g_f_cerr_stat: 0x00100000
    23. g_n_cerr_stat: 0x00001140
    24. pcie_uncorrectable_err_detect_mask: 0x00000000
    25. pcie_correctable_err_detect_mask: 0x00000000
    26. pcie_uncorrectable_err_stat: 0x00000040
    27. pcie_correctable_err_stat: 0x00000001
    28. pcie_uncorrectable_err_mask: 0x00000000
    29. pcie_correctable_err_mask: 0x00000000
    30. pcie_uncorrectable_err_ptr: 0x00000006
    31. pcie_uncorrectable_err_sv: 0x00000002
    32. pcie_global_err_stat: 0x0000
    33. pcie_global_f_err_ptr: 0x0000
    34. [Hardware Error Log]:NO.2 SMI Serial NO.0
    35. collect:bios(smi) time: 2023-11-17 21:49:35 GMT flag:0x00
    36. CPU:0 (socket:CPU1) LogType: IIO AER module:PCIe ADDITIONAL
    37. DEV:(0x00:0x02.0x00)
    38. First Error type: Non-Fatal ERROR Error Code: Received PCIe completion with UR (80)
    39. Error type: corrected errors Error Code: PCIe link bandwidth changed (76)
    40. ------iio pcie additional reg dump:------
    41. errpin_ctl: 0x00000000
    42. errpin_stat: 0x00000000
    43. g_sys_ctl: 0x00000000
    44. g_sys_stat: 0x00000000
    45. sys_map: 0x00000120
    46. g_err_ctl: 0x00000000
    47. g_ferr_stat: 0x00000000
    48. g_nerr_stat: 0x00901140
    49. g_cerr_stat: 0x00101140
    50. g_f_ferr_stat: 0x00000000
    51. g_n_ferr_stat: 0x00000000
    52. g_f_nferr_stat: 0x00100000
    53. g_n_nferr_stat: 0x00801140
    54. g_f_cerr_stat: 0x00100000
    55. g_n_cerr_stat: 0x00001140
    56. pcie_uncorrectable_err_detect_mask: 0x00000000
    57. pcie_correctable_err_detect_mask: 0x00000000
    58. pcie_uncorrectable_err_stat: 0x00000040
    59. pcie_correctable_err_stat: 0x00000001
    60. pcie_uncorrectable_err_mask: 0x00000000
    61. pcie_correctable_err_mask: 0x00000000
    62. pcie_uncorrectable_err_ptr: 0x00000006
    63. pcie_uncorrectable_err_sv: 0x00000002
    64. pcie_global_err_stat: 0x0000
    65. pcie_global_f_err_ptr: 0x0000

     mass_operate_log日志

    1. 2022-09-10 05:06:27 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    2. 2022-09-10 05:26:57 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    3. 2023-11-18 05:16:37 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    4. 2023-11-18 05:19:06 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    5. 2023-11-18 05:21:36 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    6. 2023-11-18 06:24:38 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    7. 2023-11-18 06:26:25 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    8. 2023-11-18 06:35:17 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    9. 2023-11-18 07:21:40 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    10. 2023-11-18 07:24:42 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    11. 2023-11-17 18:46:05 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    12. 2023-11-17 18:46:28 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    13. 2023-11-17 18:51:20 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    14. 2023-11-17 19:28:14 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    15. 2023-11-17 19:45:00 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    16. 2023-11-17 19:48:57 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    17. 2023-11-17 19:50:23 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    18. 2023-11-17 20:39:49 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    19. 2023-11-17 20:45:42 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    20. 2023-11-17 21:12:15 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    21. 2023-11-17 21:15:59 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    22. 2023-11-17 21:29:57 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    23. 2023-11-17 21:51:40 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    24. 2023-11-17 21:54:50 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    25. 2023-11-17 21:57:51 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    26. 2023-11-19 09:40:44 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    27. 2023-11-19 10:18:33 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    28. 2023-11-19 10:52:12 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    29. 2023-11-19 10:56:25 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    30. 2023-11-19 11:02:53 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    31. 2023-11-19 11:34:47 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    32. 2023-11-19 11:41:43 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    33. 2023-11-19 11:44:44 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]
    34. 2023-11-19 12:08:41 IPMI,Unknown@Unknown,ipmi_app,Send message(CH-6) [ 2C B8 1C 20 0E D5 57 01 00 00 3C 00 69 ]

     以下是磁盘日志

    1. 161 Normal 2023-11-18 Saturday 05:14:25 ACPI State Power on state 2200FFFF Asserted
    2. 162 Normal 2023-11-18 Saturday 05:14:25 DIMM010 Presence detected, dimm is 0/1/0 0C06FFFF Asserted
    3. 163 Normal 2023-11-18 Saturday 05:14:31 SysRestart System Restart [Power button][LOCAL] 1D0703FF Asserted
    4. 164 Normal 2023-11-18 Saturday 05:16:50 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    5. 165 Normal 2023-11-18 Saturday 05:19:44 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    6. 166 Major 2023-11-18 Saturday 05:21:56 CPU1 Prochot State Asserted 0341FFFF Asserted
    7. 167 Normal 2023-11-18 Saturday 06:22:18 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    8. 168 Major 2023-11-18 Saturday 06:22:20 CPU1 Prochot State Asserted 03C1FFFF Deasserted
    9. 169 Normal 2023-11-18 Saturday 06:25:00 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    10. 170 Normal 2023-11-18 Saturday 06:34:02 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    11. 171 Normal 2023-11-18 Saturday 07:20:25 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    12. 172 Normal 2023-11-18 Saturday 07:23:28 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    13. 173 Normal 2023-11-17 Friday 18:43:54 Eth1 Link Down Slot is Disabled 2108FFFF Asserted
    14. 174 Normal 2023-11-17 Friday 18:44:36 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    15. 175 Normal 2023-11-17 Friday 18:49:42 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    16. 176 Normal 2023-11-17 Friday 19:18:27 DISK0 Hard disk presence 0D80FFFF Deasserted
    17. 177 Major 2023-11-17 Friday 19:18:44 DISK0 In Failed Array 0D06FFFF Asserted
    18. 178 Normal 2023-11-17 Friday 19:18:52 DISK0 Hard disk presence 0D00FFFF Asserted
    19. 179 Major 2023-11-17 Friday 19:19:02 DISK0 Hard disk drive fault 0D01FFFF Asserted
    20. 180 Major 2023-11-17 Friday 19:19:02 DISK0 In Failed Array 0D86FFFF Deasserted
    21. 181 Normal 2023-11-17 Friday 19:19:24 DISK0 Hard disk presence 0D80FFFF Deasserted
    22. 182 Normal 2023-11-17 Friday 19:19:27 DISK0 Hard disk presence 0D00FFFF Asserted
    23. 183 Normal 2023-11-17 Friday 19:19:30 DISK0 Hard disk presence 0D80FFFF Deasserted
    24. 184 Normal 2023-11-17 Friday 19:19:32 DISK0 Hard disk presence 0D00FFFF Asserted
    25. 185 Normal 2023-11-17 Friday 19:19:54 DISK0 Hard disk presence 0D80FFFF Deasserted
    26. 186 Normal 2023-11-17 Friday 19:19:57 DISK0 Hard disk presence 0D00FFFF Asserted
    27. 187 Normal 2023-11-17 Friday 19:25:01 DISK0 Hard disk presence 0D80FFFF Deasserted
    28. 188 Normal 2023-11-17 Friday 19:25:03 DISK0 Hard disk presence 0D00FFFF Asserted
    29. 189 Normal 2023-11-17 Friday 19:25:09 DISK0 Hard disk presence 0D80FFFF Deasserted
    30. 190 Major 2023-11-17 Friday 19:25:18 DISK0 Hard disk drive fault 0D81FFFF Deasserted
    31. 191 Major 2023-11-17 Friday 19:25:19 DISK0 In Failed Array 0D06FFFF Asserted
    32. 192 Normal 2023-11-17 Friday 19:25:59 DISK0 Hard disk presence 0D00FFFF Asserted
    33. 193 Major 2023-11-17 Friday 19:26:08 DISK0 Hard disk drive fault 0D01FFFF Asserted
    34. 194 Major 2023-11-17 Friday 19:26:08 DISK0 In Failed Array 0D86FFFF Deasserted
    35. 195 Normal 2023-11-17 Friday 19:26:55 SysRestart System Restart [Unknown][IPMB] 1D0700FF Asserted
    36. 196 Major 2023-11-17 Friday 19:27:07 DISK0 Hard disk drive fault 0D81FFFF Deasserted
    37. 197 Major 2023-11-17 Friday 19:28:09 DISK0 Hard disk drive fault 0D01FFFF Asserted
    38. 198 Normal 2023-11-17 Friday 19:42:20 Power Button Power button pressed 1400FFFF Asserted
    39. 199 Normal 2023-11-17 Friday 19:42:24 ACPI State Power off state 2206FFFF Asserted
    40. 200 Normal 2023-11-17 Friday 19:42:25 DIMM000 Presence detected, dimm is 0/0/0 0C86FFFF Deasserted
    41. 201 Normal 2023-11-17 Friday 19:42:26 DIMM010 Presence detected, dimm is 0/1/0 0C86FFFF Deasserted
    42. 202 Major 2023-11-17 Friday 19:42:35 DISK0 Hard disk drive fault 0D81FFFF Deasserted
    43. 203 Normal 2023-11-17 Friday 19:42:37 DISK0 Hard disk presence 0D80FFFF Deasserted
    1. "Eth1 Link Down" 表示以太网端口1的连接断开。
    2. "SysRestart" 表示系统重新启动的事件。
    3. "Hard disk presence" 表示硬盘的存在状态。
    4. "Hard disk drive fault" 表示硬盘驱动故障。
    5. "In Failed Array" 表示硬盘所在的阵列(RAID)处于失败状态。

    在这些日志中,Normal 表示一般的事件状态,Major 则表示比较严重的事件状态,如硬盘故障等。

    根据这些日志,系统经历了一些硬件问题,包括以太网连接断开、系统重新启动以及硬盘存在状态和故障状态的变化。这些事件可能会影响系统的正常运行,需要进一步的诊断和处理。

    从日志可以看出从17号19点以后系统就彻底崩了。。。。大神们从日志还能分析出什么信息。

    我们是这样处理的,和华为售后联系,确定是硬盘崩了,已经过质保了 ,只能自己联系第三方对硬盘做处理了。

    华为客服让安照这个处理 :

    这是RAID双成员盘手动恢复的操作文档:https://support.xfusion.com/support/#/zh/docOnline/EDOC1100080944?path=zh-cn_topic_0000001134131193&relationId=EDOC1100080946&mark=40

    改操作会有导致数据丢失的风险,请谨慎操作。

    但是我这情况和上面的还不一样,都不能点。。。。

  • 相关阅读:
    从0到1了解大数据可视化平台
    Java基于SpringBoot+Vue+nodejs的在线小说阅读平台 element
    VsCode同时编译多个C文件
    SpringMVC学习笔记
    数论——组合数学入门
    [附源码]Python计算机毕业设计Django企业售后服务管理系统
    php内核基础说明
    二分查找——34. 在排序数组中查找元素的第一个和最后一个位置
    MySQL(10):创建和管理表
    Ubuntu-22.04通过RDP协议连接远程桌面
  • 原文地址:https://blog.csdn.net/z13615480737/article/details/134497130