Re: is my power supply dieing?

Top Page
Attachments:
Message as email
+ (text/plain)
Delete this message
Reply to this message
Author: Brian Cluff via PLUG-discuss
Date:  
To: plug-discuss
CC: Brian Cluff
Subject: Re: is my power supply dieing?
Looks like your drive probably isn't the problem.

Brian Cluff

On 7/16/20 11:54 AM, Michael via PLUG-discuss wrote:
> On Thu, Jul 16, 2020 at 3:37 AM Brian Cluff via PLUG-discuss
> <> wrote:
>> Also check the health of your hard drive with smartctl. I've hard
>> systems freeze on me because my hard drive was dying that then appear to
>> work again for a while after a reboot. Basically your drive starts
>> throwing errors and eventually the system remounts it as read only and
>> your system eventually freaks out and freezes because it can't write to
>> the drive anymore.
> I sure hope this is the problem. It'll give me an excuse to put an ssd
> in the machine.
> This is from live media boot. I'll see if I can freeze the system
> again.... it did when I tried it before but it hasn't frozen for a
> couple of days now.
> sudo smartctl --all /dev/sda
> smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.4.0-26-generic] (local build)
> Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
>
> === START OF INFORMATION SECTION ===
> Model Family:     Western Digital Caviar Blue (SATA)
> Device Model:     WDC WD3200AAKS-75L9A0
> Serial Number:    WD-WMAV2S110171
> LU WWN Device Id: 5 0014ee 057775228
> Firmware Version: 02.03E02
> User Capacity:    320,072,933,376 bytes [320 GB]
> Sector Size:      512 bytes logical/physical
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   ATA8-ACS (minor revision not indicated)
> SATA Version is:  SATA 2.5, 3.0 Gb/s
> Local Time is:    Thu Jul 16 18:33:20 2020 UTC
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled

>
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
>
> General SMART Values:
> Offline data collection status:  (0x84)    Offline data collection activity
>                      was suspended by an interrupting command from host.
>                      Auto Offline Data Collection: Enabled.
> Self-test execution status:      (   0)    The previous self-test
> routine completed
>                      without error or no self-test has ever
>                      been run.
> Total time to complete Offline
> data collection:         ( 6000) seconds.
> Offline data collection
> capabilities:              (0x7b) SMART execute Offline immediate.
>                      Auto Offline data collection on/off support.
>                      Suspend Offline collection upon new
>                      command.
>                      Offline surface scan supported.
>                      Self-test supported.
>                      Conveyance Self-test supported.
>                      Selective Self-test supported.
> SMART capabilities:            (0x0003)    Saves SMART data before entering
>                      power-saving mode.
>                      Supports SMART auto save timer.
> Error logging capability:        (0x01)    Error logging supported.
>                      General Purpose Logging supported.
> Short self-test routine
> recommended polling time:      (   2) minutes.
> Extended self-test routine
> recommended polling time:      (  73) minutes.
> Conveyance self-test routine
> recommended polling time:      (   5) minutes.
> SCT capabilities:            (0x3037)    SCT Status supported.
>                      SCT Feature Control supported.
>                      SCT Data Table supported.

>
> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
> UPDATED  WHEN_FAILED RAW_VALUE
>    1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail
> Always       -       0
>    3 Spin_Up_Time            0x0027   138   133   021    Pre-fail
> Always       -       4075
>    4 Start_Stop_Count        0x0032   089   089   000    Old_age
> Always       -       11090
>    5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail
> Always       -       0
>    7 Seek_Error_Rate         0x002e   100   253   000    Old_age
> Always       -       0
>    9 Power_On_Hours          0x0032   019   019   000    Old_age
> Always       -       59593
>   10 Spin_Retry_Count        0x0032   100   100   000    Old_age
> Always       -       0
>   11 Calibration_Retry_Count 0x0032   100   100   000    Old_age
> Always       -       0
>   12 Power_Cycle_Count       0x0032   098   098   000    Old_age
> Always       -       2282
> 192 Power-Off_Retract_Count 0x0032   199   199   000    Old_age
> Always       -       1044
> 193 Load_Cycle_Count        0x0032   197   197   000    Old_age
> Always       -       11090
> 194 Temperature_Celsius     0x0022   105   080   000    Old_age
> Always       -       38
> 196 Reallocated_Event_Count 0x0032   200   200   000    Old_age
> Always       -       0
> 197 Current_Pending_Sector  0x0032   200   200   000    Old_age
> Always       -       0
> 198 Offline_Uncorrectable   0x0030   200   200   000    Old_age
> Offline      -       0
> 199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age
> Always       -       0
> 200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age
> Offline      -       0
> 240 Head_Flying_Hours       0x0032   024   024   000    Old_age
> Always       -       55990
> 241 Total_LBAs_Written      0x0032   200   200   000    Old_age
> Always       -       78243499218
> 242 Total_LBAs_Read         0x0032   200   200   000    Old_age
> Always       -       174621575474

>
> SMART Error Log Version: 1
> No Errors Logged
>
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining
> LifeTime(hours)  LBA_of_first_error
> # 1  Short offline       Completed without error       00%     14352         -
> # 2  Short offline       Completed without error       00%         0         -

>
> SMART Selective self-test log data structure revision number 1
>   SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>      1        0        0  Not_testing
>      2        0        0  Not_testing
>      3        0        0  Not_testing
>      4        0        0  Not_testing
>      5        0        0  Not_testing
> Selective self-test flags (0x0):
>    After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.

>
> Well, I am running glmark2, watch, and stress-ng for about (one round
> of glmark2) with no freeze. all from live media
> --
> :-)~MIKE~(-:
> ---------------------------------------------------
> PLUG-discuss mailing list -
> To subscribe, unsubscribe, or to change your mail settings:
> https://lists.phxlinux.org/mailman/listinfo/plug-discuss


---------------------------------------------------
PLUG-discuss mailing list -
To subscribe, unsubscribe, or to change your mail settings:
https://lists.phxlinux.org/mailman/listinfo/plug-discuss