Install psensor and watch if something is getting hot consistently.  I did this a lot trying to figure out what was causing me grief.

This is where your friend said "quit whining and go buy a new pc".  You can chase weird instability and countless hours spent in doing so, or you just cut your losses and buy something with a warranty.  Swap parts if it remains unstable, or get weary enough to move on.  At some point time is money, and you need to move on, at least it is for folks that make money doing this for a living.

I like to tinker, but not to the point I wake up cursing the damn thing.  Reason my desktop is sitting powered off at the moment and i use my laptop.

-mb


On Thu, Jul 16, 2020 at 4:50 PM Michael via PLUG-discuss <plug-discuss@lists.phxlinux.org> wrote:
there is a fan on it and it maxes at 73 with the blue kitty in glmark2
but usually stays below 68

On Thu, Jul 16, 2020 at 5:48 PM Brian Cluff via PLUG-discuss

<plug-discuss@lists.phxlinux.org> wrote:
>
> Hows the temperature of your video card?  Does it have a fan on it?
>
> Brian
>
> On 7/16/20 1:41 PM, Michael via PLUG-discuss wrote:
>
>
> Might the problem still be the he
>
> On Thu, Jul 16, 2020 at 4:37 PM Stephen Partington via PLUG-discuss <plug-discuss@lists.phxlinux.org> wrote:
>>
>> 68C under load is good. 80C+ idle is bad.
>>
>>
>> On Thu, Jul 16, 2020 at 12:35 PM Michael via PLUG-discuss <plug-discuss@lists.phxlinux.org> wrote:
>>>
>>> lets get back to heat... last time I was running glx (with the horse
>>> and various other pictures) and watch, and stress ng the temp was like
>>> 68. I don't think that is too bad.
>>>
>>> On Thu, Jul 16, 2020 at 3:19 PM Brian Cluff via PLUG-discuss
>>> <plug-discuss@lists.phxlinux.org> wrote:
>>> >
>>> > Looks like your drive probably isn't the problem.
>>> >
>>> > Brian Cluff
>>> >
>>> > On 7/16/20 11:54 AM, Michael via PLUG-discuss wrote:
>>> > > On Thu, Jul 16, 2020 at 3:37 AM Brian Cluff via PLUG-discuss
>>> > > <plug-discuss@lists.phxlinux.org> wrote:
>>> > >> Also check the health of your hard drive with smartctl. I've hard
>>> > >> systems freeze on me because my hard drive was dying that then appear to
>>> > >> work again for a while after a reboot.  Basically your drive starts
>>> > >> throwing errors and eventually the system remounts it as read only and
>>> > >> your system eventually freaks out and freezes because it can't write to
>>> > >> the drive anymore.
>>> > > I sure hope this is the problem. It'll give me an excuse to put an ssd
>>> > > in the machine.
>>> > > This is from live media boot. I'll see if I can freeze the system
>>> > > again.... it did when I tried it before but it hasn't frozen for a
>>> > > couple of days now.
>>> > >   sudo smartctl --all /dev/sda
>>> > > smartctl 7.1 2019-12-30 r5022 [x86_64-linux-5.4.0-26-generic] (local build)
>>> > > Copyright (C) 2002-19, Bruce Allen, Christian Franke, www.smartmontools.org
>>> > >
>>> > > === START OF INFORMATION SECTION ===
>>> > > Model Family:     Western Digital Caviar Blue (SATA)
>>> > > Device Model:     WDC WD3200AAKS-75L9A0
>>> > > Serial Number:    WD-WMAV2S110171
>>> > > LU WWN Device Id: 5 0014ee 057775228
>>> > > Firmware Version: 02.03E02
>>> > > User Capacity:    320,072,933,376 bytes [320 GB]
>>> > > Sector Size:      512 bytes logical/physical
>>> > > Device is:        In smartctl database [for details use: -P show]
>>> > > ATA Version is:   ATA8-ACS (minor revision not indicated)
>>> > > SATA Version is:  SATA 2.5, 3.0 Gb/s
>>> > > Local Time is:    Thu Jul 16 18:33:20 2020 UTC
>>> > > SMART support is: Available - device has SMART capability.
>>> > > SMART support is: Enabled
>>> > >
>>> > > === START OF READ SMART DATA SECTION ===
>>> > > SMART overall-health self-assessment test result: PASSED
>>> > >
>>> > > General SMART Values:
>>> > > Offline data collection status:  (0x84)    Offline data collection activity
>>> > >                      was suspended by an interrupting command from host.
>>> > >                      Auto Offline Data Collection: Enabled.
>>> > > Self-test execution status:      (   0)    The previous self-test
>>> > > routine completed
>>> > >                      without error or no self-test has ever
>>> > >                      been run.
>>> > > Total time to complete Offline
>>> > > data collection:         ( 6000) seconds.
>>> > > Offline data collection
>>> > > capabilities:              (0x7b) SMART execute Offline immediate.
>>> > >                      Auto Offline data collection on/off support.
>>> > >                      Suspend Offline collection upon new
>>> > >                      command.
>>> > >                      Offline surface scan supported.
>>> > >                      Self-test supported.
>>> > >                      Conveyance Self-test supported.
>>> > >                      Selective Self-test supported.
>>> > > SMART capabilities:            (0x0003)    Saves SMART data before entering
>>> > >                      power-saving mode.
>>> > >                      Supports SMART auto save timer.
>>> > > Error logging capability:        (0x01)    Error logging supported.
>>> > >                      General Purpose Logging supported.
>>> > > Short self-test routine
>>> > > recommended polling time:      (   2) minutes.
>>> > > Extended self-test routine
>>> > > recommended polling time:      (  73) minutes.
>>> > > Conveyance self-test routine
>>> > > recommended polling time:      (   5) minutes.
>>> > > SCT capabilities:            (0x3037)    SCT Status supported.
>>> > >                      SCT Feature Control supported.
>>> > >                      SCT Data Table supported.
>>> > >
>>> > > SMART Attributes Data Structure revision number: 16
>>> > > Vendor Specific SMART Attributes with Thresholds:
>>> > > ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE
>>> > > UPDATED  WHEN_FAILED RAW_VALUE
>>> > >    1 Raw_Read_Error_Rate     0x002f   200   200   051    Pre-fail
>>> > > Always       -       0
>>> > >    3 Spin_Up_Time            0x0027   138   133   021    Pre-fail
>>> > > Always       -       4075
>>> > >    4 Start_Stop_Count        0x0032   089   089   000    Old_age
>>> > > Always       -       11090
>>> > >    5 Reallocated_Sector_Ct   0x0033   200   200   140    Pre-fail
>>> > > Always       -       0
>>> > >    7 Seek_Error_Rate         0x002e   100   253   000    Old_age
>>> > > Always       -       0
>>> > >    9 Power_On_Hours          0x0032   019   019   000    Old_age
>>> > > Always       -       59593
>>> > >   10 Spin_Retry_Count        0x0032   100   100   000    Old_age
>>> > > Always       -       0
>>> > >   11 Calibration_Retry_Count 0x0032   100   100   000    Old_age
>>> > > Always       -       0
>>> > >   12 Power_Cycle_Count       0x0032   098   098   000    Old_age
>>> > > Always       -       2282
>>> > > 192 Power-Off_Retract_Count 0x0032   199   199   000    Old_age
>>> > > Always       -       1044
>>> > > 193 Load_Cycle_Count        0x0032   197   197   000    Old_age
>>> > > Always       -       11090
>>> > > 194 Temperature_Celsius     0x0022   105   080   000    Old_age
>>> > > Always       -       38
>>> > > 196 Reallocated_Event_Count 0x0032   200   200   000    Old_age
>>> > > Always       -       0
>>> > > 197 Current_Pending_Sector  0x0032   200   200   000    Old_age
>>> > > Always       -       0
>>> > > 198 Offline_Uncorrectable   0x0030   200   200   000    Old_age
>>> > > Offline      -       0
>>> > > 199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age
>>> > > Always       -       0
>>> > > 200 Multi_Zone_Error_Rate   0x0008   200   200   000    Old_age
>>> > > Offline      -       0
>>> > > 240 Head_Flying_Hours       0x0032   024   024   000    Old_age
>>> > > Always       -       55990
>>> > > 241 Total_LBAs_Written      0x0032   200   200   000    Old_age
>>> > > Always       -       78243499218
>>> > > 242 Total_LBAs_Read         0x0032   200   200   000    Old_age
>>> > > Always       -       174621575474
>>> > >
>>> > > SMART Error Log Version: 1
>>> > > No Errors Logged
>>> > >
>>> > > SMART Self-test log structure revision number 1
>>> > > Num  Test_Description    Status                  Remaining
>>> > > LifeTime(hours)  LBA_of_first_error
>>> > > # 1  Short offline       Completed without error       00%     14352         -
>>> > > # 2  Short offline       Completed without error       00%         0         -
>>> > >
>>> > > SMART Selective self-test log data structure revision number 1
>>> > >   SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>>> > >      1        0        0  Not_testing
>>> > >      2        0        0  Not_testing
>>> > >      3        0        0  Not_testing
>>> > >      4        0        0  Not_testing
>>> > >      5        0        0  Not_testing
>>> > > Selective self-test flags (0x0):
>>> > >    After scanning selected spans, do NOT read-scan remainder of disk.
>>> > > If Selective self-test is pending on power-up, resume after 0 minute delay.
>>> > >
>>> > > Well, I am running glmark2, watch, and stress-ng for about (one round
>>> > > of glmark2) with no freeze.  all from live media
>>> > > --
>>> > > :-)~MIKE~(-:
>>> > > ---------------------------------------------------
>>> > > PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
>>> > > To subscribe, unsubscribe, or to change your mail settings:
>>> > > https://lists.phxlinux.org/mailman/listinfo/plug-discuss
>>> >
>>> > ---------------------------------------------------
>>> > PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
>>> > To subscribe, unsubscribe, or to change your mail settings:
>>> > https://lists.phxlinux.org/mailman/listinfo/plug-discuss
>>>
>>>
>>>
>>> --
>>> :-)~MIKE~(-:
>>> ---------------------------------------------------
>>> PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
>>> To subscribe, unsubscribe, or to change your mail settings:
>>> https://lists.phxlinux.org/mailman/listinfo/plug-discuss
>>
>>
>>
>> --
>> A mouse trap, placed on top of your alarm clock, will prevent you from rolling over and going back to sleep after you hit the snooze button.
>>
>> Stephen
>>
>> ---------------------------------------------------
>> PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
>> To subscribe, unsubscribe, or to change your mail settings:
>> https://lists.phxlinux.org/mailman/listinfo/plug-discuss
>
> --
> :-)~MIKE~(-:
>
> ---------------------------------------------------
> PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
> To subscribe, unsubscribe, or to change your mail settings:
> https://lists.phxlinux.org/mailman/listinfo/plug-discuss
>
>
> ---------------------------------------------------
> PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
> To subscribe, unsubscribe, or to change your mail settings:
> https://lists.phxlinux.org/mailman/listinfo/plug-discuss



--
:-)~MIKE~(-:
---------------------------------------------------
PLUG-discuss mailing list - PLUG-discuss@lists.phxlinux.org
To subscribe, unsubscribe, or to change your mail settings:
https://lists.phxlinux.org/mailman/listinfo/plug-discuss