Print

Print


Quyen,

I will try swapping out the memory and power supply and see if that changes anything.

Thanks!

On Thu, May 30, 2019 at 2:05 PM Quyen Hoang <[log in to unmask]> wrote:
Hi Abhiram,
We had a workstation behaving similarly couple years ago.
It turned to be a bad power supply.
Like yours, it worked fine most of the time, but shut down when running a certain program under full load.
First we changed the motherboard, swapped video cards, RAMs, and finally the power supply.
We have used power supplies from a few different brands and have been happy with Corsair.

Cheers,
Quyen

Quyen Hoang, PhD
Associate Professor
Biochemistry and Molecular Biology
Adjunct Associate Professor of Neurology
Primary Investigator of the Stark Neuroscience Research Institute
Indiana University School of Medicine
635 Barnhill Drive, MS0013C
Indianapolis, IN, 46202
(317)274-4371

On May 30, 2019, at 4:39 PM, Abhiram Chintangal <[log in to unmask]> wrote:

Clara, 

In my case, the system just powers off. So I am quite puzzled. 

I hooked up a watt-meter to the machine to see if I am stressing the power supply, but it doesn’t look like it. 

I am yet to check if there is a new bios for the board( Gigabyte X399 Aorus Extreme) 

Anything specific to PCie settings that I should be watching out for?

Thanks!

Abhiram 




On Thu, May 30, 2019 at 1:22 PM Dr. Clara Cai <[log in to unmask]> wrote:
You mean the operation system crashes or RELION is crashing? 
Have you checked if the motherboard BIOS has a new version? Pay attention to the memory and PCIe settings especially. Good luck!


On Thu, May 30, 2019 at 11:48 AM Abhiram Chintangal <[log in to unmask]> wrote:
Hey all, 

I am troubleshooting one of our new thread ripper based workstations which crashes when using all four GPU's with Relion. The machine is running a thread-ripper 2990x with 4x2080ti's and its powered by a 1500W power supply. 

Anyone here run into something similar? 

Initially I was under the impression that its a power-supply problem, but the machine seems quite stable when running gpu/cpu bechmarks together that saturate all the cores and gpu's at the same time.

To rule out memory, I also ran memtest86 on the machine. 

Any ideas are appreciated? 

Thanks! 

--
Abhiram Chintangal
QB3 Nogales Lab 
Bioinformatics Specialist @ Howard Hughes Medical Institute
University of California Berkeley 
708D Stanley Hall, Berkeley, CA 94720
Phone (510)666-3344


To unsubscribe from the CCPEM list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCPEM&A=1

--
Abhiram Chintangal
QB3 Nogales Lab
Bioinformatics Specialist @ Howard Hughes Medical Institute
University of California at Berkeley
708D Stanley Hall, Berkeley, CA 94720
Phone (510) 666-3344


To unsubscribe from the CCPEM list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCPEM&A=1




--
Abhiram Chintangal
QB3 Nogales Lab 
Bioinformatics Specialist @ Howard Hughes Medical Institute
University of California Berkeley 
708D Stanley Hall, Berkeley, CA 94720
Phone (510)666-3344


To unsubscribe from the CCPEM list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCPEM&A=1