On 8/6/19 4:32 PM, Maarten Litmaath wrote:
> Maybe a local/regional/... network issue that affected your site?
Thank you Maarten,
I ask the CCIN2P3 who deals with our network provider (RENATER) and
had confirmation that 2 relatively short network cuts occured at the
same time :
Tue Aug 6 15:44:35 2019 alarm Ping Nantes-Cisco down
Tue Aug 6 15:48:43 2019 alarm Ping Nantes-Cisco up
Tue Aug 6 16:00:27 2019 alarm Ping Nantes-Cisco down
Tue Aug 6 16:01:43 2019 alarm Ping Nantes-Cisco up
This was enough to create I/O errors in cvmfs on worker nodes,
probably because some data transfers were aborted by our squids
as the logs show. We have nagios probes on the worker nodes
detecting this.
JM
--
------------------------------------------------------------------------
Jean-michel BARBET | Tel: +33 (0)2 51 85 84 86
Laboratoire SUBATECH Nantes France | Fax: +33 (0)2 51 85 84 79
CNRS-IN2P3/IMT-Atlantique/Univ.Nantes | E-Mail: [log in to unmask]
------------------------------------------------------------------------
########################################################################
To unsubscribe from the LCG-ROLLOUT list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=LCG-ROLLOUT&A=1
|