OVHcloud Network Status

Current status
Legend
  • Operational
  • Degraded performance
  • Partial Outage
  • Major Outage
  • Under maintenance
FS#8232 — 5.135.136.0/24 and 176.31.224.0/24
Incident Report for Network & Infrastructure
Resolved
We have a weird error on pairs of N5:

Compatibility check is done:
Module bootable Impact Install-type Reason
------ -------- -------------- ------------ ------
1 yes disruptive reset SFP uController needs upgrade
100 yes disruptive reset SFP uController needs upgrade
101 yes disruptive reset SFP uController needs upgrade
102 yes disruptive reset SFP uController needs upgrade
103 yes disruptive reset SFP uController needs upgrade
104 yes disruptive reset SFP uController needs upgrade
105 yes disruptive reset SFP uController needs upgrade
106 yes disruptive reset SFP uController needs upgrade
107 yes disruptive reset SFP uController needs upgrade
108 yes disruptive reset SFP uController needs upgrade
109 yes disruptive reset SFP uController needs upgrade
111 yes disruptive reset SFP uController needs upgrade



Update(s):

Date: 2013-03-13 11:45:15 UTC
The FEX is online again.

Date: 2013-03-13 11:44:56 UTC
One of the FEX is not recognised.

We are reloading it.

--- -------- Offline N2K-C2248TP-E-1GE

Date: 2013-03-13 06:01:58 UTC
5.135.136.0/24: done
176.31.238.0/24: done

Date: 2013-03-13 06:01:48 UTC
The A is back, everything is alright, and now we are updating the B quickly.

Date: 2013-03-13 06:00:54 UTC
done

sw.176.31.238.248#
Broadcast message from root (console) (Tue Mar 12 21:17:01 2013):

The system is going down for reboot NOW!

Date: 2013-03-13 06:00:41 UTC
It goes well. Then, we will try to make the A crash:

sw.176.31.238.248# show inter trans detail
Ethernet1/1
transceiver is present

Date: 2013-03-13 05:59:53 UTC
It seems togo well.
If so, on 176.31.238.0/24 we will try to make the A crash in the same way and upgrade the FEX by B.
It does not cause failure.

Date: 2013-03-13 05:57:43 UTC
It goes well, the B updated the FEX.

Date: 2013-03-13 05:57:19 UTC
We are updating the B.

Date: 2013-03-13 05:56:59 UTC
Reason: Reset triggered due to HA policy of Reset
System version: 5.2(1)N1(2a)
Service: ethpc hap reset

Date: 2013-03-13 05:56:47 UTC
On 5.135.136.0/24 we typed a command and the switch A has crashed.. !?

sw.5.135.136.249# show inter trans detail
sw.5.135.136.249# show inter trans detail
Ethernet1/1
transceiver is present
...
Ethernet1/22
transceiver is not present

error sending request
unknown error 0x20
--More--
Broadcast message from root (console) (Tue Mar 12 20:52:08 2013):

The system is going down for reboot NOW!
Connection closed by foreign host.
Posted Mar 13, 2013 - 05:55 UTC