Thursday, 2020-12-31

*** akrpan-pure has quit IRC00:07
*** tosky has quit IRC00:16
*** DSpider has quit IRC01:53
*** sshnaidm|rover has quit IRC01:59
*** sshnaidm has joined #opendev02:00
openstackgerritMatthew Thode proposed opendev/glean master: add hosts entries and ssh keys only once  https://review.opendev.org/c/opendev/glean/+/76878502:15
*** ysandeep|away is now known as ysandeep|ruck03:41
*** whoami-rajat__ has joined #opendev04:36
prometheanfireman mock feels so arcane04:55
*** ykarel has joined #opendev05:12
prometheanfireis there a reason we don't use magicmock?05:46
prometheanfireiirc I submitted a patch a while ago that was rejected because of it05:46
prometheanfiremock's open does not use iter05:46
openstackgerritMatthew Thode proposed opendev/glean master: add hosts entries and ssh keys only once  https://review.opendev.org/c/opendev/glean/+/76878506:01
prometheanfirewell, I'll let someone tell me not to use it (and suggest something themselves)06:01
*** ykarel_ has joined #opendev06:21
*** ykarel has quit IRC06:23
*** ykarel_ is now known as ykarel07:36
*** slaweq has joined #opendev08:05
*** slaweq has quit IRC08:40
*** ykarel is now known as ykarel|lunch09:08
*** DSpider has joined #opendev09:15
*** zbr has quit IRC10:02
*** zbr has joined #opendev10:02
*** ozzzo has quit IRC10:03
*** ssbarnea has joined #opendev10:06
*** zbr has quit IRC10:07
*** zbr has joined #opendev10:08
*** zbr has quit IRC10:08
*** zbr has joined #opendev10:10
*** brinzhang0 has quit IRC10:29
*** tosky has joined #opendev10:50
*** ykarel|lunch is now known as ykarel11:13
*** ssbarnea has quit IRC11:43
*** ysandeep|ruck is now known as ysandeep|away12:35
*** ozzzo has joined #opendev14:36
openstackgerritMatthew Thode proposed opendev/glean master: add hosts entries and ssh keys only once  https://review.opendev.org/c/opendev/glean/+/76878515:06
*** whoami-rajat__ has quit IRC15:45
fungiseeing an unusual level of post_failure results with no manifest uploaded16:35
fungii'll see if i can get something from the executor logs16:35
prometheanfirethanks16:41
* prometheanfire smashes the recheck button16:41
fungiconnection timeouts hitting https://auth.cloud.ovh.net/16:42
fungiyeah, i can't reach that16:43
fungiconnection timeouts from anywhere i've tried, so the problem doesn't seem to be localized16:46
prometheanfirehere too16:48
fungiwow, my traceroute to it from the east coast of north america hops through sydney au16:50
prometheanfireheh, someone advertise something they shouldn't?16:52
fungimebbe16:53
fungiand yeah, i can't connect to ovh with openstackclient16:53
prometheanfireya, I went through au as well, from dfw to iad to lax to syd16:53
prometheanfiredid end up at the target I think16:54
prometheanfire16  ip247.ip-51-68-65.eu (51.68.65.247)  214.068 ms  214.480 ms  214.411 ms16:54
fungimore or less same here... dc to lax to syd16:54
fungiyup, the ip address in dns for it responds to icmp echo16:54
prometheanfireheh, neat16:54
fungiand to udp traceroute probes16:54
fungijust not on 443/tcp16:54
fungii'm waffling on emergency removing all our ovh regions from log uploads temporarily to at least keep builds from failing16:55
prometheanfireasked in #ovh16:56
prometheanfirenot sure if we have someone specific to reach out to16:56
*** hamalq has joined #opendev17:00
fungiearliest occurrence i find in our executor logs is 16:10 utc, so it's been down almost an hour at this point17:01
fungii can only seem to find public status/maintenance details for ovh cloud usa. i'll see if logging into the dashboard gives me any more useful info17:02
fungihttp://travaux.ovh.net/17:04
fungisyd<>lax backbone incident in progress since 17:36 (i guess that's cest?)17:05
fungiour overall ci system utilization is very low at the moment, so i'm tempted to just leave it and see if they resolve the problem soon17:06
fungithe incident details indicate " We have currently a saturation on this link , we are cheking to offload some traffic"17:08
prometheanfiresome stuff here17:10
prometheanfirehttps://twitter.com/OVH_Status17:10
fungiyep, okay, so same stuff17:11
fungii guess i'll give them a little longer before i force in an emergency base-jobs change to stop uploading there17:11
prometheanfireack17:12
fungiif this were a typical thursday i would have just done it17:14
fungiooh, i just got a response out of the api17:21
fungiit's working fairly reliably now17:22
fungii'll give it a few more minutes of stability before i get my hopes up too much17:22
prometheanfirewise17:24
fungibut yeah, traceroute's still going through their sydney peer17:24
fungistill seems to be working17:47
fungi17:05:22 was the last post_failure for that17:51
fungisurveying all the executor logs17:51
fungionly 23 builds were impacted, i think17:54
fungibut i'll status notice it just in case17:54
*** ykarel has quit IRC17:56
*** hamalq has quit IRC18:04
*** hamalq has joined #opendev18:16
*** auristor has quit IRC18:19
*** hamalq has quit IRC18:20
*** auristor has joined #opendev18:20
fungilooks like they closed the incident out at 19:05 cest, so ~40 minutes ago18:43
fungi#status notice An OVH network outage caused Zuul to report POST_FAILURE results with no summary/logs for some builds completing between 16:10 and 17:06 UTC; these should be safe to recheck now18:43
openstackstatusfungi: sending notice18:43
-openstackstatus- NOTICE: An OVH network outage caused Zuul to report POST_FAILURE results with no summary/logs for some builds completing between 16:10 and 17:06 UTC; these should be safe to recheck now18:43
openstackstatusfungi: finished sending notice18:46
*** auristor has quit IRC19:48

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!