Monday, 2019-07-08

*** jamesmcarthur has joined #openstack-infra00:01
*** cgoncalves has joined #openstack-infra00:09
*** rcernin has joined #openstack-infra00:13
*** jamesmcarthur has quit IRC00:23
*** trident has joined #openstack-infra00:33
*** jamesmcarthur has joined #openstack-infra00:47
*** jamesmcarthur has quit IRC01:00
*** jamesmcarthur has joined #openstack-infra01:02
*** imacdonn has quit IRC01:13
*** imacdonn has joined #openstack-infra01:13
*** slaweq has joined #openstack-infra01:14
*** jamesmcarthur has quit IRC01:17
*** slaweq has quit IRC01:24
*** altlogbot_2 has quit IRC01:28
*** altlogbot_0 has joined #openstack-infra01:29
*** jamesmcarthur has joined #openstack-infra01:45
*** jamesmcarthur has quit IRC01:51
*** apetrich has quit IRC01:58
*** jamesmcarthur has joined #openstack-infra02:05
*** armax has quit IRC02:05
*** rfolco has quit IRC02:09
*** bobh has joined #openstack-infra02:12
*** yamamoto has joined #openstack-infra02:16
*** slaweq has joined #openstack-infra02:16
*** bobh has quit IRC02:23
*** slaweq has quit IRC02:24
*** bhavikdbavishi has joined #openstack-infra02:25
*** bobh has joined #openstack-infra02:26
*** bhavikdbavishi1 has joined #openstack-infra02:28
*** bhavikdbavishi has quit IRC02:30
*** bhavikdbavishi1 is now known as bhavikdbavishi02:30
*** hwoarang has quit IRC02:36
*** hwoarang has joined #openstack-infra02:38
*** rkukura has joined #openstack-infra02:40
*** bhavikdbavishi has quit IRC02:59
*** slaweq has joined #openstack-infra03:15
*** yamamoto has quit IRC03:18
*** slaweq has quit IRC03:24
*** jamesmcarthur has quit IRC03:25
*** psachin has joined #openstack-infra03:27
*** rh-jelabarre has joined #openstack-infra03:48
*** zhurong has quit IRC03:49
*** ykarel has joined #openstack-infra03:49
*** bobh has quit IRC03:53
*** bobh has joined #openstack-infra03:58
*** bobh has quit IRC04:02
*** udesale has joined #openstack-infra04:09
*** bhavikdbavishi has joined #openstack-infra04:10
*** slaweq has joined #openstack-infra04:11
*** factor has quit IRC04:13
*** factor has joined #openstack-infra04:14
*** SotK has quit IRC04:14
*** SotK has joined #openstack-infra04:15
*** Teolupus has joined #openstack-infra04:16
*** slaweq has quit IRC04:24
*** rh-jelabarre has quit IRC04:29
TeolupusOut of curiosity, why on Neutron changes like this one https://review.opendev.org/#/c/162088/ I can see the third party vendors' CI checks, whereas in more recent changes like https://review.opendev.org/#/c/414251/ I only see their CI systems in the reviewers list? Are third party vendors still testing every code change to Neutron on their non-voting CIs and feeding that back upstream?04:33
*** ykarel has quit IRC04:53
*** bobh has joined #openstack-infra05:09
*** slaweq has joined #openstack-infra05:11
*** bobh has quit IRC05:13
AJaeger_Teolupus: I see in 414251 the CI checks as well...05:17
*** AJaeger_ is now known as AJaeger05:17
*** ykarel has joined #openstack-infra05:17
AJaegerTeolupus: Those are still reporting. Btw. 414251 is one year old, look at newer ones. To show up in the list of reviews, the format needs to follow what we parse, if a CI system invents a new format, bad luck ;(05:19
*** ociuhandu has joined #openstack-infra05:20
openstackgerritIan Wienand proposed opendev/system-config master: Add letsencrypt documentation  https://review.opendev.org/66960205:21
*** slaweq has quit IRC05:25
*** ociuhandu has quit IRC05:27
*** ociuhandu_ has joined #openstack-infra05:27
*** whoami-rajat has joined #openstack-infra05:30
*** ociuhandu_ has quit IRC05:33
*** Tengu has quit IRC05:38
*** Tengu has joined #openstack-infra05:43
*** raukadah is now known as chandankumar05:51
AJaegerconfig-core, could you review the stack at https://review.opendev.org/#/c/668955 for testing of zuul-jobs, please?05:52
*** ccamacho has joined #openstack-infra06:00
*** slaweq has joined #openstack-infra06:11
*** pcaruana has joined #openstack-infra06:12
*** jbadiapa has joined #openstack-infra06:19
*** slaweq has quit IRC06:26
*** xek_ has joined #openstack-infra06:30
*** slaweq has joined #openstack-infra06:39
*** ginopc has joined #openstack-infra06:55
*** Teolupus has quit IRC07:01
*** iurygregory has joined #openstack-infra07:07
*** rcernin has quit IRC07:12
*** pgaxatte has joined #openstack-infra07:14
*** ricolin has joined #openstack-infra07:18
*** psachin has quit IRC07:19
*** sshnaidm|ruck is now known as sshnaidm|afk07:21
*** witek has joined #openstack-infra07:23
*** apetrich has joined #openstack-infra07:32
*** kjackal has joined #openstack-infra07:33
*** iokiwi has quit IRC07:38
*** adriant has quit IRC07:38
*** lucasagomes has joined #openstack-infra07:39
*** ykarel is now known as ykarel|lunch07:44
*** ociuhandu has joined #openstack-infra07:45
*** ralonsoh has joined #openstack-infra07:55
*** dchen has quit IRC07:57
*** iurygregory has quit IRC07:57
*** bobh has joined #openstack-infra08:02
*** pkopec has joined #openstack-infra08:03
*** rpittau|afk is now known as rpittau08:03
*** bobh has quit IRC08:06
openstackgerritMerged zuul/zuul master: Fix multi-tenant caching of extra config files  https://review.opendev.org/66900808:06
openstackgerritMerged opendev/irc-meetings master: Add Image Encryption Popup-Team Meeting  https://review.opendev.org/66915108:08
*** tosky has joined #openstack-infra08:11
*** iokiwi has joined #openstack-infra08:20
*** iurygregory has joined #openstack-infra08:24
*** AJaeger has quit IRC08:26
*** psachin has joined #openstack-infra08:33
*** sshnaidm|afk is now known as sshnaidm|ruck08:38
*** ykarel|lunch is now known as ykarel08:42
*** iurygregory has quit IRC08:51
*** derekh has joined #openstack-infra08:54
*** iurygregory has joined #openstack-infra08:56
*** lpetrut has joined #openstack-infra09:00
*** dayou_ has quit IRC09:01
*** dayou_ has joined #openstack-infra09:01
*** Lucas_Gray has joined #openstack-infra09:05
*** priteau has joined #openstack-infra09:08
*** ociuhandu has quit IRC09:11
*** bobh has joined #openstack-infra09:17
*** bobh has quit IRC09:21
*** ykarel is now known as ykarel|meeting09:31
*** pkopec has quit IRC09:34
*** AJaeger has joined #openstack-infra09:40
*** panda is now known as panda|bbl09:48
*** ociuhandu has joined #openstack-infra09:48
*** electrofelix has joined #openstack-infra10:29
*** pkopec has joined #openstack-infra10:29
*** ykarel_ has joined #openstack-infra10:31
*** pkopec has quit IRC10:31
*** panda|bbl has quit IRC10:33
iceyis it possible to get zuul to land a change on a retired project? It doesn't seem to be showing up in gate jobs? https://review.opendev.org/#/c/669618/10:33
*** ykarel|meeting has quit IRC10:33
*** panda has joined #openstack-infra10:37
*** bhavikdbavishi has quit IRC10:38
AJaegericey: without jobs configured, it is not possible - and we have ACLs as readonly, so nobody has access to +1/+2A10:38
AJaegericey: so, add noop-jobs in project-config...10:39
AJaegericey: and add charm-ceph to zuul/main.yaml and update ACLs - and then revert back...10:39
iceyAJaeger: that's a bit heavy handed seeming to clean up the old, retired project :-P10:40
openstackgerritChris MacNaughton (icey) proposed openstack/project-config master: Add charm-ceph to allow cleaning up retired project  https://review.opendev.org/66963210:42
*** priteau has quit IRC10:42
openstackgerritChris MacNaughton (icey) proposed openstack/project-config master: Remove charm-ceph as it's a retired project  https://review.opendev.org/66963310:43
iceyAJaeger: something like that, but with a `recheck` on the retired project between landing?10:43
openstackgerritChris MacNaughton (icey) proposed openstack/project-config master: Remove charm-ceph as it's a retired project  https://review.opendev.org/66963310:44
icey(updated with a depends on to the desired change)10:44
AJaegericey: yes, recheck will be needed...10:45
AJaegericey: not enough, let me comment...10:45
iceythanks AJaeger - it's not super high priority for me, but I thought it'd be nice to get it "properly" retired, give the ML thread10:45
AJaegericey: I'll stop reviewing that insanity for a single .gitgnore ;( Let me send a reply on the mailing list10:48
iceythanks for the comment AJaeger - in the meantime, I'll abandon these changes then10:52
*** pkopec has joined #openstack-infra11:00
AJaegericey: ok - let's see where the discussion goes...11:00
*** udesale has quit IRC11:00
iceyAJaeger: indeed!11:00
*** ykarel_ is now known as ykarel|meeting11:01
*** priteau has joined #openstack-infra11:08
*** priteau has quit IRC11:16
*** tesseract has joined #openstack-infra11:17
*** yamamoto has joined #openstack-infra11:18
*** tesseract has quit IRC11:20
*** adriancz has joined #openstack-infra11:20
*** tesseract has joined #openstack-infra11:21
*** rh-jelabarre has joined #openstack-infra11:22
*** yamamoto has quit IRC11:22
*** ricolin has quit IRC11:26
openstackgerritChandan Kumar (raukadah) proposed openstack/diskimage-builder master: [RHEL] Set _clear_old_files=0 in install-pip element  https://review.opendev.org/66964111:32
*** Lucas_Gray has quit IRC11:37
*** happyhemant has joined #openstack-infra11:37
*** rfolco has joined #openstack-infra11:38
*** Lucas_Gray has joined #openstack-infra11:40
*** bhavikdbavishi has joined #openstack-infra11:43
*** Lucas_Gray has quit IRC11:48
*** ykarel_ has joined #openstack-infra11:49
*** ykarel|meeting has quit IRC11:52
*** Lucas_Gray has joined #openstack-infra11:52
*** bobh has joined #openstack-infra11:53
*** iurygregory has quit IRC12:04
*** bobh has quit IRC12:06
*** ykarel_ is now known as ykarel12:06
*** smarcet has joined #openstack-infra12:06
*** ykarel is now known as ykarel|afk12:07
*** ykarel|afk has quit IRC12:11
*** goldyfruit has quit IRC12:22
*** rlandy has joined #openstack-infra12:29
*** ijw has joined #openstack-infra12:30
*** ijw has quit IRC12:35
*** ykarel|afk has joined #openstack-infra12:40
*** ykarel|afk is now known as ykarel12:43
*** ekultails has joined #openstack-infra12:54
*** rfarr__ has joined #openstack-infra12:54
*** iurygregory has joined #openstack-infra13:07
*** bhavikdbavishi has quit IRC13:08
*** bhavikdbavishi has joined #openstack-infra13:09
*** goldyfruit has joined #openstack-infra13:10
*** tesseract has quit IRC13:14
*** yolanda has quit IRC13:14
*** tesseract has joined #openstack-infra13:16
*** lseki has joined #openstack-infra13:20
*** mriedem has joined #openstack-infra13:23
*** aaronsheffield has joined #openstack-infra13:32
*** sreejithp has joined #openstack-infra13:35
AJaegerinfra-root, seems our retirement ACLs are not working - https://review.opendev.org/#/admin/projects/openstack/charm-ceph,access lists the old ACLs, not the retired ones. But project-config/gerrit/projects.yaml points to the retired ones...13:41
AJaegerthis is beneficial, if we need to merge https://review.opendev.org/#/c/669618/ ;) But points out a problem...13:42
*** haleyb|away is now known as haleyb13:44
*** electrofelix has quit IRC13:45
AJaegersame problem here : https://review.opendev.org/#/admin/projects/opendev/puppet-releasestatus,access .13:45
AJaegerOr do I misunderstand the ACLs process?13:45
clarkbAJaeger: that is a known issue with jeepyb. I have a patch up somewhere for it but we need to test it and manually apply it once merged13:46
AJaegerclarkb: Ok, thanks13:47
*** bobh has joined #openstack-infra13:47
*** jamesmcarthur has joined #openstack-infra13:47
AJaegerclarkb: but since repo is read-only, people won't be able to merge anything, correct?13:47
clarkbThe readonly change is part of the acl update I think so itmay not flip to read only eother13:48
*** eharney has joined #openstack-infra13:49
AJaegeroops13:50
AJaegerbut since we remove it from zuul, zuul will not merge - so, leaves the loophole that an infra-root could send a +213:50
AJaegerno real problem I guess ;)13:50
*** bobh has quit IRC13:52
*** jamesmcarthur has quit IRC13:52
*** jcoufal has joined #openstack-infra13:53
*** ykarel is now known as ykarel|afk13:53
*** michael-beaver has joined #openstack-infra13:53
corvusi'm going to restart zuul to pick up the config caching fix13:57
*** sgw has quit IRC13:57
AJaeger\o/13:57
AJaegercorvus: good morning! I updated your zuul-jobs stack, could you check later whether it's fine? Everything passes now :) https://review.opendev.org/#/c/66895513:58
corvusAJaeger: thanks, will do13:58
johnsomFYI, https://zuul.openstack.org/status all I get is "Something went wrong."13:59
*** smarcet has quit IRC14:00
AJaegerjohnsom: corvus is restarting zuul...14:00
AJaegerjohnsom: grab a coffee/tee/whatever and try again, please ;)14:00
corvusAJaeger: we have jobs which use "fedora-latest" nodesets; are you saying they are broken?14:01
johnsomlol, just did grab my coffee.14:01
AJaegercorvus: notesets are fine - but not as LABEL14:01
*** priteau has joined #openstack-infra14:02
corvusAJaeger: oh, i see...14:02
*** smarcet has joined #openstack-infra14:02
*** sthussey has joined #openstack-infra14:04
*** FlorianFa has quit IRC14:05
*** smarcet has quit IRC14:06
corvus#status log restarted all of zuul on commit 5b851c14f2bd73039748fca71b5db3b05b697f7f14:07
openstackstatuscorvus: finished logging14:07
*** rfarr__ has quit IRC14:08
*** rfarr has joined #openstack-infra14:08
*** dmellado has quit IRC14:09
*** stevebaker has quit IRC14:09
*** dmellado has joined #openstack-infra14:11
*** Lucas_Gray has quit IRC14:12
*** stevebaker has joined #openstack-infra14:14
*** Lucas_Gray has joined #openstack-infra14:14
*** yolanda has joined #openstack-infra14:15
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Don't use -latest nodesets in integration jobs  https://review.opendev.org/66967514:17
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Change the repo used in some base-roles tests  https://review.opendev.org/66967614:18
corvusAJaeger: your updates look good, thanks; those are the small changes i'd like to make ^14:18
elodhi, i have a question regarding a patch, which was merged in but gerrit showed it as open and now after a recheck it got (2nd time?) merged, but disappeared: https://review.opendev.org/66772114:18
elodshould/can I do anything with it?14:19
openstackgerritChandan Kumar (raukadah) proposed openstack/diskimage-builder master: [RHEL-8] Set _clear_old_files=0 in install-pip element  https://review.opendev.org/66964114:20
AJaegercorvus: will review later - thanks14:22
*** ykarel|afk is now known as ykarel14:23
corvuselod: if the change appears in the git repo and the review history isn't critical, then i wouldn't worry about it.  i'll look into it to see if there's something systemically wrong, but if it's not super important, i may not try to recover it.14:23
*** dpawlik has quit IRC14:24
*** armax has joined #openstack-infra14:24
elodcorvus: i don't think it's superimportant :) just looked weird and i didn't know what to do with it :) thanks for the answer!14:25
corvusthis is the db record for that change: http://paste.openstack.org/show/754160/14:26
corvuselod: ^ that look right?14:26
elodcorvus: the change_key and change_id looks ok, i mean that's the one14:28
corvuselod: project and branch right?14:28
elodcorvus: manila and stable/queens, yes14:29
corvuselod: thanks14:29
elodcorvus: thanks, too!14:30
*** stevebaker has quit IRC14:30
*** dmellado has quit IRC14:30
*** dmellado has joined #openstack-infra14:32
*** chandankumar is now known as raukadah14:35
*** ricolin has joined #openstack-infra14:35
corvushttps://review.opendev.org/#/c/667721/4 works, patchsets 5 and 6 are in the database and the refs are on disk, their commits are present.  they have the same sha, which is curious.  i believe ps6 is the merge from zuul, but i don't understand why gerrit would create a new ps with the same sha for a merge.14:36
*** stevebaker has joined #openstack-infra14:37
*** bdodd has joined #openstack-infra14:38
corvusnone of the patchsets are drafts14:38
corvusi'm going to stop here and see if fungi or clarkb have any ideas about whether we need to dig further14:40
elodcorvus: patch 5 was merged on July 4, but I saw that it's still open in gerrit, so I commented a recheck, which resulted this strange issue :S14:40
elodcorvus: ok, thanks again!14:40
clarkblooking at gitea it was only merged once14:42
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Don't use -latest nodesets in integration jobs  https://review.opendev.org/66967514:45
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Change the repo used in some base-roles tests  https://review.opendev.org/66967614:45
*** smarcet has joined #openstack-infra14:46
*** gouthamr has quit IRC14:47
*** josephrsandoval has joined #openstack-infra14:48
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Add test job for ensure-sphinx  https://review.opendev.org/66895814:49
openstackgerritJames E. Blair proposed zuul/zuul-jobs master: Update the sphinx python package sets  https://review.opendev.org/66895414:49
*** stevebaker has quit IRC14:49
corvuscloudnull: ^ we needed a recheck anyway, so i rebased your change on the one that adds the test14:49
*** dmellado has quit IRC14:49
*** smarcet has quit IRC14:50
clarkbinfra-root can I get a second review on https://review.opendev.org/#/c/669007/ to finish up the fortnebula mirror work. Currently working with donnyd to diagnose an image upload problem then hopefully we are in a spot to turn on some test nodes14:51
*** dmellado has joined #openstack-infra14:53
*** jamesmcarthur has joined #openstack-infra14:54
cloudnullthanks corvus14:55
*** gouthamr has joined #openstack-infra14:56
*** iurygregory_ has joined #openstack-infra14:57
*** iurygregory has quit IRC14:57
*** stevebaker has joined #openstack-infra15:01
*** josephrsandoval has quit IRC15:07
*** dklyle has joined #openstack-infra15:13
*** sgw has joined #openstack-infra15:15
*** bobh has joined #openstack-infra15:15
openstackgerritMerged opendev/system-config master: Add apache restart handler for fortnebula LE setup  https://review.opendev.org/66900715:17
*** smarcet has joined #openstack-infra15:17
*** gyee has joined #openstack-infra15:17
openstackgerritMerged openstack/cookiecutter master: Add a testenv for lower-constraints  https://review.opendev.org/66073215:19
*** bobh has quit IRC15:20
*** apetrich has quit IRC15:24
*** dklyle has quit IRC15:28
*** dklyle has joined #openstack-infra15:28
*** iurygregory has joined #openstack-infra15:30
*** rlandy has quit IRC15:31
*** ykarel is now known as ykarel|away15:32
fungiokay, back from errands, catching up again15:32
*** iurygregory_ has quit IRC15:32
openstackgerritClark Boylan proposed openstack/project-config master: Add test nodes in fortnebula cloud  https://review.opendev.org/66970515:34
clarkbimages are uploading. If we don't want to approve ^ until images are uploaded I'll happily approve that change then if yall want to review it15:35
fungicorvus: so it looks like we did not see the scheduler memory leak last week while i was away... maybe i'm the cause? ;)15:35
AJaegerfungi, clarkb, could you review the zuul-jobs self-testing stack starting at https://review.opendev.org/#/c/668955 , please? That is work by corvus and updated by me - so, I'm not voting...15:35
clarkbAlso I checked and ssh keys and security groups seem to have updated so fixes for cloud launcher seem to have worked15:35
*** rlandy_ has joined #openstack-infra15:36
fungicorvus: as for repl, we can enable it on demand now using the rpc client right? and don't need to do that until we're ready to connect and inspect?15:36
*** rlandy_ is now known as rlandy15:37
clarkbdonnyd: https://review.opendev.org/669705 if you want to give that a look over too15:37
*** rlandy has quit IRC15:38
corvusAJaeger: i think you should vote on that stack15:39
corvusfungi: yes15:39
fungiawesome, thanks!15:40
donnydclarkb: will nodepool just figure out what image to use. All the images that are loaded have what looks like a date attached to the name, but the play only has a generic name15:42
AJaegercorvus: with +2? or just +1?15:42
clarkbdonnyd: ya there is a lookup table in a zookeeper database it maintains15:42
clarkbdonnyd: so we use logical names in the yaml there but it finds the current actual image for that logical name in the db15:43
donnydmakes perfect sense to me15:43
*** rlandy has joined #openstack-infra15:44
*** rlandy is now known as rlandy_15:45
*** rlandy has joined #openstack-infra15:45
*** tesseract has quit IRC15:46
AJaegercorvus: want to +2 on https://review.opendev.org/#/c/669147 ?15:47
*** tesseract has joined #openstack-infra15:48
*** smarcet_ has joined #openstack-infra15:48
*** smarcet has quit IRC15:48
*** smarcet_ is now known as smarcet15:48
*** lpetrut has quit IRC15:50
*** pgaxatte has quit IRC15:53
*** smarcet has quit IRC15:54
*** kjackal has quit IRC15:55
*** ginopc has quit IRC15:57
clarkbdown to two images remaining to upload for https://review.opendev.org/669705 corvus fungi maybe you can take a look at that change?16:04
openstackgerritStephen Finucane proposed openstack/project-config master: Initiate retirement of openstack/docs-specs  https://review.opendev.org/66885316:05
openstackgerritStephen Finucane proposed openstack/project-config master: Retire docs-specs  https://review.opendev.org/66885416:05
corvusclarkb: any other fn changes i should look at?16:05
clarkbcorvus: I think that is it right now16:06
clarkbthe fix for LE apache handler got in already and that was the other one I had in the queue16:06
*** iurygregory has quit IRC16:07
*** lucasagomes has quit IRC16:09
*** mattw4 has joined #openstack-infra16:09
*** smarcet has joined #openstack-infra16:09
clarkbcorvus: reading the change AJaeger linked above, https://review.opendev.org/#/c/668955, is the tr cleanup something we have a problem with or just boilerplate from elsewhere? tryign to figure out what the double newline thing is for16:10
*** whoami-rajat has quit IRC16:10
fungielod: corvus: the situation with 667721 is strange indeed. looks like ps5 was the result of a modification through the webui uncommenting a job name in .zuul.yaml and removing some associated todo comment lines. the timing of that edit is around when our daily database backups happen i think? so this may have been gerrit timing out an update query while waiting on the lock to be released16:10
fungiwe probably still have logs from that date, so going to see what i can find in them16:11
fungioh, this is fun...16:12
fungi[2019-07-04 00:16:08,506] [HTTP-83] ERROR com.google.gerrit.httpd.restapi.RestApiServlet : Error in POST /changes/667721/edit:publish16:13
fungias usual, the associated java backtrace is a mile long16:13
corvusclarkb: boilerplate from project-config -- ruamel has the entire file too much indented.16:14
fungibut seems like it's bubbling up from in com.google.gwtorm.jdbc.JdbcAccess.update16:14
fungiCaused by: com.google.gwtorm.server.OrmConcurrencyException: Concurrent modification detected16:14
fungiso that does sound like it could indeed be related to db backups16:14
corvusfungi: oh good catch16:15
*** whoami-rajat has joined #openstack-infra16:15
openstackgerritMerged openstack/project-config master: Add test nodes in fortnebula cloud  https://review.opendev.org/66970516:15
clarkbAll images have uploaded so ^ should be all set16:16
*** smarcet has quit IRC16:16
elodfungi: thanks for the investigation! the result sounds interesting :-o16:17
*** bhavikdbavishi has quit IRC16:17
donnydis that all that we need? just those 13 or so images16:17
*** tesseract has quit IRC16:18
*** Lucas_Gray has quit IRC16:18
*** smarcet has joined #openstack-infra16:24
corvusclarkb: is there a grafana dashboard?16:24
clarkbcorvus: not yet16:26
clarkbdonnyd: ya that is all of them16:26
corvusclarkb: want me to write that?16:26
*** jamesmcarthur has quit IRC16:27
clarkbcorvus: that would be great, thanks16:27
clarkbI'm going to work through this zuul-jobs stack then do a bike ride. Will keep an eye on fortnebula instances too16:28
donnydenjoy clarkb16:29
donnydI have my eyes peeled16:29
openstackgerritJames E. Blair proposed openstack/project-config master: grafana: add FortNebula dashboard  https://review.opendev.org/66971016:30
*** jamesmcarthur has joined #openstack-infra16:30
*** dtantsur is now known as dtantsur|afk16:34
*** jamesmcarthur has quit IRC16:35
*** igordc has joined #openstack-infra16:37
mnaserAJaeger: is https://review.opendev.org/#/c/668853/ okay to land now? you had a -1 earlier16:45
mnasertrying to move a stack in openstack/governance which depends on the retirement of this16:45
*** rpittau is now known as rpittau|afk16:50
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: Run jobs when their own config changes  https://review.opendev.org/66972216:53
openstackgerritMerged zuul/zuul-jobs master: Add a script to make platform-specific versions of jobs  https://review.opendev.org/66895516:54
AJaegermnaser: that one is fine finally16:55
mnaserAJaeger: yay great16:55
mnaserAJaeger: i'16:55
*** psachin has quit IRC16:55
mnaseri have +2 and i will let you +A then i will recheck to get the +1 noop on the retirement patch16:55
mnaserand chase down some reviews :)16:55
*** jamesmcarthur has joined #openstack-infra16:56
AJaegermnaser: done...16:57
AJaegerfungi, thanks for reviews - please review https://review.opendev.org/#/c/669147 as final change...16:58
fungiyep, i'm on that one now16:59
*** derekh has quit IRC17:00
AJaegerthanks, fungi17:00
fungii was just trying to check the dnm triggering change for it first17:00
AJaegerah17:00
fungibut gertty was crashing on me so took a moment to pull it up17:00
AJaegerfungi: it will merge conflict - I'll rebase once the existing stack is in...17:02
openstackgerritMerged openstack/project-config master: grafana: add FortNebula dashboard  https://review.opendev.org/66971017:03
*** diablo_rojo has joined #openstack-infra17:04
openstackgerritMerged openstack/project-config master: Initiate retirement of openstack/docs-specs  https://review.opendev.org/66885317:05
openstackgerritAndreas Jaeger proposed zuul/zuul-jobs master: Add Gentoo integration tests  https://review.opendev.org/66914717:06
AJaegerfungi, corvus, could you review again, please? ^17:06
* AJaeger had to rebase17:06
*** ykarel|away has quit IRC17:06
*** igordc has quit IRC17:10
openstackgerritMerged zuul/zuul-jobs master: Add base role integration jobs  https://review.opendev.org/66806117:11
openstackgerritAlex Schultz proposed openstack/project-config master: Retire tempest-tripleo-ui  https://review.opendev.org/66794917:14
AJaegerconfig-core, could you review https://review.opendev.org/#/c/668958 for zuul-jobs testing of sphinx role, please?17:14
AJaegermnaser: want to review 667949 , please?17:14
openstackgerritMerged zuul/zuul-jobs master: Add multi-node integration jobs  https://review.opendev.org/66876717:15
openstackgerritMerged zuul/zuul-jobs master: Don't use -latest nodesets in integration jobs  https://review.opendev.org/66967517:15
*** ralonsoh has quit IRC17:19
fungicorvus: repeatable crash in gertty by passing an integer topic to the search: http://paste.openstack.org/show/754170/17:21
*** priteau has quit IRC17:21
funginot sure if that's already a known issue17:21
corvusfungi: neat, didn't know that.17:23
fungithere are a couple newer commits in master than the version i've got installed, but none which look like they would address that17:26
*** vankalice has joined #openstack-infra17:28
corvusfungi: yeah, it repros locally and i'm running master17:28
corvusAJaeger: it turns out the only role left in ozj is mirror-info, and we moved that to opendev/base-jobs, so the real version of that role is elsewhere (and untested, since it's a config-project).17:30
corvusit's the role that creates the /etc/ci/mirror_info.sh script.  is that something we want to keep doing in opendev?  or is that a legacy-openstack-only thing?17:31
fungigrey area... it's legacy for sure, whether it's openstack-only i'm less certain17:32
AJaegerIt looks worthwhile for opendev as well17:32
AJaegerbut it's something that not every Zuul user might want17:32
corvusi see 3 options for that: 1) keep doing it for all of opendev, in which case we should move it to an untrusted project in the opendev tenant to regain the testing we had in ozj.  2) make it openstack-only -- in which case we delete it from opendev base jobs, and make a new openstack base job which includes it from ozj; 3) generalize it and support it in zuul-jobs for everyone.17:32
*** jamesmcarthur has quit IRC17:34
fungii think with the bindep-fallback and fetch-zuul-cloner changes recently we treated "legacy" zuul v2 transitional bits as openstackisms17:34
fungiso i'm leaning toward option 2 there17:35
donnydclarkb: I see some nodes spinning up fyi17:35
corvusdonnyd, clarkb: http://grafana.openstack.org/d/3Bwpi5SZk/nodepool-fortnebula?orgId=117:35
AJaegerfungi, corvus, I see mirror support as important - but /etc/ci/mirror_info.sh sounds legacy...17:36
AJaegersorry, can't think this through completely right now ;/17:36
corvusyeah, pulling on this thread is going to unravel a bit17:37
*** jamesmcarthur has joined #openstack-infra17:37
corvusdonnyd: the graph is showing errors; i'll track down the launcher and see what they are17:38
fungiyes, we used to have nodepool install that file in the ready script i think? and then added the ansible role during the v3 transition to support the auto-converted jobs which were using it17:38
corvusyeah.  i *think* in the long run, we can expect jobs to do whatever configuration they need via ansible variables, so i *think* deprecating it will be viable17:39
*** ijw has joined #openstack-infra17:39
*** ijw_ has joined #openstack-infra17:40
donnydit looks like all the instances are available17:41
*** igordc has joined #openstack-infra17:41
corvusdonnyd, clarkb: http://paste.openstack.org/show/754171/17:41
openstackgerritMerged zuul/zuul-jobs master: Change the repo used in some base-roles tests  https://review.opendev.org/66967617:42
openstackgerritMerged zuul/zuul-jobs master: Add Gentoo integration tests  https://review.opendev.org/66914717:42
fungihttp://codesearch.openstack.org/?q=mirror_info.sh shows it being used by tripleo (quickstart and ci), dib, devstack (optional i think?), openstack-ansible, devstack-gate and some d-g hooks in other projects (legacy i guess), helm's loci image builder, also puppet-openstack-integration and windmill look like they optionally use it17:42
*** igordc has quit IRC17:43
*** weifan has joined #openstack-infra17:44
*** ijw has quit IRC17:44
*** roman_g has quit IRC17:45
donnydcorvus: I am running it down17:46
*** SpamapS has quit IRC17:46
*** igordc has joined #openstack-infra17:46
openstackgerritJames E. Blair proposed openstack/openstack-zuul-jobs master: Remove base role integration testing  https://review.opendev.org/66972717:47
openstackgerritJames E. Blair proposed openstack/openstack-zuul-jobs master: Remove base role integration testing  https://review.opendev.org/66972717:48
donnydcorvus: clarkb looks like the centos image is not bringing up ipv617:48
donnydI am using SLAAC ATM17:48
corvusdonnyd: ah.  it does look like nodes for other images are going into ready / in-use states, which further points the finger at something centos-7 specific17:49
donnydwell the fedora image is doing the same. I can swap over ipv6 advertisement methods to see if the images will work another way17:50
corvus(also, i just discovered to much amusement that "gentos" is a dvorak typo for "centos")17:50
*** jamesmcarthur has quit IRC17:52
*** SpamapS has joined #openstack-infra17:55
corvusdonnyd: in case you're not aware, we use glean on our images: https://opendev.org/opendev/glean#glean17:55
corvusas long as there's no network info on a config drive, it should get out of the way17:55
corvusbut otherwise, it could be a factor17:56
clarkbcatching up17:58
*** jamesmcarthur has joined #openstack-infra17:58
clarkbI believe that fedora and centos images rely on router advertisements for ipv617:58
clarkbbeacuse glean doesn't know how to configure them statically (eg like on rax)17:58
*** jamesmcarthur has quit IRC17:59
clarkbI can boot a !centos test node, ssh into that then ssh into a centos test node via private ipv4 to see what is going on17:59
donnydI can swap the networking over to dhcpv6-stateless18:01
*** ociuhandu_ has joined #openstack-infra18:01
clarkblogan-: ^ is that what we've got in limestone?18:01
logan-checking18:03
logan-yep https://opendev.org/limestone/ci-cloud-config/src/branch/master/network_bootstrap/bootstrap-neutron.yml#L87-L8818:03
*** ociuhandu has quit IRC18:03
donnydThat seems like an easier fix to me18:04
donnydand It would be better if fn was setup like the others best I can18:04
*** ociuhandu_ has quit IRC18:06
clarkbok I see a problem (my fault). fix on its way18:11
clarkbunrelated to ipv618:11
fungiwas it stateful dhcp6 in fn previously?18:13
openstackgerritClark Boylan proposed openstack/project-config master: Use opendev mirror in fortnebula cloud  https://review.opendev.org/66973318:13
donnydWell I swapped it out anyways18:14
*** jamesmcarthur has joined #openstack-infra18:14
clarkbfungi: we didn't use it previously18:14
donnydfungi: no it was SLAAC18:14
clarkbfungi: and ubuntu handle all the cases with glean just fine iirc18:14
*** jamesmcarthur has quit IRC18:14
clarkbinfra-root https://review.opendev.org/669733 is needed to make jobs work in fn (a thing I missed with the mirror setup in zuul)18:14
donnydits dhcpv6-stateless now18:14
AJaegerclarkb, fungi, could either of your review https://review.opendev.org/668958 , please?18:14
fungiahh18:14
*** jamesmcarthur has joined #openstack-infra18:15
clarkbAJaeger: looking18:15
fungiwell, stateless dhcp6 still relies on slaac18:15
AJaegerthanks, clarkb. btw. +2 on 73318:15
fungiit's just a means of handling dns servers and the like18:15
donnydfungi: understood, but its what limestone is using.18:16
donnydso might as well be consistent18:16
fungisure, not disagreeing18:16
clarkbfungi: donnyd ya neutron's terms for things don't always necessarily map to what you expect them either18:16
fungijust curious what makes that problematic for centos/fedora18:16
clarkbso it could be that dhcpv6 stateless does something unexecptedly different18:16
fungioh, right, i forgot neutron has its own definitions for those terms ;)18:17
clarkbfungi: the difference on our images is that glean does not configure ipv6 via config drive data18:17
AJaegerconfig-core, if anybody wants to review, please look at https://review.opendev.org/667949 https://review.opendev.org/665910 and https://review.opendev.org/66870818:17
clarkbfungi: when on red hat distros18:17
clarkbfungi: its possible that slaac comes with config drive data and dhcpv6 stateless does not18:17
clarkbbecause that data is expected from dhcp in that config18:17
fungigot it18:17
logan-originally limestone was set up with dhcpv6-stateful and it was causing problems in the nodes https://opendev.org/limestone/ci-cloud-config/commit/d59ba2a74d4661d0868ca43d9f6744b91c8d06ad -- i don't remember the details though18:17
*** witek has quit IRC18:20
*** whoami-rajat has quit IRC18:20
yoctozeptogerrit masters, could you take a look at why url in this commit message displays weirdly in gerrit summary: https://review.opendev.org/#/c/669730/118:21
clarkblogan-: ya I think it came down to different neutron behaviors interacting with glean18:21
*** priteau has joined #openstack-infra18:22
*** tdasilva has joined #openstack-infra18:23
*** vankalice has quit IRC18:23
donnydlooks like the gentoo image is having issues on ipv6 as well18:29
clarkbyoctozepto: our gerrit config injects url links for changeids18:29
openstackgerritMerged zuul/zuul-jobs master: Add test job for ensure-sphinx  https://review.opendev.org/66895818:30
clarkbyoctozepto: if you replace that gerrit url with just the cahnge id or a url to a change number and not id the weird render should go away18:30
yoctozeptoclarkb: thanks for explanation!18:30
*** weifan has quit IRC18:32
*** weifan has joined #openstack-infra18:32
*** weifan has quit IRC18:32
*** weifan has joined #openstack-infra18:32
*** weifan has quit IRC18:34
*** weifan has joined #openstack-infra18:34
openstackgerritMerged zuul/zuul-jobs master: Update the sphinx python package sets  https://review.opendev.org/66895418:34
openstackgerritMerged openstack/project-config master: Use opendev mirror in fortnebula cloud  https://review.opendev.org/66973318:34
*** weifan has quit IRC18:35
*** weifan has joined #openstack-infra18:35
*** weifan has quit IRC18:36
*** weifan has joined #openstack-infra18:36
*** weifan has quit IRC18:36
*** weifan has joined #openstack-infra18:37
*** weifan has quit IRC18:37
*** weifan has joined #openstack-infra18:37
*** weifan has quit IRC18:38
*** ijw_ has quit IRC18:43
*** ijw has joined #openstack-infra18:45
*** sshnaidm|ruck is now known as sshnaidm|afk18:46
*** ociuhandu has joined #openstack-infra18:47
*** weifan has joined #openstack-infra18:49
donnydseems like fedora is still not working on dhcpv6-stateless18:51
*** ijw has quit IRC18:52
*** whoami-rajat has joined #openstack-infra18:57
clarkbI'll boot a couple test nodes (ubuntu to ipv6 in and cetnos to examine?)18:58
clarkbdonnyd: Quota exceeded for cores: Requested 8, but already used 64 of 64 cores (HTTP 403) (Request-ID: req-6e9ed64a-c33d-41ba-843a-b62c51611aa3)19:00
clarkbdonnyd: we'll need to bump that up. cpus, ram, instances, ports and disk19:00
donnydclarkb: done19:01
donnydi removed quota for everything that isn't cpu's, instances and memory19:02
clarkbnow hitting the core limit of 8 (I need 10 * 8 for nodepool + 2 * 8 for my test nodes = 96)19:04
clarkbalright two test nodes building. Lets see how they do19:05
*** weifan has quit IRC19:11
openstackgerritJeff Liu proposed zuul/zuul-operator master: Create tasks that spin up kubernetes and run the operator  https://review.opendev.org/66802919:12
donnydhrm... seems like the bionic instance is stuck19:15
clarkbdonnyd: should I delete it and try again?19:15
clarkbor maybe leave it and try again (so that it can be debugged)?19:16
donnydseems like something a little deeper is going on19:16
openstackgerritBrian Haley proposed opendev/irc-meetings master: Create a meeting for Networking OVN project  https://review.opendev.org/66801319:19
*** ricolin has quit IRC19:21
*** yamamoto has joined #openstack-infra19:22
*** cloudnull is now known as cloudkiller19:25
*** yamamoto has quit IRC19:26
clarkbdonnyd: I'm gonna pop out for a few but will leave the test ndoe taht is stuck as is so you can poke at it19:29
donnydok thanks19:29
clarkblet me know if you think I should boot a second test node or if I should just wait19:29
*** factor has quit IRC19:30
donnydYou can just leave it19:30
*** factor has joined #openstack-infra19:30
*** jtomasek has joined #openstack-infra19:31
openstackgerritJeff Liu proposed zuul/zuul-operator master: Create tasks that spin up kubernetes and run the operator  https://review.opendev.org/66802919:33
*** icarusfactor has joined #openstack-infra19:45
*** eharney has quit IRC19:46
*** factor has quit IRC19:47
*** pkopec_ has joined #openstack-infra19:49
*** weifan has joined #openstack-infra19:50
*** pkopec has quit IRC19:51
*** ijw has joined #openstack-infra19:56
donnydclarkb: seems like the only thing that is working via ipv6 is bionic and xenial19:56
funginot even debian?19:57
donnyddoesn't look like it19:57
fungistrange19:57
fungibionic and buster should be fairly similar in the parts that would matter to boot and glean19:58
clarkbya I've got another test instance booting so that I can poke at the centos node19:58
clarkbI think we'll just ahve to see what is going on19:58
fungiyeah, does seem like this is deeper than just glean and ipv619:59
*** factor has joined #openstack-infra20:01
*** ociuhandu has quit IRC20:02
*** icarusfactor has quit IRC20:03
donnydlmk what you find out clarkb20:03
clarkbdonnyd: fungi 2001:470:e045:1:a0ac:2bb:a371:7c3 seems to be the addr it wants to use (generated from mac addr with your prefix I think)20:05
clarkbthat is what ifconfig reports at least20:05
clarkbnow to check itnerface configs20:05
clarkber rather the addr in neutron is based on the mac addr with prefix I think20:07
clarkbthe address above does not seem to be?20:07
*** jcoufal has quit IRC20:08
donnydwhat mac is it showing?20:10
openstackgerritJeff Liu proposed zuul/zuul-operator master: Create tasks that spin up kubernetes and run the operator  https://review.opendev.org/66802920:11
clarkbdonnyd: ether fa:16:3e:f0:ba:8520:12
clarkbinet6 2001:4800:7817:103:be76:4eff:fe04:5a1d  prefixlen 64  scopeid 0x0<global>20:12
clarkboh interesting that address keeps changing20:13
donnydthat is the link-local address isn20:13
clarkbwait20:13
donnydisn't it20:13
clarkbwrong terminal and host20:13
clarkbinet6 2001:470:e045:1:a0ac:2bb:a371:7c3  prefixlen 64  scopeid 0x0<global>20:13
clarkbthere20:13
clarkbthe mac addr matches the prefix + mac based address from neutron20:13
donnydcan you give me the output from ``` ip a```20:14
clarkbhrm paste is not working?20:16
clarkbdonnyd: https://gist.github.com/cboylan/ebaf1104fce9ae666266871558b6eff920:17
clarkbI'm digging through logs now to see if I can find what assigned the address it has20:17
donnyd 2001:470:e045:1:f816:3eff:fef0:ba8520:17
donnydno idea where this little fella is coming from 2001:470:e045:1:a0ac:2bb:a371:7c3/6420:18
clarkbI'm going to try restarting networking to see if that changes anything20:20
donnydhttps://www.irccloud.com/pastebin/hz2mbZRq/20:21
donnydhttps://www.irccloud.com/pastebin/a01IAmg7/20:21
clarkbdonnyd: this is running with network manager so I'm not sure the dhcpv6 config helps?20:21
donnydis NM set to yes or no?20:21
clarkbyes20:22
donnydhrm20:24
*** icarusfactor has joined #openstack-infra20:25
fungithat looks like a discrepancy between traditional eui-64 slaac addressing and "privacy extensions" addressing20:25
fungiff:fe as the middle bytes of the host half of the address indicate eui-6420:26
clarkbfungi: thats the wrong address (I got it off bridge accidentally)20:26
fungiahh20:26
*** factor has quit IRC20:27
clarkbrestarting the network caused it to come up with the correct network address20:28
donnyddid you put the params in from above?20:30
clarkbdonnyd: no20:30
donnydor just restart the network20:30
clarkbjust systemctl restart network20:30
clarkbpossibly a race with glean and network manager then20:30
donnydwell that restarts network, not NetworkManager20:30
clarkbit should restart network manager too doesn't it?20:31
clarkbsince network manager provides network?20:31
donnydthe network service and NetworkManager are not the same, but I could be mistaken20:31
donnydcan you burn that instance and try doing just a network manager restart?20:32
clarkbyup20:32
*** Lucas_Gray has joined #openstack-infra20:32
*** smarcet has quit IRC20:34
fungifor centos i think the network service feeds the /etc/network-scripts/ifcfg-foo files into ifconfig (or more likely iproute2 these days)20:34
clarkbfungi: ya /etc/sysconfig/network-scripts/*20:35
clarkbwhcih is where things were configured and I thought networkmanager also read that config20:35
fungiseems like network-manager knows how to read those files, but i agree restarting one doesn't necessarily notify the other20:35
clarkbwe should be able to restart network manager easily enough in a few minutes20:35
openstackgerritJeff Liu proposed zuul/zuul-operator master: Create tasks that spin up kubernetes and run the operator  https://review.opendev.org/66802920:35
clarkbrestarting network manager does not fix it20:38
*** rlandy has quit IRC20:39
clarkbThis makes me wonder if network manager just doesn't work right with glean20:39
*** rlandy_ is now known as rlandy20:39
fungii wouldn't be shocked to learn that's the case20:40
*** ianychoi has joined #openstack-infra20:41
clarkbdo we need IPV6_AUTOCONF=yes ?20:42
*** priteau has quit IRC20:44
fungii find references to setting IPV6_AUTOCONF=no for disabling slaac addressing20:45
fungisuggesting that it's likely defaulting to yes20:46
donnydIPV6_AUTOCONF = Enable IPv6 autoconf configuration for this interface (an IPv6 address will be requested using Neighbor Discovery (ND)20:46
clarkbI added that config to the ifcfg-eth0 file and restarted network manager to no change. Then rebooted (I think restarting network wouldve been fine)20:46
clarkband that updated the address20:46
clarkbI think network manager is just oblivious to ipv620:46
openstackgerritJames E. Blair proposed zuul/zuul master: WIP: Run jobs when their own config changes  https://review.opendev.org/66975220:46
clarkbI'm not sure what the intended mode of operation is here :/20:47
clarkbpersonally I've always used nm and /etc/network/interfaces type config separately20:48
fungiit may be that nm expects ipv6 to "just work" so ignores related settings20:48
clarkband use only one or the other20:48
donnydhttps://developer.gnome.org/NetworkManager/stable/settings-ipv6.html20:49
clarkbif I had to guess at a failure mode it is this: glean and network manager units are configured to happen in the right order but not glean and network. So if nm isn't managing ipv6 the glean written config which includes things like "here is your mac address" fail to configure stuff properly20:49
*** priteau has joined #openstack-infra20:49
donnydNM_SETTING_IP6_CONFIG_ADDR_GEN_MODE_EUI64 (0) or NM_SETTING_IP6_CONFIG_ADDR_GEN_MODE_STABLE_PRIVACY  (1)20:49
donnydand the default is of course 120:49
fungii honestly have no idea how neutron deals with privacy extensions20:50
donnydNot sure what the difference is between stable privacy mode and EUI64 other than EUI64 looks to be what SLAAC uses20:50
*** pcaruana has quit IRC20:50
donnyddoes this work in limestone?20:51
clarkbdonnyd: I don't think we actually know. The assumption is that it worked, btu the NM changes in glean may have broken limestone + red hat distros and we didn't notice20:51
slaweqfungi: hi, if You are asking about IPv6 privacy extension, than neutron don't support it AFAICT20:52
fungithanks for confirming, slaweq!20:52
slaweqyw fungi :)20:52
clarkbhttps://developer.gnome.org/NetworkManager/stable/nm-settings-ifcfg-rh.html is the relevant redhat config to nm stuff20:53
fungiyeah, the switch to nm in our images was a very recent one, i think?20:53
clarkbfungi: relatively recent yes20:53
donnyd``` OpenStack currently doesn’t support the privacy extensions defined by RFC 4941. The interface identifier and DUID used must be directly derived from the MAC as described in RFC 2373. ```20:54
donnydis there a way to provide the config that is required to glean?20:54
clarkbdonnyd: glean already has that data via config-drive. The larger issue is that no one has written the code that configures interfaces for ipv6 on red hat distros20:55
donnydYea, I was a little late on that. one20:56
clarkbso in general we rely on do the right thing behavior of distros with it just working on boot20:56
donnydit looks to me like the only distro that works as we expect is bionic and xenial though20:57
clarkbreading these docs the default is to do what we want20:57
clarkbwith network manager specifically20:57
clarkbdonnyd: its possible the other distros have similarly broken20:58
donnydI also saw the gentoo image does that same20:58
clarkbgentoo relies on systemd-network or whatever its called iirc20:59
fungisystemd-networkd20:59
clarkbxenial and bionic are /etc/network/interfaces (as should be stretch and buster though buster is brand new)21:00
*** whoami-rajat has quit IRC21:00
donnydthe debian image also exhibits the same behavior21:00
clarkbdonnyd: is it buster or stretch or both?21:00
clarkbmaybe thats the thing to do then. Start checking the other distros and see what the common thread is here /me boots a stretch image21:01
*** mattw4 has quit IRC21:02
donnydyea, that may be worth while. If it's more common to not work oob, then maybe we should handle it a different way21:02
*** mattw4 has joined #openstack-infra21:03
*** nicolasbock has joined #openstack-infra21:04
clarkbalso when ianw wakes he may be able to help debug as he added the NM support to glean21:08
clarkbstretch instance is one its way up21:09
openstackgerritBrian Haley proposed opendev/irc-meetings master: Create a meeting for Networking OVN project  https://review.opendev.org/66801321:12
*** goldyfruit has quit IRC21:14
donnydclarkb: I looks like its the same deal as centos21:14
donnydit*21:14
donnydalso, how are you beating zuul.. I am apparently not fast enough to get an instance scheduled21:14
clarkbdonnyd: I had to delete the centos node to make a slot21:15
clarkbwhich is fine because I already "corrupted" it by rebooting it21:16
clarkbstill trying to get into the stretch instance21:16
clarkbit may not be configuring ipv4 there either21:16
fungiseems like we ought to just set max-servers there to 0 until this is figured out?21:16
clarkbfungi: I think the xenial and bionic nodes are working and nodepool won't give zuul a broken centos. So dependso n whether or not it is more valuable to have easier time booting test instances vs zuul instances21:17
clarkbI can go either way on that21:17
*** jamesmcarthur has quit IRC21:17
*** smarcet has joined #openstack-infra21:21
clarkbok stretch does work21:22
clarkbits just super slow to boot?21:22
clarkbin any case ipv6 there configured properly it just took a while for network (v4 or v6) to come up21:23
* clarkb tries another21:23
*** mattw4 has quit IRC21:26
*** priteau has quit IRC21:27
*** mattw4 has joined #openstack-infra21:27
*** priteau has joined #openstack-infra21:27
*** cloudkiller is now known as cloudnull21:27
fungithat could be other services not finding sufficient entropy at boot21:28
*** priteau has quit IRC21:29
clarkbopensuse 15 image also works with ipv621:36
*** jtomasek has quit IRC21:36
clarkbdouble checking buster now21:38
clarkbthen gentoo21:38
openstackgerritJames E. Blair proposed zuul/zuul master: Run jobs when their own config changes  https://review.opendev.org/66975221:39
*** smarcet has quit IRC21:39
*** sreejithp has quit IRC21:41
clarkbbuster works21:42
openstackgerritJames E. Blair proposed opendev/system-config master: Remove .zuul.yaml file matchers  https://review.opendev.org/66976221:44
openstackgerritJames E. Blair proposed opendev/system-config master: Remove .zuul.yaml file matchers  https://review.opendev.org/66976221:48
*** smarcet has joined #openstack-infra21:49
fungias slow to boot as stretch?21:53
*** mriedem has quit IRC21:53
clarkbno, but that may be due to image caching too?21:57
clarkbstill waiting on gentoo to connect via ipv4 or v621:58
clarkbconsole log shows nothing21:58
clarkb(that may be a bug in our image)21:58
fungioh, yeah if that image hadn't been booted on that hypervisor host yet21:59
clarkbI'll give gentoo another couple minutes before I move on22:04
clarkbstill no connectivity over either ip version and no console log22:04
ianwhello, so basically ipv6 on centos not working with our images on fortnebula?22:06
clarkbianw: ya22:07
clarkbianw: and a reboot seems to fix it22:07
*** slaweq has quit IRC22:07
clarkbI'm doing a survey of our other images currently22:07
ianwhrm, yeah ordering is always a suspect ...22:08
clarkbon the off chance gentoo's problem is the image upload I have deleted it and will let nodepool reupload it22:09
ianwi feel deja-vu something pabelanger was looking at22:09
ianwi'm not sure we ever got to the bottom of it22:10
*** tjgresha has joined #openstack-infra22:10
clarkbBooting fedora-29 now22:10
clarkblets see if it acts like centos22:10
*** pkopec__ has joined #openstack-infra22:11
*** pkopec_ has quit IRC22:14
clarkbya fedora29 is the same22:18
clarkbcomes up with a bad/wrong ipv6 address first boot but on second boot is correct22:18
ianwbut a globally addressed bad address (not fe80:: etc?)22:19
clarkbcorrect22:19
clarkbbad: inet6 2001:470:e045:1:3a36:438:3e36:2e0d good: inet6 2001:470:e045:1:f816:3eff:feca:65122:19
clarkbis it possible that the mac addr isn't the same?22:20
clarkbI mean the actual ports mac address is different than fa:16:3e:ca:06:51 and then we change it?22:20
clarkbmaybe sysfs will tell me22:20
*** slaweq has joined #openstack-infra22:23
fungithe first (bad) one is a privacy extensions address, the second is a eui-64 slaac address22:27
fungilikely something is starting out with privacy extensions addressing turned on22:28
*** slaweq has quit IRC22:28
clarkbfungi: I thought you said the bridge.o.o address was the one that was privacy extensioned? (the one I mispasted)22:29
clarkbfungi: is the bad address above also a privacy extensioned addr?22:29
fungiyes22:29
clarkbah ok22:29
ianwyeah if i put the "bad" one into http://silmor.de/ipaddrcalc.html#ip6 it doesn't come back with a MAC (::3a36:438:3e36:2e0d)22:29
clarkbin that case why would a reboot change the behavior?22:29
fungi2001:470:e045:1:3a36:438:3e36:2e0d does not have a ff:fe in the middle of the host portion of the address22:29
fungiso is not eui-6422:30
clarkbreading fedora bugs seems that NM doesn't have a default for privacy extensions and instead relies on sysctl22:31
clarkbwhich makes this weirder because we don't change sysctl with glean and a reboot shouldn't change that I don't think?22:31
fungiprivacy extensions stable addressing is likely enabled in the kernel by default on those platforms, though not sure why the eui-64 addressing gets added on second boot22:31
clarkbI'm going to look at sysctl on my current working/fixed instance and then boot a new one that should be broken and check that one22:32
ianwafaik glean has never ventured in to this area22:33
*** ekultails has quit IRC22:34
*** jamesmcarthur has joined #openstack-infra22:34
clarkbnet.ipv6.conf.ens3.use_tempaddr = 022:35
fungiprivacy extensions temporary addressing and privacy extensions stable addressing likely use different sysctls22:35
fungitempaddrs never really caught on because they make network management problematic22:36
fungiwhich is why stable addressing came about later as more or less a replacement22:36
clarkbfungi: I don't see any sysctls for it otherwise22:36
clarkbalso I found a clue22:36
clarkbgetting a link22:37
*** Lucas_Gray has quit IRC22:37
*** Lucas_Gray has joined #openstack-infra22:37
clarkbhttps://opendev.org/openstack/project-config/src/branch/master/nodepool/elements/nodepool-base/install.d/99-disable-rfc304122:37
clarkbthat may explain why things potentially change after a reboot22:37
auristorprivacy addresses should be transient and be flagged as temporary.  the rule is that when present, new outgoing connections default to the temporary address, unless the socket is bound to the permanent address22:37
clarkbapparently those sysctls are applied after networking is bootstrapped22:38
clarkbnow to boot a fresh node and compare22:38
auristortemporary addresses should not replace permanent addresses.22:38
clarkbauristor: ya I think what fungi is describing are stable permanent addrs not derived from mac addr22:39
ianwhttps://binblog.info/2017/09/21/ipv6-privacy-stable-addressing-roundup/ good link22:40
ianwLinux -> It's complicated22:40
ianw:)22:40
fungiyep, those22:40
fungitempaddrs are still relevant for things like mobile devices22:41
clarkbNetworkManager defaults to using addr-gen-mode=stable-privacy in the [ipv6] section of /etc/NetworkManager/system-connections/<Connection>. that may be it then22:41
fungifor servers they're aggravating22:41
clarkbok where is my link of settings for sysconfig to map into nm22:41
clarkbhttps://developer.gnome.org/NetworkManager/stable/nm-settings-ifcfg-rh.html22:42
clarkbthat says the default is eui6422:42
ianwclarkb: so the thought is that on first boot ipv6 has come up before nm or something (ordering?) ... then we configure it, then second boot it uses the stable address?22:42
clarkbianw: ya22:43
clarkbI still don't know how or why though22:43
clarkbone thing I've noticed is that network manager starts before glean, then glean updates the sysconfig interface file whihc network manager notices then updates the interface config22:44
clarkbpossible that NM isn't fully updating the interface config on that update (because not everything can be updated?) but then second boot the files are already in place and it does the correct thing then?22:45
openstackgerritJames E. Blair proposed zuul/zuul master: Run jobs when their own config changes  https://review.opendev.org/66975222:47
clarkbhrm the new fedora-29 I booted seems to actually have the correct IP22:49
clarkbso a race at boot seems likely22:49
ianwanother link -> https://arusso.io/Disabling_IPv6_Privacy_Extensions/ ... same sort of issue.  host uses SLAAC and if NM makes up an address it doesn't work22:49
clarkbit does seem like a really bad default for servers22:50
ianwi wonder if a "nmcli conn down/up" cycle as a hack works ... not very nice22:51
*** bdodd has quit IRC22:51
auristorthere can be three types of permanent non-link-local addresses: SLAAC+RDNSS and SLAAC+Stateless DHCPv6, and Stateful DHCPv6.  It is possible for a host to have more than one of them.22:51
clarkbwe are using the second22:52
fungitempaddrs are annoying for servers. cga (privacy stable) are a fine choice *if* your environment can support them since it hides your mac22:52
clarkbfungi: right but seems like none of these hosted envs support it22:53
clarkbfungi: that blog above isn't even openstack :)22:53
fungineutron wants to be able to know what address the guest will come up with, so yeah needs something (pre)deterministic22:53
*** jamesmcarthur has quit IRC22:53
auristoryou don't want temp addresses on servers unless you know the software that is running knows how to properly bind to permanent addresses and always reply using the same interface the request came in on.22:54
fungialso leaking the mac is less of a concern on virtual machines since they're fabricated and don't reflect the hardware manufacturer et cetera22:54
auristorAuriStorFS servers go to a lot of effort to ensure that we never use or advertise temporary addresses22:54
fungiauristor: in this case they're not tempaddrs, they're cga22:55
fungiso the main problem is that the virtual machine hosting platform can't guess in advance what cga will be generated by the guest on first boot22:55
auristorIf SLACC+Stateless DHCPv6 is in use, any chance there is more than one DHCPv6 server ?22:55
*** goldyfruit has joined #openstack-infra22:56
fungiought to be able to tell from the syslog, though i'm curious how stateless would impact addressing at all22:57
clarkbauristor: I don't think so its just generating the private addr instead of the mac based one22:58
clarkblooking at fedora29 syslog it seems that glean writes the ifcfg-ens3 file before NM processes that interface and it works22:58
auristorfor CGA, where / when is the public key being generated?  does that happen at after first boot?22:58
clarkbI need to boot fedora-29 instances until I catch a broken one then compare syslogs I guess22:58
funginetworkmanager handles the address generation as a bit of a black box. from what i gather it creates the address each time it sees an interface with a new mac and then stashes that for future use23:00
*** jamesmcarthur has joined #openstack-infra23:00
fungi(not that it embeds the mac in the address, just uses it for indexing and handling hotplug events)23:00
*** tkajinam has quit IRC23:01
*** tkajinam has joined #openstack-infra23:01
ianwfrom my notes -> http://paste.openstack.org/show/754182/ ... i think that was what i was thinking of with the "reboot fixes it"23:01
fungianyway, the long and short of it is that the environment these images are booted in can't support cga at all, so we need to turn it off for playforms where it defaults to on23:01
fungis/playforms/platforms/23:02
fungilikely this came about with the switch to using networkmanager on those images23:02
auristoras I understand it the benefit of a cga address is that openssh client or server can use it and its possible to prove the client and server's public keys match that used to generate the cga23:02
fungithat was the design goal for that rfc, yes, though i don't know of any actual implementations that do it that way23:03
fungiinstead they just generate a random seed and use that to create the addresses23:03
fungiit was supposed to tie in with send (secure neighbor discovery)23:03
fungibut it has mostly ended up being used as a replacement for eui-64 addressing where folks want autoconfiguration but don't want to expose their interface macs23:04
fungiand who want to make it harder to brute-force the address space too23:05
auristorthe same chicken and egg problem exists.   which come first after initial boot, bringing up the interface or generating the key material?23:05
ianwfor reference; i would say that https://review.opendev.org/#/c/618964/9..17/glean/init/glean-nm%2540.service was the ordering change i referenced in the notes23:06
*** tosky has quit IRC23:06
ianwi'm sure i had a good reason for using local-fs in there, although not sure i can remember what it was :/23:06
clarkbianw: it has to write the config to disk23:07
fungii agree it's a bit of a chicken-and-egg problem, but i also have never been concerned with exposing my servers interface macs or having brute-forcible address space in my networks23:07
clarkband then glean seems to wait for the link to go active23:07
clarkbheh 2/2 working fedora-29 boots now23:08
clarkbdonnyd: ^ did you change anything cloud side? or maybe this is just a really weird race and we are gonna hit it somtimes23:09
* clarkb tries centos7 again23:09
donnydno23:09
donnydjust tinkering with the storage and the air handler23:09
ianwmaybe this is also supposed to have a network-* target somewhere?  https://opendev.org/opendev/glean/src/branch/master/glean/init/glean-nm@.service23:10
ianwdiscussing this is like groundhog day ...23:10
*** dchen has joined #openstack-infra23:11
*** weifan has quit IRC23:12
ianwas puppet seems to have shown too, computers may be good at serialising DAG's, but the humans configuring them can very quickly create very difficult to maintain things with them23:12
lifelessianw: like k8s ? :P23:13
ianwi'm 99% certain that if that "Before=local-fs" in the glean nm .service file gets changed to "Before=network-pre" (which does seem logical) it won't boot in the CI tests; that was the original issue23:14
*** andreaf has quit IRC23:15
*** andreaf has joined #openstack-infra23:15
clarkbyup comparing working fedora-29 syslog to broken centos7 the glean ifcfg-$interface writes happen before NetworkManager edits that interface and it works. Happens after and it fails23:18
clarkbwhich would explain why a reboot fixes it (beacuse that ifcfg file is already in place on reboot)23:18
*** lseki has quit IRC23:19
clarkbwe don't really change any settings in that file that I would expect to affect this though23:19
*** goldyfruit has quit IRC23:19
clarkboh except that the file may not exist at all on very first boot so NM uses NM defaults and on second boot it uses the sysconfig plugin which depends on those defaults?23:20
clarkbmaybe an easy fix here is to write out ifcfg files? (problem with that is biosdevname is a thing now)23:20
clarkbI'm going to bet that that is the difference here. The sysconfig plugin likely tries to be server sane and NM without it is trying to handle the laptop case23:21
clarkbianw: can we do a Before NetworkManager ?23:22
ianwclarkb: i'm wondering if23:23
ianwBefore=network-pre.target23:23
ianwAfter=local-fs.target23:23
clarkbah23:23
ianwmight be the spot?  i'm not sure i tested that combo23:23
clarkbthat looks good :)23:23
ianwi think i went straight from "network-pre.target" not working to "local-fs.target" working in -> https://review.opendev.org/#/c/618964/9..17/glean/init/glean-nm%2540.service23:23
clarkbI think I can test that by rm'ing the ifcfg file. update systemd config then reboot23:23
* clarkb tries that23:24
ianwMaybe Before=network-pre.target, Wants=network-pre.target, After=local-fs.target ?23:24
clarkbtrying that23:24
ianwit's still got to have that DefaultDependencies=no as well23:25
clarkbI edited the unit in place (can just delete the VM and try again if necessary)23:26
clarkbso didn't change anything but Before and Wants23:26
clarkbianw: I think that did it23:26
clarkbinterface has correct ipv6 addr and I see in syslog that glean ran before the NM config of the interface23:27
clarkbianw: want to propose that change and see if CI is equially happy with it?23:27
clarkb(it was your idea)23:27
ianw... not sure i'd take credit for it :)23:28
clarkbI'm happy to push a change up and start some testing too23:29
* clarkb does this23:29
openstackgerritClark Boylan proposed opendev/glean master: Update NM unitfile to run after fs and before NM  https://review.opendev.org/66977223:31
clarkbsomething like that23:31
clarkbnow booting gentoo again with new image upload to see if the issue there was a bad image23:33
*** weifan has joined #openstack-infra23:35
*** rcernin has joined #openstack-infra23:35
*** rlandy has quit IRC23:36
openstackgerritIan Wienand proposed opendev/glean master: network-manager: add network-pre dependencies  https://review.opendev.org/66977323:36
ianwoh heh, great minds :)23:36
clarkbyour commit message is much better than mine (this is good though will get us twice as many test results)23:36
*** jamesmcarthur has quit IRC23:37
ianwyeah, let's let them run and make sure we get green23:37
ianwwith hindsight, this ordering now seems logical.  but i've thought the ordering was correct many times before :)23:38
*** smarcet has quit IRC23:39
*** pkopec_ has joined #openstack-infra23:40
*** gyee has quit IRC23:42
*** pkopec__ has quit IRC23:42
clarkbok confirmed that gentoo doesn't work either. glean is not running for any interfaces there23:43
clarkbah ok it may not run per interface like on other distros23:44
clarkbit may be racing the growfs23:45
clarkbgrowfs completes after glean (though not much after)23:45
*** mattw4 has quit IRC23:45
clarkbprometheanfire: ^ any ideas? the logging isn't great (I don't see any logs from glean other than mounting the config drive) and there is no net file in /etc/conf.d/net23:47
clarkbif the writes failed I would've expected tracebacks in journalctl -u glean though and don't have that23:47
openstackgerritNate Johnston proposed opendev/irc-meetings master: Capture artifacts from ical generation  https://review.opendev.org/66977523:50
clarkbI'll leave the gentoo isntance up in case prometheanfire is around and able to debug at some point23:50
clarkbbut other than waiting for ci results I think this is good23:51
*** michael-beaver has quit IRC23:53
prometheanfirehi?23:54
*** weifan has quit IRC23:54
*** sthussey has quit IRC23:54
prometheanfireso glean is having a problem with growfs?23:54
prometheanfirealso, gentoo systemd or basic system?23:55
*** gyee has joined #openstack-infra23:57
*** jamesmcarthur has joined #openstack-infra23:57
clarkbprometheanfire: gentoo systemd is not configuring networking23:57
clarkbwith glean23:57
clarkbthere is no net.ens3 file in /etc/conf.d23:58
prometheanfireit shouldn't be23:58
donnydclarkb: so are we cooking with gas now?23:58
clarkb(so one idea I had is fs/growroot related)23:58
prometheanfireit should be setting things up in /etc/systemd/network23:58
clarkbprometheanfire: oh where is the config then?23:58
prometheanfiregentoo-systemd uses networkd23:58
clarkbok I must've misread the glean code then23:58
clarkboh I see23:58
clarkbtwo different paths23:58
prometheanfireyep23:59
clarkbdonnyd: ya we think fix is up and getting tested now23:59
clarkbprometheanfire: ok I'll debug that later then. dinner prep now23:59
prometheanfire:D23:59

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!