Monday, 2018-10-08

sfbenderLogan V created software-factory/sf-config master: Allow LECM to renew certificates  https://softwarefactory-project.io/r/1387200:14
*** rcarrillocruz has quit IRC02:51
sfbenderMerged software-factory/sf-config master: Allow LECM to renew certificates  https://softwarefactory-project.io/r/1387203:21
*** rcarrillocruz has joined #softwarefactory04:16
*** nilashishc has joined #softwarefactory04:20
*** rcarrillocruz has quit IRC04:21
*** rcarrillocruz has joined #softwarefactory04:25
*** nilashishc has quit IRC04:29
*** nilashishc has joined #softwarefactory04:35
*** jangutter has joined #softwarefactory07:29
*** jangutter_ has joined #softwarefactory07:32
*** jangutter_ has quit IRC07:32
*** jangutter_ has joined #softwarefactory07:33
*** nilashishc has quit IRC07:34
*** jangutter has quit IRC07:36
*** nilashishc has joined #softwarefactory07:36
*** chandankumar has joined #softwarefactory07:40
*** jpena|off is now known as jpena07:44
*** zoli is now known as zoli|afk-fbcI08:15
*** zoli|afk-fbcI is now known as zoli08:15
spredzytristanC: o/08:26
spredzyIf you have write premission, would mind approving https://github.com/ansible/zuul-config/pull/30 ?08:26
tristanCspredzy: sure, done08:34
spredzythanks :)08:34
spredzytristanC: https://ansible.softwarefactory-project.io/zuul/status.html08:39
spredzyI got an internal server error08:40
tristanChum, let me check08:44
tristanCarg, Out of memory: Kill process 4440 (zuul-scheduler) score 818 or sacrifice child08:47
spredzy:/08:51
*** chandankumar has quit IRC08:51
spredzyI don't have the Internal Server Error anymore, just Tenant ansible isn't ready08:52
spredzyI supose I need to wait a lil08:52
tristanCspredzy: yes, scheduler is restarting08:52
spredzyNows its back08:52
spredzy\o/08:52
spredzytristanC: Question regarding https://github.com/ansible/awx/pull/230909:00
spredzyI made it Depends-On: https://github.com/ansible/zuul-jobs/pull/1609:00
spredzyBut apparently the changes hasn't caught up09:00
spredzyWould you know why I am doing wrong ?09:00
tristanCspredzy: according to status, it seems like the changes are stacked up correctly09:02
spredzyBut the job run are not the one from the change09:03
spredzyie. tox-api-lint vs. tox-awx-api-lint09:03
spredzyThe content of the #16 change doesn't seem to be applied09:03
tristanCspredzy: indeed, not sure what's going on09:06
spredzytristanC: I can merge it, see if it still applies and then revert the change.09:09
spredzy(If needed)09:09
spredzytristanC: Can it be posible that the git repo for zuul-jobs is somewhat out-of-sync ?09:10
spredzyOn friday I did a mistake and merged a commit manually on zuul-jobs rather than tag it 'mergeit'09:10
spredzyNot sure if it can be related09:10
tristanCspredzy: it shouldn't, zuul should be force pull09:11
spredzyack. So no idea about why it isn't applying the change09:11
spredzyI ll merge zuul-jobs PR and see if it does what expected09:12
spredzytristanC: so yes, I really have the feeling that the latest ansible/zuul-jobs is not taken in consideration09:18
spredzytristanC: https://ansible.softwarefactory-project.io/zuul/status.html09:18
tristanCspredzy: here are the logs: https://ansible.softwarefactory-project.io/paste/show/F4CARz8bc8hatZiIWaI1/09:18
tristanCspredzy: i think there is an issue with zuul-jobs branch, there should only be a master branch09:19
tristanCspredzy: as you can see from that paste, the project-template is applied from 3 differents branches, master, awx_template and clean_static_branch09:20
tristanCspredzy: you shouldn't create and merge such branch, it's confusing zuul09:20
spredzyLet me remove them from Github, and let the team know to work on their fork and not directly push the branches in ansible/zuul-jobs09:21
tristanCspredzy: we could also setup branch protection, as documented here: https://ansible.softwarefactory-project.io/docs/user/zuul_user.html#configure-branch-protection09:23
tristanCthen in zuul, we can make it exclude unprotected branch09:23
spredzyYep, I agree with this proposal09:25
spredzytristanC: ansible/zuul-jobs has only master now, but the configuration is still not caught up09:29
spredzyZuul is applying old config to pipeline09:30
*** zoli is now known as zoli|lunch09:30
*** zoli|lunch is now known as zoli09:30
tristanCspredzy: hum, it seems like zuul still has the branches... let me reload it09:31
spredzy1ok09:33
spredzytristanC: looks better :)09:35
*** sshnaidm is now known as sshnaidm|lnch09:40
gundalowtristanC: spredzy do we need to enable `Require pull request reviews before merging`:09:42
gundalow> When enabled, all commits must be made to a non-protected branch and submitted via a pull request with the required number of approving reviews and no changes requested before it can be merged into a branch that matches this rule.09:42
tristanCgundalow: it's up to you, we just removed the requirements in zuul gate pipeline09:43
spredzytristanC: do you see any issue with those job definitions ? https://github.com/ansible/zuul-jobs/blob/master/zuul.d/jobs.yaml09:46
spredzy2018-10-08 09:37:11.749693 | TASK [tox : Require tox_envlist variable]09:46
spredzy2018-10-08 09:37:11.791933 | static | skipping: Conditional result was False09:46
spredzyI see that in the log, not really understanding why09:46
tristanCspredzy: check ara-report ?09:46
spredzyHmm. OSError: [Errno 2] No such file or directory: '/home/zuul/src/github.com/ansible/awx'09:48
tristanCspredzy: the skip is fine, the task is actually a "fail" module09:49
spredzytristanC: Looking at the sequence here https://ansible.softwarefactory-project.io/logs/09/2309/e0c7a7becea574aefe32dd9964c9d033a0223751/check/tox-awx-api-lint/5593fd2/job-output.txt.gz#_2018-10-08_09_37_09_03098209:51
spredzyI don't see tox (not in ensure mode) being triggered09:51
spredzyBecause the fail module failed in chdir'ing09:52
tristanCspredzy: https://ansible.softwarefactory-project.io/logs/09/2309/e0c7a7becea574aefe32dd9964c9d033a0223751/check/tox-awx-api-lint/5593fd2/ara-report/file/8fde4eae-ccef-487e-8c6a-2aabfef3832c/#line-109:52
spredzyOSError: [Errno 2] No such file or directory: '/home/zuul/src/github.com/ansible/awx'09:52
tristanCspredzy: yes, the directory doesn't exist because you remove it in https://github.com/ansible/zuul-jobs/blob/master/playbooks/clean-static-node.yaml#L1009:52
tristanCperhaps we should clean before the prepare-workspace here: https://github.com/ansible/zuul-config/blob/master/playbooks/base-minimal/pre.yaml#L1309:54
spredzyOops sorry I haven't seen that (paid closer attention)09:54
*** sshnaidm|lnch is now known as sshnaidm09:59
spredzytristanC: base-minimal only applies to runc-fedora, right?10:00
spredzyhttps://github.com/ansible/zuul-config/blob/master/zuul.d/jobs.yaml#L43-L4610:00
tristanCspredzy: nodeset can be changed by child jobs10:04
tristanCspredzy: the cleaning should check for nodepool static label10:05
*** jpena is now known as jpena|off10:07
spredzyI see10:10
spredzyJust trying something real quick and if it doesn't work will go down the path you just suggested10:10
*** nilashishc has quit IRC10:14
*** nilashishc has joined #softwarefactory10:15
spredzytristanC: https://ansible.softwarefactory-project.io/zuul/status.html job has been stuck for 15min (linters + awx), anything currently happening ?10:19
tristanCspredzy: i think it's because all executors are at capacity, i had to re-enqueue the job lost after the oom10:28
tristanCspredzy: tomorrow we'll spin more executors10:28
tristanChttps://softwarefactory-project.io/grafana/d/000000001/zuul-status?panelId=66&fullscreen&orgId=110:28
spredzyOh ok nice. I always forgot about those dashboards. Really need to get use to check them out10:30
spredzytristanC: the change I submitted seems to have been enough for the use-case. All green11:09
spredzyThanks a lot for this morning assitance. Really appreciated. Merci :)11:09
tristanCspredzy: you're welcome, anytime :)11:14
*** jangutter_ has quit IRC12:27
*** jangutter has joined #softwarefactory12:27
pabelangertristanC: re: OOM how much does zuul-scheduler server have? Are we running a swap there also?14:39
pabelangerI _think_ that is shared with other services, maybe with recent quota bump we can move to dedicated server also14:39
spredzypabelanger: hey. I have a question wrt to zuul15:06
spredzyFor some reason it doesn't seem to pull the proper PR15:06
spredzyhttps://github.com/ansible/awx/pull/226615:06
spredzywhen I do a `cat .git/HEAD`15:06
spredzyI get ref/head/devel rather than the proper PR content15:07
spredzyWould you have pointer on where should I start looking at ?15:07
spredzyproper PR reference*15:07
pabelangerspredzy: where are you doing cat .git/HEAD ?15:08
spredzy chdir: "{{ ansible_user_dir }}/{{ zuul.project.src_dir }}"15:09
pabelangerspredzy: sorry, do you have log from zuul showing that15:10
spredzypabelanger: So that is the PR from zuul-jobs https://github.com/ansible/zuul-jobs/pull/22/files15:11
spredzypabelanger: here are the logs https://ansible.softwarefactory-project.io/logs/66/2266/24e2a9ace08d5e5fa05f5f6d95df40cf6ab087d7/check/tox-awx-api/c9d076f/15:12
spredzyAWX - Depends-On: zuul-jobs PR that displays cat .git/HEAD15:13
spredzytristanC | spredzy: it shouldn't, zuul should be force pull15:15
spredzypabelanger: tristanC said that this morning, so I am not very sure about whats going on15:16
pabelangerforce pull for what, the github PR?15:16
spredzyWe were talking about zuul-jobs, but I was supposing this also applies for Github PR15:17
pabelangerspredzy: okay, can you state the issue again. Got lost in the weeks trying to understand what the awx jobs are doing. What is the issue you are seeing in zuul15:18
*** nilashishc has quit IRC15:20
spredzySo in awx, we have a static-node for our nodeset. So, the issue is that when zuul catch the event of recheck, the content of the PR is not pulled15:20
spredzyit references to refs/head/devel when it should reference to the proper references something like refs/pull/226615:20
pabelangerspredzy: how is the static node content cleaned up between job runs?15:21
pabelangereg: how do we know if the job before in not affecting the current15:21
spredzyCurrently, the folder "{{ ansible_user_dir }}/{{ zuul.project.src_dir }}" is not removed15:21
spredzyBecause I though zuul will git pull force the PR15:21
pabelangerno15:21
spredzybut I might be missing it15:21
pabelangerwell15:21
pabelangerwhen a zuul-executor starts a job, one of the first tasks it does is prepare-workspace which uses synchronize to push the git content to the remote node15:22
spredzyhttps://github.com/openstack-infra/zuul-jobs/blob/master/roles/prepare-workspace/tasks/main.yaml15:22
spredzyYep15:23
pabelangerin our care, we are using ephemeral nodes, so we know there is nothing on the far side.15:23
pabelangermy question here is, how do we know there is nothing on the far side with a static node?15:23
pabelangeris it possible prepare-workspace is having an issue15:23
pabelangerand not getting the right git content on to the node15:23
spredzyWe need to think that "{{ ansible_user_dir }}/{{ zuul.project.src_dir }}" will always be present in the static node case15:24
spredzyjust not on the right refs15:24
pabelangerwell, prepare-workspace will do that15:25
pabelangerhowever, it mostly has only been used with nodes from nodepool, so maybe we are hitting a bug15:25
pabelangerbut, you could also ensure that directory is first absent when a job start / ends15:26
pabelangerThe other question I had, why does it need to be a static node? Why not use a VM from nodepool which we know is clean from start15:26
spredzyyep, that what tristanC suggested this morning but I though I had found a way around it but it doesn't seem like it15:26
spredzypabelanger: I don't have the knowledge yet to answer this question. But will be able to answer by end of week15:27
pabelangerspredzy: I talked a little with matburt at ansiblefest last week and offered to help a little. I still think defaulting to a vm first from nodepool, then run your containers on that is a great first step. It sounds like some of the things you are running into are either edge cases to static nodes or containers, which is fine. But remember you are likely one of the first people do try this, which is15:30
pabelangergoing to slow down development on awx.15:30
spredzyI realize that, I'll try to grab more knowledge this week on why things are done this way and see what preventing us from going to "fresh" env for each job15:49
*** sshnaidm is now known as sshnaidm|afk15:51
spredzypabelanger: do you know how I can access hosts.static.nodepool.label from https://ansible.softwarefactory-project.io/logs/25/2125/e7e6ccb181898dea46d9ec5a67f61087fcef7f2b/check/tox-awx-api/3b205a7/zuul-info/inventory.yaml ?16:01
pabelangerspredzy: access how?16:01
spredzyfrom whithin an ansible task16:01
spredzyie do this when: hosts.static.nodepool.label == 'static-ansible'16:02
pabelangerif you are using the inventory, you can just reference static16:02
pabelanger- hosts: static16:02
pabelanger  tasks:16:02
pabelangeretc16:02
spredzyits from all here what I am trying to do https://github.com/ansible/zuul-config/pull/3216:03
spredzylet me know if you think this isn't the proper way of adressing the issue we talked about earlier16:04
spredzys/state/file16:04
pabelangerleft comment16:10
spredzythanks /me reads16:11
*** zoli is now known as zoli|gone16:34
*** zoli|gone is now known as zoli16:34
spredzypabelanger: Thanks, I'll work on that tomorrow. Will have to go soon.16:36
spredzyDo you have the hand on our zuul system?16:36
pabelangerspredzy: I don't have SSH access anymore, I did when I was on SF team16:37
spredzyack16:37
pabelangerbut, I've been pushing on changes needed for ansible-network tenant16:37
spredzyActually I didn't ask, but when did your move happen?16:47
*** sshnaidm|afk is now known as sshnaidm16:48
pabelanger3 weeks ago16:48
spredzySo we moved in Ansible ~ the same time16:50
spredzySept. 17 for me16:50
*** sshnaidm is now known as sshnaidm|afk17:32
spredzypabelanger: re: We should can create a new job, say awx-base, as untrusted, that is zuul-jobs. have it parent to base (which is also untrusted). Then, in the post-run logic, after log collections, we can add logic to clean up static nodes. In zuul, we've also discussed the idea of a clean up handler, but don't think it is landed.17:46
spredzyCurrently all jobs are defined here https://github.com/ansible/zuul-jobs/blob/master/zuul.d/jobs.yaml17:47
spredzythey all inhrerit from tox (from openstack-infra/zuul-jobs)17:47
spredzyIf I created an awx-base as suggested, copy/paste base-minimal in ansible/zuul-jobs, how would I make tox inherit from it?17:48
spredzyor simply redefintin - job: name: tox parent: awx-base would be enough?17:49
*** nilashishc has joined #softwarefactory18:04
pabelangerspredzy: yah, we'd need to create a new job called awx-tox here since today we cannot modify tox to inherit from another parent18:09
pabelangerlong term, we've talked about this use case, and should be able to do it once we have per project jobs18:09
pabelangerright now tox job can only be defined once18:09
spredzySo I should copy-paste tox from openstack-infra/zuul-jobs18:11
spredzyname it: awx-tox18:12
spredzyand have it inherit awx-base ?18:12
spredzypabelanger: is my understanding correct ?18:12
pabelangerspredzy: let me think about it, because there is also unittests which tox parent too18:13
*** nilashishc has quit IRC18:13
spredzyCurrent PR looks like https://github.com/ansible/zuul-jobs/pull/22/files18:13
pabelangerspredzy: this is why I said VMs will be much easier :) Now we need to redesign jobs18:13
spredzyYep I understand, and agree. But just dealing with what I have to deal with atm.18:14
spredzyBe sure I'll push toward this way - this is extra-headache we could save ourselves from18:14
pabelangerspredzy: okay, so for now, you are just using tox jobs right?18:15
pabelangerlet me work on etherpad for flow and will share for tomorrow18:15
spredzyYes, just tox jobs18:16
pabelangerk18:16
spredzypabelanger: currently it fails with "Unable to freeze job graph: Job tox-awx-api-lint does not specify a run playbook"18:28
spredzyif I try to use tox/parent or copy over without the dependencies18:28
spredzyjust so you know18:28
pabelangerspredzy: can you link PR?18:28
spredzyPr is https://github.com/ansible/awx/pull/226618:29
spredzybut it depends on https://github.com/ansible/zuul-jobs/pull/2218:29
spredzyThis is the one where I am trying all the things18:29
pabelangerspredzy: okay, will dig more in a bit. As a test to your clone issue, if you used the label ansible-fedora-28-4vcpu from nodepool, is the git state correct?18:31
pabelangeryou could also use ansible-fedora-28-1vcpu18:31
spredzylemme try it and let you know18:32
spredzypabelanger: do we have access to it ?18:35
spredzyThe nodeset "ansible-fedora-28-1vcpu" was not found.18:36
pabelangerspredzy: yah, we should let your tenant load nodesets from ansible-network18:37
pabelangerbut you can just add: https://github.com/ansible-network/ansible-zuul-jobs/blob/master/zuul.d/nodesets.yaml#L2518:37
pabelangerfor now18:37
pabelangerspredzy: that will either run in limestone or vexxhost18:38
pabelangerworking to bring online a 3rd provider18:38
* spredzy googles limestone18:39
spredzywho's the 3rd name?18:39
pabelangerhttps://www.limestonenetworks.com/18:40
pabelangerspredzy: some capactiy we are testing thanks to logan-18:41
spredzyCool18:42
spredzyWhich zone are we using ?18:43
spredzypabelanger: https://ansible.softwarefactory-project.io/zuul/status.html18:43
pabelangerspredzy: both18:43
spredzyThey end up in error18:43
spredzyhere https://ansible.softwarefactory-project.io/zuul/builds.html18:44
pabelangerspredzy: for limestone?18:44
pabelangerwill need to check launcher18:44
spredzyah no, wait18:44
spredzytype18:44
spredzytypo18:45
spredzypabelanger: seems ok with the Fedora node18:50
spredzyie. proper file content is in there. Let me check the git commit18:51
pabelangerspredzy: k, that is good new18:51
pabelangerthen we know something is wrong with static18:51
pabelangerand not zuul18:51
spredzyso the issue is: with a static-node kind, how to make sure "{{ ansible_user_dir }}/{{ zuul.project.src_dir }}" is up to date and/or cleaned18:52
spredzypabelanger: yes def. proper git sha1 on the fedora nodes18:54
* spredzy goes dinner18:56
spredzylet me know if you come up with any idea Paul18:56
pabelangerspredzy: yah, lets chat more tomorrow, maybe with matburt too19:10
matburtI'm definitely happy to do that19:11

Generated by irclog2html.py 2.15.3 by Marius Gedminas - find it at mg.pov.lt!