Saturday, 2020-05-02

*** dzho has joined #opendev01:17
*** escarluka has joined #opendev01:33
*** escarluka has quit IRC04:28
openstackgerritMerged zuul/zuul-jobs master: Only run tests for ensure-bazel when it is updated  https://review.opendev.org/72493305:48
*** redrobot has quit IRC05:48
*** hiep_mq has joined #opendev05:54
*** hiep_mq has quit IRC06:10
*** DSpider has joined #opendev06:26
*** rchurch has quit IRC06:44
*** rchurch has joined #opendev06:47
*** sgw has quit IRC07:22
openstackgerritAndreas Jaeger proposed openstack/project-config master: Fix irc-meetings publishing  https://review.opendev.org/72496407:25
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ansible-lint: use matchplay instead of matchtask  https://review.opendev.org/72491008:36
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ansible-lint: use matchplay instead of matchtask  https://review.opendev.org/72491008:43
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ansible-lint: use matchplay instead of matchtask  https://review.opendev.org/72491008:48
*** tosky has joined #opendev08:48
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ansible-lint: use matchplay instead of matchtask  https://review.opendev.org/72491008:54
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ansible-lint: use matchplay instead of matchtask  https://review.opendev.org/72491008:55
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: ansible-lint: use matchplay instead of matchtask  https://review.opendev.org/72491009:17
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496709:30
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496709:47
*** iurygregory has quit IRC09:56
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496712:06
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496712:14
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496712:19
AJaegerinfra-root, could you check https://review.opendev.org/724964 - I noticed the irc-meetings job is failing every hour...12:30
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496713:01
openstackgerritMerged openstack/project-config master: Fix irc-meetings publishing  https://review.opendev.org/72496413:21
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496713:35
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496713:40
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496713:48
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: use zj_image instead of image as loopvar  https://review.opendev.org/72501213:48
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: use zj_log_file instead of item as loop_var  https://review.opendev.org/72501313:48
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Check blocks recursively for loops  https://review.opendev.org/72496713:53
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Update ansible-lint-rules testsuite to only test with the relevant rule  https://review.opendev.org/72501413:58
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Update ansible-lint-rules testsuite to only test with the relevant rule  https://review.opendev.org/72501414:07
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: Update ansible-lint-rules testsuite to only test with the relevant rule  https://review.opendev.org/72501414:13
AJaegerinfra-root, publishing irc-meetings fails now in the next step - see https://zuul.opendev.org/t/openstack/build/62ecffee3a734a77b6a7c03d11f0b0e514:35
AJaegeroh, found the typo...14:36
fungirsync: mkdir "/src/yaml2ical/calendars"14:36
fungihrm14:36
AJaegersrc -> srv14:37
openstackgerritAndreas Jaeger proposed openstack/project-config master: Fix typo in playbooks/yaml2ical/post.yaml  https://review.opendev.org/72501514:37
AJaegerhere's the fix ^14:37
fungigood eye!14:38
AJaegerinfra-root, infra-prod-run-accessbot fails as well - nothing visible from job output. See https://zuul.opendev.org/t/openstack/build/ba0b9eeb95074961b4832ad018c75a3414:38
fungii'll check /var/log/ansible/run-accessbot.yaml.log  on bridge.o.o14:39
fungithe log wasn't much help, but i see there's something like 10 hanging ansible runs of that playbook, some nearly a week old14:41
fungiroot     15871  0.0  0.5 238828 47760 ?        S    Apr26   0:03 /usr/bin/python3 /usr/local/bin/ansible-playbook -v -f 5 /home/zuul/src/opendev.org/opendev/system-config/playbooks/run-accessbot.yaml14:41
fungithat has a child process which is doing an ssh to 2001:4800:7818:104:be76:4eff:fe04:488714:43
fungireverse dns says it's14:43
fungieavesdrop01.openstack.org14:43
fungilooking through the process list on master, we've also got some stuck manage-projects runs from weeks ago14:45
fungimordred: is there a safe way to time out that sort of stuff?14:46
fungii wonder if intermittent network issues are causing remote ansible calls to get stuck indefinitely and pile up14:46
fungiall of these i'm finding, whether for accessbot or manage-projects, have hung doing this over ssh connections: /bin/sh -c 'python3 && sleep 0'14:48
fungiand they're setting -o ControlMaster=auto -o ControlPersist=60s14:50
fungiso maybe these are the persistent sockets never getting reaped?14:50
fungicould be this is just a red herring14:50
openstackgerritMerged openstack/project-config master: Fix typo in playbooks/yaml2ical/post.yaml  https://review.opendev.org/72501514:52
*** zbr has quit IRC15:05
*** zbr has joined #opendev15:05
AJaegersuccess - publish-irc-meetings works - https://zuul.opendev.org/t/openstack/build/9ec8f7d1bbec4980ba8402379b920a4a and eavesdrop has current timestop15:14
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: dhall-diff: add new job  https://review.opendev.org/71869415:18
fungii've hup'd the older hung ssh processes on bridge.o.o associated with manage-projects and accessbot runs for the past few weeks15:19
AJaegerthx, fungi15:23
fungiwell, i doubt that's going to solve whatever's going on with the accessbot runs15:25
fungiso i'm going to see if i can find a record of what's going on with it on eavesdrop now15:25
fungithe last thing it logged in /var/log/accessbot/ on eavesdrop01.openstack.org was 2020-04-02 22:19:0215:27
fungibut syslog says ansible is running it regularly:15:29
fungiMay  2 15:18:54 eavesdrop01 python3[8448]: ansible-command Invoked with _uses_shell=False removes=None strip_empty_ends=True chdir=None creates=None warn=True stdin_add_newline=True executable=None stdin=None _raw_params=/usr/local/bin/accessbot argv=None15:29
fungi/usr/local/bin/accessbot is a wrapper script which does: exec docker run --rm --net=host -v/etc/accessbot:/etc/accessbot -v/var/log/accessbot:/var/log/accessbot docker.io/opendevorg/accessbot15:31
*** elod has quit IRC15:34
fungithis could be related though...?15:35
fungiMay  2 15:18:19 eavesdrop01 cc3116109c7b[6092]: /bin/sh: 1: Syntax error: Unterminated quoted string15:35
*** elod has joined #opendev15:35
fungilooks like maybe that's getting syslogged from inside a container15:35
AJaegeris that happening regularly?15:42
fungieach time it's invoked by ansible, i think15:42
fungiso probably something's not right inside that container15:42
fungii'm going to try running it from a command-line on eavesdrop and see if it gives me anything useful that's getting swallowed by indirection15:43
AJaegerI manually inspected docker/accessbot/accessbot.py - that looks fine15:43
AJaegerfungi: good idea15:43
fungii did it from a root shell without the exec15:44
fungiyep, that does indeed seem to be where the error's coming from:15:44
fungiroot@eavesdrop01:~# docker run --rm --net=host -v/etc/accessbot:/etc/accessbot -v/var/log/accessbot:/var/log/accessbot docker.io/opendevorg/accessbot15:44
fungi/bin/sh: 1: Syntax error: Unterminated quoted string15:45
AJaegerfungi: I have an idea - one moment15:47
openstackgerritAndreas Jaeger proposed opendev/system-config master: Remove incomplete args from accessbot/Dockerfile  https://review.opendev.org/72502115:49
AJaegerfungi, is that the culprit ? ^15:49
AJaegercount the " chars...15:50
fungihttps://opendev.org/opendev/system-config/src/branch/master/docker/accessbot/Dockerfile#L21 has an extra ]15:51
fungioh, yeah, and a bare "15:52
fungiAJaeger: i wonder if that was meant to append to /var/log/accessbot/accessbot.log and was merely unfinished15:52
fungimordred: that ^ appears to have been introduced by a change of yours, so maybe you know?15:53
openstackgerritAndreas Jaeger proposed opendev/system-config master: Remove incomplete args from accessbot/Dockerfile  https://review.opendev.org/72502115:57
AJaegerfungi:  /var/log/accessbot/accessbot.log would be inside the container, so question is how mordred envisoned the logging.15:57
AJaegermy change above should fix it - not sure about the logging. Something to iterate on ;)15:58
fungioh, yep, it's running this which already redirects: https://opendev.org/opendev/system-config/src/branch/master/docker/accessbot/accessbot.sh#L1715:58
mordredfungi, AJaeger - yeah - we should be mounting /var/log/accessbot as well as redirecting to it15:59
AJaegerthe dockerfile calls docker/accessbot/accessbot.sh and that does the redirection, doesn't it?16:00
AJaegerI think there's duplication between accessbot.sh and the Dockerfile16:00
mordredyeah - but I thnik that error about unterminated string is bad16:01
mordredand also that16:01
AJaegeraccessbot.sh has : "exec python /usr/local/bin/accessbot.py -c /etc/accessbot/accessbot.config -l /etc/accessbot/channels.yaml >> /var/log/accessbot/accessbot.log 2>&1"16:01
mordredwhy don't we just make the Dockerfile run the script16:01
AJaegermordred, yeah - on it...16:01
mordredI think that was likely the idea originally16:01
mordredand then - wow16:01
openstackgerritAndreas Jaeger proposed opendev/system-config master: Fix accessbot/Dockerfile  https://review.opendev.org/72502116:02
AJaegerthis way ^16:02
mordredAJaeger: yes. - that16:03
fungii arrived at the same conclusion, +216:03
AJaegerwill either of you +A now that you both gave +2, please?16:04
fungijust did16:04
AJaegerthanks16:04
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: tox: Add regex for matching ansible-lint output  https://review.opendev.org/72502716:04
fungithanks for fixing!16:05
AJaegermordred: infra-prod-remote-puppet-else is failing occasionally - and outputs quite a few ansible warnings. https://zuul.opendev.org/t/openstack/build/5ce68b5b2b2442f3a0ff67770c1c8a7e16:05
AJaegerno idea on that one...16:05
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: tox: Add regex for matching ansible-lint output  https://review.opendev.org/72502716:06
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: DNM: Test ansible-lint commenting  https://review.opendev.org/72502816:06
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: tox: Add regex for matching ansible-lint output  https://review.opendev.org/72502716:12
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: DNM: Test ansible-lint commenting  https://review.opendev.org/72502816:12
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: tox: update pep8 regex to not require column  https://review.opendev.org/72503016:29
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: DNM: Test ansible-lint commenting  https://review.opendev.org/72502816:29
openstackgerritMerged opendev/system-config master: Fix accessbot/Dockerfile  https://review.opendev.org/72502117:09
AJaeger\o/ infra-prod-service-eavesdrop passed as deploy job after the merge17:17
fungithanks again for fixing, AJaeger!!!17:19
AJaegerthat was colaboration - thanks fungi and mordred as well17:31
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: tox: update pep8 regex to not require column  https://review.opendev.org/72503017:34
openstackgerritAlbin Vass proposed zuul/zuul-jobs master: DNM: Test ansible-lint commenting  https://review.opendev.org/72502817:34
*** iurygregory has joined #opendev18:49
AJaegereavesdrop - but we've been looking at accessbot ;(18:51
AJaegerArgh ;(18:51
AJaegerhttps://zuul.opendev.org/t/openstack/build/eb449333e8ba470fb5f646ba98f880d7 failed for accessbot. any ideas why this time?18:52
AJaegermordred: "ansible-playbook -v -f 5 /home/zuul/src/opendev.org/opendev/system-config/playbooks/run-accessbot.yaml >> /var/log/ansible/run-accessbot.yaml.log18:53
AJaegerand the file accessbot.sh redirects as well - that's again duplications, isn't it?18:54
AJaegermordred: but you mentioned mounting the log directory inside - could you look and fix on Monday. This can wait ;)18:55
fungii'll take another peek and see if there's better logged errors this time18:57
fungi"TypeError: unsupported operand type(s) for +: 'dict_items' and 'dict_items'" maybe a python3 incompatibility? http://paste.openstack.org/show/79300219:01
fungiprobably need to recast them as list() before concatenating19:02
fungii'm in the midst of weekend yardwork, but can try to whip up a patch and check for any other py3k problems later today if nobody beats me to it19:02
openstackgerritAndreas Jaeger proposed opendev/system-config master: Fix py3 problem in accessbot.py  https://review.opendev.org/72503620:12
AJaegerhere's the patch - at least that function works now ^20:12
AJaegerthanks, fungi.20:12
fungibigger thanks for pushing a patch!20:14
AJaeger;)20:15
AJaegeruntil the next one ...20:15
AJaegerI still think the log files needs reviewing but leave that to others to check..20:17
fungiwell, the logs are on the server, and the script crashes on the first exception, so without running it on each successive fix the next bug won't become apparent20:20
fungiideally we'd have some means of exercising it in a test, but i suspect that would depend on setting up ircseven and charybdis in an integration test20:20
fungiunless someone puts together some good mocks20:21
openstackgerritMerged opendev/system-config master: Fix py3 problem in accessbot.py  https://review.opendev.org/72503621:18
*** DSpider has quit IRC22:07
*** tosky has quit IRC23:28

Generated by irclog2html.py 2.17.2 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!