Monday, 2013-11-18

*** jamesmcarthur has quit IRC00:00
*** matsuhashi has joined #openstack-infra00:02
*** thomasem has quit IRC00:06
*** michchap has quit IRC00:06
*** michchap has joined #openstack-infra00:06
lifelessclarkb: if around - https://review.openstack.org/#/c/54152/ needs second +2.00:06
lifelessmordred: if you have suggestions on how I could have communicated what I wanted better ^ I'd appreciate them - I was asking for the same stauff for the last 3 iterations or so00:07
lifeless:)00:07
clarkblifeless: any idea why that valueerror exception might be triggered? we already ignore blank lines and comments00:09
lifelessclarkb: urls00:11
lifelessclarkb: requirements.txt can also include index urls, bizarrely enough00:11
clarkbo_O00:12
clarkbapproved00:12
*** slong has quit IRC00:13
*** matsuhashi has quit IRC00:14
*** matsuhashi has joined #openstack-infra00:15
*** matsuhashi has quit IRC00:15
*** matsuhashi has joined #openstack-infra00:15
*** wenlock has quit IRC00:16
*** herndon has quit IRC00:22
*** matsuhashi has quit IRC00:29
*** matsuhashi has joined #openstack-infra00:29
*** dcramer_ has quit IRC00:30
*** sarob has joined #openstack-infra00:32
openstackgerritKhai Do proposed a change to openstack-infra/config: proposal to allow additional people to release jjb  https://review.openstack.org/5682300:32
*** matsuhashi has quit IRC00:34
zaroclarkb: if you got time i  would like this to merge.. https://review.openstack.org/#/c/5671500:34
*** senk has joined #openstack-infra00:36
*** boris-42 has joined #openstack-infra01:03
*** markwash has joined #openstack-infra01:05
*** talluri has quit IRC01:06
cyeohmikal: sorry about that - I don't know the guy personally. Please feel free to reply to him saying you don't respond to off-list emails. I'll try to chase up what is going on01:06
*** senk has quit IRC01:07
*** matsuhashi has joined #openstack-infra01:11
*** boris-42 has quit IRC01:17
*** nati_ueno has joined #openstack-infra01:20
*** ljjjustin has joined #openstack-infra01:22
*** sarob has quit IRC01:23
*** sarob has joined #openstack-infra01:23
*** nosnos has joined #openstack-infra01:24
*** matsuhashi has quit IRC01:24
*** matsuhashi has joined #openstack-infra01:25
*** sarob has quit IRC01:29
*** matsuhas_ has joined #openstack-infra01:29
*** amotoki has joined #openstack-infra01:32
*** matsuhashi has quit IRC01:32
*** xchu has joined #openstack-infra01:35
*** matsuhas_ has quit IRC01:37
*** matsuhashi has joined #openstack-infra01:38
*** bingbu has joined #openstack-infra01:40
*** ljjjustin has quit IRC01:43
*** ljjjustin has joined #openstack-infra01:44
*** matsuhas_ has joined #openstack-infra01:45
*** matsuhashi has quit IRC01:48
*** wenlock has joined #openstack-infra01:51
*** nati_ueno has quit IRC01:59
*** mriedem has joined #openstack-infra02:00
*** yaguang has joined #openstack-infra02:05
openstackgerritA change was merged to openstack-infra/reviewstats: Add python-solumclient subproject  https://review.openstack.org/5681002:06
jheskethclarkb: ping02:16
clarkbpong02:17
jheskethwould you mind taking a look at why I can't push changes to gerrit for turbo hipster please?02:18
jheskethclarkb: https://review.openstack.org/#/c/56849/02:18
jheskeththey aren't merging somehow02:18
clarkbugh I know what the problem is, zuul cloned the empty repo again so has a bad local cache02:19
clarkbjhesketh: if you promise not to push more patchsets for a bit I will fix it02:19
*** michchap has quit IRC02:19
jheskethdeal02:19
jheskeththanks clarkb02:19
*** yamahata has joined #openstack-infra02:21
clarkbI have to ninja fix it under zuul and don't want it interfering with me which it will do if new patchsets show up :)02:21
*** michchap has joined #openstack-infra02:22
jheskethwhat is broken?02:22
clarkbjhesketh: it is a race between the repo being populated in gerrit and zuul cloning from gerrit02:22
clarkbjhesketh: what happens is zuul clones an empty repo because the project is created in gerrit without any content02:22
clarkbthen we rerun the manage projects script which populates the repo and everything is good02:23
*** paul-- has quit IRC02:23
clarkbbut zuul is left in the old state02:23
clarkbjhesketh: ok try it now (recheck should work)02:24
jheskethah okay02:24
clarkbturbo-hipster jobs are queued I think that means I fixed it02:26
*** bingbu has quit IRC02:26
jheskethyep, saw them get queued02:27
jheskeththanks for that :-)02:27
clarkbnot the underlying problem though, we really need to decouple github stuff from populating gerrit in manage-projects02:27
clarkbnp02:27
*** cody-somerville has joined #openstack-infra02:28
*** sarob has joined #openstack-infra02:34
*** dcramer_ has joined #openstack-infra02:35
clarkbrussellb: wow how many changes did you just updated?02:37
russellbclarkb: a couple02:38
russellb... dozen02:38
russellbi don't know02:38
clarkbrussellb: :)02:38
*** sarob has quit IRC02:39
russellb28 i guess02:39
*** bingbu has joined #openstack-infra02:43
clarkbrussellb: I had just noticed that a bunch of nova changes showed up and were using all of the slaves02:43
*** cody-somerville has quit IRC02:44
russellbmy bad :)02:46
russellbi don't feel that bad.02:47
* russellb feels powerful02:47
*** sjing has joined #openstack-infra02:51
sjingHello -infra02:53
sjingI just wanted to update the message I got from Claire about the slides for the talk presentations in HK summit02:53
sjingin case you have the same question02:53
sjingher reply is :02:54
sjingAll of the videos are now live on the website - http://www.openstack.org/summit/openstack-summit-hong-kong-2013/session-videos/.02:54
sjingHowever we only have presentation slides from the speakers who are responsive and willing to give them to us to post on the website.  Hopefully in the next week or two we will get more of the slides.02:54
*** dcramer_ has quit IRC02:57
amotokihi, a patch from Jenkins to update global requirements is something wrong. https://review.openstack.org/#/c/56163/02:57
*** guohliu has joined #openstack-infra02:58
amotokiit tried to update the requirement file to old ones and gate-neutron-requirements always fails....02:58
clarkbamotoki: yes, there is a bug for that somewhere, basically the script was proposing changes made to stable requirements to master branches of projects02:58
clarkbwe have disabled that job02:59
*** paul-- has joined #openstack-infra02:59
*** bingbu has quit IRC02:59
amotokiclarkb: thanks. I -2'ed the patch.03:01
clarkbamotoki: fwiw jenkins won't let it merge03:01
clarkbamotoki: we should probably abandon all of those changes, fungi and I discussed it but haven't done that yet03:02
*** nati_ueno has joined #openstack-infra03:09
*** bingbu has joined #openstack-infra03:12
*** nati_ueno has quit IRC03:14
openstackgerritMichael Chapman proposed a change to openstack-infra/config: Add stackforge project: puppet_openstack_builder  https://review.openstack.org/5685403:15
*** talluri has joined #openstack-infra03:19
*** markwash has joined #openstack-infra03:23
lifelessclarkb: you're slacking off http://russellbryant.net/openstack-stats/all-reviewers-30.txt03:24
lifeless:)03:24
clarkblifeless: ya, I have been distracted by too many other things03:26
pleia2clarkb: but jeblair comes back tomorrow! :D03:26
*** markwash has quit IRC03:28
*** mriedem has quit IRC03:28
*** wenlock has quit IRC03:42
*** chandankumar has quit IRC03:45
*** chandankumar has joined #openstack-infra03:45
*** matsuhas_ has quit IRC03:46
*** matsuhashi has joined #openstack-infra03:47
*** ben_duyujie has joined #openstack-infra03:48
*** boris-42 has joined #openstack-infra03:49
*** matsuhashi has quit IRC03:51
*** rcleere has joined #openstack-infra03:52
*** paul-- has quit IRC03:52
*** boris-42 has quit IRC03:54
*** jamesmcarthur has joined #openstack-infra04:02
*** nati_ueno has joined #openstack-infra04:09
*** senk has joined #openstack-infra04:10
*** senk has quit IRC04:11
*** senk has joined #openstack-infra04:12
*** markwash has joined #openstack-infra04:16
*** jamesmcarthur has quit IRC04:16
*** matsuhashi has joined #openstack-infra04:20
*** CaptTofu has quit IRC04:24
*** CaptTofu has joined #openstack-infra04:24
*** markwash has quit IRC04:24
jheskethclarkb: How do you get past the docs gate when you have dependencies like graphviz/dot?04:26
clarkbjhesketh: is the problem the builds fails because we don't have graphviz installed on the slaves?04:26
jheskethmore or less04:26
jheskethI was also using sphinxcontrib.programoutput but since that failed I just removed it04:27
jheskethclarkb: eg: http://logs.openstack.org/49/56849/3/check/gate-turbo-hipster-docs/7bf90d0/console.html04:27
clarkbI thought I had a change proposed to add graphviz to the slaves, wonder what happened to it04:27
clarkbjhesketh: I can't find it, you can propose a change to openstack-infra/config/modules/jenkins/manifests/slave.pp and params.pp to add graphviz04:29
clarkbprogramoutput should work though, you may need to list it in one of your requirements files, JJB uses it iirc04:30
jheskethI think the problem with programout is that turbo-hipster has dependencies that aren't installed04:31
jheskethI'll look at adding graphviz :-)04:31
clarkbnon python dependencies?04:32
clarkbnon python dependencies like graphviz can be installed via puppet like I mentioned. python deps should go in a requirements or test-requirements file04:32
jheskethnope, but I didn't look closely04:32
jheskethit might be my requirements file is wrong04:32
jheskethI'll play with a virtualenv04:33
*** wenlock has joined #openstack-infra04:34
clarkbreally weird that I can't find my graphviz change04:37
clarkbmaybe I wrote it and never pushed it, and to look I am on the wrong machine04:38
*** matsuhashi has quit IRC04:42
*** matsuhashi has joined #openstack-infra04:42
*** paul-- has joined #openstack-infra04:45
openstackgerritJoshua Hesketh proposed a change to openstack-infra/config: Install graphviz on jenkins slaves for docs  https://review.openstack.org/5685704:45
jheskethclarkb: ^04:45
jheskethso I didn't ensure that it's actually installed because 1) I couldn't see other packages doing that and 2) It's not a very important package04:46
clarkbthe ensure happens further down in the code, what you have is fine, but can you update the comment in slave.pp? I will leave a comment in gerrit04:47
jheskethright, I meant ensure that /usr/bin/dot exists04:48
jheskethbut I didn't see the package ensure which is good04:48
jheskethah yes, copy and paste fail04:48
openstackgerritJoshua Hesketh proposed a change to openstack-infra/config: Install graphviz on jenkins slaves for docs  https://review.openstack.org/5685704:49
jheskethclarkb: fixed. Sorry about that. I didn't mention sphinx doc building as it can apply more generically if we have other places that want graphs04:49
clarkbyup wfm04:50
clarkbis dot provided by the graphviz package?04:50
jheskethyes04:51
clarkbmordred: ^ I think we are safe depending on EPL licensed projects to produce data for us04:56
clarkbmordred: but a second set of eyes would be nice04:56
*** wenlock has quit IRC05:00
*** dstanek has quit IRC05:01
*** sjing has quit IRC05:04
*** sjing has joined #openstack-infra05:05
*** dstanek has joined #openstack-infra05:08
zaroohh graphviz.  clarkb, does that mean graphviz will be on all slaves?05:10
clarkbzaro: if that change merges05:11
*** sarob has joined #openstack-infra05:11
*** senk has quit IRC05:12
*** yamahata has quit IRC05:12
zaroi was thinking of **maybe** building dependency graph of our build jobs.05:12
zarothey are so dang crazy to follow.05:12
clarkbpackage dependencies?05:13
zarono build jobs.05:13
clarkblike for the plugins? (I guess I don't know what "build" jobs are and what dependencies we are talking about)05:15
clarkbmost of our jenkins jobs don't have any dependencies, they just run05:16
clarkbthe jobs related to publishing have a small dependency list as the sdist must be built first then pushed to pypi then mirror updates05:16
clarkbits also late on sunday night and my brain is probably just not working05:17
zaroyeah, i guess what i mean is gate and release jobs in layout.yaml.05:18
zaroalso, are there dependencies defined in the jenkins jobs themselves?05:18
clarkbno we don't define dependencies in the jenkins jobs directly05:19
clarkbI think at one time we might have, but now we rely on zuul05:19
zaroohh.  no upstream/downstream jobs?05:19
clarkbnot anymore I think, you can grep in jjb to double chek05:20
*** talluri has quit IRC05:20
zaromaybe tommorrow, my brain not cooperating with my eyes. too late.05:21
zarobut that's good to hear.05:21
clarkbthe big reason for that is multiple jenkins05:22
clarkbthat doens't work too well with downstream jobs05:22
zaroso maybe just a graph of dependent jobs from yaml file might be interesting.  not sure if you can generate dependeny graphs from yaml05:22
zaroguess you not feeling that.05:25
clarkbI am tired too :) you should be able to build a graph based on the yaml. may need massaging though05:26
zarojhesketh: +1 for graphviz.  thanks.05:28
zaroclarkb: can't wait to hear about your weekend IRCing..05:29
clarkbzaro: it was great, I watched football and had an irc client open05:29
clarkbI did that + sleep that is about it05:30
zaroahhh football.. espn.. those were the days..05:31
* zaro dreams about clarkb days tonight.05:32
*** dstanek has quit IRC05:33
*** talluri has joined #openstack-infra05:35
*** nosnos_ has joined #openstack-infra05:36
*** nosnos has quit IRC05:36
clarkbzaro: it might be neat to fiddle witht he puppet graphing stuff too05:38
clarkbI think it uses graphviz to show the catalog graph, which probably isn't too interesting as a whole but might be neat in smaller subsets05:39
clarkbbut that doesn't show you inter host stuff which is actually interesting05:39
*** aardvark has quit IRC05:40
*** aardvark has joined #openstack-infra05:40
*** DennyZhang has joined #openstack-infra05:40
*** afazekas has quit IRC05:40
*** nati_ueno has quit IRC05:41
*** boris-42 has joined #openstack-infra05:44
*** jhesketh has quit IRC05:44
clarkbzaro: the JJB change lgtm after a quick look, but I need to spend some time to go through all of the new files05:46
*** paul-- has quit IRC05:48
*** rcleere has quit IRC05:49
*** SergeyLukjanov has joined #openstack-infra05:54
*** jhesketh has joined #openstack-infra05:57
*** sarob has quit IRC05:57
*** sarob has joined #openstack-infra05:57
*** sarob has quit IRC06:02
*** boris-42 has quit IRC06:03
*** DennyZhang has quit IRC06:04
*** boris-42 has joined #openstack-infra06:06
jog0fungi mordred lifeless mikal clarkb: do any of you take weekend off?06:06
clarkboccasionally06:07
jog0fungi: the risk of doing auto-recheck for things that elastic-recheck is we may get stuck in an infite loop.06:07
jog0for a bad query06:07
clarkbjog0: I think fungi was thinking that we would require a bug in e-r to recheck06:07
jog0clarkb: I like that idea06:07
clarkbI mostly didn't work this weekend. I was watching football06:07
jog0clarkb: football + IRC = ?06:08
clarkbfun06:08
jog0I do think we could eventually do auto recheck based on e-r if we have more precautions in place06:08
jog0(long term)06:08
clarkbthe next few weekends I won't be working much06:10
jog0thats what you say now ...06:10
clarkbno really. going to be in portland for the next one06:10
clarkbwhich means hanging out with people and not owrking06:10
jog0mikal: nice find for the console error06:10
jog0want to file a patch to elastic-recheck for it?06:11
jog0mikal: https://github.com/openstack-infra/elastic-recheck/blob/master/queries.yaml06:12
jog0clarkb: so I didn't read the entire backlog about rechecks but, it sounds like we all agree its a problem, which is good06:12
clarkblet me try and tl;dr for you06:12
clarkbcurrent problem appears to be we add new problems quicker than we remove them so what do we do?06:13
jog0clarkb: easy, cry06:13
clarkbdo we disable problematic tests? remove ability to recheck? run many tests to have higher chance of catching things? educate our devs? etc06:13
clarkbI don't think we came away with any hard answers but we do need to do something06:13
jog0clarkb: I think this sounds like a fun ML fodder06:14
jog0well sdague and I have at least one idea06:14
jog0but I have to go so that will have to wait for the ML thread that I am hoping someone will start06:15
clarkbI think sdague or mordred may be best to start it06:15
jog0clarkb: ++06:15
jog0sdague mordred: ^06:15
jog0anyway back to my weekend06:15
clarkbbecause TC and all that06:16
lifelessjog0: I totally did06:17
lifelessjog0: I got nothing done06:17
*** markwash has joined #openstack-infra06:20
*** marun has joined #openstack-infra06:27
*** sarob has joined #openstack-infra06:36
*** SergeyLukjanov has quit IRC06:41
*** matsuhashi has quit IRC06:54
*** pblaho has joined #openstack-infra06:56
*** matsuhashi has joined #openstack-infra06:56
*** nosnos_ has quit IRC06:56
*** nosnos has joined #openstack-infra06:56
mikaljog0: sure, I'll take a look at elastic recheck after dinner07:01
*** zoresvit has joined #openstack-infra07:01
*** nati_ueno has joined #openstack-infra07:06
*** nati_ueno has quit IRC07:08
*** ljjjusti1 has joined #openstack-infra07:14
*** ljjjustin has quit IRC07:18
*** sarob has quit IRC07:19
*** sarob has joined #openstack-infra07:19
*** sarob has quit IRC07:24
*** jcoufal has joined #openstack-infra07:27
*** yolanda has joined #openstack-infra07:32
*** boris-42 has quit IRC07:37
*** DinaBelova has joined #openstack-infra07:42
*** michchap has quit IRC07:43
*** michchap has joined #openstack-infra07:43
*** DinaBelova has quit IRC07:46
*** flaper87 has quit IRC07:48
*** flaper87 has joined #openstack-infra07:48
*** DinaBelova has joined #openstack-infra07:51
*** alexpilotti has joined #openstack-infra07:53
*** uvirtbot has joined #openstack-infra07:53
*** SergeyLukjanov has joined #openstack-infra08:02
*** matsuhashi has quit IRC08:13
*** matsuhashi has joined #openstack-infra08:14
*** matsuhashi has quit IRC08:15
*** matsuhashi has joined #openstack-infra08:15
*** osanchez has joined #openstack-infra08:24
*** che-arne has quit IRC08:29
*** che-arne has joined #openstack-infra08:34
*** shardy_afk is now known as shardy08:36
*** sjing has quit IRC08:36
*** sjing has joined #openstack-infra08:38
*** fbo_away is now known as fbo08:47
*** zoresvit has quit IRC08:48
*** sjing has quit IRC08:48
*** hashar has joined #openstack-infra08:49
*** ljjjusti1 has quit IRC08:49
*** zoresvit has joined #openstack-infra08:52
*** ilyashakhat has joined #openstack-infra08:52
*** guohliu has quit IRC08:54
openstackgerritMichael Still proposed a change to openstack-infra/elastic-recheck: Add bug 1251920 to our list of bugs we suggest for recheck  https://review.openstack.org/5688308:54
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Undecided,New] https://launchpad.net/bugs/125192008:54
*** nosnos has quit IRC08:59
*** nosnos has joined #openstack-infra08:59
*** andreaf has quit IRC09:00
*** andreaf has joined #openstack-infra09:01
*** nosnos has quit IRC09:03
*** hashar has quit IRC09:05
*** che-arne has quit IRC09:06
openstackgerritJulien Danjou proposed a change to openstack/requirements: Bump sqlalchemy-migrate to 0.7.3  https://review.openstack.org/5688809:07
*** davidhadas has quit IRC09:09
*** hashar has joined #openstack-infra09:09
*** jpich has joined #openstack-infra09:10
*** Ryan_Lane has quit IRC09:10
*** che-arne has joined #openstack-infra09:14
*** yassine has joined #openstack-infra09:18
*** bingbu has quit IRC09:27
*** rpodolyaka has joined #openstack-infra09:29
mikalAny adult supervision around?09:34
mikalMy elastic recheck patch just failed its check, but the links to logs are 40409:34
mikalWhich is a bit... special09:34
*** rpodolyaka has left #openstack-infra09:39
openstackgerritYuuichi Fujioka proposed a change to openstack-dev/hacking: Add metaclass for Python3 compatibility  https://review.openstack.org/5689009:41
*** Ryan_Lane has joined #openstack-infra09:41
*** dpyzhov has left #openstack-infra09:41
*** markmc has joined #openstack-infra09:43
*** davidhadas has joined #openstack-infra09:44
*** derekh has joined #openstack-infra09:45
*** saschpe_ has joined #openstack-infra09:45
*** saschpe has quit IRC09:46
*** boris-42 has joined #openstack-infra09:49
*** Ryan_Lane has quit IRC09:50
*** soren has joined #openstack-infra09:50
*** che-arne has quit IRC09:50
*** SergeyLukjanov has quit IRC09:51
*** Dafna has joined #openstack-infra09:54
*** SergeyLukjanov has joined #openstack-infra09:54
*** BobBall is now known as BobBallAway09:55
*** matsuhashi has quit IRC09:55
*** adarazs has joined #openstack-infra09:55
*** matsuhashi has joined #openstack-infra09:56
*** matsuhashi has quit IRC10:01
*** matsuhashi has joined #openstack-infra10:05
*** dizquierdo has joined #openstack-infra10:17
*** xchu has quit IRC10:21
*** marun has quit IRC10:21
*** afazekas has joined #openstack-infra10:25
*** amotoki has quit IRC10:37
openstackgerritChangBo Guo proposed a change to openstack-dev/hacking: Clean up how test env variables are parsed  https://review.openstack.org/5689810:37
*** jhesketh has quit IRC10:38
*** yassine has quit IRC10:39
*** dizquierdo has quit IRC10:41
*** marun has joined #openstack-infra10:42
*** marun has quit IRC10:46
*** matsuhashi has quit IRC10:47
*** Ryan_Lane has joined #openstack-infra10:47
*** matsuhashi has joined #openstack-infra10:47
*** nsaje has joined #openstack-infra10:49
openstackgerritChangBo Guo proposed a change to openstack-dev/hacking: Clean up how test env variables are parsed  https://review.openstack.org/5689810:49
*** Ryan_Lane has quit IRC10:51
*** zoresvit has quit IRC10:51
*** rfolco has joined #openstack-infra10:51
jd__I think Jenkins is having some issues :(10:52
*** jhesketh has joined #openstack-infra10:52
jd__some tests fail but I can't even retrieve the log10:52
*** matsuhashi has quit IRC10:52
*** zoresvit has joined #openstack-infra10:53
*** marun has joined #openstack-infra10:53
*** matsuhashi has joined #openstack-infra10:54
*** fifieldt has joined #openstack-infra10:57
*** nsaje_ has joined #openstack-infra10:57
*** nsaje has quit IRC10:57
*** ArxCruz has joined #openstack-infra11:01
openstackgerritNikita Konovalov proposed a change to openstack-infra/storyboard: Added task ordering  https://review.openstack.org/5602611:07
*** johnthetubaguy has joined #openstack-infra11:11
*** yaguang has quit IRC11:12
*** michchap has quit IRC11:12
*** matsuhashi has quit IRC11:13
*** michchap has joined #openstack-infra11:13
*** pblaho has quit IRC11:13
*** matsuhashi has joined #openstack-infra11:14
*** jcoufal has quit IRC11:14
*** lcestari has joined #openstack-infra11:14
*** arata has joined #openstack-infra11:15
*** ilyashakhat has quit IRC11:17
*** ilyashakhat has joined #openstack-infra11:18
*** matsuhashi has quit IRC11:19
*** DinaBelova has quit IRC11:19
*** pcm_ has joined #openstack-infra11:20
*** pcm_ has joined #openstack-infra11:20
*** michchap has quit IRC11:22
*** michchap has joined #openstack-infra11:23
arataThere is something wrong with jenkins0111:23
aratanot resonding.11:24
*** SergeyLukjanov has quit IRC11:24
*** yamahata_ has joined #openstack-infra11:26
*** SergeyLukjanov has joined #openstack-infra11:27
*** nsaje_ has quit IRC11:29
*** matsuhashi has joined #openstack-infra11:32
*** dizquierdo has joined #openstack-infra11:34
*** fifieldt has quit IRC11:34
*** nsaje has joined #openstack-infra11:35
*** DinaBelova has joined #openstack-infra11:36
*** ben_duyujie has quit IRC11:39
*** dkranz has quit IRC11:41
*** talluri has quit IRC11:47
*** Ryan_Lane has joined #openstack-infra11:48
openstackgerritSoren Hansen proposed a change to openstack-infra/config: Add BasicDB to stackforge  https://review.openstack.org/5551911:52
*** Ryan_Lane has quit IRC11:52
*** adalbas has joined #openstack-infra11:57
*** nsaje has quit IRC11:59
*** DinaBelova has quit IRC12:01
*** jcoufal has joined #openstack-infra12:01
*** davidhadas_ has joined #openstack-infra12:03
*** davidhadas has quit IRC12:06
*** yassine has joined #openstack-infra12:06
mordredarata: I'll look in to it in a bit - dealing with my personal network being bonghits first12:12
mordred-bash: fork: Cannot allocate memory12:12
mordredwow12:12
mordredthat's exciting12:12
*** michchap has quit IRC12:14
*** michchap has joined #openstack-infra12:15
mordredok. jenkins01.openstack.org was in a world of hurt - it could not fork processes, I could not even cat things in /oroc12:17
mordredso I nova reboot-ed it12:17
mordredjd__, arata, mikal ^^12:18
jd__thanks mordred12:18
*** yamahata_ has quit IRC12:19
* mordred wishes he had more information about why it was broken, but given that /proc was broken, diagnosing was a bit rough :)12:20
aratamordred: although still getting ssl_error_rx_record_too_long, thanks anyway12:23
*** hashar has quit IRC12:28
*** arata has left #openstack-infra12:29
mordredah. great. jenkins01 is even more borked than I thought12:29
mordredfungi: when you wake up ^^12:30
mordredfungi: root@jenkins01:~# puppet agent --test12:30
mordredExiting; no certificate found and waitforcert is disabled12:30
mordredjenkins01 has gotten REALLY unhappy12:30
ttxmordred: when you have 5 minutes I'd like to exchange a bit on your Thunderbird setup12:33
mordredttx: I always have 5 minutes for you!12:33
*** nsaje has joined #openstack-infra12:34
mordredttx: it's really simple, I have a folder for openstack-dev, and in that folder, I got to the View menu, sort by, threaded12:34
*** mattymo has quit IRC12:34
ttxmordred: I already used threaded view but was using "N" to jump to next unread, which still makes the sheer number a pain to process. What do you use and how do you go through it ?12:34
mordredmy mouse12:34
mordredalso \ is helpful for 'collapse all'12:35
ttxso.. you select interesting threads and expand them ? then once done you mark folder as read ?12:35
mordredttx: yes. or I mark the thread as read12:35
mordredwhich is r when the thread is highlighted12:35
mordredthat's how I ignore threads I don't care about12:36
*** sandywalsh has joined #openstack-infra12:36
ttxbut then it still comes back whenever a new message is posted to it. that said "ignore thread" is a bit extreme12:37
mordredwell, yeah - there's 'k' for ignore thread12:37
mordredand 'w' for watch thread12:37
mordred(which I've never used, but am going to)12:37
ttx't' shows promise12:38
ttx"Go to Next Unread Thread (and mark current thread as read)"12:38
ttxnot sure what "watch thread" actually does12:38
mordredme either12:38
ttxdo you use starring at all ?12:39
mordredttx: ah - watch thread is probably not useful to us at al12:40
mordredyou can watcha thread, and then switch to a view where you only see watched threads12:40
ttxanyway, most of it is about accepting I can't read every message. My old workflow (using "n") was still pretending to read them all12:40
mordredI do not use starring12:40
ttxwhich made the prospect of parsing 192 messages in the morning not that exciting12:40
mordredand yeah , I think it's more about accepting that than anything12:40
ttxI recently switched my reading of openstack@l.o.o to "read thread subjects only" and it works well12:41
mordredyah. subject filtering by eyeball is pretty effective12:42
ttxmordred: do you manage to get TB to always start in collapsed threads mode, or do you abuse \ regularly to force it to submission ?12:42
mordredabuse \12:43
ttxok, same here12:43
*** dkliban has quit IRC12:45
*** hashar has joined #openstack-infra12:47
*** weshay has joined #openstack-infra12:48
*** Ryan_Lane has joined #openstack-infra12:48
*** che-arne has joined #openstack-infra12:52
openstackgerritMonty Taylor proposed a change to openstack-infra/devstack-gate: Remove duplicate entries  https://review.openstack.org/5692012:53
*** SergeyLukjanov is now known as _SergeyLukjanov12:53
*** Ryan_Lane has quit IRC12:53
*** _SergeyLukjanov has quit IRC12:53
*** hashar has quit IRC12:54
openstackgerritMonty Taylor proposed a change to openstack-dev/pbr: Enable wheel processing in the tests  https://review.openstack.org/5681712:55
openstackgerritMonty Taylor proposed a change to openstack-dev/pbr: Clean up integration script  https://review.openstack.org/5681612:55
openstackgerritMonty Taylor proposed a change to openstack-dev/pbr: Use wheels for installation  https://review.openstack.org/4880312:55
*** hashar has joined #openstack-infra12:58
*** dkranz has joined #openstack-infra13:03
*** yamahata_ has joined #openstack-infra13:05
*** matsuhashi has quit IRC13:05
*** matsuhashi has joined #openstack-infra13:05
*** CaptTofu has quit IRC13:05
*** CaptTofu has joined #openstack-infra13:06
*** dkliban has joined #openstack-infra13:06
openstackgerritYassine Lamgarchal proposed a change to openstack-infra/config: Add tooz  https://review.openstack.org/5692713:18
*** dkliban has quit IRC13:21
openstackgerritYassine Lamgarchal proposed a change to openstack-infra/config: Add tooz  https://review.openstack.org/5692713:22
*** dkliban has joined #openstack-infra13:26
*** dkliban has quit IRC13:30
*** thomasem has joined #openstack-infra13:33
*** nati_ueno has joined #openstack-infra13:35
*** dstanek has joined #openstack-infra13:36
*** nati_ueno has quit IRC13:37
*** nati_ueno has joined #openstack-infra13:38
ArxCruzmordred: what need to have this https://review.openstack.org/#/c/53432/ merged? :)13:39
*** nsaje has quit IRC13:39
ArxCruzwhat do I need to do*13:40
mordredArxCruz: nothing. need another review on it13:40
mordredArxCruz: we've been short staffed last week, so the review queue is a bit long13:40
ArxCruzfungi: https://review.openstack.org/#/c/53432/ :D ?13:40
ArxCruzmordred: okay :)13:40
*** nsaje has joined #openstack-infra13:40
*** nsaje has quit IRC13:42
*** SergeyLukjanov has joined #openstack-infra13:44
*** yamahata_ has quit IRC13:47
*** yamahata_ has joined #openstack-infra13:49
*** Ryan_Lane has joined #openstack-infra13:49
*** Ryan_Lane has quit IRC13:54
*** dprince has joined #openstack-infra13:54
*** matsuhashi has quit IRC13:56
*** lascii is now known as alaski13:56
*** matsuhashi has joined #openstack-infra13:57
*** matsuhas_ has joined #openstack-infra13:58
openstackgerritChuck Short proposed a change to openstack/requirements: Add HACKING.rst  https://review.openstack.org/5590913:58
*** matsuhashi has quit IRC13:59
*** sdake__ is now known as sdake14:01
hasharif any Gerrit folk is around,  one of our volunteer found out a WIP plugin for Gerrit : https://github.com/davido/gerrit-wip-plugin (by David Ostrovsky )14:03
hasharmordred: ^^14:03
mordredhashar: yup!14:04
mordredhashar: _david_ wrote that for us14:04
hashar\O/14:04
*** ryanpetrello has joined #openstack-infra14:04
mordredhashar: we need to get one patch landed to upstream gerrit, aiui14:04
hasharnow that it is a plugin, we will be able to add it on our installation \O/14:05
openstackgerritYuuichi Fujioka proposed a change to openstack-dev/hacking: Add metaclass for Python3 compatibility  https://review.openstack.org/5689014:05
hasharmordred: ^d and qchris from Wikimedia might be able to assist in review / +1.  Although both are working on other projects nowadays.14:05
mordredhashar: w00t!14:07
mordredhashar: I find it very funny that we can't get WIP mainlined14:07
mordredbut, that's a topic for beer14:08
hasharI guess Google folks are only interested in maintaining features they are actively using14:09
hasharleaving third parties with plugins.14:09
*** jpeeler has joined #openstack-infra14:09
*** julim has joined #openstack-infra14:12
*** cody-somerville has joined #openstack-infra14:16
*** mfer has joined #openstack-infra14:16
*** mriedem has joined #openstack-infra14:21
*** davidhadas_ has quit IRC14:22
*** julim has quit IRC14:23
*** dkliban has joined #openstack-infra14:24
*** julim has joined #openstack-infra14:26
anteayafungi teh jenkins is sick14:27
anteayamordred thinks jenkins01 needs some care14:27
*** nsaje has joined #openstack-infra14:28
*** Dafna has quit IRC14:28
*** dkranz has quit IRC14:30
*** SergeyLukjanov is now known as _SergeyLukjanov14:32
*** _SergeyLukjanov has quit IRC14:32
*** nsaje has quit IRC14:32
*** nsaje has joined #openstack-infra14:34
*** afazekas has quit IRC14:36
*** alcabrera has joined #openstack-infra14:37
*** Loquacity has quit IRC14:38
*** Loquacity has joined #openstack-infra14:39
*** nsaje has quit IRC14:44
*** jamesmcarthur has joined #openstack-infra14:45
*** dkranz has joined #openstack-infra14:45
*** Dafna has joined #openstack-infra14:48
*** afazekas has joined #openstack-infra14:49
openstackgerritRyan Petrello proposed a change to openstack-infra/config: Run tox tests for pecan to gate against WSME, Ceilometer, and Ironic.  https://review.openstack.org/5433314:50
*** xeyed4good has joined #openstack-infra14:51
*** ftcjeff has joined #openstack-infra14:53
*** ftcjeff has quit IRC14:55
*** damnsmith is now known as dansmith14:56
*** matsuhas_ has quit IRC14:57
*** matsuhashi has joined #openstack-infra14:57
fungianteaya: having a look14:59
*** davidhadas has joined #openstack-infra14:59
*** oubiwann has joined #openstack-infra15:01
*** SergeyLukjanov has joined #openstack-infra15:02
fungilooks like mordred rebooted it several hours ago15:03
mordredfungi: indeed. but now it's in a land of unhappy15:03
fungiyeah, ssl errors on 44315:03
mordredfungi: the jenkins service is not in a happy place - AND - running puppet agent --test on it15:03
mordredreturns missing cert15:03
mordredwhich makes me think that it's bonghits15:03
*** matsuhashi has quit IRC15:04
*** alcabrera is now known as alcabrera|afk15:04
*** thedodd has joined #openstack-infra15:04
*** nsaje has joined #openstack-infra15:04
*** matsuhashi has joined #openstack-infra15:04
*** SergeyLukjanov has quit IRC15:04
fungithat does seem very un-right15:05
fungicomparing against jenkins02 now15:05
*** jgrimm has joined #openstack-infra15:05
fungipuppet's fine on jenkins02. good15:07
fungiExiting; no certificate found and waitforcert is disabled15:07
*** alcabrera|afk has quit IRC15:07
fungithat's so great15:07
*** Dafna1 has joined #openstack-infra15:08
*** Dafna has quit IRC15:08
*** Loquacity has quit IRC15:08
*** matsuhashi has quit IRC15:08
*** Loquacity has joined #openstack-infra15:09
*** ryanpetrello_ has joined #openstack-infra15:10
mordredright?15:10
mordredI mean15:10
*** SergeyLukjanov has joined #openstack-infra15:10
mordredI was really excited by it myself15:10
*** ryanpetrello has quit IRC15:12
*** ryanpetrello_ is now known as ryanpetrello15:12
fungiyeah, i'm trying to determine what, if anything, is up with the puppet cert for it now15:14
*** Dafna1 has quit IRC15:15
*** CaptainTacoSauce has joined #openstack-infra15:15
*** rwsu-pto is now known as rwsu15:16
*** alcabrera|afk has joined #openstack-infra15:16
*** alcabrera|afk is now known as alcabrera15:16
*** jpeeler1 has joined #openstack-infra15:17
*** jpeeler has quit IRC15:17
*** pblaho has joined #openstack-infra15:19
mordredfungi: I blame gremlins15:19
*** davidhadas has quit IRC15:19
mordredfungi: btw - not important, but I self approved this: https://review.openstack.org/#/c/56920/ this morning15:21
mordredfungi: my cleanup of the pbr integration test caused it to start throwing errors on duplicates in the PROJECTS list15:21
mordredI figure, fix the dupllicates, then land the pbr change, then no more duplicates can sneak in - profit!15:21
*** herndon has joined #openstack-infra15:23
fungipuppet cert on jenkins01 seems to validate against the ca, and it's the same ca which signed the 02 cert15:24
*** thedodd has quit IRC15:26
*** dcramer_ has joined #openstack-infra15:26
*** Dafna has joined #openstack-infra15:29
mordredfungi: somehow that makes me less happy15:30
fungiyeah, the syslog also has "puppet-agent[1128]: Did not receive certificate" over and over since reboot15:31
anteayalooking at the test nodes graphic in status.openstack.org/zuul, something happened at about 6am test node time15:32
*** dcramer_ has quit IRC15:32
anteayawas that the reboot perhaps?15:32
*** dkranz has quit IRC15:33
*** jpeeler1 is now known as jpeeler15:33
*** jpeeler has joined #openstack-infra15:33
fungianteaya: roughly15:33
*** datsun180b has joined #openstack-infra15:36
*** mgagne has joined #openstack-infra15:37
anteayak15:37
*** davidhadas has joined #openstack-infra15:39
*** markwash has quit IRC15:40
fungiran through everything in http://docs.puppetlabs.com/pe/latest/trouble_comms.html just to be sure, but it all checks out15:41
*** wenlock has joined #openstack-infra15:41
jd__fungi, mordred got a minute to review https://review.openstack.org/#/c/56927/ ? the devs would like to start pushing patches soon :)15:42
fungidoing a debsums run on it now to see if maybe we've got any obviously corrupted executables15:44
funginothing out of the ordinary there15:46
anteayajd__: fungi will be awhile he is tending to the jenkins15:46
*** dkranz has joined #openstack-infra15:46
jd__bad bad Jenkins :)15:48
anteayaor something15:48
anteayaso far no clues what is causing the sickness15:48
fungiwell, more like too many clues, leading in different directions15:48
anteayaoh15:48
anteayaeven worse15:48
anteayawhat are you seeing15:49
fungihere's a possible bite... puppet.conf was changed today at 09:23z15:50
fungii bet unattended upgrades are to blame15:51
anteayaoh15:51
anteayathat would bring us to our knees15:51
anteayacan we revert the puppet.conf change?15:51
funginope. today's didn't run on that server until 15:42z15:52
fungidime to start diffing15:52
anteayahuh15:52
fungier, time15:52
anteayak15:52
*** dcramer_ has joined #openstack-infra15:54
fungisomehow it got certname=undef15:54
fungishould have its fqdn in there15:54
anteayais that an easy correction?15:55
fungiyeah, now i'm trying to figure out how it happened though15:56
fungiand here we go...15:56
fungiNov 18 09:23:42 jenkins01 puppet-agent[1123]: (/Stage[main]/Openstack_project::Base/File[/etc/puppet/puppet.conf]/content) content changed '{md5}db139fe80a2db5dfe350a9a5eaa0ccae' to '{md5}bf3bd06b3c035a33167619450190f861'15:56
* fungi checks for recently approved changes to that file15:57
anteayaI am not seeing a change to openstack-infra/config for Nov 1816:01
* anteaya isn't sure where else a change to puppet.conf could come from16:02
openstackgerritSergey Lukjanov proposed a change to openstack-infra/reviewstats: Add storyboard project  https://review.openstack.org/5697216:03
*** senk has joined #openstack-infra16:04
*** julim has quit IRC16:04
*** julim has joined #openstack-infra16:05
*** jergerber has joined #openstack-infra16:05
*** pblaho1 has joined #openstack-infra16:06
*** dripton has quit IRC16:07
*** nsaje has quit IRC16:08
*** pblaho has quit IRC16:08
zaromgagne: jjb release is eminent.16:08
openstackgerritNikita Konovalov proposed a change to openstack-infra/storyboard: Added task ordering  https://review.openstack.org/5602616:09
*** mrodden has joined #openstack-infra16:09
zaromgagne: https://review.openstack.org/#/c/5682316:09
*** dripton has joined #openstack-infra16:10
*** SergeyLukjanov has quit IRC16:13
*** saschpe_ has quit IRC16:13
*** saschpe has joined #openstack-infra16:14
fungianteaya: mordred: doesn't look like anything changed in our puppet config to cause it... seems this was puppet spontaneously breaking itself by falling back on a (presumably very old) cached config... http://paste.openstack.org/show/53536/16:15
fungianteaya: mordred: my best guess so far is that jenkins01 lost contact with the puppet master somehow for an extended period between 08:43 and 09:23, but possibly related to a too-many-open-processes issue? (note the cron can't fork error in there)16:16
fungier, either too many open file descriptors or too many processes16:17
* anteaya clicks16:17
fungiwould be my guess16:17
anteayaany way of know how many processes were running on that jenkins at the time?16:18
funginot unless mordred took notes before he rebooted it16:18
fungicould also have been an oom condition, but i'd have expected to see that reflected in the logs16:18
*** afazekas has quit IRC16:19
zarohorray for multiple masters!16:20
anteayazaro: I'm wondering if this means we need more16:21
anteayaand a script that sets a maximum of processes per master16:21
zaromemory setting might be problem, always is with java.16:21
anteayaif we don't already have that16:21
*** katyafervent has joined #openstack-infra16:22
fungianteaya: we have separate issues as a cascading effect from this which is causing us to be starved for d-g slaves (note the purported deleting, building and in use states on the graph many of which i suspect are tied to the dead jenkins master)16:23
zarofungi: all jobs that were running on that master were probably lost.  does logs show what was running?16:23
*** dprince has quit IRC16:23
*** boris-42 has quit IRC16:23
fungiand also causing some jobs to be blocking the gate because they never reported back as finished, i think16:23
fungizaro: it might, i haven't looked yet16:23
ryanpetrellothere was talk at the summit of moving to wheels for pypi uploads16:24
ryanpetrelloI'm guessing stackforge projects will be doing this too?16:24
anteayafungi yes I have been watching that graph with interest16:24
ryanpetrellowondering what steps packages should start taking to prepare, e.g., https://gist.github.com/kennethreitz/748960316:24
zaroohh wait, doesn't zuul keep those jobs or at least tell us which jobs timed out?16:24
*** dkranz has quit IRC16:25
*** afazekas has joined #openstack-infra16:25
anteayaryanpetrello mordred is working on wheels16:25
ryanpetrelloyea, I'll poke him about it16:26
ryanpetrellojust want to do what I can to prepare :)16:26
fungiokay, jenkins01 is back up and running jobs again. going to see what i can do to start cleaning up the deleting (in nodepool)/offline (in jenkins) d-g slave mess now16:28
*** pabelanger has quit IRC16:29
anteayaryanpetrello: good plan16:30
*** krotscheck has joined #openstack-infra16:30
*** rnirmal has joined #openstack-infra16:31
*** pabelanger has joined #openstack-infra16:33
*** pabelanger has quit IRC16:33
*** pabelanger has joined #openstack-infra16:33
*** ben_duyujie has joined #openstack-infra16:33
*** ^d has joined #openstack-infra16:33
pabelangerfungi, anteaya: You can disable puppet from use the cached catalog to stop that exact behaviour.16:34
*** afazekas has quit IRC16:34
anteayado you have a command for that?16:34
anteayaor is that a config option?16:34
pabelangerusecacheonfailure = false16:35
pabelangerfor agents16:35
anteayathank you16:35
*** dstanek has quit IRC16:35
*** ben_duyujie has quit IRC16:35
fungipabelanger: awesome! so i guess "puppet agent eats itself" is a known issue with cached configs and keeping your puppet.config in puppet16:37
pabelangerfungi, ya, I hit that same issue many moons ago.  My nodes couldn't download the catalog for some reason, puppet basically nuked my production boxes systematically across my network.  Not a fun day that was explaining it to customers16:38
fungiof course, if usecacheonfailure = false is an option which goes in the agent config, we won't be able to apply it with puppet without blowing away the certname option on all of them at the same time16:38
fungiso adding that option will take some planning, i suspect16:39
*** dstanek has joined #openstack-infra16:39
pabelangerwhy do you need to blow away certname?16:39
*** reed has joined #openstack-infra16:39
fungiwe don't need to. that's just what i expect puppet will do if we make a change to that file16:39
* pabelanger goes to check how puppet.conf is built16:39
fungiright now the base one in our config repo has certname=undef in it16:40
annegentlejeblair: around?16:40
fungiand i suspect changing any part of that file will set them all to undef when it reapplies the updated file16:40
*** dkranz has joined #openstack-infra16:41
pabelangerfungi, Oh, I see16:41
*** mdenny has joined #openstack-infra16:41
annegentlefungi: just got another reminder email from Holly at O'Reilly, should we just schedule something and hope jeblair can make it?16:42
fungiannegentle: i'm uncomfortable doing that--he seemed particularly displeased that i didn't push them to fully integrate with our existing systems on that first call where i was the only infra representative, so i'd really rather not repeat that without him around to make his desires very clear16:43
annegentlefungi: yep, makes sense. oh jeblair jeblair where are you?16:43
*** UtahDave has joined #openstack-infra16:44
*** hashar has quit IRC16:46
*** danger_fo_away is now known as dangers16:47
mgagnezaro: cool =)16:47
fungistill over 100 nodes showing in a delete status in nodepool after jenkins01 has had a chance to settle, and the graph suggests they all ended up in that state following its nosedive earlier, so i'm going to 'nodepool delete' those16:51
*** branen has joined #openstack-infra16:51
zaromgagne: if you have time i would really like this to merge.. https://review.openstack.org/#/c/5671516:55
mgagnezaro: checking16:55
zaromgagne: what did gf think of the food you ate?16:55
*** pblaho1 has quit IRC16:57
*** krotscheck has quit IRC16:57
mgagnezaro: was ok with it, I guess she was hoping for much more, especially desserts :P16:58
*** UtahDave has quit IRC16:58
*** markmc has quit IRC16:59
*** SergeyLukjanov has joined #openstack-infra17:01
*** weshay has quit IRC17:02
*** dkranz has quit IRC17:02
mgagnezaro: my coworker found a bakery in central hk but I couldn't find it again later. We do have some chinese bakeries here but it's probably not the same =(17:04
*** hogepodge has joined #openstack-infra17:05
mgagnezaro: jenkins disagrees: https://review.openstack.org/#/c/56715/17:07
zaromgagne: egg tarts were the thing in HK, but are kinda difficult to find these days.  not sure what's up with that :(17:07
Alex_Gaynormordred: pbr now has more installs from pypi than psycopg2 :]17:08
Alex_Gaynorerr, than pymongo17:08
*** nsaje has joined #openstack-infra17:08
*** nsaje has quit IRC17:11
*** nsaje has joined #openstack-infra17:12
*** pblaho has joined #openstack-infra17:13
*** dkranz has joined #openstack-infra17:14
*** julim has quit IRC17:15
jlkit's all the hipster python coders thinking it's a beer library17:15
dstufftI thought all the hipsters used node17:16
jlkwas more a joke about "PBR" as in Pabst Blue Ribbon17:17
*** hogepodge has quit IRC17:18
openstackgerritSergey Lukjanov proposed a change to openstack-infra/reviewstats: Add storyboard project  https://review.openstack.org/5697217:18
dstufftjlk: :]17:18
*** julim has joined #openstack-infra17:19
*** senk has quit IRC17:20
openstackgerritKevin proposed a change to openstack-infra/jenkins-job-builder: publishers: added runAfterFinalised to copy-to-master  https://review.openstack.org/5698917:20
openstackgerritKevin proposed a change to openstack-infra/jenkins-job-builder: Added template-test command & run-after-finalized  https://review.openstack.org/5499317:25
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: update doc and add new JJB unit tests  https://review.openstack.org/5671517:26
*** pblaho has quit IRC17:26
*** pblaho has joined #openstack-infra17:27
fungittx: did you want to push signed folsom-eol tags on the tip of stable/folsom for that list of projects i posted to the openstack-stable-maint ml last week, or should someone else do it (one of the other stable release managers)?17:27
*** xeyed4good has quit IRC17:27
*** guohliu has joined #openstack-infra17:28
*** pblaho has quit IRC17:31
*** pblaho has joined #openstack-infra17:31
*** jcoufal has quit IRC17:31
*** pblaho has quit IRC17:32
*** pblaho has joined #openstack-infra17:33
*** dprince has joined #openstack-infra17:33
*** pblaho has quit IRC17:33
*** dizquierdo has quit IRC17:36
*** dizquierdo has joined #openstack-infra17:37
*** flaper87 is now known as flaper87|afk17:37
*** dizquierdo has quit IRC17:38
*** dstanek has quit IRC17:39
*** jpich has quit IRC17:40
openstackgerritRussell Bryant proposed a change to openstack-infra/reviewstats: Add storyboard project  https://review.openstack.org/5697217:42
salv-orlandoI do apologise for disturbing, but if I can get another infra-core reviewing this: https://review.openstack.org/#/c/56722/ we might be able to do some serious progress towards neutron parallel testing (doing that locally now, but it would be useful testing against the gate as well)17:42
openstackgerritA change was merged to openstack-infra/reviewstats: Add storyboard project  https://review.openstack.org/5697217:42
*** yassine has quit IRC17:45
*** hogepodge has joined #openstack-infra17:48
*** dangers is now known as danger_fo17:49
*** osanchez has quit IRC17:51
*** markwash has joined #openstack-infra17:51
*** danger_fo is now known as dangers17:51
salv-orlandothanks fungi!17:52
fungisalv-orlando: no problem. i definitely don't want to stand in the way of neutron testing/gating improvements17:52
*** johnthetubaguy has quit IRC17:54
clarkbmorning17:55
*** pmathews has joined #openstack-infra17:55
clarkbpoor jenkins0117:56
olaphmorning clarkb17:56
openstackgerritA change was merged to openstack-infra/config: Add job for neutron parallel testing w/tenant isolation  https://review.openstack.org/5672217:56
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: fix jjb scp publisher example  https://review.openstack.org/5699817:57
*** dcramer_ has quit IRC17:58
fungiclarkb: now that the deleting nodes are mostly cleared, i'm starting to become less and less convinced that the available count is correct either17:58
*** harlowja has joined #openstack-infra17:59
pleia2good morning17:59
clarkbfungi: interesting17:59
*** derekh has quit IRC18:00
clarkbfungi: is it convinced it has more nodes than actually exist?18:00
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: fix jjb scp publisher example  https://review.openstack.org/5699818:00
fungiclarkb: i suspect so--haven't checked yet. also jenkins01 has hundreds of what i assume are old devstack slaves in an offline state18:01
fungiwhereas jenkins02 has maybe half a dozen18:01
clarkbfungi: may need to manually GC those. I can help with that if we determine that nodepool has forgotten about them18:01
fungii wanted to wait until the deletes finished so i could confirm18:02
clarkbwfm18:02
fungii think those are just abotu wrapped up now18:02
*** Dafna has left #openstack-infra18:04
markwashcan I use gerrit to track two major-release series of python-glanceclient? Like, suppose I want to create a 1.0 branch and start making backwards incompatible changes. . .18:06
markwashmeanwhile maintaining a 0.0 branch18:06
*** sarob has joined #openstack-infra18:07
clarkbmarkwash: you can create branches in gerrit like that and tag commits on them with arbitrary tags. So gerrit won't enforce what you describe but would allow it. Doing so would change glanceclient's release process compared to the other clients (not necessarily a bad thing but something to consider)18:10
*** gyee has joined #openstack-infra18:10
clarkbfungi I know you have thought a lot about python client release and compatiblity schemes. any thoughts on ^18:10
markwashclarkb: so the way to get gerrit to track another branch for python-glanceclient is just to push a branch to the gerrit remote? and who will end up with core powers on that branch?18:11
fungiclarkb: markwash: well, getting a release management opinion on it would help (ttx?)18:11
*** alexpilotti has quit IRC18:11
*** melwitt has joined #openstack-infra18:12
fungibut also, keep in mind that it multiplies what testign we do, and we'd need to decide how the two independent versions of glanceclient would be tested adequately18:12
pleia2zaro: thanks for the jjb docs updates \o/18:12
markwashfungi: yeah, it seems there are lot of complicating implications18:12
*** nsaje has quit IRC18:12
fungimarkwash: clarkb: and then there's the vulnerability management concern of having to track backports for security fixes which affect both branches18:13
*** nsaje has joined #openstack-infra18:13
clarkbfungi: I knew there was a reason I asked for your thoughts :)18:13
clarkbbut yeah changing snowflakes usually comes with all sorts of fun stuff to work through18:13
fungithere are almost certainly more issues which aren't springing to mind just yet18:14
clarkbmarkwash: gerrit ACLs are weird and in this case you would create the branch directly in gerrit. the release mgmt team would have rights to non master branches iirc18:14
clarkbso more reason to bring this up with ttx18:15
zaropleia2: np. i've actually found a lot more of those.  it was driving me crazy because jjb doesn't tell you that anything is wrong.18:15
*** dstanek has joined #openstack-infra18:17
*** nsaje has quit IRC18:18
*** ArxCruz has quit IRC18:23
*** Loquacity has quit IRC18:26
*** Loquacity has joined #openstack-infra18:26
*** guohliu has quit IRC18:26
*** dstanek has quit IRC18:30
*** nsaje has joined #openstack-infra18:32
clarkbfungi: let me know if I can help with the jenkins situation. also https://review.openstack.org/#/c/56720/ for when jenkins is happy18:34
*** oubiwann has quit IRC18:35
*** oubiwann has joined #openstack-infra18:35
*** mrmartin has joined #openstack-infra18:36
fungiclarkb: deletes seem to have finished, so i'm going to manually gc jenkins01 offline devstack nodes starting from the bottom of the list and working my way up (since nodepool lists no "delete" state nodes on 01 now)18:36
*** Ryan_Lane has joined #openstack-infra18:37
clarkbfungi: I can start at the top18:37
fungiclarkb: awesome--thanks!18:37
openstackgerritBrant Knudson proposed a change to openstack-infra/reviewstats: Morgan Fainberg is Keystone core  https://review.openstack.org/5700818:39
openstackgerritA change was merged to openstack-infra/reviewstats: Morgan Fainberg is Keystone core  https://review.openstack.org/5700818:42
openstackgerritVijendar Komalla proposed a change to openstack/requirements: Update python-troveclient version  https://review.openstack.org/5213718:42
openstackgerritHunter Haugen proposed a change to openstack-infra/config: Pass $mysql_password through to gerrit class  https://review.openstack.org/5630618:43
*** alcabrera is now known as alcabrera|afk18:49
*** pabelanger has quit IRC18:54
clarkbfungi: this is quite the manual task. I probably should learn to use the jenkins api for things like this18:54
*** pabelanger has joined #openstack-infra18:54
mordredAlex_Gaynor: wow. that's pretty crazy18:55
Alex_Gaynormordred: 22nd most downloaded package of all time, it'll overtake psycopg2 in a few days18:55
*** hogepodge has quit IRC18:56
*** alcabrera|afk is now known as alcabrera18:57
*** Guest79045 is now known as Vivek18:57
*** Vivek has quit IRC18:57
*** Vivek has joined #openstack-infra18:57
*** hogepodge has joined #openstack-infra18:57
*** rnirmal has quit IRC18:58
openstackgerritA change was merged to openstack-infra/jenkins-job-builder: fix jjb scp publisher example  https://review.openstack.org/5699819:00
*** vipul is now known as vipul-away19:01
*** sarob has quit IRC19:01
*** sarob has joined #openstack-infra19:01
*** weshay has joined #openstack-infra19:04
*** sarob has quit IRC19:06
clarkbAlex_Gaynor: mordred: that almost makes me wonder if our test slaves are hitting pypi.python.org :)19:06
mordredclarkb: makes me wonder something similar19:08
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: update doc and add new JJB unit tests  https://review.openstack.org/5671519:08
Alex_GaynorI think maybe people just use this openstack thing a lot19:08
clarkbfungi: I left those slaves in the az1 670k series there because those seem new enough taht nodepool should know about them19:10
clarkbfungi: but they aren't going away veryquickly either19:10
fungiclarkb: okay19:10
fungiotherwise, seems we've met in the middle19:10
*** vipul-away is now known as vipul19:10
EmilienMhi, i would like a review on Neutron Metering agent in devstack > https://review.openstack.org/#/c/48042/19:10
*** hogepodge has quit IRC19:11
mordredAlex_Gaynor: bah. I hear CloudStack is where it's at19:11
clarkbEmilienM: https://review.openstack.org/#/admin/groups/50,members is the group of folks that can give you +2s19:11
*** hogepodge has joined #openstack-infra19:12
fungiEmilienM: also the devstack developers tend to be more numerous on the #openstack-qa channel19:12
clarkbmordred: now that they have a code review system and one apparent way to submit patches we should watch out :) (seriously though good for them changing the code submission process)19:12
EmilienMclarkb: fungi : i already asked on qa channel, this patch takes a while to merge, i don't understand19:12
openstackgerritMatt Riedemann proposed a change to openstack-infra/elastic-recheck: Add query for bug 1251516  https://review.openstack.org/5701619:12
mordredclarkb: I had no idea that they'd done any of this19:12
uvirtbotLaunchpad bug 1251516 in tempest "test_get_console_output fails with "Console output was empty"" [Undecided,New] https://launchpad.net/bugs/125151619:12
*** senk has joined #openstack-infra19:13
clarkbmordred: I checked recently because I was complaining to someone about how bad it was (email patch, attach to jira, pull request) and was told that isn't the case anymore and sure enough they use reviewboard iirc19:13
EmilienMclarkb: can i add you as reviewer ?19:13
clarkbEmilienM: sure, but I can't help approve that patch19:13
mordredwell good or them19:13
mordredfor19:13
clarkbI don't have +219:13
EmilienMok19:14
clarkbEmilienM: really you need another devstack-core to take a look at it19:14
EmilienMclarkb: thx :)19:14
EmilienMi'll ping them19:14
clarkbfungi: it is really weird that jenkins01 has so few slaves now. I guess we need to wait for nodepool to shift pool growth from jenkins02 to jenkins0119:15
fungiclarkb: perhaps. i think we may also be waiting for nodepool to notice that all those slaves on jenkins01 weren't really available19:16
anteayajog0 around today?19:16
fungiclarkb: since there are plenty of devstack jobs queued but the test nodes graph claims ~150-200 nodes available19:16
*** herndon has quit IRC19:17
*** derekh has joined #openstack-infra19:19
clarkbfungi: jenkins02 has a large number of offline nodes too, though most of those look recent19:19
clarkbI wonder if we should kick nodepool19:19
jog0anteaya: yup19:20
clarkboooohhh you know what19:20
clarkbI wonder if hpcloud is hating us right now19:20
*** jamesmcarthur has quit IRC19:20
*** hogepodge has quit IRC19:21
clarkbthe rackspace nodes seem to be coming up and doing work but there aren't as many hpcloud nodes as I would expect19:21
*** jamesmcarthur has joined #openstack-infra19:21
clarkbfungi: mordred: yup there seem to be some fun stacktraces in the nodepool debug log. ssh timeouts and RAM limits exceeded19:23
clarkbI wonder if we have attempted to allocate a bunch of servers that are not coming up within the timeout and as a result are hitting quota limits19:23
anteayajog0: prepping for the neutron meeting and the tempest topic: https://wiki.openstack.org/wiki/Network/Meetings#Neutron_Tempest_.28anteaya.2919:23
fungiclarkb: possible19:23
anteayacan we discuss this in -neutron so we have current info, using logstash?19:23
fungialso possible some of those entries are me running nodepool delete too quickly and hitting api rate limit throttles19:24
jog0anteaya: sure19:24
*** dcramer_ has joined #openstack-infra19:24
fungiclarkb: i'm in favor of giving nodepool a restart after what we've just put it through. should be nondisruptive19:25
*** jerdfelt has joined #openstack-infra19:25
clarkbfungi: it shouldn't hurt but I don't think it will help either19:25
mordredclarkb: wow. that's fantastic19:26
clarkbinstead I think we may want to bump the ssh timeout and see if that fixes the quota problems19:26
jerdfeltclarkb: hi. did i remember on Friday you said that rackspace already had a third-party testing account in gerritt?19:26
fungii think we need to compare the nodes it thinks are available with ones which actually exist in jenkins19:26
*** dstanek has joined #openstack-infra19:26
clarkbjerdfelt: I am not sure if mikal has that configured yet, but I believe it was/is his eventual goal19:26
jerdfeltclarkb: i was trying to track down the account information. but i didn't see anything at https://review.openstack.org/#/admin/groups/91,members that looked like rackspace19:27
fungiclarkb: the "delete" status node count associated with jenkins02 is falling, so i suspect nodepool is working through those okay19:27
jerdfelti've also sent an email to mikal to find out what he knows and has done19:27
*** blamar has quit IRC19:28
*** sarob has joined #openstack-infra19:28
clarkbfungi: cool. I see more hpcloud nodes on jenkins02 now19:28
clarkbjerdfelt: they may not have gotten that far then19:29
clarkbfungi: still not many on jenkins01, may still be working through the shift of load19:29
fungiclarkb: i wonder if it would be wise to put jenkins01 in shutdown mode, then nodepool delete all nodes associated with it, then bring it back up and do the same on jenkins0219:29
jerdfeltclarkb: is it possible that it may not be listed in that gerritt group or is that group accurate?19:30
*** hogepodge has joined #openstack-infra19:31
clarkbjerdfelt: the group should be accurate19:31
fungijerdfelt: that group is how those accounts get permission to do a verify column vote19:31
*** dstanek has quit IRC19:31
jerdfeltclarkb, fungi: cool, thanks. i'll get to the bottom of what mikal has done. thanks!19:32
fungijerdfelt: jhesketh may also have some awareness of what's going on there, not sure19:32
*** jamesmcarthur has quit IRC19:33
jerdfeltfungi: i'll follow up with him too19:34
clarkbfungi: I think that may be a good idea. There is a large discrepency between ready nodes in nodepool and available slaves in jenkins19:35
fungiclarkb: since jenkins01 isn't pulling its weight yet anyway, let's hit it fist i guess?19:36
fungier, hit it first19:36
clarkbfungi: yup19:36
fungiokay, putting jenkins01 into shutdown mode. i fear the longer things run at the current limited capacity the bigger the pile-up in zuul will become19:38
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: update doc and add new JJB unit tests  https://review.openstack.org/5671519:38
clarkbya best to start fixing this now19:38
* mordred in meeting - only partially useful - ping me if I can be of help19:38
fungionce jenkins01 finishes quiescing, i'll nodepool delete any of its remaining nodes nodepool knows about and then manually delete any straggler d-g nodes through the jenkins ui, then fire it back up19:39
clarkbsounds good19:40
*** ArxCruz has joined #openstack-infra19:40
jog0mordred: do you want to start a ML thread about the discussion over the weekend (re: that its bad that we rely  on recheck so much)19:40
mordredjog0: kinda. I also kinda want jeblair to tell me I'm dumb a little bit first19:40
*** jcooley_ has joined #openstack-infra19:41
jog0mordred: when does jeblair return?19:41
*** dcramer_ has quit IRC19:41
mordredjog0: he arrived back in the US a few hours ago19:42
mordredso, hopefully tomorrow?19:42
*** senk has quit IRC19:43
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add bug 1251920 to our list of bugs we suggest for recheck  https://review.openstack.org/5688319:43
uvirtbotLaunchpad bug 1251920 in nova "Tempest failures due to failure to return console logs from an instance" [Medium,Confirmed] https://launchpad.net/bugs/125192019:43
jog0mordred: cool19:45
clarkbman that zuul backlog is impressive19:45
jog0clarkb: whao just looked what happened?19:45
clarkbjog0: jenkins01 fell over and we are still dealing with the fallout19:46
clarkbfungi: I don't see any hpcloud nodes running jobs on jenkins01, you can probably safely start deleting hpcloud nodes on jenkins0119:46
jog0clarkb: ahhh19:46
fungiclarkb: good call. doing now19:46
*** senk has joined #openstack-infra19:48
*** sarob has quit IRC19:50
*** sarob has joined #openstack-infra19:50
zaromordred: have time for this one? https://review.openstack.org/#/c/5682319:50
*** afazekas has joined #openstack-infra19:51
*** SergeyLukjanov has quit IRC19:52
*** sarob_ has joined #openstack-infra19:52
*** sarob has quit IRC19:52
*** jamesmcarthur has joined #openstack-infra19:52
fungiclarkb: i split the jenkins01 hpcloud node list 5 ways and have parallel deletes running on them19:53
fungiso should hopefully finish fairly quickly19:54
*** dcramer_ has joined #openstack-infra19:54
*** hogepodge has quit IRC19:54
*** nsaje has quit IRC19:55
*** nsaje has joined #openstack-infra19:55
*** dstanek has joined #openstack-infra19:58
*** nsaje has quit IRC20:00
*** mestery_ has joined #openstack-infra20:01
*** sarob_ has quit IRC20:02
jamesmcarthurfungi: thanks for the feedback on https://review.openstack.org/#/c/53644/  Just an FYI Sebastian is on vacation for the next week, so it might be a bit before we get back to you to address your concerns.  I think the Redis version was something Sebastian addressed earlier, but I'll have to discuss with him to get the details.20:02
*** sarob has joined #openstack-infra20:02
*** mestery has quit IRC20:02
*** dstanek has quit IRC20:03
clarkbfungi: down to 25 jenkins01 hpcloud nodes20:03
*** mgagne has quit IRC20:03
*** nsaje has joined #openstack-infra20:04
*** nsaje has quit IRC20:04
fungijamesmcarthur: right, if memory serves we were looking to switch it to use the ubuntu packages for redis, but the current puppet module in that change seems to want to install redis from source instead i think20:04
*** mgagne has joined #openstack-infra20:04
*** mgagne has quit IRC20:04
*** mgagne has joined #openstack-infra20:04
*** nsaje has joined #openstack-infra20:05
jamesmcarthurfungi: yeah, i believe that's correct20:05
*** jcooley_ has quit IRC20:05
fungijamesmcarthur: but anyway, mostly i just wanted to give the current state of the change an initial test drive to spot where it's broken, so we can start to iterate through it once he's back at the wheel there20:06
clarkbfungi: the rackspace nodes are done running tests on jenkins01 too20:06
jamesmcarthurfungi: roger that. thank you sir!20:06
fungiclarkb: cool, i'll start popping them as well20:06
*** rfolco has quit IRC20:06
*** sarob has quit IRC20:07
*** mestery_ is now known as mestery20:08
*** senk has quit IRC20:09
*** sandywalsh has quit IRC20:09
*** nsaje has quit IRC20:09
clarkbfungi: there appears to be one stubborn hpcloud node associated with jenkins0120:09
clarkboh it is gone now \o/20:09
fungiclarkb: i saw it and added it to the pile for this round20:10
*** dcramer_ has quit IRC20:13
*** senk has joined #openstack-infra20:13
openstackgerritMatt Riedemann proposed a change to openstack-infra/elastic-recheck: Add query for bug 1251512  https://review.openstack.org/5703820:17
uvirtbotLaunchpad bug 1251512 in tempest "test_get_console_output fails in gate with MismatchError" [Undecided,New] https://launchpad.net/bugs/125151220:17
*** rnirmal has joined #openstack-infra20:18
clarkbfungi: down to 8 rax slaves20:20
fungistill a couple delete threads going but i think they're almost done20:21
*** sandywalsh has joined #openstack-infra20:21
*** jgrimm has quit IRC20:21
*** sdake_ has quit IRC20:21
*** yolanda has quit IRC20:22
*** herndon_ has joined #openstack-infra20:24
fungithis seems to have done wonders for the available nodes accuracy on the graph now20:26
*** jamesmcarthur has quit IRC20:27
*** dcramer_ has joined #openstack-infra20:27
openstackgerritEdward Raigosa proposed a change to openstack-infra/config: Make pip install from upstream better  https://review.openstack.org/5142520:27
*** dprince has quit IRC20:30
*** jamesmcarthur has joined #openstack-infra20:31
openstackgerritJoe Gordon proposed a change to openstack-infra/devstack-gate: Run df after gate  https://review.openstack.org/5629820:32
clarkbfungi: woot20:33
clarkbfungi: there is one extra node on jenkins 0120:33
clarkbthe UI looks fine though so jsut needs to be killed in nodepool20:33
*** hogepodge has joined #openstack-infra20:35
*** hashar has joined #openstack-infra20:36
*** hogepodge has quit IRC20:36
clarkbfungi: woot we are down to 0 now20:36
clarkbfungi: should we reenable jenkins01?20:36
*** derekh has quit IRC20:36
*** nsaje has joined #openstack-infra20:37
*** nsaje has quit IRC20:37
*** nsaje has joined #openstack-infra20:38
fungiyeah, all clear, deletes finished20:38
fungireenabling now20:38
fungionce nodepool builds it back up to a reasonable level of use, i'll do the same to jenkins0220:39
clarkb++ I am going to run out and grab lunch and stuff quickly while we wait for things to shift20:40
fungik20:40
*** nsaje has quit IRC20:42
*** sarob has joined #openstack-infra20:44
*** jcooley_ has joined #openstack-infra20:45
*** jcooley_ has quit IRC20:46
fungiwe've got 70 nodes on jenkins01 now20:53
*** mrmartin has quit IRC20:53
fungior at least associated with it, probably still all building since i don't see them in its webui just yet20:53
*** Ryan_Lane has quit IRC20:55
fungilooks like they're starting to appear in the webui and run jobs now20:55
*** Ryan_Lane has joined #openstack-infra20:55
anteayait has been a colourful test node graph day20:56
mordredclarkb, fungi: thank you for fixing that - sorry I've been absentish20:56
*** hogepodge has joined #openstack-infra20:57
anteaya-neutron meeting is about to start20:58
*** denis_makogon_ has joined #openstack-infra20:59
*** marun has quit IRC20:59
*** marun has joined #openstack-infra21:00
*** thedodd has joined #openstack-infra21:01
*** amotoki has joined #openstack-infra21:02
*** denis_makogon_ is now known as denis_makogon21:03
mikalSo... I'd really like to try and sneak https://review.openstack.org/#/c/56158/ in soon as it will unblock some stuff we want to do. Who do I need to beg to do jeepyb reviews?21:03
clarkbmikal: I can take a look since you are so important in openstack21:06
mikalclarkb: "kind of big in openstack" you mean21:07
mikalclarkb: and thanks21:07
fungimikal: now you just need a shirt which says "i'm kind of a big deal in openstack"21:08
mikalfungi: that is already in progress...21:08
mikalI was thinking I should have one for the next summit21:09
pabelangerfungi, , I thought russellb was the only big deal?21:14
russellbo.O21:14
openstackgerritA change was merged to openstack-infra/elastic-recheck: Add query for bug 1251512  https://review.openstack.org/5703821:15
uvirtbotLaunchpad bug 1251512 in tempest "test_get_console_output fails in gate with MismatchError" [Undecided,New] https://launchpad.net/bugs/125151221:15
*** dstanek has joined #openstack-infra21:17
*** julim has quit IRC21:19
lifelessmordred: jenkins hates you https://review.openstack.org/#/c/54152/521:19
fungii suspect it's mutual21:19
mikalmriedem / jog0: that bug (1251512) looks lik a special case of 1251920 to me21:20
mikalI've been seeing that failure an marking it as 1251920 when I see it21:20
mikalUntil we can get better console log debugging in the libvirt driver21:20
fungii think a good litmus test for jenkins02 shutdown/cleanup will be once demand starts to fall for centos6 slaves, since that'll mean that we're mostly blocking on devstack slave capacity at that point21:25
*** hogepodge has quit IRC21:25
*** ilyashakhat has quit IRC21:26
fungiany earlier and we'll be hurting gate throughput by holding up jobs which could have used unit test slaves associated with jenkins0221:26
*** ilyashakhat has joined #openstack-infra21:26
*** senk has quit IRC21:27
*** jhesketh has quit IRC21:27
fungiand after that, i'll take a look at the state of alien nodes which look like something nodepool once created. we might have some of those wasting part of our quotas, i suspect21:29
*** alcabrera has quit IRC21:30
fungiyeah, looks like a couple dozen. maybe i'll just go ahead and clean those up now21:32
clarkbfungi: I don't think we need to worry too much about waiting that long21:33
clarkbbetter to get this done with than drag it out imo21:33
lifelesszuul is a bit sick atm right ?21:35
fungilifeless: it's mainly better now, just catching up (and there are a few "stuck" jobs i think from earlier when jenkins01 went awol)21:36
fungier, stuck changes i mean21:36
lifelesskk thanks21:36
lifelesslots of gate-tempest-devstack-vm-full: queued21:36
lifelessthat means we've hit our quota right ?21:37
clarkblifeless: sort of, jenkins01 went sideways and nodepool added a bunch of capacity that the jnekins slaves can't see21:37
*** jamesmcarthur has quit IRC21:37
clarkbso we are at "capacity" according to nodepool but not jenkins. we are slowly fixing that21:37
fungilifeless: when jenkins01 tanked, it left nodepool thinking we had a lot of available machines even though we didn't. that's cleaned up now and new slaves are being spun up and put to work as fast as we can manage21:37
lifelessgotchya21:38
lifelessis nodepool single threaded in that regard?21:38
lifelessor concurrent?21:38
clarkbconcurrent21:38
clarkbwell its python threads21:39
fungihowever our providers also limit how quickly we can make api calls21:39
openstackgerritMichael Still proposed a change to openstack-infra/elastic-recheck: Tell people to do a recheck  https://review.openstack.org/5611821:39
fungiand we further throttle that so that we don't get smacked down by their rate limiting, i believe21:40
clarkbfungi: I would say put jenkins02 in shutdown mode now so that load starts shifting, the currently running jobs will take a bit of time to get through anyways21:40
fungiwill do21:40
*** jcooley_ has joined #openstack-infra21:43
openstackgerritVijendar Komalla proposed a change to openstack/requirements: Update python-troveclient version  https://review.openstack.org/5213721:44
*** blamar has joined #openstack-infra21:50
clarkbmikal: reviewed21:52
*** ryanpetrello has quit IRC21:53
*** wenlock has quit IRC21:54
*** wenlock has joined #openstack-infra21:55
*** emagana has joined #openstack-infra21:56
clarkbfungi: the results queue in zuul is a little scary (we should keep an eye on that and make sure it falls at some point21:56
clarkbI mean it should right? we don't have enough slaves to run all of the tests :)21:56
*** jhesketh has joined #openstack-infra21:57
*** melwitt has quit IRC21:57
reedthe core-non-spider discussion gets more and more complicated21:57
*** melwitt has joined #openstack-infra21:57
zaromordred: hope this works for you.. https://review.openstack.org/#/c/5676021:58
*** vipul is now known as vipul-away21:59
clarkbfungi: it just fell by about 200 events21:59
*** wenlock has quit IRC22:00
*** wenlock has joined #openstack-infra22:00
clarkbfungi: only rax slaves are running jobs on 02 now. if you want to start deleting the hpcloud nodes22:00
*** nati_uen_ has joined #openstack-infra22:00
fungii am doing so now22:01
mriedemmikal: yeah, different failures though22:01
mriedemin one case the output is empty, in one it's not the number of lines expected22:01
clarkbreed: which discussion? I am failing at context switching right now22:01
reedthe badly titled "Re: [Openstack] [Foundation Board] Resolutions from the Technical Committee"22:02
*** mfer has quit IRC22:02
*** dkliban has quit IRC22:02
zaroclarkb, fungi: there's a major fix to jenkins that reduces the number of threads by 75%.  do you think we should upgrade just for that?22:03
clarkbzaro: I think we should, but I don't think I want to deal with that while sorting out the nodepool problems22:03
fungizaro: i definitely think we should discuss it, and probably test it out on jenkins-dev at least22:03
clarkbzaro: I think we should spend some time this week and attempt to upgrade if everyone is happy with it though22:03
* fungi agrees22:03
clarkbin fact I can upgrade jenkins-dev right now if we want22:04
*** melwitt has quit IRC22:04
*** nati_ueno has quit IRC22:04
zarois upgrading jenkins-dev something i can help with?22:04
clarkblet me do that so that we have that step out of the way22:04
*** melwitt has joined #openstack-infra22:04
clarkbzaro: you can follow along over my shoulder22:04
zaroclarkb: coolio.22:04
clarkbzaro: pleia2: the basic process isn't too exciting. stop puppet on jenkins-dev, copy jenkins war (it is in /usr/share/jenkins) to a version specific war that we can use to roll back if necessary, apt-get update, apt-get install jenkins and I think that restarts the service22:05
clarkbthen restart puppet, really the only thing I do that is special is backup the old war for super easy revert22:06
clarkbI am sure apt magic can make that easy enough as well but apt magic is dark voodoo22:06
*** ryanpetrello has joined #openstack-infra22:06
fungiall hpcloud devstack nodes associated with jenkins02 have been nodepool deleted22:10
*** vipul-away is now known as vipul22:13
clarkboh stop jenkins on jenkins-dev before updating it because it doesn't always stop cleanly22:13
reedoh, more opinions on the mailing list split!22:13
clarkbI am stoping jenkins on jenkins-dev now22:13
clarkbfungi: ^22:13
fungiclarkb: sounds good22:13
clarkbfungi: pleia2: zaro and I have noted a couple things from the release notes of jenkins that may bite us. There are new REST API permissions around creating, deleting and updating nodes. and the ssh-slaves and credentials plugins which are bundled in jenkins have been upgraded a couple times22:15
fungitoo fun22:16
clarkbtrying to sort out how to run the jenkins-test script in the config repo22:21
fungii've not done it22:21
*** sarob_ has joined #openstack-infra22:22
clarkbit looks simple, just trying to sort out what sort of config it wants22:23
*** CaptainTacoSauce has quit IRC22:23
*** sarob has quit IRC22:25
*** sarob_ has quit IRC22:27
fungijenkins02 is idle now. nodepool deleting the remaining nodes from it22:31
mriedemare we using different cirros images lately in the full gate jobs?22:31
clarkbmriedem: I think we cache them once a day. If upstream cirros has updated we wuold get that within 24 hours22:32
mriedemclarkb: ok, looks like the latest cirros release was february 8th, so i guess nothing recent there22:34
mriedemhow about libvirt?22:34
mriedemdo we use 1.1.4?22:35
clarkbfungi: that test script relies on old d-g and zuul which had interaction with jenkins. need to rewrite it to use nodepool libs instead. I am not going to worry about that now as there are more pressing things happening. Starting puppet on jenkins-dev now22:35
fungik22:36
clarkbmriedem: libvirt we get from cloud archive iirc22:37
*** dangers is now known as danger_fo_away22:37
clarkbfungi: down to two nodes on 0222:37
mriedemclarkb: ok, trying to figure out what if anything could be recently changing to cause so many test_get_console_output failures lately, it got real flaky in the last week or so22:38
fungiclarkb: yeah, just one delete thread still running and on its last entry22:38
*** jcooley_ has quit IRC22:38
clarkbmriedem: it is possible that libvirt upgraded for the havana release in cloud archive22:39
fungiclarkb: jenkins02 nodepool list is now empty, and the aliens-list for devstack-precise-<providers> has been manually cleaned out as well22:39
*** nsaje has joined #openstack-infra22:39
*** mriedem has quit IRC22:40
fungiready for me to start jenkins02?22:40
clarkbfungi: woot, go for it22:40
fungihopefully things will start to pick up again now once it gets some new nodes22:40
*** ryanpetrello has quit IRC22:44
*** xeyed4good has joined #openstack-infra22:44
*** nsaje has quit IRC22:44
clarkbfungi: then if zuul catches up we can restart it to have it forget about those changes that have been around for 10 hours22:46
*** lcestari has quit IRC22:47
*** loq_mac has joined #openstack-infra22:47
*** julim has joined #openstack-infra22:49
clarkbfungi: so uh22:50
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: fix jjb job template documenation  https://review.openstack.org/5706222:50
clarkbwe seem to have run into the same problem again22:50
clarkbmaybe nodepool really does need to be kicked22:51
*** rcleere has joined #openstack-infra22:52
clarkbfungi: I think the event stream from jenkins01 may not be working as expected22:54
clarkbfungi: looking at logstash to see if I can find any jenkins01 jobs22:54
clarkbfungi: I think that is the problem22:56
clarkbnetstat shows tcp connections are established though22:57
clarkbfungi: I am getting the events locally22:59
*** pcm_ has quit IRC23:00
mordredzaro: thank you23:00
clarkbso the event stream is fine, I think it must be on the client side23:00
mgagnezaro: which job/script is testing import order in python projects? I suspect JJB isn't tested against it.23:00
*** julim has quit IRC23:01
*** thomasem has quit IRC23:02
*** fifieldt has joined #openstack-infra23:03
*** amotoki has quit IRC23:04
clarkbfungi: I think we should restart nodepoold23:05
clarkbthen hit everything with a hammer again (though maybe more aggressively and deal with the fallout)23:05
zaromgagne: import order?  not sure what you mean.23:06
* clarkb goes ahead and restarts nodepoold23:07
mgagnezaro: I think it's a lint/pep8 check23:07
zaromgagne: while you are here.  can you please take another look at https://review.openstack.org/#/c/5671523:07
mgagnezaro: "import sys\nimport os" would throw an error about import not being alphabetically ordered.23:07
fungimmm23:08
clarkbfungi: it has been restarted23:08
fungiokay23:08
clarkbfungi: I think what was happening is gearman plugin put the slaves in offline mode23:08
zaromgagne: hmm.. i think pep8 is run against jjb project.  let me check.23:08
clarkbfungi: but nodepool never saw the nodes as being used because the finished events were not coming through23:09
clarkbmgagne: zaro: pep8 is run but not hacking23:09
mgagnezaro: or is it hacking ?23:09
mgagnezaro: right23:09
mgagneclarkb, zaro: how hard would it be to add it?23:09
clarkbmgagne: depends on how unclean jjb is :)23:10
fungiclarkb: makes sense. and explains the giant mass of "available" nodes doing nothing but offline in jenkins23:10
clarkbchanging the tox.ini is simple, but all of the failures need to be fixed too23:10
zaromgagne: not difficult to add, but what clarkb said.23:10
clarkbfungi: so now I think we should put jenkins01 in shutdown mode again, wait for it to finish then kill all of its nodes23:10
clarkbfungi: or just start manually pewpewing nodes on that host23:10
fungiclarkb: we ought to be able to nodepool delete them this time, i think23:10
clarkbfungi: ok23:10
mgagnezaro: would fixing hacking failures worth it?23:10
* fungi ponders23:11
clarkbfungi: it looks like doing hpcloud is safe23:11
mgagnezaro: or we add hacking as a non-voting job until failures are fixed?23:11
clarkbbut not rax23:11
clarkbmgagne: zaro: I would just do it in one go23:11
zaromgagne: what did you want from hacking?23:11
clarkbrather than wait for all failures to be fixed23:11
mgagnezaro: import order is bothering me :D23:12
fungiclarkb: i think all the "ready" nodes in nodepool could just be blown away at this point. checking that assumption now23:12
*** dkranz has quit IRC23:12
*** hogepodge has joined #openstack-infra23:12
zaromgagne: hmmm,  i think i can help with that. but don't want to do it myself since it doesn't bother me ;)23:12
fungimaybe not. 86 in a "building" state, 288 "ready"23:13
mgagnezaro: if someone can explain me how to enable hacking, I don't mind taking a look23:14
*** ^d is now known as ^demon|busy23:14
fungiand a lot more "used" in jenkins than nodepool seems to think23:14
*** beekneemech is now known as bnemec23:14
zaromgagne: first clone openstack-infra/config23:14
blamaranyone know off the top of their head if it's possible to escape braces ({}) in jjb templates so they don't get substituted? will dive into the code next…23:14
mgagnezaro: next: bootstrap whole CI infra on your laptop :D23:14
mgagneblamar: {{}}23:14
clarkbblamar: yes, use {{}}23:14
blamarty23:15
blamarty23:15
blamarall of you!23:15
fungiclarkb: so i think nodepool delete blindly on ready nodes is probably not safe, since i get the impression some of those may actually be doing work23:16
fungibest to go back through putting jenkins01 in shutdown i think23:16
zaromgagne: haha!  it's not that bad.23:16
clarkbfungi: ok23:16
fungithen i can safely delete any nodes associated with it (again)23:16
zaromgagne: look in modules/opesntack_project/files/zuul/layout.yaml23:17
*** julim has joined #openstack-infra23:17
mgagnezaro: ok, the idea is to look for examples in existing projects =)23:17
clarkbfungi: when it is in shutdown mode and we are waiting you should review the havana jobs change :)23:18
zaromgagne: yup.  that's where all the projects are, there are already many that use hacking.  just add it to jjb.23:18
funginote to self. when a jenkins master goes toes-up, nodepool can lose track of the event stream and needs a restart23:18
morganfainbergdid we decide at the summit that 1 core fore transifex is the general policy?23:19
clarkbfungi: fwiw it shouldn't do that, 0mq is supposed to retry on an interval and according to netstat it had an established connection23:19
morganfainbergi remember something along those lines23:19
clarkbpossible bug in 0mq or pyzmq23:19
clarkbmorganfainberg: that sounds right, you basically just want ot have a sanity check that only translation files were updated instead of code and so on23:19
*** hogepodge has quit IRC23:20
morganfainbergclarkb, figured as much.  *pokes jenkins with a stick again*  we have one w/ like 33k changes... i personally want to have those go in a bit more frequently :P23:20
*** yamahata_ has quit IRC23:21
clarkbfungi: I think it is safe to start deleting hpcloud jenkins01 slaves23:22
*** datsun180b has quit IRC23:22
zaromgagne: arrg!!! good catch on that filename. i'm really bad with english.23:23
mgagnezaro: if you think I'm gooder than you ;)23:24
clarkbfungi: then if we finally get this all to settle out, I can restart zuul tonight once things have caught up assuming they catch up23:25
*** denis_makogon has quit IRC23:26
*** vipul is now known as vipul-away23:28
fungik23:29
*** yjiang5 has joined #openstack-infra23:29
fungideleting hpcloud nodes from jenkins01 now23:29
mgagneguys, finalised or finalized ?23:31
clarkbuse the zed look23:32
clarkb*luke wow I fail hard23:32
*** vipul-away is now known as vipul23:32
mgagnegoogle says 2.5M vs 13.2M results :O23:32
clarkbmordred: fungi: ordering the new hpcloud zone changes23:35
clarkbmordred: fungi: I think the one that adds az support to nodepool needs to go in first? then we can update the nodepool config23:35
clarkbdoes that sound right?23:35
pabelangermordred, reading your comment on https://review.openstack.org/#/c/56371/ always wondered why tox -edocs was never created23:37
pabelangervs people adding a Makefile23:37
fungimgagne: depends on which side of the atlantic you source your english23:38
mgagnefungi: is JJB british or american? :D23:38
clarkbit is both! I am pretty sure we don't nit pick those things23:38
mgagneclarkb: lets support both configs! :D23:39
clarkboh is this for the config itself? then I guess this matters in different ways23:39
clarkbfor docs I just go meh23:39
fungimgagne: well, he was written by americans but has a british name, so could go either way23:39
clarkbfungi: LinuxJedi wrote the first pass23:39
clarkband he speaks the wrong english23:39
fungid'oh, right!23:39
fungier, i mean, wrong!23:40
fungii would tend to gravitate to config options which don't suffer from alternate spellings, personally. pick a different word if possible23:40
*** boris-42 has joined #openstack-infra23:41
fungithough if it's mirroring a config option in jenkins, then do whatever it does i guess23:41
mgagnefungi: jenkins config is finalised, jjb doc/jjb config/commit is finalized ^^'23:42
fungiyeah, in this case i'd do what jenkins does23:42
mgagnealright, thanks!23:42
fungibut i'm just one voice, after all23:42
openstackgerritDan Prince proposed a change to openstack-infra/devstack-gate: Bump grenade master to use stable/havana.  https://review.openstack.org/5706623:42
*** vipul is now known as vipul-away23:44
*** vipul-away is now known as vipul23:44
*** weshay has quit IRC23:44
funginodepool delete is about halfway through the hpcloud nodes on jenkins0123:45
*** zjdriver has joined #openstack-infra23:46
fungias soon as the last couple rax jobs finish i'll start some threads deleting those too23:46
*** boris-42 has quit IRC23:47
*** thedodd has quit IRC23:51
openstackgerritEdward Raigosa proposed a change to openstack-infra/config: Make pip install from upstream better  https://review.openstack.org/5142523:51
*** vipul is now known as vipul-away23:54
*** vipul-away is now known as vipul23:54
*** vipul is now known as vipul-away23:55
*** vipul-away is now known as vipul23:55
wenlockmordred, that commit works behind our firewall .... if you can review it when you get some time23:55
openstackgerritKhai Do proposed a change to openstack-infra/jenkins-job-builder: use jjb tests as the examples  https://review.openstack.org/5706823:57

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!