Friday, 2013-10-11

clarkbmordred: I left a comment on the change, I think I managed to express my concern properly, but let me know if it isn't clear00:02
openstackgerritDan Bode proposed a change to openstack-infra/config: Add stackforge project: puppet_openstack_builder
clarkbmordred: if I +2 that do you want to babysit an approval?00:15
clarkbI need to drop offline here for a bit in order to get move stuff done prior to the weekend00:16
* clarkb AFKs to do that. I did +2 the change. I think it just needs a sanity check once in so that the next gerrit restart doesn't go sideways00:17
openstackgerritA change was merged to openstack-infra/config: Document how to delete a pad from Etherpad Lite
openstackgerritDan Bode proposed a change to openstack-infra/config: Add stackforge project: puppet_openstack_builder
openstackgerritA change was merged to openstack-infra/jenkins-job-builder: Add repo scm
openstackgerritA change was merged to openstack-infra/devstack-gate: Improve fallback to master branch
openstackgerritA change was merged to openstack-infra/devstack-gate: Revert "Revert "Enable q-vpn service""
openstackgerritA change was merged to openstack-infra/devstack-gate: Conditionally override PyPI for reqs integration
*** nosnos has joined #openstack-infra00:44
stevebakerhey, is there some permissions I need to review heat proposals on ?01:18
*** sarob has quit IRC02:15
*** guohliu has joined #openstack-infra02:43
lifelessmordred: you might care about
uvirtbotLaunchpad bug 1222306 in tripleo "can't install keystone with pypi mirror" [Medium,Triaged]02:57
lifelessmordred: or
uvirtbotLaunchpad bug 1222308 in tripleo "can't install cinderclient with pypi mirror" [Medium,Triaged]02:57
clarkblifeless: we really should require <0.8alpha or whatever the lowest 0.8 version is02:59
lifelessclarkb: of requests?03:00
clarkblifeless: sqlalchemy03:00
clarkbits silly we can't just say <0.803:01
mordredah. fascinating03:01
mordredclarkb: we can with pip 1.403:01
lifelessclarkb: oh right, there are two distinct bugs03:01
clarkbmordred: right, but everyone else doesn't do new pip03:01
lifelessmordred: yeah, I found this testing --offline with a fresh mirror03:01
lifelessmordred: so this is in the 'stuff we don't mirror in' category03:01
lifelessthe problem is global requirements doesn't list all the different requirements all releases of clients had03:02
mordredlifeless: yah.,n,z03:02
mordredclarkb: I think we have a script bug:
mordredlook at the commit message03:03
lifelessmordred: I'm not sure how that will fix the issue03:03
lifelessmordred: we're installing releases03:03
lifelessmordred: when we pip install nova trunk03:04
lifelessmordred: we get a release of python-neutronclient03:04
* mordred bats eyelashes03:04
lifelessmordred: if the current requirements rules don't bring down versions that match the requirements when the release of that client was cut03:05
mordredall of the projects should merge all of those changes and then cut releases03:05
clarkbmordred: thanks. I also made sure to document why that horrible read into a variable trick is used03:05
mordredhrm. ok03:05
mordredlifeless: I grok what you are saying03:05
clarkbbecause I keep forgetting why we did that and I don't want to have to remember03:05
lifelessmordred: I don't claim to have an answer yet03:05
lifelessmordred: just thought you should have it in your thinking cap03:05
openstackgerritA change was merged to openstack-infra/config: Use a single change ID per requirement proposal.
mordredlifeless: I think this may fall in to the category of things that jeblair was worried about in terms of enabling use of our mirror for non-gate activities03:05
mordredlifeless: which is to say, I think it may have some design holes03:06
lifelessmordred: we're not using your mirror yet03:06
lifelessmordred: this is a fresh run-mirror'd mirror03:06
mordredlifeless: yup. I grok. but the mirror script is designed to keep a running mirror03:06
lifelessmordred: right, ack.03:06
mordredlifeless: thinking cap on - btw03:06
mordredthis is my way of thinking03:06
lifelessonce we get sophisticated enough in our CI03:07
lifelesswe'll spin up new mirrors as part of the test03:07
lifelessand detect this03:07
mordredI will be honest - my most recent thinking has been to investigate use of devpi03:07
lifelesss/the test/a test/03:07
lifelessmordred: fully offline is very attractive for dc bringup stories03:07
mordredyup. devpi has fully offline03:07
lifelessmordred: so I'm not super keen on devpi03:07
lifelessmordred: I thought it only captured what you used?03:08
mordredit also has pockets03:08
mordredso you can have a "mirror upstream" pocket, and a "my local stuff" which depends on the "mirror upstream"03:08
lifelessmordred: so devpi would demonstrate the same failure mode03:08
mordredso pointing at my local stuff will get you both03:08
mordredlifeless: yes. I'm just saying03:08
lifelessok, tangent, sure.03:08
mordredI've been thinking that richer implementation scripting might be better served at this point by devpi instead of pypi-mirror03:09
mordredI support the goal you are expressing03:09
mordredsort of03:09
mordredI mean03:09
lifelessso I suspect we're going to be gating a different scenario than the gate currently does03:09
lifelessI'm thinking I should mail the list when we're in sight of success03:09
lifelessand get discussion03:09
lifelessand/or a session in the CFP at the project level I guess03:09
clarkbmordred: are you thinking we should use devpi for our mirror too?03:10
*** dims has quit IRC03:10
mordredclarkb: toying with the idea03:10
mordredclarkb: the fact that it support multiple sets of things03:10
mordredclarkb: and local uploads03:10
mordredbut also linking things03:10
mordredis very attractive03:11
mordreddownside: it serves things from pyton instead of apache03:11
clarkbright, I was just going to ask about that03:11
mordredyup. that's the assinine part03:11
mordredbut also the part that allows you to describe sets that depend on other sets03:11
mordredso, you know, feature. bug.03:11
mordredalso - I'm thrilled that 3rd party testing has finally caught on03:13
mordredit only took a year03:13
mordredmaybe a year and a half03:13
clarkbmordred: so I was thinking about swift logs and realized we should just put our mirror in swift too03:13
mordredhow long have we been doing this?03:13
mordredclarkb: totes03:13
clarkbmordred: then we can manage a single index.html file03:13
clarkband maybe not even that03:13
clarkbmordred: nova is requiring it for their hypervisors03:14
clarkbmordred: I think ssh will always be the way to go for third party testing (because event stream > polling)03:15
*** wenlock_ has joined #openstack-infra03:18
*** wenlock has quit IRC03:19
*** wenlock_ is now known as wenlock03:19
mordredamazing how russellb telling people they have to do it or they're going to get dropped gets further than us offering that they can do it and people can track the quality of their driver03:20
*** matsuhashi has joined #openstack-infra03:24
*** matsuhashi has quit IRC03:31
*** matsuhashi has joined #openstack-infra03:32
openstackgerritTom Fifield proposed a change to openstack-infra/config: Fix Doc Location for Transifex
clarkbfifieldt: you around?04:21
fifieldtyessir clarkb04:22
fifieldtthe sun is up and doing well04:22
clarkbfifieldt: cool. We would like to add Ironic to transifex and I figured I should figure out how you would like to go about adding new prjoects04:22
fifieldtright, yes, that proceedure should be documented04:22
fifieldtI take it you're most interested in the transifex side of things?04:23
clarkbI think I have sufficient permissions to do it, but didn't want to be sidestepping things04:23
clarkbfifieldt: right04:23
clarkbfifieldt: I can send an email or submit a bug or whatever is best for you04:23
fifieldtif you want, we can step through it now and just do it?04:23
fifieldtand I can update the wiki at the same time04:23
fifieldtso, we start in the OpenStack "organisation" on transifex04:23
fifieldtat the top of the projects list is the "+ NEW" button04:24
fifieldtwe type in a name, and description as appropriate04:24
clarkbyup, I have clicked the NEW button04:24
fifieldtand importantly: set the source language to English (en)04:24
clarkbfifieldt: and the name is the project less openstack/ ?04:24
fifieldtthe openstack organisation provides the openstack bit04:25
fifieldtchoose "Permissive Open Source" as the license04:25
fifieldtand paste the URL for the source (either github or in the "source code URL" box04:25
fifieldtonce you have created the project, go to its page and click the "Manage" button04:26
clarkbfifieldt: does the URL for the source need to be a clonable path?04:26
clarkbor is that just a handy link for humans?04:26
fifieldtjust a handy link for humans04:26
clarkbok I am on the manage page04:26
*** basha has joined #openstack-infra04:27
fifieldtfeel free to fill out a long description, home page, if you want,04:27
fifieldtbut the important bit here is maintainers04:27
fifieldtnot maintainers04:27
fifieldtaccess control04:27
fifieldtset the "Project Type" to "Outsourced project"04:27
clarkbfifieldt: under features is a TM check box. should I check that?04:27
fifieldtand "Outsource Access to" OpenStack04:27
fifieldtyes, that is a good idea clarkb04:28
clarkbok TM check box checked and project outsourced to openstack04:28
clarkbnow I need to add maintainers04:28
fifieldtin theory, that is done through the OpenStack organisation04:29
fifieldtbut you can add anyone you think is relevent to an individual project04:29
clarkbfifieldt: can you check if you have management perms on Ironic?04:29
clarkbyou haven't been explicitly added but are part of the project hub04:29
fifieldtI do indeed04:29
fifieldtso no problems with permissions04:29
clarkbcool I will leave it as is then04:29
fifieldtyay :)04:29
clarkbis that it for the transifex side?04:30
clarkbawesome thanks04:30
fifieldtthere is one thing I'm not 100% sure of04:30
fifieldtthat is whether there's a need to manually create the "Resources" the first time04:30
fifieldtI think the client can do that04:30
fifieldtbut I'm not 100% sure04:30
clarkbI think the client can do that too04:30
fifieldtthen yes, that should be everything04:31
clarkbas other new projects haven't needed to do anything under resources, instead jenkins jobs push to them and they are automagically added04:31
fifieldtit's good to get confirmation on that04:31
clarkbfifieldt: I will try to remeber and double check ironic once the jenkins jobs are in place04:31
clarkbbut I haven't heard complaining about it not working so it must work right? :)04:31
fifieldtright :)04:32
fifieldt <-- though, speaking of failing jenkins jobs, how do you feel about this? :) I'd like to get manuals working again :(04:32
clarkbdevananda: ^ you are ready for the jenkins jobs04:32
clarkbfifieldt: 51112 lgtm +2'd04:32
* fifieldt wonders who else he can bother at this insane timezone04:33
fifieldtdammit clarkb, now I have to check every project to make sure that TM box is ticked :D04:40
fifieldtit must be a new option04:42
fifieldtthey weren't04:42
fifieldtnice job on the discovery :)04:42
*** senk has joined #openstack-infra04:42
*** DennyZhang has quit IRC04:46
*** senk has quit IRC04:47
clarkbfifieldt: :)04:47
*** changbl has quit IRC04:47
*** changbl has joined #openstack-infra04:51
*** DennyZhang has joined #openstack-infra04:56
*** sarob has quit IRC05:02
*** sarob has joined #openstack-infra05:02
*** sarob has quit IRC05:07
*** SergeyLukjanov has quit IRC05:08
*** afazekas has quit IRC05:11
*** senk has joined #openstack-infra05:43
*** senk has quit IRC05:48
openstackgerritEndre Karlson proposed a change to openstack-infra/config: Add pypi job to python-libraclient
*** yolanda has joined #openstack-infra06:11
*** sarob has joined #openstack-infra06:13
openstackgerritEndre Karlson proposed a change to openstack-infra/config: Add / Change python-libraclient jobs
*** sarob has quit IRC06:18
*** mkoderer_ is now known as mkoderer06:37
*** senk has joined #openstack-infra06:45
*** senk has quit IRC06:50
*** mancdaz has joined #openstack-infra07:15
openstackgerritMasashi Ozawa proposed a change to openstack/requirements: Set boto minimum version
openstackgerritMasashi Ozawa proposed a change to openstack/requirements: Set boto minimum version
bauzashi all07:22
bauzasI'm having trouble with the py27 build for a review :
bauzasmy own tow -r -epy27 works like a charm07:23
bauzasbut the oslo config on the Jenkins VM is incorrect07:24
bauzasI checked both Jenkins and tox venvs07:24
bauzasand the pip freeze is slighly different07:24
*** fbo_away is now known as fbo07:25
bauzasoslo.config is the same 1.2.107:25
*** senk has quit IRC07:51
openstackgerritLucas Alvares Gomes proposed a change to openstack/requirements: Added lower version boundary for netaddr
openstackgerritMasashi Ozawa proposed a change to openstack/requirements: Set boto minimum version
silehtthx fungi I have seen the pypi mirror updated !09:13
openstackgerritMehdi Abaakouk proposed a change to openstack-infra/jenkins-job-builder: Allow macro is dict key
* ttx juggles with CIVS since it doesn't allow more than 1000 voters10:17
ttxfungi, jeblair, mordred: multiple failures downloading deps on various jobs10:23
ttxlooks like network issues10:24
ttxdoesn't hit the same dep every time10:24
* ttx lunches10:24
sdaguettx: it doesn't allow more than 1000 voters?10:26
*** branen has quit IRC10:30
openstackgerritEndre Karlson proposed a change to openstack-infra/config: Add / Change python-libraclient jobs
openstackgerritQiu Yu proposed a change to openstack-infra/jeepyb: Print help message and exit if no config file by default
sorenttx: CIVS is free software, IIRC. You might be able to install it somewhere and crank that limit up to eleven... thousand.11:17
sdaguemight need that for next go around. the ATC growth being what it is11:32
ttxsoren: yes, it's a bit weird but I ran it locally recently to test the ability to rerun ballots with alternative algorithms11:35
ttxsdague: you can actually send voters in multiple batches of <100011:35
ttxfungi, jeblair, mordred: ^ may or may not be related with the network issues we're experiencing fetching deps11:42
ttxAt this rate we'll reach universe entropy in 67 minutes11:42
ttxsdague: ever saw something like it ?11:43
sdagueyeh, that looks crazy11:44
sdagueI wonder if the network timeouts are preventing the node builds11:44
sdaguewhich would make sense11:44
ttxsdague: yes, definitely started to appear at around the same time11:44
sdagueso they enter that state, but stall out11:44
sdagueand the system is correctly trying to build more, because it's not getting any out the other side11:45
sdaguebecause we are definitely backed up on devstack nodes11:45
ttxit's like watching a train wreck in slow motion11:45
ttxgood thing I got most of my patches merged earlier.11:46
sdaguewho knew that skynet would need this much care and feeding11:46
ttxsdague: I was thinking of issuing a statusbot alert.11:47
sdagueprobably fair11:47
ttxon it11:47
ttx#status notice Gate is currently stuck (probably due to networking issues preventing new test nodes from being spun)11:49
*** senk has joined #openstack-infra11:49
ttxI like how every time I need to use that bot it miserably fails11:50
ttxwhere the heck is openstackstatus bot11:50
*** basha has quit IRC11:50
*** senk has quit IRC11:53
-ttx- Top issues right now: (1) test node starvation (2) networking issues fetching dep (might be the cause of 1) and (3) no statusbot to warn people11:54
ttxsdague: wondering if we are not past the peak of network issues and starting to gradually recover11:58
ttxlooking at the graph and the status of the very few tests that run11:58
sdagueyeh, could be11:59
*** cody-somerville has joined #openstack-infra12:22
mordredyay! things have fixed themselves before I woke up?12:23
BobBallthey knew you were coming12:23
BobBalland were scared...12:23
mordredBobBall: ++12:24
thomasm'Tis a good day.12:26
ttxmordred: gate is totally wedged right now.12:43
fifieldtthat looks awesome12:44
bauzassdague: ping ?12:44
fifieldtbut the amount of scrolling was annoying to get to the graphs ;)12:44
ttxfifieldt: if I didn't need it urgently for RC2 production I would probably find it funny too12:44
bauzassdague: I'm now at my office, still broken about my oslo.config version12:44
bauzasbtw, maybe ppl could help me ?12:45
fifieldtsorry ttx :) 2345 here and the brain is off, it seems12:45
* ttx sees his weekend vanish12:45
bauzasoslo-config got pulled from Jenkins while it shouldn't12:45
bauzasmy own tox venv on my laptop doesn't get this pretty old oslo-config beta version12:45
*** dafter has quit IRC12:46
bauzasthe gate should be fine12:46
mordredwhy are we timing out on fetches from pypi.o.o ?12:46
*** jhesketh has quit IRC12:46
ttxmordred: you tell me12:46
*** openstackstatus has joined #openstack-infra12:47
mordredok. there's statusbot12:47
ttxyay, a bot12:47
ttx#status notice Gate is currently stuck (probably due to networking issues preventing new test nodes from being spun)12:48
openstackstatusNOTICE: Gate is currently stuck (probably due to networking issues preventing new test nodes from being spun)12:48
*** basha has joined #openstack-infra12:48
openstackgerritRenat Akhmerov proposed a change to openstack-infra/config: Add configuration for Mistral project
*** ChanServ changes topic to "The Infrastructure team is working through some devstack node starvation issues which is currently holding up gating and slowing checks. ETA 1600 UTC"14:14
fungiso i think starting around thenish, hpcloud ceased acting on any nova delete or create calls. maybe nodepool lost a persistent connection and didn't realize/retry?14:48
fungiit has established https sockets (so it thinks) to addresses very similar to what the hpcloud service endpoint resolves to. sniffing now to see if those are actually dead connections14:52
*** julim has quit IRC20:59
jeblairboris-42: ok.  you can look at the swift-devstack-vm-functional jobs for an example of how to do something like that21:00
boris-42jeblair thank you!21:01
boris-42jeblair will try on next week=)21:01
clarkbsdague: I can try. reading is a good place to start21:01
*** melwitt has joined #openstack-infra21:01
jeblairclarkb: that's the job before, the neutron job, and the job after21:02
jeblairtimestamps don't seem to overlap21:02
jeblairclarkb: and neither the job before or after did that21:02
jeblairi'm leaning toward 'jenkins got busy' or 'jenkins got semi-deadlocked' or 'jenkins garbage collected' or, well, in general, just blaming jenkins for being jenkins.21:03
clarkbjeblair: this is weird. ya jenkins for being jenkins seems plausible21:03
*** jerryz has quit IRC21:04
jeblairmarkmcclain, sdague: so it looks like 15 min of that runtime is jenkins derping.  let's call that a fluke for the moment, unless it happens with significant regularity.21:04
sdaguejeblair: sounds fair21:04
sdagueclarkb: ok, I'm assuming this will live in openstack-infra/ and will propose a patch accordingly21:05
jeblairsdague, clarkb: ++21:05
*** senk has quit IRC21:06
clarkbsdague yup. the stackforge page is a decent template for what you need though21:07
*** miqui has quit IRC21:08
*** matty_dubs is now known as matty_dubs|gone21:09
sdaguewhat's included in python-jobs?21:10
clarkbpep8 pythonXX and pypy21:10
clarkbalso gate-*-docs21:10
clarkband coverage21:10
*** lcestari has quit IRC21:14
*** sarob has joined #openstack-infra21:14
openstackgerritSean Dague proposed a change to openstack-infra/config: add os-loganalyze to gerrit & zuul
sdagueso, that about right?21:15
*** CaptTofu has quit IRC21:17
*** CaptTofu has joined #openstack-infra21:17
clarkbsdague the pep8 and python jobs are just gate-* no check-*21:17
sdaguelet me fix that quick21:18
openstackgerritSean Dague proposed a change to openstack-infra/config: add os-loganalyze to gerrit & zuul
*** anteaya has joined #openstack-infra21:19
anteayaclarkb: I am meeting all sorts of elastic search people21:19
*** SergeyLukjanov has quit IRC21:19
dkranzclarkb: My tempest job watcher thinks only four tempest gate jobs have finished in the past few hours. Is it wrong or did I just pick a bad time to start with this?21:19
anteayado you have a list of bugs or an etherpad that outlines your current pain points with logstash and elastic search so I can read up and ask intelligent questions21:19
*** SergeyLukjanov has joined #openstack-infra21:20
anteayaand maybe find out something useful for you?21:20
clarkbanteaya: I don't they are fairly nebulous around scaling21:20
clarkbsdague lgtm21:20
anteayaclarkb: yeah that is what I understood21:20
clarkbanteaya I need to upgrade to latest next week21:20
clarkbnewer versions are supposed to be better21:20
anteayado you think that will address some of the current scaling issues?21:20
anteayaI'll ask about versions tomorrow21:21
*** mrodden has quit IRC21:21
anteayawhat version of logstash and elastic search are we using right now21:21
anteayaand what do you want to go to next week?21:21
clarkbyes es memory use is much better in 0.90.X apparently21:21
sdagueclarkb: next time you are in logstash, I have requests for 2 pieces of metadata to get added to the runs21:21
sdague1) cloud-az21:21
sdague2) branch21:22
*** CaptTofu has quit IRC21:22
clarkbdkranz: I don't know. currently on a poor connection.21:23
anteayaclarkb: the one bit of info I got from my after dinner walk around Budapest companions, who just happen to have an elastic search as a service company - what luck - is that they run many small clusters rather than large clusters21:23
clarkbsdague: noted21:23
sdagueclarkb: thanks :)21:24
dkranzclarkb: ok, given that my patch is still hung in zuul almost 4 hours later perhaps it is just slow21:24
anteayaI'm not sure how the size of our cluster would be characterized21:24
clarkbanteaya: interesting I wonder how they shard across clusters21:24
anteayaI can ask21:24
*** esker has quit IRC21:27
sdagueclarkb: ok, jenkins did a +1 -
*** vipul is now known as vipul-away21:28
*** vipul-away is now known as vipul21:28
sdaguejeblair, you got a sec to check that out as well?21:28
sdagueI'd like to get this over so I can at least call that part good before the weekend, if possible :)21:29
*** SergeyLukjanov is now known as _SergeyLukjanov21:32
*** _SergeyLukjanov is now known as SergeyLukjanov21:32
*** SergeyLukjanov is now known as _SergeyLukjanov21:33
*** _SergeyLukjanov is now known as SergeyLukjanov21:33
*** SergeyLukjanov is now known as _SergeyLukjanov21:33
*** _SergeyLukjanov is now known as SergeyLukjanov21:33
*** SergeyLukjanov is now known as _SergeyLukjanov21:34
*** _SergeyLukjanov is now known as SergeyLukjanov21:34
*** blamar has joined #openstack-infra21:39
*** vipul is now known as vipul-away21:43
openstackgerritA change was merged to openstack-dev/pbr: Do not pass unicode where byte strings are wanted
fungisdague: you still have teh typoz21:50
*** anteaya has quit IRC21:51
*** vipul-away is now known as vipul21:54
*** mgagne has quit IRC21:56
fungiso as far as the py26 unit test timeout, i see than jenkins02 is in the midst of one of those use-all-the-things fits and is well on its way to memory exhaustion as a result...
fungii give it 30-60 minutes before available ram is full21:59
fungithough looking at the swap graph, yesterday's oom condition didn't happen until it reached around 0.5g swap used and then suddenly spiked in a matter of 10-20 minutes until it was up to 2g swap22:01
clarkb:/ is there a newer version of jenkins out. ww could try upgrading22:02
jeblairfungi can you gracefully stop and restart it?22:02
fungijeblair: i definitely can22:02
fungiwas wondering if we wanted to troubleshoot further first, since we've caught it in this state22:02
fungii'm checking the thread count real quick22:02
*** senk has joined #openstack-infra22:03
jeblairi am afk and not useful22:03
*** dkranz has quit IRC22:03
fungino worries--i'm collecting what details i can first22:03
fungibut will definitely try to cycle it here in a moment and see if that helps22:03
*** gyee has quit IRC22:03
*** pcm_ has quit IRC22:04
clarkbI cant help for a bit but should have proper wifi in about an hour22:04
*** SergeyLukjanov has quit IRC22:04
fungithread count is highish but reasonable. not like that other time where it went batty22:05
*** SergeyLukjanov has joined #openstack-infra22:05
fungiThreads on Number = 1,935, Maximum = 3,390, Total started = 106,69922:05
Steely_Spamis that hanging out because it got a -1 during check?22:05
Steely_SpamI didn't think that was a thing22:06
*** SergeyLukjanov is now known as _SergeyLukjanov22:06
fungifor comparison...22:06
fungiThreads on Number = 1,422, Maximum = 19,590, Total started = 807,51722:06
*** _SergeyLukjanov is now known as SergeyLukjanov22:06
*** senk has quit IRC22:07
clarkbno it should clear the -1 and move on. that is why zuul leaves a gate jobs starting comment22:07
Steely_Spamclarkb: okay, I thought so...22:08
fungiclarkb: Steely_Spam: though in this case i'm not finding it on the zuul status page22:08
Steely_Spamfungi: right, it's not in the queue for some reason22:09
fungiit got a new patchset upload after it was approved but before it merged, then got approved again22:09
*** jerryz has joined #openstack-infra22:09
Steely_Spammaybe a reverify would kick it?22:09
fungiit's possible it was re-approved while the previous patchset was still in the process of waiting to be kicked out of today's extremely slow gate22:09
fungiSteely_Spam: so, yes, try to reverify and see if jenkins leaves a new "starting gating" comment on it after that22:10
* Steely_Spam tries22:10
jerryzfungi: it is still not unusual for me to run into this bug:
uvirtbotLaunchpad bug 1225664 in openstack-ci "tempest.api.volume.test_volumes_actions.VolumesActionsTestXML flakey failure" [High,Triaged]22:10
Steely_Spamfungi: related question: can I put a recheck/reverify command on the first line and more comment below it, or does the whole comment have to be just the command in order to work?22:10
fungijerryz: did you hit it recently?22:11
jerryzfungi: i also see in e-r status report several reviews also fail due to that bug22:11
Steely_Spamfungi: yes, that kicked it and like ten behind it, thanks :)22:11
fungiSteely_Spam: no, it's a very strict match right now, no comments in the same post. i usually leave a second comment with my details22:11
Steely_Spamfungi: okay, I've been doing the same, just wondering22:11
jerryzfungi: my code base tested should be two or three days ago22:11
jerryzfungi: but in e-r 's report, recent reviews also hit similar failure22:12
fungijerryz: it's also possible the elastic-recheck criteria for matching that issue are too vague and catching more than one problem under that umbrella22:12
fungijerryz: link to a recent failure or the report you're talking about?22:13
jerryzAffecting changes: 42523, 46696, 46479, 46206, 46598, 45306, 46738, 46777, 46219, 46792, 4224022:13
*** dcramer_ has quit IRC22:14
*** tvb|afk has quit IRC22:14
fungijerryz: thanks--i'll try to take a look in a bit once i've got jenkins02 back under control22:14
fungiheh... top reports the jvm on jenkins02 is using 40g of virtual memory. it doesn't have but 32g including swap22:15
fungimust be shared22:15
fungiresident is 26g though22:16
fungiokay, jenkins02 is preparing for shutdown. i'll restart the service once all jobs complete22:17
fungiprobably about 30 minutes22:18
jeblairfungi i think nodepool is running the new code that should shift load to jenkins01.  you may want to keep an eye on jenkins01.22:19
fungiyeah, as of this morning's restart. i was thinking about that as well22:20
jeblairsince theres a lot of untested stuff going on.22:20
jeblairif jenkins01 gets overloaded we may need to add a cap in nodepool.22:21
fungijerryz: okay, i see that's the swift storage cap being exceeded? you might see if afazekas wants to work on enlarging that since he did the past couple of changes for it (or propose a similar one?)22:21
fungijeblair: definitely agree22:21
jgriffithjerryz: question on that...22:21
jgriffithjerryz: which case of it are you seeing?22:21
jeblairfungi if there is a prob you can adjust provider max values in nodepool.yaml to quickly get a similar effect.22:22
fungijeblair: noted--thanks!22:22
*** SergeyLukjanov has quit IRC22:22
jerryzjgriffith:  and
jerryzthose are recent failures22:23
fungii'll afk for a few minutes while jenkins02 finishes up and brb22:24
jgriffithjerryz: interesting... 500 failure back from the glance client22:26
*** thedodd has quit IRC22:30
*** CaptTofu has joined #openstack-infra22:30
lifelessjeblair: I'd like to offer all TripleO ATC's accounts on this cloud; I could just mail -dev but I'm pondering whether something more directed (e.g. direct email) would be good22:31
*** sarob has quit IRC22:31
BobBallfungi: Is there any way to access a vnc console or similar for VMs in the HP cloud?22:33
openstackgerritSean Dague proposed a change to openstack-infra/config: add os-loganalyze to gerrit & zuul
sdaguefungi: oops, thanks22:33
*** changbl has quit IRC22:35
fungilifeless: if it's decided that an e-mail list of tripleo atcs is warranted, i can generate one on whatever set of repositories and timeframe you want, basically same as we would for a tripleo ptl election22:36
*** rcleere has quit IRC22:36
jeblairlifeless recommend -dev for now as i'd want to carefully consider giving out email addrs22:36
fungiagreed. i'm hesitant as well, but it's a technical possibility22:37
lifelessjeblair: ack22:37
jeblairi personally think this is a fine use, but i dont want to surprise anyone or break any implied trusts22:37
fungi(and you can always just scrape the git commit logs, but that's got the same privacy concerns)22:37
fungiBobBall: i believe so, but it's been a while since i needed console access to am hpcloud vm22:38
jeblairso lets separately come up with some policy for the future22:38
sdaguefungi: can I get another look from you on the os_loganalyze add - ?22:38
*** datsun180b has quit IRC22:38
fungisdague: yep, was about to pull it back up22:39
BobBallfungi: any ideas how I might do that? the web interface doesn't seem to give me a clue...22:39
*** CaptTofu has quit IRC22:39
fungisdague: keep in mind i only -1'd you to game my review positivity stats ;)22:39
*** CaptTofu has joined #openstack-infra22:40
clarkbBobBall I am not sure you can. I had the same problem last I tried22:40
fungithat's sucky22:41
openstackgerritDan Nguyen proposed a change to openstack/requirements: Add pwtools to requirements for password generator
* BobBall sighs deeply22:41
BobBallthat's a real shame...22:41
lifelessjeblair: cool, thanks22:41
fungion the other hand, it seems like a good chunk of nova denial of service issues were related to novnc, so maybe disallowing access there is a defensive measure22:42
* BobBall bangs his head against the soft fluffy HP cloud22:42
lifelessBobBall: oh?22:42
BobBallStruggling trying to get Xen booting nested so we can look at gating tests... and the lack of VNC access means I can't play with boot parameters - once I set them, and it fails, I have to reinstall the machine22:43
BobBallit's a right pain22:43
*** CaptTofu has quit IRC22:44
lifelessBobBall: oh :)22:44
fungii know rackspace provides a console. on the down side the reason i know that is because of having to frequently try to troubleshoot crashed/hung/dead virtual machines22:44
lifelessBobBall: erm, I meant oh :(22:44
lifelessBobBall: do you have xen booting locally using kvm ?22:44
lifelessBobBall: could you just upload a custom image?22:44
BobBallwe've had it working, yes22:44
fungilifeless: via that awesome glance service they offer their customers ;)22:45
lifelessfungi: yup, we have that22:45
BobBallnot seen that upload a custom image?22:45
fungilifeless: is it no longer in beta?22:45
lifelessfungi: it's in public beta still I believe22:45
jerryzjgriffith: can i file a bug?22:46
fungiwell, public beta is way better than secret beta. that's something rackspace still hasn't provided22:46
BobBalllifeless: how would I do that?22:46
jgriffithjerryz: the bug that you pointed to is valid.  Just need to add cinder and possibly glance but not sure yet22:46
jgriffithjerryz: I'll have to get back to it here when I have some more time22:47
BobBallfungi: RS cloud is even less fun - in theory it's doable but in practice we need an HVM linux guest which is a pain to get hold of with RS cloud :P22:47
* fungi nods22:47
jgriffithjerryz: feel free to add Cinder to the projects, I don't think it's an infra bug that's for sure22:47
BobBallthis is the joy of nested virt...22:48
lifelessBobBall: hardware assisted virt will be disabled in the kvm vms though surely22:48
lifelessBobBall: go to
lifelessBobBall: select us east in the beta section and request access22:49
lifelessBobBall: then once you get that, you can ask for glance access too22:50
BobBallgreat, thanks lifeless22:50
lifelessBobBall: it was about 24 hour turnaround when I got it enabled on the -infra account22:50
lifelessthough I don't think they've done anything with it:P22:50
BobBallbeta request sent :)22:50
lifelessBobBall: I'd be delighted to help you get a physical test environment up, if you guys have machines - we should be able to use nova baremetal + nodepool to get you d-g style instances of actual xen deployed pretty easily22:52
BobBallwe do - although not nearly the number of machines that -infra use for the gate :)22:54
sdagueclarkb, jeblair: either of you good with putting this through ? then we could get the gerrit core team set, and I can make changes on that side22:54
BobBallvirtualisation should work - it _really_ should...22:55
jgriffithjerryz: cool.. thanks!22:56
*** dcramer_ has joined #openstack-infra22:56
fungisdague: clarkb seemed basically okay with the previous patchset in irc. i'm okay approving it and will troubleshoot whatever i might overlook22:56
sdaguefungi: that would be awesome22:57
lifelessBobBall: how many concurrent vm's does a gate run need though?22:57
sdaguethen add me + infra-core to the core team in gerrit22:57
lifelessBobBall: say one for d-g itself, and some N concurrent test instances: one solid xen machine should be able to support at least 5 or 6 concurrent d-g style tests.22:58
lifelessBobBall: (without slowing each test down, I mean)22:58
sdagueI'm on for about the next 20 mins22:58
sdaguethen it's off to Plan 9 -
BobBallPerhaps - although I figured we needed one host per VM that's running tests - just to ensure there aren't any cross-interactions which might cause problems?23:00
BobBallalthough maybe I don't understand what d-g style tests are :P23:00
lifelessBobBall: d-g runs devstack which you'd want configured to talk to xen23:00
lifelessBobBall: I don't know xen well; could you have multiple devstacks talking to one xen ?23:00
BobBallin theory, sure23:01
BobBallbut if you have it then there is a risk of one set of tests interacting with another23:01
*** boris-42 has quit IRC23:01
BobBalle.g. if you break the xenserver in a horrible way (or the plugins don't match...) then it might show up as a failure when it shouldn't have23:01
lifelessperhaps have it just run nova gates?23:01
BobBallThat'd be easier for sure23:01
BobBallso how many hosts do you think might be needed?23:02
openstackgerritA change was merged to openstack-infra/config: add os-loganalyze to gerrit & zuul
lifelessnova is a pretty big fraction of the changes23:03
lifelessI don't have a gut feel - clarkb / fungi may well23:03
*** senk has joined #openstack-infra23:03
lifelessthe full gate, remembering my back of envelope figures23:04
lifelesswas 400 changes in one day23:04
lifelessat 30m each23:04
BobBallI see23:05
openstackgerritJoe Gordon proposed a change to openstack-infra/elastic-recheck: Change test_queries from logical AND to OR
sdaguefungi: so now that it's merged, we just want for the next puppet update to trigger the import?23:06
BobBalloh rubbish - just realised it's midnight23:06
BobBallI really should get some sleep23:06
lifelessBobBall: we figureed 40 concurrent test environments is sufficient23:06
lifelessBobBall: so 40 small machines for xen23:06
fungisdague: yup, and then i'll add you as the initial core group member, and add the infra core group as included23:06
sdaguefungi: cool23:06
lifelessBobBall: perhaps a moonshot chassis fully loaded?23:07
BobBallthat'd be a very nice way to do it23:08
*** senk has quit IRC23:08
fungilifeless: BobBall: if you're just talking about gating load, have a look at and note that each job listed for a change is using an 8gb vm with 4x vcpu23:09
lifelessfungi: moonshot is dual core + hyperthreads with 8GB23:09
fungiso depending on the project you're gating, maybe around 10ish servers in parallel23:09
fungilifeless: sounds comparable23:10
lifelessfungi: right, it's why I suggested it.23:10
fungiis that the arm hardfloat version or the atom one?23:10
lifelessfungi: there will be higher density cartridges in future, of course23:10
BobBall10 doesn't sound enough to me if I'm honest23:10
lifelessfungi: atom, it even has VTx23:10
fungiBobBall: i meant 10ish per change you want to test in parallel23:10
BobBallI'm very tempted by the moonshot idea23:11
lifelessBobBall: fungi means 10 * - 10 servers per commit, but I think he's wrong :)23:11
BobBalloh I see23:11
BobBallwhy 10 per commit?23:11
fungii may be. checking the veracity of my assertion now23:11
lifelessfungi: do you mean 'tempest runs 10 sub-vm's ?23:11
lifelessfungi: or do you mean 'zuul schedules 10 jobs' ?23:11
sdagueare you guys talking about devstack/tempest runs?23:11
sdaguebecause our experience is the cpu does matter quite a bit23:11
BobBallWe're talking about adding a devstack/tempest/xenapi run somehow :)23:12
sdaguewhich is why the rax nodes aren't used23:12
sdagueso atom... not a great idea :)23:12
fungijust talking about jobs in general. if you were to replicate *all* of our gating, we use 9 virtual machines in parallel for each iteration of attempting to gate a nova change, for example23:12
lifelesssdague: mmm, I'd seriously consider native atom over virtualised $other :>23:12
lifelessfungi: right, so thats the wrong way to look at it23:13
BobBallahhh ok23:13
funginot sure what metric BobBall was looking for there23:13
lifelessfungi: the way to look at is is we're adding one more job to that set.23:13
fungioh, in that case one per change tested in parallel23:13
lifelessfungi: so from 9 vm's to 10, one of which BobBall would be providing in a dedicated xen-capable-environment.23:13
BobBallI'm not sure I know either :)23:13
sdaguelifeless: it seems pretty cpu bound, so virtualized doesn't have much overhead23:13
lifelesssdague: tempest is running against qemu vm's23:14
sdaguelifeless: the qemu vm start times isn't really the issue23:14
fungijenkins02 has been gracefully restarted and is coming up now23:14
*** sarob has joined #openstack-infra23:14
lifelesssdague: ok; I'll defer to data here.23:14
lifelesssdague: just that even cirros can't make the vm's do their stuff well >23:14
lifelesssdague: I would want to investigate a xen-on-moonshot test before writing it off23:15
sdaguefair, just saying what I've seen.23:15
lifelesssdague: these aren't the atoms most folk have seen23:15
sdagueok, well even the amd chips in rax give us a 40% slow down compared to the intel chips at hp23:16
BobBallwell I'd question whether we'd need to run the full test of tempest tests as well - they all pass of course, so that's not the issue, but some of them are entirely independent of the ypervisor driver23:16
lifeless!tab=specs <- the cartridges I'm referring to23:16
BobBallthe rax chips you were testing on are a fair bit older than the intel ones at HP though23:16
sdaguelifeless: what's the L3 look like on those?23:16
sdagueBobBall: fair23:16
sdagueI'd say get some data on a real system first though23:17
lifelesssdague: 1 MB23:17
lifelesssdague: yes, +1 on getting real data23:17
sdagueso, I'd be suspicious then. We've some some pretty strong corolation between L3 size and speed here.23:18
sdaguebut some runs would be good23:18
BobBallDo you have access to a moonshot system lifeless?  I can probably get access but it's likely to take a while23:18
sdagueok, movie time23:18
lifelessBobBall: not at the moment, but I know folk who do :/23:18
*** fifieldt has joined #openstack-infra23:19
fungisdague: if it's wood's original plan 9, one of my favorites ;)23:19
BobBallokay I'll check with our HP blokey23:19
lifelessBobBall: I would suggest, if doing this is a real possibility, that we go in the front door and get a sales person involved - the sales folk have ready access to moonshot for customer evaluations23:19
lifelessBobBall: (e.g. fully populated 45 cartridge + two switch chassis)23:19
BobBallMaybe.  I know someone who has been talking about moonshot so I'll have a few words with him first23:20
BobBalland try the glance upload route too :)23:21
BobBallall sorts of fun!23:21
lifelessif you run into a wall, let me know23:21
lifelessI have some interactions with moonshot teams23:21
BobBallperfect, thanks.23:21
*** BobBall is now known as BobBallAway23:22
*** marktraceur is now known as FreeThaiFood23:23
fungijeblair: anecdotal but worth watching for next time, we had a great many more devstack jobs end up on jenkins02 as soon as it came up than were running on jenkins01. like it got favored for some reason (maybe accumulated shares from while it was unreachable?)23:26
fungiat the moment there are about 5 devstack jobs running on jenkins01 and nearly 50 on jenkins0223:27
fungibut jobs still seem to be running and completing successfully23:29
fungii'll check back in on it in a bit23:29
*** pentameter has quit IRC23:33
*** mriedem has joined #openstack-infra23:37
*** nati_uen_ has joined #openstack-infra23:45
*** nati_ueno has quit IRC23:46
*** FreeThaiFood is now known as marktraceur23:48
*** hogepodge has quit IRC23:49
*** rnirmal has quit IRC23:54
*** vipul is now known as vipul-away23:57

Generated by 2.14.0 by Marius Gedminas - find it at!