Thursday, 2015-03-19

mordredStevenK: http://git.openstack.org/cgit/openstack-infra/system-config/tree/modules/openstack_project/manifests/proposal_slave.pp00:00
harlowja_clarkb let me know when u think i can recheck https://review.openstack.org/#/c/164836/ and it might work00:01
*** alexpilotti has quit IRC00:01
pleia2I can confirm with AJaeger that script-wise that's the only place it's needed00:01
clarkbharlowja_: I have no idea, I was hoping mordred would have some great insight for us00:01
pleia2we'll need to know where to put this puppet change anyway00:01
harlowja_k00:01
harlowja_mordred great insight required!00:02
mordredclarkb: why would i have insight?00:02
harlowja_i don't think any py26 supporting project can update its requirements with those '~=' style until that gets addressed00:02
*** claudiub has joined #openstack-infra00:02
*** alexpilotti has joined #openstack-infra00:02
clarkbmordred: because you wrote all the recent changes to that pin00:02
mordredclarkb: oh - that was the pin  thing00:03
clarkbharlowja_: maybe we shouldn't use ~= then00:03
clarkbharlowja_: because we can't garuntee everyone using this sfotware has pip 6.000:03
mordredclarkb: is that pinning to older or newer?00:03
clarkbmordred: older00:03
harlowja_clarkb u won't get disagreement from me :-P00:03
mordredclarkb: yes, we can. we do not make any guarantees about non-most-recent pip00:03
*** Ryan_Lane has quit IRC00:03
mordredclarkb: in fact, we explicitly don't work on it, i believe00:03
harlowja_https://github.com/openstack/requirements/commit/25ef10 put a bunch in though, lol00:03
*** alexpilotti has quit IRC00:03
clarkbmordred: we do we know this because centos600:04
*** yamamoto has joined #openstack-infra00:04
pleia2I need to run off to an evening event, thanks jeblair, StevenK and mordred00:04
mordredclarkb: no - if you're installing from pip, you need to install latest pip00:04
*** jp_at_hp has joined #openstack-infra00:04
mordredwe do not support the pip in centos600:04
clarkbmordred: that is not how we test00:04
mordredclarkb: then we're testing wrong00:04
clarkbmordred: thats fine we can fix that00:04
mordredclarkb: woot00:04
clarkbmordred: just pointing out that your assertion is false as of right now00:04
*** sdake_ has joined #openstack-infra00:04
*** alexpilotti has joined #openstack-infra00:04
mordredclarkb: cool. well - I refuse to support old pip except for transitionally00:05
mordredbecause it makes absolutely no sense00:05
clarkbthe reason I suggest not using ~= is because >= and < should also work fine00:05
clarkbso there is little benefit in supporting fancy new symbol that isn't available everwhere00:05
mordredclarkb: right - but openstack is adopting ~= across the board pretty aggressively00:05
clarkbmordred: yes but there is no reason to00:05
clarkbmordred: why would we do that?00:05
mordredclarkb: there is no reason not to00:05
mordredclarkb: supporting out of date pip has no point00:05
clarkbmordred: there is00:05
mordredthere is literally no benefit00:06
clarkbmordred: code change churn00:06
clarkbthere is no benefit to change to ~= so lets just leave it be00:06
mordredtehn don't pip install from master00:06
*** alexpilotti has quit IRC00:06
mordredclarkb: not our call here00:06
clarkbmordred: this has nothing to do with pip00:06
* harlowja_ has to go fix anvil to convert ~= -> rpm versions, lol, (code churn there, lol)00:06
mordredsure it does00:06
clarkbmordred: and everything to do with we don't need hundreds of requirements updates to support ~= unnecessarily00:06
clarkbmordred: it just happens that old pip also fails at this00:06
mordredI'm not going to police that00:06
mordredif the project wants to adopt it, great00:06
clarkbsure, I haven't -1'd anything00:07
mordredit fits in with my understanding of our pip support00:07
clarkbjust pointing out that there are valid reasons to not do this00:07
mordredwhich is "anything other than latest pip is insane"00:07
mordredsure00:07
jeblairclarkb, mordred: 155072 includes a formal adoption of it00:07
mordredI'm just saying that if we're pinning to an out of date venv - sorry, that's my bad00:07
mordredand we should stop doing it00:08
*** sdake has quit IRC00:08
mordredbecause we test with latest pip in devstack00:08
*** mtanino has quit IRC00:08
mordredand testing with older pip elsewhere because I suck is a bad policy :)00:08
harlowja_mordred please suck less, thx00:08
mordredharlowja_: listen man - there is only so much suck I can remove on a given day00:08
jrollwhoa, pip has ~= now? what does that do?00:08
* jroll not sure if he wants to know00:09
clarkbalso  Istill don't know why trusty works00:09
clarkbmordred: ^ which was mostly I think what I wanted clarification on before we changed anything00:09
harlowja_jroll https://www.python.org/dev/peps/pep-0440/#compatible-release00:09
clarkbit appears the only place we install virtualenv is in the base.pp manifest00:09
clarkband yet trusty nodes don't seem to get downgraded00:09
*** jamielennox is now known as jamielennox|away00:09
*** liusheng has quit IRC00:10
jrollhuh00:10
mordredclarkb: hrm00:10
harlowja_lol, jroll same response, idk either :-P00:10
*** liusheng has joined #openstack-infra00:10
jroll00:06:11          clarkb | there is no benefit to change to ~= so lets just leave it be <- I like this argument00:10
*** SumitNaiksatam has quit IRC00:11
*** jeffgdl has joined #openstack-infra00:11
jrollbut then again I would rather == would work00:11
*** garyh has quit IRC00:12
*** ddieterly has joined #openstack-infra00:12
harlowja_i start to feel that someone was taking a little to many drugs when making up  ~=  as a version identifier00:12
harlowja_lol00:12
*** mrmartin has quit IRC00:12
jrollI've only ever seen that in npm things00:12
jrollso.... you may be right00:12
harlowja_maybe they share(d) drugs then00:12
harlowja_ha00:12
*** otter768 has joined #openstack-infra00:13
*** dims__ is now known as dims00:13
harlowja_i00:13
*** sdake has joined #openstack-infra00:13
harlowja_*i'd rather have the versions be simple, have a nice solid dependency resolver, and then maybe start adding these edge-cases00:13
ianwfungi: is anyone actively looking at https://review.openstack.org/#/c/165620/ (Temporarily stop adding devstack-f20 nodes in rax)?00:14
jrollharlowja_: yeah00:14
dimsfungi: clarkb: some oslo.messaging -docs, -tarball are stuck in the -post queue for a while. do i need to re-jiggle them some how?00:14
*** achanda has quit IRC00:15
clarkbdims: we need more nodes, I am currently cleaning up leaked nodes in hpcloud to hopefully free up capacity00:15
openstackgerritSpencer Krum proposed openstack-infra/puppet-zuul: Modernize template syntax  https://review.openstack.org/15713600:16
dimsclarkb: thanks (i saw the "zuul jobs" and "gerrit events" charts go south and thought i may have to do something)00:17
*** otter768 has quit IRC00:17
anteayadims: thanks for keeping an eye on that00:17
*** sdake_ has quit IRC00:17
dimsanteaya: :)00:18
anteayadims: when the gate shifts and zuul needs to calculate the merge-check queue it can spike for a few minutes00:18
anteayaI've seen event go up over 800000:18
greghaynesasselin: Hey, im working on some fakes testing and im hitting the bug you patch https://review.openstack.org/#/c/162313 fixes, id like to base a test off it but I need to rebase your change off master if thats ok?00:18
dimsgotcha00:18
openstackgerritNan Liu proposed openstack-infra/puppet-elasticsearch: Use archive to fetch/checksum instead of wget.  https://review.openstack.org/16490800:18
anteayabut they don't last long00:18
anteayadims: but you are looking in the right spots00:18
asselingreghaynes, sure go ahead00:19
dimsthanks anteaya00:20
anteaya:)00:21
*** rfolco has quit IRC00:21
openstackgerritJeremy Stanley proposed openstack-infra/project-config: Add docs jobs for openstack-infra/openstackid repo  https://review.openstack.org/16566100:21
fungiianw: i have no idea. it was agreed as a good thing to try, but got no reviews once uploaded00:22
*** jamielennox|away is now known as jamielennox00:23
ianwfungi: i mean the failure to boot; just trying to grab some logs on this now00:23
clarkbfungi: maybe just apply by hand and see if it makes any difference?00:24
fungiianw: we get no logs. we never get an instance returned by nova when we request it to boot00:24
fungiclarkb: i was tempted to do that, but then worried i might accidentally leave puppet disabled on nodepool so thought better of it00:25
fungiclarkb: so if i was going to apply it by hand, better to just self-approve that change i guess (i proposed a wip revert along with it for everyone's convenience)00:25
fungiwhich i guess i can do00:26
*** tkelsey has joined #openstack-infra00:26
*** gyee has quit IRC00:30
*** tkelsey has quit IRC00:31
*** __TheDodd__ has quit IRC00:32
*** MarkAtwood has quit IRC00:32
ianwfungi: alright, well if i can help i will, obviously console logs, if it gets that far, would be good00:32
*** jeffgdl has quit IRC00:33
*** ZZelle_ has quit IRC00:34
*** tonytan4ever has quit IRC00:34
*** Sukhdev has quit IRC00:34
*** markvoelker has quit IRC00:37
fungiianw: i connected to the console of one at random. i'm getting "boot: No DEFAULT or UI configuration directive found!" looping over and over there00:37
fungibut i'm about to knock off for the night. getting lateish here00:37
*** e0ne has quit IRC00:38
*** tonytan4ever has joined #openstack-infra00:38
fungiianw: i'm guessing it's something not quite right with the bootloader installed to the image00:39
patrickeasthi, i have a nodepool question… is there a way to setup nodes with multiple floating ip’s each coming from a different pool?00:41
patrickeasti can make nodes with multiple networks, and a single floating ip from the pool specified in the providers ‘pool’ setting00:41
patrickeastbut didn’t see anything for other nics that are configured00:42
anteayapatrickeast: did you get a private network for nodepool?00:42
ianwfungi: alright, i'll try to replicate a little00:42
patrickeastanteaya: heh still waiting on that, in the mean time i am playing with setting up iscsi multipathing to our flash array on test nodes00:43
patrickeastanteaya: should be soon, waiting on some new cables or something they needed to put it into the dmz00:43
fungiianw: thanks!00:43
*** baoli has quit IRC00:43
anteayapatrickeast: okay00:43
anteayapatrickeast: and I have no idea how to answer your original question00:44
*** marun has quit IRC00:45
*** asettle is now known as asettle-lunch00:45
clarkbalien nodes are cleaned up in hpcloud now00:45
clarkbpatrickeast: there is a way to specify multiple networks I don't think you can specify multiple floating ips though00:46
patrickeastclarkb: yea that lines up with what i’m seeing in the docs00:46
patrickeastfigured I would check before hacking away at the code00:47
anteayaclarkb: yay00:47
clarkbpatrickeast: what are you trying to accomplish with multiple floating ips?00:47
patrickeastwe need to route traffic through a handful of 10Gb nic’s on the compute nodes to the test nodes so they can reach the iscsi ports on our flash array to test out multipathing00:48
patrickeastso each node would have 3 networks, management and 2x data00:48
*** cdent has quit IRC00:48
openstackgerritMerged openstack-infra/system-config: Temporarily stop adding devstack-f20 nodes in rax  https://review.openstack.org/16562000:48
patrickeastall 3 of which are on different “external” networks as far as openstack is concerned00:48
*** mfink_ has joined #openstack-infra00:49
clarkbpatrickeast: couldn't you do that all without floating ips?00:49
clarkbpatrickeast: just put them on the appropriate network00:49
*** claudiub has quit IRC00:50
*** ibiris is now known as ibiris_away00:50
patrickeastclarkb: mm that would be easier, can I put instances directly on a neutron network/subnet that is external w/ a bridge?00:51
ianwfungi: yeah, just upgrading kernel & rebooting f20 on rax it's not happy with that error, so that's kind of good in that it's not specific ...00:52
* patrickeast goes back to reading neutron docs00:52
fungiianw: awesome--glad it's not hard to reproduce. hopefully shortly we'll know whether hpcloud does any better in that regard00:53
clarkbpatrickeast: I am not sure how neutron does it but if neutron can set up NAT rules in iptables for internal:external mappings then it should be able to just use IPs on those networks direclty00:53
*** nanliu has quit IRC00:53
clarkbI certainly hope that neutron allows that :/00:53
*** yamamoto has quit IRC00:53
fungiianw: is the depends-on in https://review.openstack.org/164901 a typo?00:53
patrickeastclarkb: hehe yea i would think so too00:54
*** bhuvan has quit IRC00:54
openstackgerritMerged openstack-infra/jenkins-job-builder: Add support to configure boolean parameter in PBT  https://review.openstack.org/14666500:55
ianwfungi: oh, that got abandoned ... should be the previous one; just a tick00:55
fungiotherwise lgtm00:55
*** ddieterl_ has joined #openstack-infra00:55
openstackgerritIan Wienand proposed openstack-infra/system-config: Add nodepool config-validation environment  https://review.openstack.org/16490100:55
ianwfungi: ^ updated, thanks00:56
*** Rockyg has quit IRC00:56
*** ddieterly has quit IRC00:56
fungii see a devstack-f20 node building in hpcloud-b5 for 0.8 hours, so i think it's just unhappy everywheres00:57
fungieverwheres00:57
*** isq has quit IRC00:57
fungii'll go ahead and approve the revert00:58
*** jp_at_hp has quit IRC00:58
mordredianw: so - you enjoy diskimage-builder, right?00:58
mordred:)00:58
mordredianw: I made this a while back: http://git.openstack.org/cgit/openstack/diskimage-builder/tree/elements/ubuntu-minimal ... and a similar one http://git.openstack.org/cgit/openstack/diskimage-builder/tree/elements/centos-minimal00:59
ianwmordred: well ... sometimes.  my last change took about 6 months to merge00:59
mordredianw: :)00:59
mordredianw: the ubuntu one does what Id expect - which is make a super minimal base image that's similar in size and scope to the ubuntu docker image01:00
mordredianw: the centos one is ... larger - and I think I did it wrong ... and I don't have a fedora one at all01:00
*** bknudson has joined #openstack-infra01:00
*** Sukhdev has joined #openstack-infra01:00
mordredianw: if you get bored at some point and wanted to make a better one for centos/fedora that wound up with something tiny like a docker image ... I will send a freshly dried alpaca to your house01:00
ianwmordred: so one thing that stands out immediately is http://git.openstack.org/cgit/openstack/diskimage-builder/tree/elements/centos-minimal/pkg-map <- redhat-lsb brings in a whole bunch of stuff, X and cups etc01:01
ianwthat should probably be redhat-lsb-core01:01
mordredianw: yeah - it's the only way I could figure out how to get the lsb_release command ... AH01:01
mordredsee - this is you knowing more things than me01:01
*** tonytan4ever has quit IRC01:01
mordredianw: I was also told that perhaps rinse is not actually a great way to accomplish this ... and perhaps something about just using rpm with an alternate root *me waves hands wildly*01:02
fungimordred: also `rpm -qf lsb_release`01:02
fungithat is assuming you already have the packages installed, that will tell you which one is actually providing it01:03
fungii'm sure yum has something akin to apt-file for searching filenames of packages available on the package mirror but that you haven't installed, i just don't know the cli01:04
ianwmordred: so don't we have the whole atomic thing?  is there overlap here?01:04
mordredianw: it's not quite the same thing ... I don't actually want docker01:05
mordredI want actual base images - just small without the things that my distro providers think I want in a "cloud image"01:05
mordredbecause I have this whole puppet infrastructure where I declare the dependencies I actually want :)01:05
patrickeastclarkb: hah nice, thanks for the suggestion… I was making that much more painful than it had to be, although now we need to setup the /etc/network/interface.d/ script for the node with its static ip (the floating ips let us use the neutron dhcp), do you know if the /etc/nodepool/node_private file has *all* of the private ip addresses for a node so we can set it up with a ready script?01:05
mordredpatrickeast: fyi - I'm in the middle of replacing the openstack calls in nodepool with the shade library - so if you do find that there are some places where you need enhancements to the openstack interactions - like the floating-ips thing or richer handling of multiple nics - it's probably functionality we want to add to shade too ...01:06
*** bhunter71 has joined #openstack-infra01:07
patrickeastmordred: oo nice, is there a launchpad or something for it where I could document any feature requests?01:08
ianwmordred: you don't want a bootable image though?01:09
mordredpatrickeast: https://storyboard.openstack.org/#!/project/76001:09
*** Longgeek has joined #openstack-infra01:09
mordredianw: no, bootable is good - but there are already other elements to handle things like grub and whatnot01:09
mordredianw: the 'vm' element handles that01:10
ianwmordred: right, ok, so rinse gives you even less than downloading the upstream .qcow01:11
*** isq has joined #openstack-infra01:11
ianwupstream .qcow cloud images01:11
*** garyh has joined #openstack-infra01:12
*** pelix has quit IRC01:14
*** YorikSar has quit IRC01:15
*** achanda has joined #openstack-infra01:15
*** markvoelker has joined #openstack-infra01:17
fungioh! not approving the revert yet. we've got a used devstack-f20 node in hpcloud now01:17
fungiso it does seem we can still boot them there01:17
ianwfungi: so it's saying "boot: No DEFAULT or UI configuration directive found!" because there's no default configuration directive.  who'd have thunk ...01:18
ianwfungi: this looks like rackspace fiddling with extlinux.conf again ... similar thing happened with f2101:18
fungiianw: yeah, i've also had problems with them replacing grub2 with pv-grub in the mbr of my personal instances as well01:19
mordredianw, fungi: is this file-injection feeling things?01:20
funginot sure, but it would be nice if they would just not feel obligated to mess with bootloaders01:20
mordredianw, fungi: I think Ng got glance turned on on his rax account - so hopefully we shoudl have the non-nova-agent based element soon01:20
ianwmordred: no, i think they edit /boot/extlinux.conf for some reason, and then the scripts in the rpm files that update it don't work properly01:20
*** achanda has quit IRC01:21
mordredoh - you mean in the images they make for  us01:21
ianwyep01:21
mordredyeah - I'm very much looking forward to getting our cloud providers out of the pipeline for making images for us altogether01:21
* mordred promises he's working on that01:22
*** markvoelker has quit IRC01:22
ianwfungi: i'll communicate to jroll the issue, i can put a work-around into the prepare scripts01:22
*** mwagner_lap has joined #openstack-infra01:22
*** garyh has quit IRC01:23
jrollianw: ?01:23
fungithat was fast ;)01:24
jroll>.>01:24
* jroll runs away01:24
ianwjroll: the rax f20 image is broken in a similar way to the f21 where extlinux.conf gets mangled by kernel updates.  i'll get exact details and email01:24
jrollianw: shoot me an email or else I'll forget01:24
jrollyeah01:24
jrollthanks!01:24
*** mwagner_lap has quit IRC01:27
*** bhuvan has joined #openstack-infra01:28
*** garyh has joined #openstack-infra01:28
fungithanks as always jroll!01:30
jroll:)01:34
openstackgerritJoshua Hesketh proposed openstack-infra/nodepool: Move nodepool creation in tests to common method  https://review.openstack.org/16558101:35
openstackgerritSukhdev Kapur proposed openstack-infra/project-config: Adding create permissions for networking-arista  https://review.openstack.org/16567301:36
*** garyh has quit IRC01:38
*** mwagner_lap has joined #openstack-infra01:40
*** asettle-lunch is now known as asettle01:43
*** ivar-laz_ has joined #openstack-infra01:43
Sukhdevfungi: Hi there - I just pushed the patch based upon your suggestion. Can you please give it a review - https://review.openstack.org/#/c/165673/01:44
*** spzala has quit IRC01:46
*** ivar-lazzaro has quit IRC01:46
*** ivar-laz_ has quit IRC01:47
*** isq has quit IRC01:48
*** ChuckC has joined #openstack-infra01:49
*** unicell has quit IRC01:51
rluciohey guys, i linked my ci to the sandbox repo to test, and i checked back and both of my slaves are offline saying "offline due to gearman request"... anyone know what that means?01:52
fungirlucio: it means you've copied our configuration and are using an OFFLINE_NODE_WHEN_COMPLETE parameter in your jobs. we do this because we don't reuse workers, but delete them after a job completes and use a fresh one for each new job01:53
*** unicell has joined #openstack-infra01:54
*** unicell has quit IRC01:54
*** unicell has joined #openstack-infra01:54
rluciofungi: ah ha, cool.  yea i have static workers so i'll have to change that01:55
Sukhdevanteaya: Ping01:55
rluciofungi: thanks!01:55
fungirlucio: we have some static workers too, and for those we have a "reusable_node" parameter function which unsets that parameter for us01:55
*** isq has joined #openstack-infra01:55
rluciofungi: yea i see that defined in zuul/openstack_functions.py01:56
*** marun has joined #openstack-infra01:56
fungirlucio: if you look at our zuul/layout.yaml you'll see where we set it for jobs we're running on reusable nodes01:57
*** bhunter71 has quit IRC01:57
*** mwagner_lap has quit IRC01:58
SukhdevHi Infra cores - I need one more blessing on this patch, please - https://review.openstack.org/#/c/165673/01:58
fungirlucio: i take that back. we just apply it based on pattern matches of the job names these days. we used to set it from layout.yaml but i guess not any longer01:59
rluciofungi: right, just looking at that now, in the jobs: section of layout.yaml01:59
*** SumitNaiksatam has joined #openstack-infra02:00
rluciofungi: simple enough to fix :)02:00
openstackgerritMerged openstack-infra/nodepool: Fix Nodepool DIB Uploads to Glance w/o region_name  https://review.openstack.org/16231302:00
*** marun has quit IRC02:01
fungirlucio: yep. or just don't bother with the parameter function script we use. depending on your situation it's like a lot of unnecessary complication02:01
fungis/like/likely/02:01
*** ajmiller has joined #openstack-infra02:01
*** unicell has quit IRC02:02
*** ddieterly has joined #openstack-infra02:06
*** ddieterl_ has quit IRC02:06
*** mwagner_lap has joined #openstack-infra02:11
*** Sukhdev has quit IRC02:11
*** harlowja_ is now known as harlowja_away02:14
*** otter768 has joined #openstack-infra02:14
*** Alexandra_ has joined #openstack-infra02:16
*** asettle has quit IRC02:16
*** markvoelker has joined #openstack-infra02:18
*** otter768 has quit IRC02:18
*** reed has quit IRC02:20
*** macjack1 has joined #openstack-infra02:22
*** markvoelker has quit IRC02:23
*** macjack1 has quit IRC02:23
*** yamahata has quit IRC02:24
*** achanda has joined #openstack-infra02:27
*** sigmavirus24_awa is now known as sigmavirus2402:27
*** tkelsey has joined #openstack-infra02:28
*** baoli has joined #openstack-infra02:28
*** dims has quit IRC02:28
greghaynesclarkb: we need a good novaclient fake for this to work02:29
greghaynes:(02:29
*** tkelsey has quit IRC02:32
*** kaisers has joined #openstack-infra02:33
*** kaisers1 has quit IRC02:33
*** zz_dimtruck is now known as dimtruck02:35
*** dmorita_ has quit IRC02:36
*** otter768 has joined #openstack-infra02:38
*** unicell has joined #openstack-infra02:40
*** macjack1 has joined #openstack-infra02:42
*** unicell has quit IRC02:44
*** unicell has joined #openstack-infra02:45
*** macjack1 has quit IRC02:48
ianwjroll: looks like it's a fedora bug02:49
jrollianw: cool, I don't feel so bad now!02:50
* jroll deflects to redhat employees02:50
*** erlon has quit IRC02:51
* ianw curses anyone who thinks "i've got a need to open and close a bunch of files and mangle a lot of strings ... i think i'll write this in C"02:52
jrolls/open and close a bunch of files and mangle a lot of strings/do things with computers/02:52
jroll(mostly joking)02:52
*** mrmartin has joined #openstack-infra02:55
*** harlowja_at_home has joined #openstack-infra02:56
*** patrickeast has quit IRC02:58
*** spzala has joined #openstack-infra03:01
*** coolsvap|afk is now known as coolsvap03:04
*** stevemar has joined #openstack-infra03:08
*** harlowja_at_home has quit IRC03:11
*** spzala has quit IRC03:11
*** otter768 has quit IRC03:13
dougwighi infra.  is there an easy way to see a list of a particular job?  i tried here, but it's empty: https://jenkins.openstack.org/job/check-neutron-lbaas-dsvm-api/    i'm wanting to monitor it for stability.03:14
*** mmedvede has quit IRC03:17
*** Somay has joined #openstack-infra03:19
*** sdake_ has joined #openstack-infra03:22
ianwdougwig: i think you probably want to consult http://graphite.openstack.org/03:25
*** sdake has quit IRC03:26
dougwigianw: ty03:28
*** camunoz_gone is now known as camunoz03:29
*** coolsvap is now known as coolsvap|afk03:32
*** camunoz has joined #openstack-infra03:33
*** changbl has joined #openstack-infra03:34
*** sputnik13 has joined #openstack-infra03:36
openstackgerritIan Wienand proposed openstack-infra/system-config: Work-around grubby boot failure on Fedora 20  https://review.openstack.org/16568103:38
*** camunoz has quit IRC03:38
*** dimtruck is now known as zz_dimtruck03:39
*** coolsvap|afk is now known as coolsvap03:39
ianwfungi: ^ when you look at scrollback03:40
*** yamamoto has joined #openstack-infra03:40
openstackgerritIan Wienand proposed openstack-infra/system-config: Revert "Temporarily stop adding devstack-f20 nodes in rax"  https://review.openstack.org/16562103:41
*** achanda has quit IRC03:42
*** greghaynes has quit IRC03:43
*** achanda has joined #openstack-infra03:44
*** macjack has joined #openstack-infra03:46
*** mrmartin has quit IRC03:47
*** ddieterly has quit IRC03:47
openstackgerritgreghaynes proposed openstack-infra/nodepool: Move nodepool creation in tests to common method  https://review.openstack.org/16558103:49
openstackgerritgreghaynes proposed openstack-infra/nodepool: Monkeypatch Fake Clients for tests  https://review.openstack.org/16568203:49
openstackgerritgreghaynes proposed openstack-infra/nodepool: Add node deletion tests  https://review.openstack.org/16558203:50
*** woodster_ has quit IRC03:50
*** camunoz has joined #openstack-infra03:51
openstackgerritgreghaynes proposed openstack-infra/nodepool: Move nodepool creation in tests to common method  https://review.openstack.org/16558103:54
*** garyh has joined #openstack-infra03:54
*** tqtran has quit IRC03:55
*** sdake has joined #openstack-infra03:55
*** macjack has quit IRC03:56
*** tqtran has joined #openstack-infra03:56
openstackgerritgreghaynes proposed openstack-infra/nodepool: Monkeypatch Fake Clients for tests  https://review.openstack.org/16568203:56
openstackgerritgreghaynes proposed openstack-infra/nodepool: Add node deletion tests  https://review.openstack.org/16558203:56
*** sigmavirus24 is now known as sigmavirus24_awa03:57
*** sdake_ has quit IRC03:59
*** garyh has quit IRC04:00
*** sdake has quit IRC04:04
*** achanda has quit IRC04:05
openstackgerritgreghaynes proposed openstack-infra/nodepool: Move nodepool creation in tests to common method  https://review.openstack.org/16558104:06
*** jamielennox is now known as jamielennox|away04:06
openstackgerritgreghaynes proposed openstack-infra/nodepool: Monkeypatch Fake Clients for tests  https://review.openstack.org/16568204:06
*** VijayTripathi has joined #openstack-infra04:09
*** baoli has quit IRC04:10
*** mmedvede has joined #openstack-infra04:10
openstackgerritgreghaynes proposed openstack-infra/nodepool: Add node deletion tests  https://review.openstack.org/16558204:11
*** Alexandra_ has quit IRC04:15
*** dims has joined #openstack-infra04:19
*** markvoelker has joined #openstack-infra04:19
*** koolhead17 has joined #openstack-infra04:22
*** markvoelker has quit IRC04:24
*** carl_baldwin has joined #openstack-infra04:26
*** dims has quit IRC04:27
*** Sukhdev has joined #openstack-infra04:31
*** markvan has quit IRC04:33
openstackgerritKhai Do proposed openstack-infra/system-config: Fix jquery setup on Gerrit server.  https://review.openstack.org/16514504:33
*** mmedvede has quit IRC04:34
*** markvan has joined #openstack-infra04:35
*** achanda has joined #openstack-infra04:37
*** sdake has joined #openstack-infra04:38
*** dboik_ has quit IRC04:39
*** amotoki has joined #openstack-infra04:40
openstackgerritgreghaynes proposed openstack-infra/nodepool: Monkeypatch Fake Clients for tests  https://review.openstack.org/16568204:40
*** achanda has quit IRC04:41
openstackgerritgreghaynes proposed openstack-infra/nodepool: Move nodepool creation in tests to common method  https://review.openstack.org/16558104:45
*** dboik has joined #openstack-infra04:46
*** harlowja_at_home has joined #openstack-infra04:47
*** ddieterly has joined #openstack-infra04:48
openstackgerritgreghaynes proposed openstack-infra/nodepool: Monkeypatch Fake Clients for tests  https://review.openstack.org/16568204:51
openstackgerritgreghaynes proposed openstack-infra/nodepool: Add node deletion tests  https://review.openstack.org/16558204:51
*** ddieterly has quit IRC04:53
*** garyh has joined #openstack-infra05:00
*** __afazekas is now known as afazekas05:03
*** yamamoto has quit IRC05:07
*** achanda has joined #openstack-infra05:08
*** garyh has quit IRC05:09
openstackgerritKhai Do proposed openstack-infra/puppet-gerrit: Make mysql clients assume utf8 character encoding  https://review.openstack.org/16426405:11
*** sdake has quit IRC05:12
*** otter768 has joined #openstack-infra05:14
*** otter768 has quit IRC05:18
*** markvoelker has joined #openstack-infra05:20
*** jyuso1 has joined #openstack-infra05:25
*** markvoelker has quit IRC05:25
*** liusheng has quit IRC05:26
*** liusheng has joined #openstack-infra05:26
*** armax has quit IRC05:27
*** mmedvede has joined #openstack-infra05:28
*** yfried|afk is now known as yfried05:31
*** yamamoto has joined #openstack-infra05:32
*** dannywilson has quit IRC05:34
*** greghaynes has joined #openstack-infra05:39
*** carl_baldwin has quit IRC05:39
*** harlowja_at_home has quit IRC05:40
*** coolsvap is now known as coolsvap|afk05:41
*** ChuckC has quit IRC05:45
*** MarkAtwood has joined #openstack-infra05:45
*** coolsvap|afk is now known as coolsvap05:48
*** ddieterly has joined #openstack-infra05:49
*** ddieterly has quit IRC05:53
*** MarkAtwood has quit IRC05:55
*** ibiris_away is now known as ibiris05:58
*** yfried is now known as yfried|afk05:59
*** stevemar has quit IRC06:00
*** sarob has quit IRC06:05
*** subscope_ has joined #openstack-infra06:09
*** garyh has joined #openstack-infra06:10
*** BharatK has joined #openstack-infra06:10
*** yfried|afk has quit IRC06:12
*** dims has joined #openstack-infra06:13
*** dims has quit IRC06:19
*** garyh has quit IRC06:20
*** markvoelker has joined #openstack-infra06:21
*** bswartz has quit IRC06:24
*** markvoelker has quit IRC06:26
*** achanda has quit IRC06:28
*** tkelsey has joined #openstack-infra06:29
*** Sukhdev has quit IRC06:31
*** bswartz has joined #openstack-infra06:32
*** pcaruana has quit IRC06:33
*** tkelsey has quit IRC06:33
*** mfink_ has quit IRC06:35
*** mfink_ has joined #openstack-infra06:37
*** ddieterly has joined #openstack-infra06:50
*** kiseok7_ has joined #openstack-infra06:50
*** achanda has joined #openstack-infra06:52
*** jcoufal has joined #openstack-infra06:53
*** ddieterly has quit IRC06:55
*** achanda has quit IRC07:00
*** ujuc has joined #openstack-infra07:04
*** achanda has joined #openstack-infra07:04
*** sarob has joined #openstack-infra07:05
*** subscope_ has quit IRC07:07
*** jyuso1 has quit IRC07:08
*** pblaho has joined #openstack-infra07:09
*** sarob has quit IRC07:11
*** kiseok7_ has quit IRC07:11
openstackgerritFatih Degirmenci proposed openstack-infra/jenkins-job-builder: Pass through git revision to triggered jobs  https://review.openstack.org/16314907:12
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Fix failure detection in irc access check  https://review.openstack.org/16550807:12
openstackgerritAndreas Jaeger proposed openstack-infra/project-config: Fix failure detection in irc access check  https://review.openstack.org/16550807:13
*** tnovacik has joined #openstack-infra07:13
*** otter768 has joined #openstack-infra07:15
*** bhuvan has quit IRC07:18
*** otter768 has quit IRC07:19
*** garyh has joined #openstack-infra07:20
*** markvoelker has joined #openstack-infra07:22
*** yfried|afk has joined #openstack-infra07:25
*** markvoelker has quit IRC07:26
openstackgerritMerged openstack-infra/project-config: Move barbican client experimental job to voting gate job  https://review.openstack.org/16553407:27
openstackgerritMerged openstack-infra/project-config: Adding create permissions for networking-arista  https://review.openstack.org/16567307:28
*** yfried has joined #openstack-infra07:30
*** yfried|afk has quit IRC07:30
*** garyh has quit IRC07:31
*** scheuran has joined #openstack-infra07:33
*** VijayTripathi has quit IRC07:35
*** achanda has quit IRC07:37
*** HeOS has quit IRC07:38
*** ildikov has quit IRC07:38
*** mpaolino has joined #openstack-infra07:42
*** Bsony has joined #openstack-infra07:45
*** chlong has quit IRC07:47
*** mpavone has joined #openstack-infra07:50
*** ddieterly has joined #openstack-infra07:51
*** SumitNaiksatam has quit IRC07:51
*** belmoreira has joined #openstack-infra07:53
*** ominakov has joined #openstack-infra07:54
*** yfried is now known as yfried|afk07:54
*** ominakov has quit IRC07:54
*** ominakov_ has joined #openstack-infra07:54
*** ddieterly has quit IRC07:55
*** teran_ has joined #openstack-infra07:56
*** flaper87 has quit IRC07:56
*** flaper87 has joined #openstack-infra07:56
*** teran has quit IRC07:57
*** scheuran has quit IRC07:58
*** scheuran has joined #openstack-infra07:58
*** mrda is now known as mrda-away07:58
*** yfried|afk is now known as yfried08:01
*** mpaolino has quit IRC08:02
*** mpaolino has joined #openstack-infra08:03
*** dims has joined #openstack-infra08:04
*** markus_z has joined #openstack-infra08:07
*** dims has quit IRC08:10
*** hashar has joined #openstack-infra08:11
*** e0ne has joined #openstack-infra08:14
*** ildikov has joined #openstack-infra08:14
*** koolhead17 has quit IRC08:15
*** e0ne has quit IRC08:15
GheRiveromornig all08:16
*** koolhead17 has joined #openstack-infra08:16
*** jp_at_hp has joined #openstack-infra08:17
*** arxcruz has joined #openstack-infra08:17
*** jp_at_hp has quit IRC08:18
*** yfried is now known as yfried|afk08:19
*** koolhead17 has quit IRC08:21
*** tkelsey has joined #openstack-infra08:28
*** deepakcs has joined #openstack-infra08:29
*** dboik has quit IRC08:31
*** garyh has joined #openstack-infra08:31
*** mpaolino has quit IRC08:36
*** jistr has joined #openstack-infra08:37
*** mpaolino has joined #openstack-infra08:38
*** spredzy|afk is now known as spredzy08:39
*** yfried|afk is now known as yfried08:39
*** teran_ has quit IRC08:40
*** garyh has quit IRC08:41
*** fabo has quit IRC08:43
*** mfink_ has quit IRC08:49
openstackgerrityolanda.robla proposed openstack-infra/puppet-cgit: Add configurable setting for MaxClients  https://review.openstack.org/16088508:53
*** shardy_z is now known as shardy08:55
*** jlanoux has joined #openstack-infra08:56
*** flip214 has joined #openstack-infra08:57
*** andreykurilin_ has joined #openstack-infra08:58
*** nilasae has joined #openstack-infra08:59
flip214clarkb: ping09:00
*** ujuc has quit IRC09:02
*** koolhead17 has joined #openstack-infra09:04
*** YorikSar has joined #openstack-infra09:05
*** fabo has joined #openstack-infra09:05
*** tqtran has quit IRC09:09
*** mfink_ has joined #openstack-infra09:11
*** koolhead17 has quit IRC09:13
*** skolekonov has joined #openstack-infra09:14
*** otter768 has joined #openstack-infra09:16
openstackgerritAnthony Michon proposed openstack-infra/project-config: Add cerberus-dashboard, python-cerberusclient, sticks, sticks-dashboard, python-sticksclient  https://review.openstack.org/16387609:19
*** otter768 has quit IRC09:20
*** zz_johnthetubagu is now known as johnthetubaguy09:20
*** dims has joined #openstack-infra09:22
*** markvoelker has joined #openstack-infra09:23
*** HeOS has joined #openstack-infra09:23
*** dims has quit IRC09:26
*** Longgeek has quit IRC09:27
*** andreykurilin_ has quit IRC09:27
*** andreykurilin_ has joined #openstack-infra09:28
*** markvoelker has quit IRC09:28
*** freerunner has quit IRC09:28
*** Longgeek has joined #openstack-infra09:29
*** freerunner has joined #openstack-infra09:29
*** freerunner has quit IRC09:29
*** freerunner has joined #openstack-infra09:31
*** Longgeek has quit IRC09:31
*** freerunner has quit IRC09:31
*** freerunner has joined #openstack-infra09:33
*** mfink_ has quit IRC09:33
*** freerunner has quit IRC09:33
*** Somay has quit IRC09:35
*** ihrachyshka has joined #openstack-infra09:35
*** freerunner has joined #openstack-infra09:37
*** derekh has joined #openstack-infra09:37
*** freerunner has quit IRC09:37
*** Longgeek has joined #openstack-infra09:37
*** freerunner has joined #openstack-infra09:38
*** freerunner has quit IRC09:38
*** freerunner has joined #openstack-infra09:39
*** freerunner has quit IRC09:39
*** teran has joined #openstack-infra09:40
*** freerunner has joined #openstack-infra09:40
*** freerunner has quit IRC09:40
*** liusheng has quit IRC09:41
*** garyh has joined #openstack-infra09:42
*** liusheng has joined #openstack-infra09:42
*** teran_ has joined #openstack-infra09:43
*** ZZelle has quit IRC09:43
*** mfink_ has joined #openstack-infra09:43
*** ZZelle has joined #openstack-infra09:43
*** ihrachyshka has quit IRC09:44
*** freerunner has joined #openstack-infra09:44
*** teran has quit IRC09:45
*** freerunner has quit IRC09:45
openstackgerrityolanda.robla proposed openstack-infra/puppet-graphite: Add ability to configure retention policy  https://review.openstack.org/16167909:45
*** ihrachyshka has joined #openstack-infra09:45
*** dims has joined #openstack-infra09:45
*** freerunner has joined #openstack-infra09:46
*** freerunner has quit IRC09:46
*** rlucio has quit IRC09:47
*** andreykurilin_ has quit IRC09:47
*** freerunner has joined #openstack-infra09:47
*** freerunner has quit IRC09:48
*** amotoki has quit IRC09:48
*** freerunner has joined #openstack-infra09:50
*** Trozz has quit IRC09:50
*** mpaolino has quit IRC09:52
*** ddieterly has joined #openstack-infra09:52
*** garyh has quit IRC09:52
*** Trozz has joined #openstack-infra09:53
*** mpavone has quit IRC09:53
*** dtantsur|afk is now known as dtantsur09:54
*** yamamoto has quit IRC09:54
*** Hal1 has joined #openstack-infra09:54
*** ujuc has joined #openstack-infra09:54
*** Trozz is now known as Trozz_09:55
*** Trozz_ is now known as Trozz09:55
*** ssam2 has joined #openstack-infra09:56
*** ddieterly has quit IRC09:57
*** pelix has joined #openstack-infra10:02
*** ghostpl_ has joined #openstack-infra10:04
openstackgerrityolanda.robla proposed openstack-infra/puppet-jenkins: Add extension settings in jjb  https://review.openstack.org/16215210:04
*** zzz_pczesno has joined #openstack-infra10:10
*** wznoinsk_ has joined #openstack-infra10:10
openstackgerrityolanda.robla proposed openstack-infra/puppet-user: Ensure home and home/.ssh folders are created  https://review.openstack.org/16502910:10
*** hichihara has quit IRC10:11
*** alex_xu has quit IRC10:12
*** pczesno has quit IRC10:12
*** Longgeek has quit IRC10:13
*** Longgeek has joined #openstack-infra10:13
*** wznoinsk has quit IRC10:13
*** alex_xu has joined #openstack-infra10:14
*** ihrachyshka has quit IRC10:17
*** mmedvede has quit IRC10:18
*** yamamoto has joined #openstack-infra10:18
*** claudiub has joined #openstack-infra10:18
*** pc_m has joined #openstack-infra10:19
*** e0ne has joined #openstack-infra10:23
*** markvoelker has joined #openstack-infra10:24
*** Hal1 has quit IRC10:25
*** Hal1 has joined #openstack-infra10:26
*** markvoelker has quit IRC10:29
*** rlandy has joined #openstack-infra10:29
*** ihrachyshka has joined #openstack-infra10:29
*** rlucio has joined #openstack-infra10:33
*** yfried is now known as yfried|afk10:34
*** mpavone has joined #openstack-infra10:39
*** ujuc_ has joined #openstack-infra10:40
*** ujuc has quit IRC10:40
*** coolsvap is now known as coolsvap|afk10:41
rluciohey guys, are there any other options for hosting ci logs besides rolling your own server? like can you use dropbox or ftp or something?  just wondering10:46
*** dalgaaf has joined #openstack-infra10:49
*** mmedvede has joined #openstack-infra10:51
*** garyh has joined #openstack-infra10:53
*** ddieterly has joined #openstack-infra10:53
*** yfried|afk is now known as yfried10:54
*** mpaolino has joined #openstack-infra10:57
*** ddieterly has quit IRC10:58
*** mmedvede has quit IRC10:58
*** Hal1 has quit IRC10:59
*** Hal2 has joined #openstack-infra10:59
*** amotoki has joined #openstack-infra11:00
*** mfink_ has quit IRC11:00
*** mpaolino has quit IRC11:02
*** mpaolino has joined #openstack-infra11:03
*** jaypipes has joined #openstack-infra11:04
*** garyh has quit IRC11:04
*** pcaruana has joined #openstack-infra11:05
*** ominakov_ has quit IRC11:07
*** ominakov_ has joined #openstack-infra11:07
*** sambetts has quit IRC11:09
*** mpaolino has quit IRC11:11
*** sambetts has joined #openstack-infra11:11
*** mattfari_ has joined #openstack-infra11:11
*** mattfari_ has quit IRC11:12
*** macjack has joined #openstack-infra11:13
*** otter768 has joined #openstack-infra11:16
*** Somay has joined #openstack-infra11:17
*** otter768 has quit IRC11:21
*** mwagner_lap has quit IRC11:21
wznoinsk_rlucio: I haven't heard about any restrictions on where you want to host the artifacts, as long as it follows the requirements from http://ci.openstack.org/third_party.html11:24
*** wznoinsk_ is now known as wznoinsk11:24
*** yamamoto has quit IRC11:25
*** markvoelker has joined #openstack-infra11:25
*** Somay has quit IRC11:28
*** markvoelker has quit IRC11:30
*** Somay has joined #openstack-infra11:30
*** deepakcs has quit IRC11:30
*** Hal2 has quit IRC11:30
*** Hal1 has joined #openstack-infra11:30
mordredmorning GheRivero11:33
*** coolsvap|afk is now known as coolsvap11:36
openstackgerritAdrian Moisey proposed openstack-infra/jenkins-job-builder: Update the nofifications plugin to support 1.9  https://review.openstack.org/16578511:36
sdaguejhesketh: you still up and about?11:37
*** e0ne is now known as e0ne_11:37
sdagueI can no longer run os-loganalyze unit tests locally due to the swift code not failing gracefully11:38
openstackgerritAdrian Moisey proposed openstack-infra/jenkins-job-builder: Add missing '  https://review.openstack.org/16578611:38
*** e0ne_ has quit IRC11:42
*** mmedvede has joined #openstack-infra11:47
*** macjack has quit IRC11:47
*** e0ne has joined #openstack-infra11:48
*** spredzy is now known as spredzy|afk11:49
*** yamamoto has joined #openstack-infra11:50
openstackgerritAntoine Musso proposed openstack-infra/jenkins-job-builder: Stop mkdir output on each job write  https://review.openstack.org/16578911:50
sdaguemordred: any idea why the # of usable test nodes seems so low?11:52
sdaguelooks like we've only got about 150 in flight11:52
mordredsdague: we've been having cloud issues with HP - we may have leaked zombies again ... lemme look11:53
*** koolhead17 has joined #openstack-infra11:53
mordredoh. my11:53
mordredthat's ... interesting11:53
mordrednovaclient.exceptions.ClientException: An unexpected error prevented the server from fulfilling your request: [Errno 12] Cannot allocate memory (Disable debug mode to suppress these details.) (HTTP 500)11:53
*** weshay has joined #openstack-infra11:54
*** markvoelker has joined #openstack-infra11:54
mordredsdague: is that actually passed through from the server? ^^ ??11:54
*** ddieterly has joined #openstack-infra11:54
sdagueapparently if debug is enabled?11:55
sdaguethat's interesting11:55
*** mmedvede has quit IRC11:55
mordrednevermind - that's not hpcloud11:57
mordredthat's hp1.tripleo11:57
*** kgiusti has joined #openstack-infra11:58
*** ddieterly has quit IRC11:59
*** cdent has joined #openstack-infra11:59
sdagueheh11:59
*** lucasagomes has joined #openstack-infra11:59
*** aysyd has joined #openstack-infra12:00
*** woodster_ has joined #openstack-infra12:00
lucasagomeshi, if you have some time please take a look at https://review.openstack.org/#/c/163344/ it's adding a gate check for openstack/ironic-python-agent (non-voting) to test PXE drivers using the IPA ramdisk12:00
*** Hal1 has quit IRC12:01
lucasagomeswe want IPA to be our default deploy ramdisk, and it's already being tested on the Ironic gate. So it would be good to also check on the ironic-python-agent gate so we can capture failures quicker12:01
*** Hal2 has joined #openstack-infra12:01
*** garyh has joined #openstack-infra12:04
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Don't die while doing alien list  https://review.openstack.org/16579212:04
*** claudiub_ has joined #openstack-infra12:05
*** Somay has quit IRC12:07
*** mwagner_lap has joined #openstack-infra12:07
*** claudiub has quit IRC12:07
*** rfolco has joined #openstack-infra12:10
*** ibiris is now known as ibiris_away12:13
*** dprince has joined #openstack-infra12:13
*** chlong has joined #openstack-infra12:14
*** garyh has quit IRC12:14
*** adalbas has joined #openstack-infra12:15
alex_xuianw: good morning! I updated this patch https://review.openstack.org/160068 hope you can take a look at, just left few hours for freeze~ And the patch depend on this passed the ci tests https://review.openstack.org/150267  Thanks in advance!12:19
*** Longgeek has quit IRC12:20
*** Longgeek has joined #openstack-infra12:20
mordredsdague: ok - I'm deleting a bunch of alien nodes on hpcloud12:20
sdaguemordred: thanks12:20
mordredclarkb, fungi, jeblair: I'm a bit concerned about these aliens - I do not see the pattern clarkb was talking about yesterday where they'd been deleted and then came back12:21
mordredI see no delete calls for them in teh logs at all, in fact - but they sure are not things nodepool groks12:21
mordredsdague: btw - I just learned about block_device_mapping_v212:22
openstackgerritJaroslav Henner proposed openstack-infra/jenkins-job-builder: Add checking of multijob phase condition.  https://review.openstack.org/16466912:23
mordredsdague: a) dude what a terrible name ... but b) exposes the type of capability I really want in things, so awesome12:23
*** ajmiller has quit IRC12:23
*** sdake has joined #openstack-infra12:24
*** bknudson has quit IRC12:25
mordredsdague: honestly, if there was a similar parameter for floating_ips, like "floating_ip_mapping = [{ delete_on_terminate: True, id = <floating_ip_uuid>, pool = <pool name>}] and if id was left out it would just make one for you - I think I'd have everything I need12:25
mordredsdague: well, except for the ssh hostkey thing - but I'll write that in the next cycle12:25
*** sdake_ has joined #openstack-infra12:25
mordredsdague: do I need to be writing a spec for that for liberty already? or is it still too early (never written a nova spec before)12:25
*** chlong has quit IRC12:26
sdaguespec is fine at this point, I think they are effectively already open for liberty12:26
*** spredzy|afk is now known as spredzy12:29
openstackgerritSean Dague proposed openstack-infra/os-loganalyze: fix supports_sev matching  https://review.openstack.org/16554212:29
openstackgerritSean Dague proposed openstack-infra/os-loganalyze: let tests be run from test file location  https://review.openstack.org/16579912:29
*** sdake has quit IRC12:29
sdagueclarkb: I respun your fix with some test cases just to convince myself that \b would work in this case12:30
*** BobBall_AWOL is now known as BobBall12:31
openstackgerritAdrian Moisey proposed openstack-infra/jenkins-job-builder: Update the nofifications plugin to support 1.9  https://review.openstack.org/16578512:32
*** Hal2 has quit IRC12:32
*** Hal1 has joined #openstack-infra12:32
*** dkranz has joined #openstack-infra12:32
*** e0ne is now known as e0ne_12:33
*** gordc has joined #openstack-infra12:34
*** ghostpl_ has quit IRC12:35
*** gordc has joined #openstack-infra12:37
*** Longgeek_ has joined #openstack-infra12:37
openstackgerritAdrian Moisey proposed openstack-infra/jenkins-job-builder: Update the nofifications plugin to support 1.9  https://review.openstack.org/16578512:39
*** Longgeek has quit IRC12:41
*** mrmartin has joined #openstack-infra12:41
*** achanda has joined #openstack-infra12:42
*** chlong has joined #openstack-infra12:43
*** bswartz has quit IRC12:43
*** unicell1 has joined #openstack-infra12:43
*** e0ne_ has quit IRC12:43
*** unicell has quit IRC12:44
*** bknudson has joined #openstack-infra12:45
*** achanda has quit IRC12:47
*** ujuc_ has quit IRC12:47
*** eharney has quit IRC12:53
*** mfink_ has joined #openstack-infra12:53
*** ddieterly has joined #openstack-infra12:55
*** ddieterly has quit IRC12:59
openstackgerritMonty Taylor proposed openstack-infra/system-config: Remove HP TripleO region  https://review.openstack.org/16581413:00
*** dims has quit IRC13:01
*** sdake_ has quit IRC13:01
*** dims has joined #openstack-infra13:02
*** dboik has joined #openstack-infra13:03
mordredNg: any luck with rax glance?13:04
openstackgerritMonty Taylor proposed openstack-infra/shade: Add 'rebuild' to shade  https://review.openstack.org/16106713:05
*** mjturek1 has joined #openstack-infra13:06
*** jistr has quit IRC13:08
*** ldnunes has joined #openstack-infra13:08
*** jistr has joined #openstack-infra13:09
*** sdake has joined #openstack-infra13:09
*** sdake_ has joined #openstack-infra13:12
*** dustins has joined #openstack-infra13:12
*** bswartz has joined #openstack-infra13:14
*** garyh has joined #openstack-infra13:15
*** sdake has quit IRC13:16
*** mrmartin has quit IRC13:17
*** marcusvrn1 has joined #openstack-infra13:17
*** ddieterly has joined #openstack-infra13:17
*** otter768 has joined #openstack-infra13:17
*** marcusvrn has quit IRC13:17
sdaguemordred: any luck in freeing up servers?13:18
*** dboik has quit IRC13:18
sdaguelooks like changes are waiting about an hour to get nodes in check right now, would be nice to help with that13:19
sdagueespecially as very few folks are awake atm13:19
*** ddieterly has quit IRC13:19
pabelangerafazekas, rumor has it you've done some work with multi-node jobs with nodepool?  I was curious if you didn't mind pointing to some code for that13:19
*** ibiris_away is now known as ibiris13:19
*** chlong has quit IRC13:21
*** mrmartin has joined #openstack-infra13:21
*** otter768 has quit IRC13:22
ttxsdague: some check jobs have been in queue for 13 hours now13:22
openstackgerritMonty Taylor proposed openstack-infra/system-config: Remove tripleo-test-cloud-hp1 from nodepool config  https://review.openstack.org/15661513:23
ttxand jenkins04 fails to respond13:23
sdagueoh, that's a different issue then13:23
ttxwe might have a stuck jenkins instance13:24
mordredsdague: I'm not convinced the thing I was looking at was the thing13:24
ttxall stuck jobs seem to be on jenkins0413:24
mordredyah13:24
mordredjenkins04 is hanging when I try to ssh to  it13:24
*** sdake has joined #openstack-infra13:25
ttxyeah, confirmed all jobs >12hours wait are all on jenkins0413:25
ttxyeah, confirmed all jobs >12hours wait are all on jenkins0413:25
*** garyh has quit IRC13:25
*** ilyashakhat has quit IRC13:25
*** peristeri has joined #openstack-infra13:26
anteayastuck ttx?13:26
sdaguejenkins crash probably13:27
*** e0ne has joined #openstack-infra13:27
* mordred poking it with a stick13:27
ttxanteaya: some check jobs have been waiting for test results for 14 hours now13:28
*** sdake_ has quit IRC13:28
anteayattx: you posted the same thing twice13:29
*** ghostpl_ has joined #openstack-infra13:29
anteayattx: my lame attempt at making a funny13:29
Ngmordred: nope. starting to get quite annoyed at it. AFAICS I have the latest python-glanceclient installed fine in a venv, I'm exporting all the environment stuff I can, and still just 401. If you have a working environment, can we figure out what's different about mine from yours? like redact your password from whatever environment stuff you source, and a13:29
Ngpip freeze?13:29
anteayattx: and yes looking at the status page I confirm your findings13:29
mordredNg: yup - did you get my paste yesterday?13:29
*** garyh has joined #openstack-infra13:29
devanandaI think https://jenkins04.openstack.org/job/check-tempest-dsvm-ironic-pxe_ipa-nv/40/ is stuck ?13:29
Ngmordred: hrm, I must have missed that13:30
mordreddevananda: yes. everything on jenkins04 went south13:30
devanandais there a way that it can be ignored at this point, as a non-voting job13:30
devanandatha tis now blocking a patch13:30
mordreddevananda: not really - sorry - it's going to have to get rekicked13:30
devanandamordred: ah. lovely13:30
* mordred just hard-rebooted jenkins0413:30
devanandagotcha13:30
mordredthere will be a bit of fallout from that13:30
mordredNg: you're probably missing export OS_IMAGE_API_VERSION=213:30
Ngmordred: I definitely have that exported13:31
devanandamordred: so, how can I initiate a recheck, when zuul hasn't finished the last job run?13:31
mordredok. let me give you redacted13:31
Ngta13:31
*** baoli has joined #openstack-infra13:32
devanandaor will zuul figure that out in a minute on its own?13:32
Ng-(cmsj@stonker)-(~)- glance --version13:32
Ng0.16.113:32
Ng-(cmsj@stonker)-(~)- echo $OS_IMAGE_API_VERSION13:32
Ng213:32
mordreddevananda: well, you could do something silly like change a punctuation in the commit message ttw13:32
devanandagah. right.13:32
mordredNg: OS_TENANT_NAME ?13:32
* devananda forgets these silly tricks sometimes13:32
Ngmordred: I have that set to my account number13:33
mordredgreat. like 6 digits or whatever?13:33
Ngyeah13:33
mordredNg: http://paste.openstack.org/show/193563/13:34
*** Longgeek_ has quit IRC13:35
mordredNg: and 'glance image-list' works fine for me with those on glanceclient 0.16.113:36
mordredNg: if you do 'keystone catalog' - do you get an entry for Service: image ?13:36
mordredttx: we're definitely feature freeze week, aren't we? :)13:37
ttxmordred: even FF day. The day I need that queue thing to work :)13:38
Ngmordred: nup, keystone is 401ing all over town too. Comparing to my env, I'm on a different region (their UK one). I'm wondering if they're not actually all the same13:38
Ngso far the only api client I've managed to get working with it, is their hacked up version of novaclient13:39
Ng(rackspace-novaclient in pypi)13:39
*** dboik has joined #openstack-infra13:40
mordredoh gross. do you have access to the non-UK regions? like, can you try doing against DFW?13:40
mordredttx: the day you need it to work is the day it never will13:40
*** nelsnelson has joined #openstack-infra13:42
Ngmordred: same. I wonder if the reverse would work - if you set OS_REGION_NAME=LON and OS_AUTH_URL=https://lon.identity.api.rackspacecloud.com/v2.0/13:43
*** sarob has joined #openstack-infra13:43
mordredlet me try13:43
*** mrmartin has quit IRC13:44
Ngthe web console seems to suggest taht accounts are region-locked, in that I can't do stuff in DFW via that, it tells me to create a new account13:44
mordredNg: HEADDESK13:44
NgI'm gonna go back to the support people and say I know for a fact this works against DFW, and if this somehow isn't possible against LON, can I have my account switched over/recreated13:45
mordredpublic endpoint for image service in LON region not found13:45
mordredis what I get when I try london13:45
*** tnovacik has quit IRC13:46
mordredjroll, JayF: you both have magical rackspace powers, right? pvo isn't ever on IRC anymore, so I can't poke him ...13:46
*** ddieterly has joined #openstack-infra13:46
*** mtanino has joined #openstack-infra13:47
*** dkliban_afk is now known as dkliban13:47
anteayamordred: what is the status on jenkins04?13:48
*** sarob has quit IRC13:48
anteayasince you rebooted13:48
anteayaand it looks to me like we might need to clean up more alien nodes like clarkb did last night13:49
ttxzuul still shows them as stuck13:49
anteayamordred: do you know how to do that?13:49
* mordred is cleaning up alien nodes13:49
anteayathank you, that will help13:50
mordredhowever, there may be additional cleanup tasks that jeblair and clarkb suggest13:50
anteayadoing what you can is appreciated13:50
anteayait being feature freeze day and all13:51
mordredyah13:51
anteayathanks13:51
*** amitgandhinz has joined #openstack-infra13:52
*** eharney has joined #openstack-infra13:53
*** larsks has joined #openstack-infra13:53
*** mfink__ has joined #openstack-infra13:53
anteayathe number of zuul workers looks funny to me13:54
anteayain that it looks like there aren't many13:54
*** mfink_ has quit IRC13:54
fungiskimmed scrollback. is jenkins04 still in need of help?13:55
anteayaor perhaps those are available workers, so the number is low if they aren't available13:55
anteayafungi: I'm uncertain13:55
fungiahh, looks like it got restarted but its slave list was not evacuated13:55
*** ddieterly has quit IRC13:56
fungiso it has many hundreds of offline slaves according to its webui13:56
anteayaawesome13:56
jrollmordred: what's up13:56
fungiokay, maybe not many hundreds, but a few hundred13:56
*** mriedem_away is now known as mriedem13:57
*** mfink_ has joined #openstack-infra13:57
jrollmordred: the only special rackspace power I have is internal irc, but if you have a paste or whatever with some details I can link it to people13:57
*** ddieterly has joined #openstack-infra13:58
*** aysyd has quit IRC13:58
openstackgerritMonty Taylor proposed openstack-infra/nodepool: Record interesting info into nova metadata  https://review.openstack.org/12662113:58
*** wuhg has joined #openstack-infra13:59
jrollmordred: reading again this looks like an incoming rant about uk vs us accounts... I can't help with that but it is being worked on(tm)13:59
mordredfungi: yes - jenkins04 was completely dead13:59
*** aysyd has joined #openstack-infra13:59
mordredjroll: awesome - so, is the best bet just for Ng  to poke support and get his account move - or to get a new account created?13:59
*** koolhead17 has quit IRC13:59
*** spzala has joined #openstack-infra13:59
anteayamordred: so fungi appears to be suggesting that evacuating a jenkins slave list prior to a reboot might be something to consider for next time14:00
mordredjroll: it seems that the UK region has different operational semantics too - as in python-glanceclient does not work there14:00
mordredanteaya: sure - but the server was completely hung and it was not possible to connect to it14:00
fungiany more i've taken to killing the jenkins process, deleting all the lines inside <slaves></slaves> from the config.xml, marking everything in the nodepool database assigned to that jenkins as delete, then starting the jenkins process again14:00
fungiannoying but effective14:00
mordredfungi: it was not possible to ssh in to the box14:00
*** mfink__ has quit IRC14:00
fungioh marvellous14:00
mordredyeah.14:00
mordredthis is why I hard rebooted it14:00
jrollmordred: yeah, it's a huge problem. I hate it.14:00
mordredit was stuck very badly14:00
fungianything interesting on the virtual console, or did you have a chance to check?14:00
mordredfungi: the virtual console also did not work :)14:00
jrollmordred: Ng: I'm not sure if poke support or make a new account is the best route, support can tell you :P14:01
fungiwow awesome14:01
mordredfungi: yah. it was pleasing14:01
Ngjroll: thanks. I shall see what my options are with support :)14:01
*** asselin_ has joined #openstack-infra14:01
Nghopefully all of my options involve fanaticism of some kind14:01
*** zz_dimtruck is now known as dimtruck14:01
*** yfried is now known as yfried|afk14:01
fungii'm going what i can to get as much cleaned up for nodepool as possible, and giving the other jenkins masters a health check14:02
*** sigmavirus24_awa is now known as sigmavirus2414:02
anteayafungi: cool, thanks14:02
jrollNg: IMO this separate account thin is strictly non-fanatical :)14:02
fungijroll: Ng: fantastical!14:02
*** dkranz has quit IRC14:03
jrollheh14:03
fungijenkins01 is pretty annoyed too14:04
anteaya:(14:04
fungibut not dead yet14:04
anteayayay for life14:04
fungii'll get it into a graceful restart cycle as soon as possible14:04
fungiand clean it up while it's restarted for good measure14:05
mordredfungi: I've got a loop going deleting things from alien-list btw14:05
* anteaya nods14:05
fungimordred: awesome. i hadn't gotten that far yet, cleaning up stale tracked nodes in nodepool first14:05
*** mfink__ has joined #openstack-infra14:05
openstackgerritMerged openstack-infra/system-config: Work-around grubby boot failure on Fedora 20  https://review.openstack.org/16568114:06
fungisince there seemed to be rather a few in ready for far longer than is likely, and no sign of them on the jenkins masters to which they claimed to belong14:06
fungithat has a tendency to throw off the demand calculations pretty badly14:06
fungioh yeah jenkins01 was not much longer for this life. took several minutes just to get logged into the webui14:06
pabelangerWas there a mid-cycle infra meeting this time around?  If so, where could I look to locate the discussions?  I don't see much on the infra wiki page14:07
*** achuprin has joined #openstack-infra14:07
anteayafungi: glad you caught it14:07
fungipabelanger: we don't usually have mid-cycle meetups14:07
fungipabelanger: we prefer to do virtual sprints and similar sorts of activities where more people can participate without needing to travel14:07
pabelangerfungi, Oh, perhaps I'm thinking of informal meetings.  Didn't something happen in NYC and Australia?14:08
anteayanyc was in 2013 in june, we had an onboarding14:09
*** yfried|afk is now known as yfried14:09
anteayaaustralia was lca and we saw each other there14:09
pabelangerMakes sense14:09
mordredpabelanger: yeah - no mid-cycle this time14:09
anteayathe only thing that took place there that had an etherpad was I locked markmcclain and gus in a room to come up with a direction for nova-net to neutron migration14:09
fungiyeah, it's more that we have opportunistic gatherings when enough people already expect to be around14:09
*** radez_g0n3 is now known as radez14:09
anteayayes that14:09
fungithe other jenkins masters look like they're doing okay for now. once jenkins01 is back to normal i'll gracefully restart 04 so that i can delete its pile of cruft slaves14:11
*** mattfarina has joined #openstack-infra14:12
*** claudiub has joined #openstack-infra14:12
*** mfink__ has quit IRC14:12
*** mfink_ has quit IRC14:12
*** mfink_ has joined #openstack-infra14:13
*** claudiub_ has quit IRC14:14
*** ajmiller has joined #openstack-infra14:14
*** stevemar has joined #openstack-infra14:14
openstackgerritClaudiu Belu proposed openstack-infra/project-config: Adds compute-hyperv in StackForge  https://review.openstack.org/16561114:14
fungimordred: wow. we have 555 unassociated floating ips in hpcloud. cleaning those up now14:15
fungithough these may be from churn... some of them are already coming up not found when deleting14:17
devanandais jenkins06 having trouble now?14:17
fungidevananda: i don't think so--seems to be running jobs14:18
fungidevananda: what makes you ask?14:18
* devananda is watching https://jenkins06.openstack.org/job/check-heat-dsvm-functional-mysql/662/14:19
devanandaperhaps just slow ...14:19
*** dhellmann has quit IRC14:19
*** dhellmann has joined #openstack-infra14:19
devanandaah, yep. it's alive still14:20
fungimostly dead is still partly alive!14:20
devanandaindeed!14:20
devananda(and it passed. yay!)14:20
*** timcline has joined #openstack-infra14:21
fungii'm rerunning my fip cruft check with a 5-minute separation instead of 114:21
fungihopefully this will be marginally more accurate14:21
*** e0ne is now known as e0ne_14:22
*** ayoung has quit IRC14:22
*** e0ne_ is now known as e0ne14:23
*** _nadya_ has joined #openstack-infra14:24
*** mrmartin has joined #openstack-infra14:25
*** erlon has joined #openstack-infra14:26
*** yamamoto has quit IRC14:27
openstackgerritMerged openstack-infra/system-config: Remove tripleo-test-cloud-hp1 from nodepool config  https://review.openstack.org/15661514:28
*** sushilkm has joined #openstack-infra14:28
fungimordred: something else is wrong with floating ips in hpcloud14:29
*** jcoufal_ has joined #openstack-infra14:29
sushilkmThe gating jobs for trove seem to stuck14:29
sushilkmhttps://jenkins04.openstack.org/job/check-grenade-dsvm/28125/console14:29
sushilkmI think it needs to be restarted14:29
sushilkmOr some fix14:29
mordredfungi: sweet!14:29
fungimordred: `nova floating-ip-delete 01178e82-74c6-4da4-8a60-3315a3a4be7c` returns "ERROR (CommandError): Floating IP 01178e82-74c6-4da4-8a60-3315a3a4be7c not found." but `nova floating-ip-list` says it exists14:30
openstackgerritSean Dague proposed openstack-infra/os-loganalyze: extract static methods  https://review.openstack.org/16585014:30
openstackgerritSean Dague proposed openstack-infra/os-loganalyze: unwind test class multiple inheritance  https://review.openstack.org/16585114:30
fungimordred: nevermind. looks like nova can't delete floating ips by uuid any longer, only by ip address14:30
anteayasushilkm: jenkins04 was stuck and has been rebooted14:32
mordredfungi: neutron can delete them by uuid14:32
*** sdake_ has joined #openstack-infra14:32
*** sdake_ has quit IRC14:32
mordredfungi: and it's more efficient to delete them with neutron, fwiw14:32
*** jcoufal has quit IRC14:32
anteayasushilkm: when jenkins comments on your patch you may need to recheck to send it through again14:32
fungisushilkm: yes, i think we're probably going to have to restart zuul to clear those up. it doesn't seem to know that the jobs which were running on jenkins04 before it died are no longer running14:32
*** sdake__ has joined #openstack-infra14:33
sushilkmYes its stuck since 13 hrs14:33
*** yamamoto has joined #openstack-infra14:33
*** sdake has quit IRC14:34
sdaguefungi: or push a new patch14:34
sdaguesushilkm: edit the commit message to kick it back into the pool14:35
fungisdague: yeah, i was just considering that. there's only 13 in that state14:35
sdaguebecause they'll need to rerun for new tests anyway14:35
fungiso maybe better than restarting the whole world14:35
sdagueyeh14:35
*** achanda has joined #openstack-infra14:35
anteayayes14:35
anteayafungi: do you have a list of the 13? I can edit and get them back in14:35
sushilkmWud the recheck not work14:35
fungianteaya: i can see them in the top-left corner of the zuul status page14:36
sdagueit will not14:36
fungianteaya: the 13 in the check pipeline with times of 13+ hours in queue14:36
anteayafungi: ah okay I will get them14:36
fungii mean, they're not doing any harm being there14:37
anteayasushilkm: recheck only works if jenkins is finished and has reported back14:37
anteayasushilkm: it won't work if they are stuck in the queue14:37
fungiso if people who own those 13 changes care they're also welcome to just push a new patchset, tweak the commit message in the webui, whatever14:37
anteayaoh okay14:37
fungiit's an independent pipeline, so it's not tying up any other changes14:37
anteayawas just going to be neighbourly14:37
anteayabut I don't have to14:37
fungii'm currently more concerned about getting our worker capacity back to sanity14:38
sdagueit might be good to wait on flushing them all until we have more ready nodes14:38
sdaguefungi: ++14:38
mordredanteaya: when I want to be neighborly, I tend to give people fluffy large animals14:38
*** wenlock has joined #openstack-infra14:38
sdaguewe're at < 100 in use nodes14:38
mordredanteaya: although I've been considering adding sea mammals to the mix recently14:38
anteayamordred: I see that14:38
sdaguewhich is quite bad for throughput14:38
anteayamordred: oh like what?14:39
fungiyou're not kidding14:39
mordredanteaya: your approach is likely more productive than mine14:39
mordredanteaya: I  mean, dolphins are a good start - but manatees could also be fun14:39
anteayamordred: yours makes more friends though14:39
anteayamordred: my approach doesn' tend to make friends14:39
*** tonytan4ever has joined #openstack-infra14:40
mordredanteaya: maybe you should try distributing mooses?14:40
anteayanot sure that would achieve the goal14:40
mordredanteaya: probably not14:40
anteayameese tend to take up quite a bit of space14:41
anteayagot chased by a moose once, that was exciting14:41
mordredanteaya: but they also tend to keep home invaders at bay14:41
*** sushilkm has left #openstack-infra14:41
sdagueah, the openssl vuln is just a DoS attack, not an exploit - https://www.openssl.org/news/vulnerabilities.html14:41
anteayathey consider anyone not them to be a home invader14:42
sdaguehopefully that means no need to take down services for patching14:42
*** baoli has quit IRC14:42
mtreinishsdague: oh good, I wasn't really looking forward to having to do that again14:42
sdagueoh, wait, there are a ton of CVEs in that list today14:42
*** spzala has quit IRC14:43
*** baoli has joined #openstack-infra14:43
*** mfink_ has quit IRC14:43
*** achanda has quit IRC14:43
*** mfink_ has joined #openstack-infra14:43
sdaguethere are 12 CVEs today actually14:43
anteaya:(14:43
sdagueit looks like they are all DoS or segfault14:44
*** sarob has joined #openstack-infra14:44
fungiyeah, it's a fun batch14:45
fungii'm still mulling it over14:45
mordredanteaya: yes - I mean, they also keep you away from your home - but your home will certainly be safe from other people if you're worried about that sort of thing14:45
anteayaright14:45
*** baoli has quit IRC14:45
anteayaif my home happens to be a muskeg swamp14:46
anteayait will be nice and safe14:46
mordredanteaya: I don't want to judge people's home choices14:46
anteayaI did live close for a period of time14:46
anteayawas nice14:46
anteayahence the experience being chased14:46
*** baoli has joined #openstack-infra14:47
*** dkranz has joined #openstack-infra14:47
*** yfried is now known as yfried|afk14:48
*** timcline has quit IRC14:48
jeblairfungi: only 5 leaked ports14:48
anteayamorning jeblair14:48
*** mrmartin has quit IRC14:48
*** timcline has joined #openstack-infra14:49
*** sarob has quit IRC14:49
jeblairanteaya: morning14:49
jeblairfungi: are you working on jenkins01?14:49
*** prad has joined #openstack-infra14:49
*** flip214 has left #openstack-infra14:50
jeblairah yeh, it's down to 2 jobs14:50
*** MarkAtwood has joined #openstack-infra14:50
*** bhunter71 has joined #openstack-infra14:51
*** jlanoux has quit IRC14:51
*** garyh has quit IRC14:51
*** mrunge has quit IRC14:52
*** yfried|afk is now known as yfried14:52
*** carl_baldwin has joined #openstack-infra14:54
fungijeblair: yeah waiting for jobs to wrap up there14:54
jeblairfungi: most of the errors i'm seeing are timeouts on server deletion; i'm not actually seeing any build errors14:55
fungitwo more running, almost complete14:55
jeblairi've double checked the ssh timeouts i see, and those are actually just nodes that were in the process of being deleted but got picked up by the periodic check14:55
fungiinterestingly, we have a crazy high number of nodes in a building state proportional to everything else14:56
*** marcusvrn has joined #openstack-infra14:56
*** sarob has joined #openstack-infra14:56
openstackgerritAnthony Michon proposed openstack-infra/project-config: Add cerberus related projects  https://review.openstack.org/16387614:57
jeblairwe also have quite a few ready nodes on jenkins01 and 04.  i suspect those are errors.  i will delete them14:57
*** timcline has quit IRC14:57
*** timcline has joined #openstack-infra14:57
fungisounds like a good idea. i'm also planning to blow away everything assigned to jenkins01 here momentarily, as soon as it's down14:58
fungiand then circle back around to do the same for jenkins0414:58
fungiwe have 82 nodes in a building state for more than an hour, all in hpcloud14:59
*** armax has joined #openstack-infra14:59
fungi320 for more than 30 minutes, also all in hpcloud15:00
anteaya:(15:00
fungisomething's gumming up the works there15:00
anteayasame region?15:00
fungiwe only use one region, but i'll correlate to az15:00
anteayaoh15:00
anteayasorry15:01
*** dboik_ has joined #openstack-infra15:01
mordredanteaya: no need to be sorry - it was the right question15:01
*** mrunge has joined #openstack-infra15:01
anteayak15:01
fungi78% are in az315:01
anteayawow15:01
fungimordred: jeblair: ^15:01
mordredfungi: excellent15:02
jeblairfungi: we target 63% of our nodes to be in az315:02
jeblairfungi: so that may not be off-mark15:02
*** yfried is now known as yfried|afk15:02
fungiyeah, so maybe slightly inflated ratio but not crazy then15:03
fungiso may just be roughly evenly distributed by assigned quota15:03
*** garyh has joined #openstack-infra15:03
*** dboik has quit IRC15:04
*** pblaho_ has joined #openstack-infra15:04
*** asselin_ has quit IRC15:05
*** pblaho has quit IRC15:05
*** dkranz has quit IRC15:05
*** mrunge has quit IRC15:06
*** sputnik13 has quit IRC15:07
*** sdake has joined #openstack-infra15:07
*** timcline has quit IRC15:08
*** sputnik13 has joined #openstack-infra15:09
*** timcline has joined #openstack-infra15:09
*** sputnik13 has quit IRC15:09
*** wayneeseguin has quit IRC15:10
sdagueanteaya: what's the policy on things like this random 'noop' comment - https://review.openstack.org/#/c/165863/115:11
sdaguelooks like a poorly setup test system15:11
*** spzala has joined #openstack-infra15:11
*** wayneeseguin has joined #openstack-infra15:11
*** sdake__ has quit IRC15:11
anteayasdague: policy is they get disabled15:12
fungiokay, jenkins01 has finally quiesced so i'm stopping it and cleaning up all its workers now15:12
sdagueanteaya: could you do that please? thanks in advance15:12
anteayasdague: dims wanted a chance to reach out to them, unless you would like them disabled now15:12
anteayasdague: sure15:12
anteayacan someone please disable gerrit account id 13431: http://paste.openstack.org/show/193589/ https://review.openstack.org/#/c/165863/1 personal account posting ci comments15:15
*** mika has quit IRC15:15
jeblairanteaya: done15:16
anteayajeblair: thank you15:16
dimsanteaya: sdague: just emailed them15:16
anteayaI'll tell them they are disabled15:16
dimsanteaya: ack thanks15:16
*** dkranz has joined #openstack-infra15:17
*** otter768 has joined #openstack-infra15:18
jd__can I haz an ETA on https://review.openstack.org/#/c/162146/ because it's kinda "blocking" us to release?15:18
anteayadims: keep an eye on them for me will you? we might have to clear cache if they have an open connection to prevent them from continuing to post comments15:19
*** hashar has quit IRC15:19
anteayajd__: we are dealing with multiple challenges today15:20
anteayawe can't give ETA's15:20
*** masayukig_ has joined #openstack-infra15:20
fungijenkins01 is on its way back up now. all nodepool nodes for it have been marked delete and all slave entries have been deleted from its config15:20
anteayawe are however doing everything we can to help developers and release mangers achieve their goals15:20
anteayawe have a shortage of workers to run tests and are doing what we can to make more available15:21
anteayafungi: yay15:21
*** dannywilson has joined #openstack-infra15:21
clarkbrenaming a project requires a scheduled gerriy downtime15:21
clarkbjd__ why is that blocking a release? cant you release gnocchi from the stackforge repo?15:22
*** Hal1 has quit IRC15:22
*** mika has joined #openstack-infra15:22
*** otter768 has quit IRC15:23
*** yfried|afk is now known as yfried15:23
*** masayukig_ has quit IRC15:23
sdagueclarkb: https://review.openstack.org/#/c/165542/ - I redid your fix of my fix with a test to convince myself that your fix was the right one15:23
fungijd__: best guess on the project renames is that we'll batch them up a week from saturday, since we don't want to add potential confusion to this saturday's gerrit os upgrade/ip address change15:24
clarkbsdague thanks15:24
sdaguethere is also a stack of test refactorings above that which are not urgent, but while I was in the code I did it anyway.15:24
fungijd__: but we'll probably discuss the plan in greater detail in tuesday's meeting15:24
*** ChuckC has joined #openstack-infra15:24
*** dannywil_ has joined #openstack-infra15:24
mordredfungi: my loop of cleaning up all of the aliens has finished, btw15:25
fungimordred: awesome. stale floating ip deletion is still going, though jenkins01 is starting to get workers again15:25
jeblairfungi, mordred: i'm seeing a lot of 2015-03-19 15:24:18,583 ERROR nodepool.NodeLauncher: EOFError launching node id: 1365036 in provider: hpcloud-b3 error:15:25
jeblaircorvus@fuligin:~$ ssh 15.126.138.9315:25
jeblairConnection closed by 15.126.138.9315:25
fungihere in a moment i'll set jenkins04 to prepare for shutdown in preparation to clean it up15:25
*** dannywilson has quit IRC15:26
jd__clarkb: I can release but I wanted to test things like PyPI etc at the same time as it's our first release; so it's not "blocking" (note the "") it's just less fun ;)15:26
mordredso - I've got a bunch of things in alien list again15:26
jd__fungi: ok, thanks :)15:26
mordredI think something is going south there15:26
clarkb04 appears to have leaked threads according to melody so the ssh-slaves plugin update does not correct that behavior :/15:26
fungijeblair: great. that could be anything from broken filesystem to sshd child processes getting killed to network issues15:26
jeblairmordred: make sure you take 2 snapshots and compare15:27
jeblairmordred: new nodes briefly show up in alien lists during builds15:27
mordredgotcha15:27
mordredjeblair: Conflict: Instance 2a478e27-ab9e-48c7-8d14-14190a95b9ed is locked (HTTP 409) (Request-ID: req-e0f296ad-4306-4e6c-a3ba-7df1dd87cc80)15:27
mordredwhat does that mean?15:27
clarkbhttps://jenkins04.openstack.org/monitoring?part=graph&graph=threadCount15:27
jeblairmordred: no idea; last time i asked the noc about it i didn't get a response15:27
clarkbmordred its nova locked15:28
*** masayukig_ has joined #openstack-infra15:28
clarkbbut nova unlock doesnt work on it15:28
anteayaclarkb: :/ for threads15:28
jeblairwhat does "nova locked" mean?15:28
jeblairand how does something become "nova locked"?15:28
*** AJaeger has joined #openstack-infra15:28
clarkbjeblair `nova lock` is the command15:28
mordredclarkb: but we don't run that anywhere, do we?15:28
AJaegerfungi, could you review jeblair's irrc check access patch again, please? I rebased it... https://review.openstack.org/#/c/165508/15:29
clarkbthe help text is minimal and I think it says "lock an instance" so the actual semantics are fuzzy there15:29
clarkbmordred no15:29
*** masayukig has quit IRC15:29
*** jlanoux has joined #openstack-infra15:29
jeblairyeah, so i'm no closer to knowing what that means15:29
mordredclarkb, jeblair: I'm asking the noc about 2a478e27-ab9e-48c7-8d14-14190a95b9ed15:29
mordredwhich our logs show us as having requested deletion for but is in active state15:30
*** sdake_ has joined #openstack-infra15:30
*** Hal1 has joined #openstack-infra15:30
AJaegerfungi, thanks!15:31
*** masayukig_ has quit IRC15:31
*** pblaho_ has quit IRC15:33
*** pblaho_ has joined #openstack-infra15:33
dimsanteaya: ack15:34
*** sdake has quit IRC15:34
*** masayukig has joined #openstack-infra15:34
jeblairfungi: i have found a console log difference between a host with EOFError and one without15:34
fungioh! i've not had tons of luck with nova console yet15:35
jeblairfungi: http://paste.openstack.org/show/193593/15:35
jeblairfungi: working node on top, failing node on bottom15:36
fungijeblair: nice! theory is lack of entropy at boot, no hostkey generated15:36
*** ayoung has joined #openstack-infra15:36
fungialso why is /etc/sysconfig/network-scripts/ifcfg-* missing?15:37
fungioh, it does that on both15:37
*** gyee has joined #openstack-infra15:37
fungiright, not red hat15:37
*** reed has joined #openstack-infra15:37
AJaegerjeblair, fungi, anteaya: My understanding of a discussion here on IRC some days ago (don't remember when exactly) is that we prefer if projects have a single lint target. In that spirit, I merged for project-config the two targets bashate and pep8 in patch https://review.openstack.org/164531 - and confused anteaya with it. Could you confirm whether this is the right direction, please?15:37
jeblairAJaeger: i'll have to look into that later15:38
mordredjeblair: you know how to open hp support tickets, yeah?15:38
jeblairmordred: through the web ui?15:38
mordredk15:39
AJaegerjeblair, ok, thanks. There's no urgency - just like to get direction15:39
jeblairmordred: and then go check infra-root's mailbox because they send the response there and forgot to copy-paste it into the support ticket tracker?15:39
*** Bsony has quit IRC15:39
anteayaAJaeger: yeah we are trying to get more workers available for jobs right now15:39
clarkbI prefered to email for that reason15:40
jeblairat least, that's the most recent procedure15:40
clarkbsupport@hpcloud.com iirc15:40
openstackgerritMerged openstack-infra/project-config: Fix failure detection in irc access check  https://review.openstack.org/16550815:40
anteayaAJaeger: perhaps we can discuss it tomorrow and thanks for bring it up15:40
AJaegeranteaya, feel free to discuss when you have time and give me a pointer if I'm not around ;) I can read the logs on eavesdrop then15:40
anteayayou can bring it up, it is your issue, just another time is better15:41
*** unicell1 has quit IRC15:41
*** unicell has joined #openstack-infra15:42
jeblairfungi: i've confirmed the eoferror and working nodes i looked at were built from the same image in the same region, and within less than 1 second of each other15:43
fungijeblair: was there any console output earlier during boot about host key generation breaking?15:43
*** Trozz has quit IRC15:46
*** bhuvan has joined #openstack-infra15:46
*** Trozz has joined #openstack-infra15:47
jeblairfungi: it's missing more things like Your identification has been saved in /etc/ssh/ssh_host_rsa_key.15:47
jeblairGenerating public/private rsa key pair.15:47
jeblairfungi: check ~corvus/okaylog and ~corvus/eoflog on puppetmaster15:47
*** dangers_away is now known as dangers15:47
fungijeblair: awesome, thanks15:47
fungii wonder if it's a filesystem problem15:47
jeblairfungi: 2015-03-19 14:57:14,832 - util.py[WARNING]: Failed fetching metadata from url http://169.254.169.254/2009-04-04/meta-data15:48
jeblairfungi: that seems to be where it starts to diverge15:49
fungiwe rely on the metadata server after all?15:49
*** marun has joined #openstack-infra15:49
*** bdpayne has joined #openstack-infra15:49
fungifor some reason i thought we'd managed to not need it15:49
jeblairme too,  but it seems somehow tied into cloud-init's setting up of ssh keys?15:49
jeblairmordred, SpamapS: ^ can you shed light on this?15:50
mordredthis is hp or rax?15:50
fungihp15:50
mordredyeah - that's how keypair extension works15:50
mordredthis is why I want to not install cloud-init15:50
mordredand just bake in keys15:50
mordredI am very sorry that is taking so long15:51
fungiwell, it seems to be the underlying reason for a majority of boot failures in hpcloud15:51
*** ashleighfarnham has joined #openstack-infra15:51
*** tjones1 has joined #openstack-infra15:51
jeblair397 in the past few hours15:51
clarkbcan we config drive insyead?15:51
mordredas a stop-gap we could switch to config-drive and we could change our base image cloud-init config to ignore ec2 metadata service15:51
fungii wonder if cloud-init couldn't just be coaxed into retrying that for longer as a stopgap?15:52
fungior that15:52
mordredor - since it's hp - we _could_ just uninstall cloud-init in the base image and bake a nodepool key ni to it15:52
clarkbfungi metadata is likely notnroutable15:52
clarkbfungi so timeouts wont help15:52
anteayawhat changed, something changed about 24 hours ago to result in these errors15:52
mordredwe don't have to fix the problem for real yet - uninstalling cloud-init will not break rax15:52
fungiclarkb: it's linklocal. that's never routable. you mean it's not responding?15:52
mordredyou want I should make a patch to do that real quick?15:53
clarkbfungi no the way the metadata service works is it gets nated amd must be routable15:53
clarkbfungi my experience is this is fragile and breaks a lot15:53
clarkbmordred does nodepool support that?15:53
mordredclarkb: it doesn't need to15:54
fungiclarkb: where do those nat rules live? something has to arp reply for that linklocal address on the local broadcast segment obviously15:54
clarkbmordred initial boot relies on cloud init iirc15:54
*** dhritishikhar has joined #openstack-infra15:54
mordredclarkb: no15:54
clarkbfungi nova/neutron set them up15:54
mordredI mean, image build step will wonky15:54
mordredbut that doesn't affect node throughput15:54
clarkbfungi it lives on the router device iirc15:54
mordredlemme make a patch - it'll be 5 minutes15:54
fungiclarkb: so nova/neutron is failing to present that interface or a nat rule attached to it into the bridge to which the instance gets attached?15:55
clarkbmordred if image build doesnt work then you get no images with your fix...15:55
rhe00I am experiencing an apparent race condition between nodepool and jenkins. Job #91 finishes up and nodepool receives the onCompleted and onFinalized notification, before nodepool has removed the node from the jenkins pool, the node gets picked up by Jenkins for a new job (#92). #92 quickly fails because the test can't grab the FC devices (the same node is still holding on to them). This all repeats for job #93.15:55
jeblairclarkb, mordred: i don't think zmq from jenkins04 is working15:55
mordredjeblair: awesome15:55
mordredclarkb: we can boot images _some_ times15:55
clarkbfungi my experience was that it was presented ok but packet return path is easily borked because they only DNAT15:55
mordredclarkb: all we need to do boot is one node during image build15:55
mordredfor snapshot images15:55
mordredonce we do that, all the rest of the nodes booted from taht one will be fine15:56
fungioh lovely, and now freenode is falling over on me15:56
mordredI think it's worth a try15:56
clarkbmordred ok15:56
anteayafungi: :(15:56
*** gampel has quit IRC15:56
rhe00does anyone have any ideas what might be going on? I am about to dig into nodepool and see if I can close the race window.15:56
anteayarhe00: at the moment we are all focused on getting working nodes for our zuul queues15:57
openstackgerritAdrian Moisey proposed openstack-infra/jenkins-job-builder: Update the Nofifications plugin to support 1.9  https://review.openstack.org/16578515:57
anteayarhe00: we might have more time to help in a few hours15:57
fungirhe00: we set our jobs to pass an OFFLINE_NODE_WHEN_COMPLETE parameter for the job with a value of 1 so that the gearman plugin offlines the node in jenkins before it can pick up a new job15:58
rhe00anteaya: ok, I can wait. I will dig some more on my own. thanks!15:58
anteayarhe00: thanks for understanding15:58
rhe00fungi: ok, I will look into that.15:58
fungii've got jenkins04 in prepare for shutdown, and will clean up its slave cruft once i can safely stop it15:59
zaromorning15:59
openstackgerritJerry Zhao proposed openstack-infra/nodepool: add option to use ipv6 for image update and node launching  https://review.openstack.org/15617815:59
fungirhe00: we use a zuul parameter function to add that to our jobs by default15:59
anteayamorning zaro15:59
clarkbjeblair its likely the sockets werent closed on the other ends so they never reconnected15:59
jeblairclarkb: i think so15:59
clarkbwe will probably need to restart nodepool and logstash client16:00
jeblairfungi, clarkb, mordred: i suspect nodepool wil need to be restarted to correct the zmq error with 0416:00
jeblairya16:00
JayFmordred: I don't have magical powers, but I do know an array of people to prod if something is wrong16:00
jeblair04 is in shutdown mode now, it should prevent the problem from getting worse.  i will delete more 04 nodes16:00
*** esker has joined #openstack-infra16:00
fungijeblair: i can restart nodepoold once i have jenkins04 stopped16:01
JayFmordred: nevermind, I see jroll "resolved" your issue16:01
*** rhsu has joined #openstack-infra16:01
*** baoli has quit IRC16:02
clarkbwe can try killing the tcp connection on the clients but iirc that never works well16:03
*** baoli has joined #openstack-infra16:03
openstackgerritMonty Taylor proposed openstack-infra/system-config: Get rid of cloud-init on single-use-slaves  https://review.openstack.org/16588916:03
mordredclarkb, fungi, jeblair: ^^16:03
mordredthat's what I'm thinking to deal with the metadata service instability16:03
*** koolhead17 has joined #openstack-infra16:03
devanandagertty has started dumping urllib3 warnings this morning. making it somewhat non functional ...16:04
clarkbmordred that will break because the distros want you to use not root16:04
*** LinuxJed_ has joined #openstack-infra16:04
clarkbmordred I thibk we ssh as jenkins anyways so maybe unnecessary16:04
mordredclarkb: oh right. so I need to add it to each of the distro users16:04
mordredclarkb: when we boot, don't we do an ssh in to test things?16:05
mordredas the nodepool user16:05
clarkbmordred or maybe we dont need it at all if nodepool sshs in as jenkins16:05
mordredto the distro user16:05
mordredit does not16:05
mordredit ssh's in as the nodepool user to the appropriate distro user16:05
mordredmodified patch coming16:05
clarkbthen ya it needs to be the distro user16:05
jeblairi'm deleting all nodes on 04 which are not running a job16:05
fungithanks16:06
fungithat should help16:06
*** ashleighfarnham has quit IRC16:06
*** tnovacik has joined #openstack-infra16:07
*** Somay has joined #openstack-infra16:07
*** jcoufal_ has quit IRC16:08
*** LinuxJedi has quit IRC16:08
mordredtaht's ubuntu on ubuntu, cloud-user on centos and fedora on fedora?16:08
clarkbsounds right, there is a list in nodepool itself16:09
mordredit's not mapped - we just try each of them :)16:09
mordredoh - centos is root16:09
openstackgerritKhai Do proposed openstack-infra/puppet-gerrit: Make mysql clients assume utf8 character encoding  https://review.openstack.org/16426416:10
zarofungi: question about this ^16:10
openstackgerritLars Kellogg-Stedman proposed openstack-infra/project-config: Add "heat-coe-templates" project  https://review.openstack.org/16480616:10
openstackgerritMonty Taylor proposed openstack-infra/system-config: Get rid of cloud-init on single-use-slaves  https://review.openstack.org/16588916:11
*** _nadya_ has quit IRC16:11
zarofungi: do you think the same code to manage my.cnf should also be duplicated in gerrit::mysql ??16:11
mordredclarkb: ^^ that should fix it16:11
*** belmoreira has quit IRC16:12
fungizaro: well, i'm not sure that ! defined(Class['gerrit::mysql']) will actually ensure that gerrit::mysql isn't used, just that it hasn't been defined before that gets tested. but someone with better puppet mojo should confirm16:12
clarkbmordred the linter may not like the extra whitespace but other than that it looks good16:12
SpamapSjeblair: just now sat down. Reading backscroll.16:12
openstackgerritMonty Taylor proposed openstack-infra/system-config: Get rid of cloud-init on single-use-slaves  https://review.openstack.org/16588916:12
mordredfungi, jeblair, clarkb: ^^ whitespace fixed16:13
mordredI mean, given the state of the queue, we may need ninja-merge that16:13
zaronibalizer: I've adding refreshonly and subscribe to this https://review.openstack.org/#/c/165145/6/modules/openstack_project/manifests/gerrit.pp16:13
clarkbmordred ya16:13
mordredso I'd love to get eyes on it16:13
fungizaro: as far as doing something similar in the gerrit module's mysql.pp, if you want to do that you probably need to implement it via the mysqldb::config class (i think that's what it's called)16:13
zaronibalizer: was wondering if i should remove some of the requires?16:13
SpamapSFYI I believe the reason HP's ec2 metadata service fails has something to do with conntrack filling up on compute hosts or neutron-metadata-agent nodes. I forget which.16:14
clarkbmordred the package is the same name on centos and fedora too?16:14
mordredclarkb: checking16:14
anteayamordred: you don't need commas after 'fedora' 'ubuntu' and 'root'?16:14
mordredanteaya: nope - it's a case statement, not a parameter list ... but good eye16:14
anteayaokay16:14
mordredclarkb: it's the same on centos - I'm not sure I've got a fedora lying aroudn ...16:15
mordredoh - I can check yum in a docker image16:15
*** scheuran has quit IRC16:15
SpamapSKiall: ^^ would you concur, that the reason HP Cloud's ec2 metadata stops responding is conntrack related?16:15
fungidevananda: known issue. you can either roll back to an earlier urllib3, install requests[security] or apply the proposed gertty change to silence that warning16:15
zarofungi: hmm, is that an existing class?  i can't seem to find a config.pp16:15
*** ildikov has quit IRC16:15
fungizaro: it's not our module16:15
*** ildikov has joined #openstack-infra16:15
KiallSpamapS: I think there might be another reason behind that ..16:15
devanandafungi: great. thanks16:16
*** unicell has quit IRC16:16
mordredSpamapS: enjoy my most recent patch :)16:16
mordredclarkb: yes - also cloud-init on fedora16:16
fungizaro: aha, it's called mysql::server::config but probably the parameter you want is exposed through mysql::server https://forge.puppetlabs.com/puppetlabs/mysql#customizing-server-options16:16
*** mpavone has quit IRC16:18
SpamapSmordred: to which thing?16:18
mordredclarkb: I got the public key value via:16:18
mordrednodepool@nodepool:~$ ssh-keygen -y -f ~/.ssh/id_rsa16:18
mordredclarkb: fwiw16:18
mordredSpamapS: https://review.openstack.org/#/c/165889/16:19
SpamapSmordred: oh, so why aren't we just switching to config drive?16:19
openstackgerritMax Rydahl Andersen proposed openstack-infra/jenkins-job-builder: Add support for 'raw' to inject direct xml to job configs.  https://review.openstack.org/16015816:19
mordredSpamapS: because that would involve adding features to nodepool, and we'd STILL have to make a system-cofig change to configure cloud-init to skip ec2 metadata16:20
SpamapSmordred: skipping ec2 metadata would just speed things up a bit16:20
mordredSpamapS: whereas, the only thing we use it for on these hosts is getting a known ssh key on them16:20
SpamapSmordred: it would work .. just would have a 2 minute boot delay16:20
mordredSpamapS: so, really, that patch above solves everything :)16:20
mordredand removes one more piece of fragility16:20
*** rhsu has quit IRC16:20
SpamapSmordred: sure, I think it's fine. It does mean our images are now _OUR_ images. But that was always true really, nobody's using them stock right?16:21
zarofungi: from what i've gathered defined('gerrit::mysql') check that the class has been defined. while defined(Class['gerrit::mysql']) checks whether it has been declared.16:21
zarohttps://docs.puppetlabs.com/references/latest/function.html#defined16:21
*** dboik_ has quit IRC16:21
SpamapSBuilding in SSH keys is now things worked on EC2 for a lot of people too before cloud-init.16:21
mordredSpamapS: that's right16:21
SpamapSBecause fetching bits from ec2 metadata was poorly understood.16:21
*** pblaho__ has joined #openstack-infra16:22
*** amotoki has quit IRC16:22
*** dboik has joined #openstack-infra16:22
*** ddieterly has quit IRC16:22
mordredyup16:22
clarkbSpamapS I use them stock16:22
clarkbbut I have the benefit of having my key baked in :)16:22
mordredclarkb: jeblair is busy, but says he's ++ on 86ing cloud-init ... fungi - opinion?16:22
mordredclarkb: :)16:22
*** ddieterly has joined #openstack-infra16:22
mordredclarkb: honestly, right now, I care more about us and less about people who might reuse this - we can do better later16:22
clarkbmordred ya16:22
*** ihrachyshka has quit IRC16:22
jeblairya, sorry :(16:23
fungiyep, wfm16:23
mordredgreat. I'm going to ninja approve it16:23
larsksHello all.  I'd like to add a project to stackforge that will run a commit check using tox, but *not* pep8 (because it's not python).  What's the correct way to set this up?16:23
fungimordred: what's with the require => File['/root/.ssh'] there?16:23
SpamapSlarsks: IIRC pep8 is in python-jobs16:24
mordredfungi: THANK YOU16:24
SpamapSlarsks: so don't list python-jobs16:24
larsksSpamapS: Right.  But I'm not entirely clear how to add a *new* job.16:24
*** radez is now known as radez_g0n316:25
*** pblaho_ has quit IRC16:25
openstackgerritMonty Taylor proposed openstack-infra/system-config: Get rid of cloud-init on single-use-slaves  https://review.openstack.org/16588916:26
mordredfungi, clarkb: ^^16:26
*** emagana has joined #openstack-infra16:26
*** teran_ has quit IRC16:26
SpamapSlarsks: maybe look at how devstack.yaml is done in project-config/jenkins/jobs16:27
larsksSpamapS: I'll take a look...thanks.16:27
clarkbmordred lgtm16:27
openstackgerritMerged openstack-infra/system-config: Get rid of cloud-init on single-use-slaves  https://review.openstack.org/16588916:27
mordredok. now to kick off a new image ... I'm going to start with one in hp16:28
mordredjust to see16:28
SpamapSdo you have to put the private key in the secret place for infra root only?16:28
mordredSpamapS: that is the public key for the already-existing private key for the nodepool user16:28
SpamapSI assume there's puppet code that just references variables that aren't set in the normal hiera?16:28
* SpamapS is using words he's not entirely comfortable with16:28
SpamapSmordred: OH16:28
mordredSpamapS: all of the puppet code that references things in the secret hiera is in the hiera() calls16:29
SpamapSbut that is still true yes?16:29
mordredyes16:29
SpamapSok16:29
* SpamapS stops sweating16:29
*** ghostpl_ has quit IRC16:29
mordredSpamapS: if you look in manifest/site.pp, you'll see:16:29
mordred    nodepool_ssh_private_key => hiera('jenkins_ssh_private_key_contents', 'XXX'),16:29
mordredclarkb: heh. ^^ you were half-right before16:30
mordredit is the jenkins KEY that gets used16:30
mordred:)16:30
ttxfungi: is there a way to remove the stuck jobs from zuul check queue display ?16:30
anteayattx by uploading a new commit message16:30
anteayawe discussed that16:30
fungittx: yeah, new patchset to them. are they targeted for the freeze?16:30
fungior just unsightly?16:30
anteayabut figured gettin gmore workers was a higher priority than kicking them back in and having them use resources16:30
mordredclarkb: root@nodepool:~# nodepool image-update hpcloud-b5 nodepool-trusty16:31
mordredis what I'm doing right now16:31
ttxfungi: not necessarily, it's just that people tend to scream "OMG please tag K3 Monday check queue is BORK"16:31
ttxeven if it isn't16:31
anteayait is very slow16:31
anteayawhich we are working on but getting new workers16:31
anteayabut k-3 isn't a surprise16:31
mordredclarkb: I lied: root@nodepool:~# nodepool image-update hpcloud-b5 devstack-trusty16:32
anteayaand some ptls were willing to merge stuff on the weekend to avoid problems16:32
anteayamikal mentioned it at last week's nova meeting16:32
*** esker has quit IRC16:32
*** dmorita has joined #openstack-infra16:32
*** Bsony has joined #openstack-infra16:32
clarkbmordred ok that should do dib build then upload, if it works dont image-update that image again, instead just image-upload it to the other providers16:32
mordredclarkb: ++16:33
mordredclarkb: nice thing - since this is the dib nodes, we don't even have to wait on a template node to be able to spin up against the metadata borking16:33
*** tsg_ has joined #openstack-infra16:33
*** tqtran has joined #openstack-infra16:34
*** sarob has quit IRC16:34
openstackgerritLars Kellogg-Stedman proposed openstack-infra/project-config: Add "heat-coe-templates" project  https://review.openstack.org/16480616:34
*** ghostpl_ has joined #openstack-infra16:34
anteayaI'm going to be afk for a bit16:35
anteayaback later16:36
pleia2good morning16:36
*** _nadya_ has joined #openstack-infra16:37
jeblairmordred: "nodepool-trusty" or "devstack-trusty" ?16:37
mordredjeblair: the second16:37
*** yfried has quit IRC16:37
jeblaircool, that'll be better16:37
mordredjeblair: turns out, we don't have any images called nodepool-trusty16:37
mordredjeblair: luckily - nodepool refused to operate :)16:38
nibalizerzaro: shrug16:39
greghaynesclarkb: did you see I got https://review.openstack.org/#/c/165682/ worked out16:39
mordredclarkb: STAB STAB STAB16:39
nibalizerrequires never hurt anybody16:39
greghaynesclarkb: for doing monkeypatching of fakes16:39
greghaynesended up working nicely16:39
*** ayoung has quit IRC16:39
jeblairdeleting another round of jenkins04 nodes16:39
mordredclarkb: in our dib images, we don't have an ubuntu user, apparently16:39
zaronibalizer: cool, i'll take that :)16:40
mordredSpamapS: is that expected in images built from dib on the ubuntu element?16:40
clarkbmordred lol16:40
jeblairmordred: i thought that was a desired feature of dib images?16:40
*** sandywalsh has joined #openstack-infra16:40
mordredjeblair: yes, I believe so - it just makes this puppet mildly harder16:40
clarkbgreghaynes I noted you pushed a lot of code while I watched day9 play grim fandango then I fell asleep16:41
fungimordred: well, only until we're all-dib there ;)16:41
SpamapSmordred: yes16:41
greghaynesmordred: wierd, I thought we got that for free with ubuntu cloudimg16:41
fungithen it makes the puppet much, much easier16:41
SpamapSmordred: cloud-init makes the user16:41
mordredfungi: yes. :)16:41
greghaynesoh!16:41
SpamapSmordred: should be true on all OS's16:41
greghayneslolol16:41
mordredSpamapS: hahahahaahahah16:41
mordredSpamapS: orly16:41
mordred?16:41
clarkbSpamapS wait16:41
SpamapSsince cloud-init 0.716:41
mordredSpamapS: so - does the "you can't log in with the root user" come from cloud-init too?16:41
fungiWAT16:41
clarkbthis isnt a distro sillyness?16:41
clarkbits cloud init?16:41
clarkbwhat are they smoking16:41
mordredSpamapS: like, if I remove cloud-init - can I just put the key in /root/ ?16:42
mordredand have it work?16:42
SpamapSmordred: No I think they just smash that into sshd's config16:42
fungimordred: i believe so. i patch that out in my cloudinit config16:42
SpamapSduring image build16:42
clarkbwow16:42
fungiSpamapS: it has an option for it at least16:42
clarkbI dont16:42
clarkbeven16:42
mordredok. follow up patch coming16:42
SpamapSor did they add that too?16:42
clarkbwat16:42
mordredclarkb: see - this is why we're getting out of the cloud-init business16:42
SpamapSWouldn't surprise me if they put all cloud image intelligence into cloudinit16:42
mordredSTOP TRYING TO BE CLEVER16:42
greghaynesWhy dont youall bake in a user you know the username of?16:43
mordredwe're installing sshd_config files with puppet already aren't we?16:43
greghaynesI love suggesting this16:43
SpamapSmordred: I believe this makes the cloud images more consumable in different environments.16:43
mordredgreghaynes: it's called "root"16:43
greghaynesbut everyone hates doing that for some reason16:43
jeblairgreghaynes: we could call it... root16:43
mordredit's amazing16:43
mordredit's a user that has root access16:43
fungiSpamapS: rackspace installs a /etc/cloud/cloud.cfg.d/10_rackspace.cfg which turns off passwordauthentication and permitrootlogin for example16:43
*** jordanP has joined #openstack-infra16:43
greghayneshah, fine16:43
SpamapSmordred: don't hate on something people do actually like. You're just made because it doesn't work well on hpcloud. ;)16:43
*** unicell has joined #openstack-infra16:43
greghaynesthat works too :)16:43
SpamapSor rax16:43
SpamapS;)16:43
mordredSpamapS: or, well, for any of our use cases16:43
mordredSpamapS: but I heard you16:43
mordredhear16:43
SpamapSEC2 is the 800lb. gorilla, and most of what cloud-init does is at the request of EC2. The rest is at the request of harlowja_away16:44
*** BobBall is now known as BobBall_AWOL16:44
jeblairwhy would someone who isn't ec2... nevermind, i'm falling into the same trap16:44
openstackgerritDouglas Mendizábal proposed openstack-infra/project-config: Add Python 2.6 jobs to Castellan  https://review.openstack.org/16591316:44
clarkbyes its amazing that every install comes with a user preinstalled to do exactly this16:44
jordanPhi guys. I am in charge of a third party CI. I am using jenkins + gerrit trigger (not zuuul). I have a formating issue with how to report back the results. Can someone help me ? Do you know if someone is using successfully gerrit trigger ?16:44
greghaynesclarkb: hah about day9, I think I was actually netsplit from openstackbot when I was pushing that and assume no one was seeing me sneak in all those patches16:45
clarkbthen for some reason everyone has decided to not use that user16:45
SpamapSI'll end my cloud-init sermon with this: The dream is that one root FS can be used on any cloud, for any user, with userdata being the only thing that customizes the image.16:46
fungiclarkb: because being able to log in as root is bad. because it's a predictable name which eases brute-forcing. so replace it with an equally-guessable name16:46
SpamapSThe reality is nobody cares about that except distro vendors. :)16:46
cineramaanyone here have some historical knowledge about openstackid? was messing with it the other day to get logins going on zanata16:46
openstackgerritMonty Taylor proposed openstack-infra/system-config: Just install the key into the root user  https://review.openstack.org/16591416:46
clarkbSpamapS: using root as the default user does nothing to impede that goal16:46
mordredfungi, SpamapS, clarkb, jeblair: ^^16:46
fungiSpamapS: actually i care a lot about that. the problem is that the methods for getting the userdata to the instances is, well, not straightforward16:47
SpamapSclarkb: using root as the default user removes all chance at audit logging via sudo.16:47
SpamapSIt's a sane general default.16:47
clarkbmordred: sorry that my initial comment sent us down this path16:47
mordredclarkb: it's ok16:47
mordredSpamapS: I hear that - but in my experience so far - userdata via cloud init is WAY less useful than userdata via ssh post boot16:47
clarkbSpamapS: enabling sudo removes all chance at audit logging via sudo16:47
mordredall one needs is one very simple method to get an ssh key onto the host16:47
lucasagomeshi, if you have some time please take a look at https://review.openstack.org/#/c/163344/ it's adding a gate check for openstack/ironic-python-agent (non-voting) to test PXE drivers using the IPA ramdisk16:47
SpamapSmordred: becuase your userdata delivery methods suck. :)16:47
jeblairmordred, clarkb: so if this runs on a snapshot image, won't we still have an ubuntu user?16:47
SpamapSclarkb: eh?16:48
clarkbjeblair: yes, but hrm gah16:48
clarkbSpamapS: once you have root via sudo its game over regardless16:48
mordredjeblair: but if we install the key onto the root user16:48
mordredjeblair: then the nodepool user cycle should still find it16:48
mordredregardless of whetehr the ubuntu user exists16:48
SpamapSclarkb: the Ubuntu security team's thinking has always been that the system should be setup to ship auth/security logs elsewhere. So you should, in theory, at least get a 'sudo rm /etc/syslog/....' before it goes dark16:48
clarkbmordred: except that the initial cloud init would have broken our ability to ssh as root16:48
jeblairmordred: right, because we try root first16:48
mordredclarkb: but we install an sshd_config16:49
mordredwith puppet16:49
mordredso we will ahve fixed it16:49
clarkbmordred: gotcha16:49
*** hdd has joined #openstack-infra16:49
mordredclarkb: let me verify that we do install an sshd_config16:49
jeblairwe better :)16:49
SpamapSclarkb: so that's not exactly "game over" , that is "first battle goes to the person who gained access to the passwordless sudo user".. but you at least have a breadcrumb to that moment.16:49
mordredI know we do on our long-lived hosts16:50
*** thingee has joined #openstack-infra16:50
SpamapSFor the purposes of single use nodes, yes.. just root in baby.16:50
clarkbmordred: ya I think you are correct, the sshd config is applied universally16:50
openstackgerrityolanda.robla proposed openstack-infra/puppet-zuul: Add fetch_url and poll parameters to zuul.conf  https://review.openstack.org/16259116:50
jeblairSpamapS: (as an aside, if you want the breadcrumb with root logins, just have ssh log the key id of incoming users)16:50
mordredclarkb: I cannot find where we do it16:51
*** sandywalsh has quit IRC16:51
clarkbmordred: let me find it16:51
SpamapSjeblair: the theory is that you want regular interactive users to be trained to never shell as root, so they're used to sudo'ing.16:51
SpamapSjeblair: It's all theory.16:52
mordredSpamapS: sure. and I appluad that theory16:52
SpamapSI don't want it to turn into a security theory debate, just trying to explain why I think it's fine to force people through the sudo funnel.16:52
mordredI just think that a second shared full-root user like the ubuntu user is actually a bad idea16:52
*** dustins_ has joined #openstack-infra16:52
clarkbmordred: include ssh in template.pp I think16:52
clarkbmordred: openstack_project::template16:52
mordredawesome16:52
*** yamahata has joined #openstack-infra16:52
SpamapSmordred: well in theory 90% of what you want to do as a sysadmin is fine as a regular user. top, ps, ls, netstat, etc.16:53
clarkbSpamapS: maybe cloud init should have created a cloud-init user if that was the goal?16:53
mordredclarkb: oh - actually - we're going to need ...16:53
mordredblast16:53
clarkbSpamapS: using distro specific users provides zero benefit and a lot of work16:53
SpamapSclarkb: it did, but each distro decided it should be a marketing opportunity. :)16:53
jeblairmordred: i say "we better" install an sshd_config, because that was literally change 1 in gerrit ;)16:53
SpamapSI agree 100% that having the user be the distro name was short sighted.16:53
jeblairhttps://review.openstack.org/116:53
funginice!16:54
SpamapSeven worse was Fedora making it 'ec2-user'16:54
*** baoli has quit IRC16:54
mordredjeblair: we do have that with PermitRootLogin off16:54
mordredand16:54
mordredMatch host puppetmaster.openstack.org16:54
mordred    PermitRootLogin without-password16:54
clarkbmordred: ha!16:54
mordred:)16:54
mordredone sec ...16:54
fungiso, yeah, we'll need some tweaking16:54
*** baoli has joined #openstack-infra16:55
mordreddoes anybody know if a per-key ssh option will override that?16:55
clarkbhow hard would it be to just config drive instead?16:55
mordredharder16:55
clarkbmordred: we can't just set the image metadata in nodepool config?16:55
*** dustins has quit IRC16:55
*** ildikov has quit IRC16:55
clarkbmordred: iirc that was all we needed to do right (and one reason we added image metadata)16:55
mordredclarkb: it will be at LEAST as much work as this16:55
mordredbecause we'll ahve to configure the nodes to mount config-drive16:56
clarkbmordred: yes but thats a one line update to nodepool.yaml16:56
mordredI'm going to keep working on this16:56
mordredbecause cloud-init can suck it16:56
clarkbthen cloud init does the rest?16:56
mordredno it doesn't16:56
*** Ryan_Lane has joined #openstack-infra16:56
mordredcloud-init does not mount config-drive16:56
mordredbecause BONGHITS16:56
*** skolekonov has quit IRC16:56
*** gyee has quit IRC16:56
clarkbI thought it did16:57
SpamapSbut Ubuntu does?16:57
mordredyeah - me too16:57
mordredit dose not16:57
SpamapShrm16:57
jeblairmordred: silly question time -- are you focused on dib or snapshot images?16:57
mordredI promise16:57
mordredjeblair: both - I'm just doing this in puppet so it should hit both of them16:57
jeblairmordred: so nodepool does not ssh into anything other than jenkins on the dib path16:57
pleia2AJaeger: if you're still about, could use some confirmation about where we need zanata client installed, I suspect it's just our translations slave but I want to be sure16:57
mordredjeblair: oh - ok - so I can make this simpler then16:58
pleia2AJaeger: translations/proposals - it auto-submits changes to gerrit, like 75%+ translations pulled from transifex16:58
*** hyakuhei has joined #openstack-infra16:59
*** ihrachyshka has joined #openstack-infra17:00
fungiokay, jenkins04 just achieved quiescence, so stopping the jenkins service now and cleaning it up, then i'll restart nodepoold before i start jenkins again17:00
jeblairmordred: and on the snapshot path -- it only does that when creating a snapshot, before puppet is run (that ssh connection is so that it can run puppet)17:00
clarkbfungi: let me know when you do that and I will restart the logstash client too17:00
fungiclarkb: you bet17:01
*** Swami has joined #openstack-infra17:01
mordredjeblair: really? ok, then I'm confused as to why the metadata server is giving us grief17:01
*** arxcruz has quit IRC17:01
openstackgerritKhai Do proposed openstack-infra/puppet-gerrit: Make mysql clients assume utf8 character encoding  https://review.openstack.org/16426417:02
clarkbmordred: because cloud init is still enabled17:02
jeblairmordred: well, i mean cloud-init is run17:02
clarkbhow does disabling cloud init affect ssh host key generation?17:02
fungimordred: i believe it's because cloud init blanks the ssh host key and wants to recreate it so that it's unique, but goes sideways when it can't talk to the metadata server17:02
clarkbwill nodepool stop working because too many hosts with the same key?17:02
*** derekh has quit IRC17:02
fungiwould be good to verify though17:02
jeblairfungi: that seems reasonable17:03
mordredso, actually then, all we need to do is remove cloud-init17:03
mordredbecause we already have the key baked in that we care about17:03
jeblairmordred: correct17:03
mordredand it literally does nothing17:03
mordredawesome17:03
mordredthat's the easiest patch ever17:03
fungithat sounds likely anyway. likely enough to test17:03
*** unicell has quit IRC17:03
*** unicell1 has joined #openstack-infra17:03
*** lucasagomes has quit IRC17:03
SpamapSHonestly, if you're not going to replace the key that is in the fetchable image .. telnet ftw? ;)17:04
openstackgerritMonty Taylor proposed openstack-infra/system-config: Just remove cloud-init  https://review.openstack.org/16591417:04
mordredSpamapS: what?17:04
mordredSpamapS: that makes no sense at all17:04
mordredSpamapS: the key is a public key17:04
SpamapSthe key that is not being replaced is the private host key17:04
mordredoh - the one that provides no security at all because of how clouds work?17:05
mordredyeah17:05
mordredimportant to replace that17:05
fungiSpamapS: don't most sshd initscripts create the key at start if it's missing?17:05
SpamapSExactly> :)17:05
openstackgerrityolanda.robla proposed openstack-infra/project-config: Add stackforge/puppet-nscld  https://review.openstack.org/16592217:05
mordredsince I have no secure path to verify it17:05
jeblairSpamapS is right -- we will have identical private keys (which can be retrieved by a job that issues 'cat /etc/ssh/id_rsa...')17:05
SpamapSfungi: yes17:05
fungiso we could just delete the host keys in our images17:05
jeblairi am not sure what someone would do with that, but still...17:05
*** pcaruana has quit IRC17:05
fungias part of the image build process17:05
jeblairfungi: that sounds like it would work.17:05
SpamapSwith that one can mitm and decrypt passively sniffed packets with a full ssh session.17:06
mordredfungi: want me to put that into the puppet change above?17:06
*** sputnik13 has joined #openstack-infra17:06
SpamapSergo: telnet ftw :)17:06
SpamapSI'm half serious.17:06
fungimordred: yeah, that can't hurt. at least we'll find out where it works and where we need to add a boot-time shim to call ssh-keygen to do it17:06
jeblairyeah, this is probably not important for most of our single-use nodes today, but eventually, we will want to run important things on single-use nodes, so, we should probably fix it17:07
SpamapSmordred: there's a secure way to verify host keys, but it involves passing one in that you generate, and then throwing away that one and generating a new one.17:07
*** jistr has quit IRC17:07
mordredjeblair: you want to fix it with this patch or as a follow on17:07
SpamapSand that assumes nobody has rooted the box before you get to it to replace the one you passed in.17:07
jeblairmordred: this one17:07
mordredjeblair: k on it17:07
greghaynesSpamapS: I think the issue is still anyone can make a job to cat it right?17:07
*** amitgandhinz has quit IRC17:08
SpamapSgreghaynes: they'd be catting their own host's private key. It would be useless after that node was destroyed.17:08
clarkbSpamapS: and mostly useless in the interim since they already have access to that node17:08
greghaynesSpamapS: No, because you could still use it to mitm since itll verify fine17:08
fungiclarkb: i'm restarting nodepoold now17:08
SpamapSgreghaynes: you generate a small (1024bit) RSA key, pass it in via userdata, and let the box install that. Once you SSH in with that one, you rm that key, and regenerate the private host key, noting the public key.17:09
greghaynesSpamapS: you have to also revoke it and then its complicated17:09
*** harlowja_away is now known as harlowja_17:09
SpamapSclarkb: right17:09
SpamapSgreghaynes: You wouldn't put it in your normal known_hosts file. :)17:09
fungiokay, nodepoold is running again17:09
harlowja_SpamapS lol (just saw cloud-init comment)17:09
fungiand i've marked all the jenkins04 nodes delete17:09
SpamapSgreghaynes: that initial SSH would be in  known_hosts_for_this_one_time_only ..17:09
fungiand deleted them from the jenkins config17:10
greghaynesah, yea, that works17:10
SpamapSso the window for compromise relies on somebody intercepting that private key between the time you fed it to nova and the time you SSH in.17:10
fungijenkins service is coming back online on jenkins04 now17:10
openstackgerritMonty Taylor proposed openstack-infra/project-config: Remove ssh host keys during image build  https://review.openstack.org/16592417:10
mordredjeblair, fungi: ^^17:10
clarkbfungi: ok restarting the gearman client now17:10
SpamapSthis prevents mitm w/ arbitrary key. it does not prevent #allthethings.17:10
mordredclarkb, SpamapS: ^^ you too17:11
jeblairmordred: both lgtm17:11
greghaynesnice17:11
SpamapSmordred: +1'd17:11
*** sandywalsh has joined #openstack-infra17:12
clarkbmordred: see comment17:12
clarkbmordred: I don't think the dib element needs sudo but pretty sure snapshot does17:12
*** timcline has quit IRC17:12
mordred++17:12
*** nilasae has quit IRC17:13
cineramapleia2: i meant to ask you if we should try to move to the later wildfly17:13
openstackgerritMonty Taylor proposed openstack-infra/project-config: Remove ssh host keys during image build  https://review.openstack.org/16592417:13
mordredclarkb: nice catch17:13
clarkbmordred: I have hit that enough times myself :)17:13
*** Hal2 has joined #openstack-infra17:14
clarkbmordred: also reading cloud init docs and config drive docs you are correct17:14
*** Hal1 has quit IRC17:14
clarkbmordred: it appears you must mount the config drive yourself17:14
clarkbwhich makes me wonder why cloud init even exists17:14
mordredyes17:14
pleia2cinerama: we'll need to do some testing, not all of the wildfly version work just yet17:14
mordredclarkb: every. single. use. of cloud init is broken in some little tiny way17:14
mordredclarkb: drives me batty17:14
openstackgerritMerged openstack-infra/system-config: Just remove cloud-init  https://review.openstack.org/16591417:14
openstackgerritMerged openstack-infra/project-config: Remove ssh host keys during image build  https://review.openstack.org/16592417:15
harlowja_no mount yourself, lol17:15
SpamapSmordred: harlowja_ suggests that configdrive should be mounted by clout-init if Ubuntu's cloud image has it enabled.17:15
harlowja_liesss17:15
pleia2cinerama: we had been stuck on a particular one because of compatibility problems, but with the latest zanata release now is a good time to check17:15
harlowja_^ ya, not a lie what SpamapS  says, lol, lie that u need to mount it yourself :-P17:15
cineramapleia2: i mean as long as we haven't rolled out yet it might be worth a shot17:15
clarkbI have checked jenkins thread counts on our masters, no master appears to currently link threads17:15
mordredharlowja_: sure. except that  it does not work17:15
clarkbs/link/leak/17:15
mordredharlowja_: empirical testing has shown me that it does not17:15
pleia2cinerama: also fyi - like gerrit, we'll have a zanata-dev server as well once we go into production, so we'll have a space to work on upgrade testing in the future :)17:15
harlowja_mordred hmmm, works on yahoo cloud for a long time17:15
mordredso, I don't really know or care about the theory - in teh existing images on the public clouds I touch, it does not work17:16
cineramapleia2: party time in upgrade city17:16
mordredI'm sure it coudl be configured to17:16
clarkbbut 04 did have latest ssh-slaves plugin and still leaked so that may not be the fix, but 02, 05, and 07 all have that plugin version to and haven't leaked since so it is possible that this helps somewhat17:16
harlowja_maybe public clouds disabled it17:16
mordredbut why it would ever be configured NOT to is insane17:16
pleia2cinerama: (as much as I *love* our hacky vagrant/cloud testing right now ;))17:16
*** tonytan4ever has quit IRC17:16
*** dimtruck is now known as zz_dimtruck17:16
mordredand once again, a good idea being made useless by vendors17:16
*** timcline has joined #openstack-infra17:16
SpamapS./cloud.cfg.d/90_dpkg.cfg:datasource_list: [ NoCloud, ConfigDrive, OpenNebula, Azure, AltCloud, OVF, MAAS, GCE, OpenStack, CloudSigma, Ec2, CloudStack, None ]17:16
harlowja_mordred do u need a hug17:16
SpamapSmordred: ^^17:16
clarkbharlowja_: I would accept working cloud17:17
harlowja_lol17:17
*** claudiub has quit IRC17:17
harlowja_just a hug, jeez17:17
mordredI'm rebuilding devstack-trusty on hcloud-b5 again17:17
SpamapSmordred: thats from my instance on HPCloud East btw.17:17
harlowja_stop being so greedy, lol17:17
*** Bsony has quit IRC17:17
jeblairi do not expect working cloud so will cut my losses and accept a hug17:17
mordredSpamapS: have you tried booting an instance in hpcloud with config drive enabled and then logging in to see if it's mounted?17:17
mordredjeblair: ++17:17
harlowja_so then u may need to look @  /var//log/cloud-init.log17:17
SpamapSmordred: I ran the 'nova boot' command for that about 30s ago. ;)17:18
mordredharlowja_: that's hard to do if cloud-init has an error and you can't log in :)17:18
mordredSpamapS: and config drive is mounted?17:18
SpamapSmordred: it's not booted yet17:18
harlowja_mordred agreed17:18
mordredoh17:18
*** Hal2 has quit IRC17:18
mordredSpamapS: if it works for you, you and I are going to sit in a dark room and figure out why17:18
mordredbecause it's never worked for me :)17:18
harlowja_always worked for me, in fact all yahoo vms on openstack are using it, lol17:19
harlowja_but i can help trouble shoot it (since i'm one of the creators, lol)17:19
*** otter768 has joined #openstack-infra17:19
mordredheh17:19
*** Bsony has joined #openstack-infra17:19
harlowja_http://bazaar.launchpad.net/~cloud-init-dev/cloud-init/trunk/view/head:/cloudinit/sources/DataSourceConfigDrive.py#L73 (the magical mounting code)17:19
clarkbharlowja_: what version is that first in?17:20
fungiharlowja_: i wonder if ubuntu 12.04 cloud-init is too old17:20
harlowja_hmmm17:20
harlowja_there should be a version of it in that17:20
fungi0.6.317:20
* SpamapS drums fingers while instance schedules17:20
mordredSpamapS: good luck with that17:20
harlowja_let me download 0.6.3; forgot what was in it17:20
SpamapSat this point nova boot hasn't even come back17:21
SpamapSahh, only 90s to respond17:21
*** ashleighfarnham has joined #openstack-infra17:21
*** patrickeast has joined #openstack-infra17:21
*** xyang1 has joined #openstack-infra17:21
fungiSpamapS: welcome to hpcloud! 90s is a good response time from what i've seen17:22
harlowja_so 0.6.3 is pretty damn old17:22
harlowja_it may/may not work correctly, i don't think it has support for what nova is writing into it (but i'm not 100% sure)17:23
harlowja_it has the old config-drive v1 code17:23
fungiharlowja_: ubuntu 14.04 (the other ubuntu we run) has 0.7.5 at least17:23
harlowja_yup17:23
*** otter768 has quit IRC17:23
SpamapSCloud-init v. 0.7.5 finished at Thu, 19 Mar 2015 17:22:21 +0000. Datasource DataSourceConfigDriveNet [net,ver=2][source=/dev/sr0].  Up 50.63 seconds17:24
SpamapSmordred: ^17:24
harlowja_0.6.3 was before my huge refactor17:24
harlowja_0.7.0 include josh super sauce17:24
harlowja_0.7.0+17:24
harlowja_(which is why there is no 0.6.4, lol)17:24
* SpamapS watches harlowja_ flex17:24
clarkbharlowja_: ok so its likely will not work for us17:24
*** bhuvan has quit IRC17:24
harlowja_SpamapS oh ya17:25
harlowja_lol17:25
harlowja_i've seen people ask for it to be backported; but i don't think thats how canonical works17:25
SpamapSmordred: 2015-03-19 17:21:40,798 - util.py[DEBUG]: Running command ['mount', '-o', 'ro,sync', '/dev/sr0', '/tmp/tmpVPaVgX'] with allowed return codes [0] (shell=False, capture=True)17:25
harlowja_amazing :-P17:25
fungiJayF: and centos 6 has cloud-init 0.7.5 as well (or 0.7.4 in epel)17:25
fungier, harlowja_ ^17:26
fungisorry JayF17:26
harlowja_ya17:26
JayFIt's fine, It's good for me to know too17:26
JayFprobably means I should bump our downstream patched versino of cloud-init as well, heh17:26
SpamapSmordred: so what you may not like, is that it just mounts it _for its purposes_17:26
SpamapSmordred: but it leaves everything it found in /var/lib/cloud/instance17:26
mordredSpamapS: HEADDESK17:27
mordredSpamapS: STAB STAB STAB17:27
harlowja_lol17:27
SpamapSso you have the entirety of the data there.17:27
*** mwagner_lap has quit IRC17:27
harlowja_http://cloudinit.readthedocs.org/en/latest/topics/dir_layout.html17:27
openstackgerritValeriy Ponomaryov proposed openstack-infra/project-config: Add tempest + postgres job for manila  https://review.openstack.org/16593517:27
harlowja_and more....17:27
*** achanda has joined #openstack-infra17:28
SpamapSmordred: pretty sure you just want to hate. Perhaps you should go to Belgium and help them hate the Flemmish instead?17:28
fungiokay, so cloud-init mounts it, copies the data to a known stable location in the root filesystem and then unmounts it again. that's not so bad17:28
openstackgerritValeriy Ponomaryov proposed openstack-infra/project-config: Add tempest + postgres job for manila  https://review.openstack.org/16593517:28
mordredSpamapS: but I don't have it where config-drive17:28
mordredputs it17:28
mordredanyway17:28
SpamapSyou do17:28
harlowja_mordred hug, lol17:28
SpamapSits still there on /dev/sr017:28
clarkbSpamapS: that was a trusty node right?17:28
*** achanda has quit IRC17:28
SpamapSclarkb: aye17:28
clarkbSpamapS: we will need to check precise as well17:28
*** ominakov_ has quit IRC17:28
SpamapSDescription:Ubuntu 14.04.1 LTS17:28
*** xyang1 has quit IRC17:28
harlowja_http://bazaar.launchpad.net/~cloud-init-dev/cloud-init/trunk/view/head:/cloudinit/sources/DataSourceConfigDrive.py#L228 (not so magical find rountine)17:28
fungimaybe once we're able to stop supporting precise we can revisit17:28
clarkbsince precise is older cloud init and we need precise17:28
*** achanda has joined #openstack-infra17:28
SpamapSnova boot --config-drive 1 --key-name default --image 9d25fe2d-cf31-4b05-8c58-f238ec78e633 test-cdrive --flavor 10017:28
clarkbfungi: ya17:29
SpamapSin useast17:29
*** xyang1 has joined #openstack-infra17:29
clarkbSpamapS: how did you get that to work without network info?17:29
SpamapSclarkb: magic?17:29
*** wuhg has quit IRC17:29
clarkbSpamapS: unrelated its just my biggest annoyance booting nodes in hpcloud17:29
SpamapSI have a network already setup of course.17:29
SpamapSwhich is like o_O when you have to set it up17:30
*** e0ne has quit IRC17:30
*** dtantsur is now known as dtantsur|afk17:30
clarkbSpamapS: yes but even then you still have to provide a network to boot on17:30
SpamapSbut once I made it, I only have one, so hpcloud attaches it17:30
clarkbSpamapS: then you have to attach floating ip to get into the node17:30
*** ominakov has joined #openstack-infra17:30
SpamapSclarkb: No it will attach all the networks you have access to17:30
clarkboh ou have only one17:30
SpamapSor did they screw that up now?17:30
clarkbSpamapS: it errors for us an our many networks if you don't supply a specific network17:30
SpamapSis there a reason you have many networks?17:30
*** mestery_ has joined #openstack-infra17:31
clarkbSpamapS: yes, the router/networks can only handle about 100 nodes a piece17:31
SpamapSlurvely17:31
fungiSpamapS: neutron router instances don't scale well to our nova instance count17:31
fungialso even if they did, they're provisioned as a /2417:31
SpamapSDVR should solve that17:31
*** andreykurilin_ has joined #openstack-infra17:31
SpamapSDVR won't solve that17:31
fungiso we couldn't put more than ~253 nodes on one17:31
clarkbfungi: ya we can make bigger networks it just wasn't required since we were told ~100 nodes per router17:31
SpamapSbut you can make it a /8 if you want17:31
fungioh, if we have control to push that lower than /24 then cool17:32
SpamapSI don't know if they rolled out DVR in the public cloud tho17:32
clarkbSpamapS: why would DVR solve the scaling issue?17:32
fungithought they'd imposed a hard limit there but i must have misremembered17:32
clarkbSpamapS: aiui DVR is more similar to nova net multihost17:32
*** _nadya_ has quit IRC17:32
clarkbSpamapS: which is more about redundancy than performance17:32
clarkb(though you get some performance improvment because you are partitioned)17:32
fungijenkins04 is now looking good and happy17:33
*** mestery has quit IRC17:33
*** _nadya_ has joined #openstack-infra17:33
*** coolsvap is now known as coolsvap|afk17:33
SpamapSclarkb: IIRC, east/west gets faster, north/south still has to go through l3 agent.17:33
clarkbugh people need to stop using those terms17:33
SpamapSbecause you don't like them or because they're wrong?17:34
clarkbSpamapS: because the are non descriptive and tell me little17:34
clarkbnetworking has nothing to do with compass direction17:34
SpamapSclarkb: it has to do with the most common network layout diagram17:34
fungieast is east of west right? though i guess east is also west of west. damn you spherical coordinate system17:34
SpamapSnorth == out, south == in, east/west == amongst peers17:35
*** packet has joined #openstack-infra17:35
clarkbSpamapS: and for our use case that won't help anyting since we re prodominently pushing packets off network17:35
fungithat's such a marvellous example of cargo culting terminology. almost as great as the origin of "cloud"17:35
clarkbjenkins <-> slave <-> mirrors17:35
SpamapSfungi: In this particular diagram east of east thar be dragons.17:36
jrollI need a gif of 1s and 0s flying in random directions17:36
fungiSpamapS: ooh! hplcoud is deployed on a flat earth!17:36
SpamapSclarkb: yeah, agree.17:36
*** ildikov has joined #openstack-infra17:36
*** markus_z has quit IRC17:36
SpamapSfungi: explains a lot17:36
funginow i have proof for my spherical fallacy conspiracy17:36
*** ominakov_ has joined #openstack-infra17:36
*** ominakov has quit IRC17:36
*** claudiub has joined #openstack-infra17:37
*** amitgandhinz has joined #openstack-infra17:37
*** andreykurilin__ has joined #openstack-infra17:37
*** andreykurilin_ has quit IRC17:37
*** xyang1 has quit IRC17:37
greghaynesThe real question is how the east/west map into toroid networks17:37
SpamapSgreghaynes: mostly with a hammer to your head and some shrooms17:38
clarkbSpamapS: also why not describe it as l2 vs l3 communication?17:38
greghaynesclarkb: its not17:39
clarkbin any case I should go back to poking at jenkinses17:39
greghaynesits about moving up the tree17:39
SpamapSclarkb: because east/west can describe L3 segment hops too.17:39
*** garyh has quit IRC17:39
*** marcusvrn has quit IRC17:39
SpamapSclarkb: in the DVR case, if you're just sending to another 10.x range DVR has that route and doesn't need l3 agent, it just sends.17:39
clarkbSpamapS: what about the term "east/west" describes any of that?17:39
*** mestery_ is now known as mestery17:39
clarkbanyways I am just grumping, I find the terms to be silly as they do not describe what is going on in a meaningful way17:40
*** ominakov_ has quit IRC17:40
clarkbfungi: jenkins01 was restarted too right?17:40
*** bhuvan has joined #openstack-infra17:41
fungiclarkb: yep17:41
SpamapSclarkb: east/west is generally used to mean "doesn't go through the thing that connects north/south" .. In a big network, you'll have L2 segments behind L3 routers linked together as peers and then north of those the uplinks to the internet/WAN.17:41
SpamapSclarkb: I don't know a better short hand for this, but I'm open to suggestions.17:42
clarkbfungi: https://jenkins01.openstack.org/monitoring?part=graph&graph=threadCount it also leaked threads but isn't running the latest ssh-slaves plugin17:42
*** garyh has joined #openstack-infra17:42
*** amitgandhinz has quit IRC17:42
*** amitgandhinz has joined #openstack-infra17:43
*** ivar-lazzaro has joined #openstack-infra17:43
clarkbfungi: it did not leak as badly as 04 which is probably why you were able to more gracefully restart it17:43
*** yamamoto has quit IRC17:44
fungiclarkb: well, i caught 01 by manually looking for misbehaving servers. in the wake of 04 which mordred caught once it had fallen over to the point where it wasn't even responding to ssh any longer17:44
fungiso i have a feeling 01 would have gotten there itself in due time17:45
clarkbso, we can update ssh-slaves plugin everywhere simply because it appears to help but not fix the problem (05, 02 and 07 haven't leaked yet) but not sure what causes the leaks so that may just be wishful thinking17:45
*** ivar-lazzaro has quit IRC17:45
clarkbfungi: ya, the leak does not seem to be cleaned up17:45
clarkbfungi: I tested that on 05 by leaving it in shutdown mode for a day a few weeks back and the thread count did not fall17:46
fungifun17:46
clarkbmordred: how is that image build going?17:46
*** andreykurilin__ has quit IRC17:46
clarkbzaro: ^ any other ideas on the thread leak?17:46
clarkbzaro: that patch to the ssh-slaves plugin looked so promising17:46
*** tonytan4ever has joined #openstack-infra17:47
*** ivar-lazzaro has joined #openstack-infra17:47
*** crc32 has joined #openstack-infra17:47
*** hyakuhei has quit IRC17:49
mordredclarkb: reading logs17:50
*** ayoung has joined #openstack-infra17:50
mordredException: DIB failed creating /opt/nodepool_dib/devstack-trusty-1426785436.qcow217:51
*** tonytan4ever has quit IRC17:51
*** dustins_ has quit IRC17:52
*** zz_dimtruck is now known as dimtruck17:52
*** Somay has quit IRC17:53
*** ghostpl_ has quit IRC17:53
*** Somay has joined #openstack-infra17:53
mordredclarkb: except it seems to have succeeded17:54
clarkbmordred: that doesn't end up in the image.log for me to read right? because it was a command not the daemon17:55
*** SumitNaiksatam has joined #openstack-infra17:55
*** HeOS has quit IRC17:56
mordredyah17:56
mordred2015-03-19 17:41:15,623 INFO nodepool.DiskImageUpdater: Image devstack-trusty-1426786672.template.openstack.org in hpcloud-b5 is ready17:56
*** tonytan4ever has joined #openstack-infra17:57
mordredclarkb: http://paste.openstack.org/show/193643/17:58
sdaguefyi, the ubuntu ssl package updates seem to have just surfaced in the mirrors17:58
clarkbmordred: so it uploaded the previous image I think17:59
clarkbmordred: I would do image-build not image-update now17:59
clarkbmordred: but look further back for where dib failed17:59
*** reed has quit IRC17:59
mordredclarkb: look at the top of that18:00
greghaynesmordred: that looks like dib cleanup at the top18:00
mordredclarkb: oh - wait - no, not the rm -rf18:00
mordred2015-03-19 17:37:51,310 INFO nodepool.image.build.devstack-trusty: qemu-img: error while compressing sector 13184128: No space left on device18:00
mordredthere it is18:00
clarkbmordred: you need to go back, this is one problem with dib, something will fail about 1000 log lines earlier before you get the exit one18:00
clarkbmordred: :(18:00
greghaynesthar ya go18:00
*** jlanoux has quit IRC18:00
mordred/dev/xvde2       72G   65G  3.2G  96% /opt18:01
mordredgreghaynes: I havea  bunch of devstack-fedora21-dib-1426774571.d directories18:02
mordredthose are all leftovers from failed buidls, right?18:02
clarkbmordred: those are fallout from f21 image builds failing Ithink18:02
clarkbdib doesn't cleanup those dirs after failure18:02
*** bhunter71 has quit IRC18:02
*** boris-42 has quit IRC18:02
greghaynesin what locatoin?18:02
clarkbbut they are small18:02
clarkbgreghaynes: in the image output dir18:02
clarkbmordred: I think adding the f21 image pushed our little disk over the limit18:03
fungisdague: thanks for the heads up. i poured over the list of vulnerabilities today and it's at worst remote denial of service for some ssl-enabled services. i'm okay with letting them get applied through our usual automatic channels18:03
greghaynesah, youre making unique image names18:03
*** ZZelle_ has joined #openstack-infra18:03
clarkbgreghaynes: yes18:03
greghaynesyes, those arent cleaned18:03
SpamapSpython needs a mode where it ignores any .pyc's that don't have their supporting .py18:03
*** _nadya_ has quit IRC18:03
*** enikanorov has joined #openstack-infra18:03
greghaynesI dont see it because I tend to reuse the same image name18:03
greghaynesI think its just manifests18:03
clarkbgreghaynes: yup18:03
SpamapSThe amount of time I've spent this week chasing ghosts then running 'find . -name '*.pyc' -print0 | xargs -0 rm' is stupid18:03
clarkbits not a significant amount of disk space18:04
clarkbmordred: we can probably get rid of the precise-dib and trusty-dib images18:04
greghayneshuh, looks like for some reason the dpkg element depends on manifests18:05
mordredclarkb: kk18:05
greghaynesthat should probably just be removed18:05
clarkbmordred: I will nodepool image-delete the two older ones of that image18:05
clarkbmordred: that should give you enough to get a new image for now18:05
clarkbmordred: also you didn't sudo -H -u nodepool so you will need to fix the dib cache and all that18:06
*** enikanorov__ has quit IRC18:06
SpamapSgreghaynes: probably just a bug that got missed.18:06
mordredI will now sudo -H -u nodepool18:06
clarkbmordred: first you need to fix it18:06
*** gyee has joined #openstack-infra18:06
clarkbmordred: I need to remember what the fix is18:06
mordredclarkb: its chown -R nodepool:nodepool /opt/dib_cache now?18:06
mordredno?18:06
clarkbmordred: ya that looks right18:07
mordreddoing that18:07
clarkbmordred: also chown the -new or maybe rm that image in /opt/nodepool_dib18:07
clarkbmordred: I think that is your failed image18:07
SpamapSgreghaynes: 4c467964 .. probably just need to drop a file in environment.d for manifests, and then make dpkg inspect whether it is set or not.18:08
greghaynesSpamapS: yep18:08
SpamapSlooks like source-repositories also deps on it but could use that method too18:08
* SpamapS was never really fully onboard with the manifests plan but wanted to get out of their way because it seemed important to the people working on it.18:09
clarkbmordred: you have 16GB of disk, should be enough to build that image18:09
greghaynesYea, its hard to tell a user with a clear use case no18:09
clarkbmordred: I ran sudo -H -u nodepool nodepool dib-image-delete devstack-*-dib essentially18:09
*** sdake_ has quit IRC18:09
*** sdake has joined #openstack-infra18:11
*** coolsvap|afk has quit IRC18:11
*** crc32 has quit IRC18:11
fungiwe've had a steady (and steep) climb in building nodes corresponding to clearing up tons of delete nodes. not sure if this is something we've triggered or something improving in hpcloud18:12
clarkbfungi: I don't think we have any of the fixes in yet other than having unstuck some jenkins18:13
fungihrm, or things getting worse in hpcloud? we have 125 nodes building for >1 hour18:14
fungilooks like maybe this is still the replenishment from the nodepool restart18:14
jeblairclarkb, fungi: unsticking jenkins actually helped a bit -- i believe when jenkins is very slow to respond, it slows down the state conversions of servers enough to have an effect18:15
mordredclarkb: rebuilding18:15
fungijeblair: ahh, yep that would make sense18:15
jeblairclarkb, fungi: also, if it's slow enough to error out, that can actually kill a build that otherwise worked18:15
*** dustins has joined #openstack-infra18:15
jeblairclarkb, fungi: so, for example, once jenkins01 was shut down, there was an immediate uptick of about 50 ready nodes18:15
clarkbjeblair: gotcha18:16
fungiit does look like the current building hill is peaking, so maybe we'll see lots more used nodes here shortly18:16
*** hodos has joined #openstack-infra18:16
dimsanteaya: still see comments from that id we banned this AM - https://review.openstack.org/#/c/165947/18:17
*** bhunter71 has joined #openstack-infra18:17
openstackgerritJoe Gordon proposed openstack-infra/project-config: Remove check-tempest-dsvm-f20  https://review.openstack.org/16553218:17
clarkbdims: its cached, you have to wait for that to get invalidated18:17
dimsclarkb: ack thanks18:17
*** ddieterly has quit IRC18:17
fungii don't recall if anyone got around to disabling that account yet18:17
clarkbfungi: I think jeblair did it18:18
fungik18:18
clarkbbut the cache affects disabled accounts too18:18
fungiyep18:18
*** ddieterly has joined #openstack-infra18:19
*** cdent has quit IRC18:19
*** johnthetubaguy is now known as zz_johnthetubagu18:19
asselinanyone know how to get the apikey forthe jenkin's user credintials-id?18:20
clarkbasselin: login as that user, go to settings page click button to get api key18:21
clarkbasselin: I think you have to do that at least once but from that point on the key is available on disk in $JENKINSHOME/users/username/config.xml or something like that18:21
asselinclarkb, yeah...in that file it's ecrypted...but then there's no credentails-id18:22
mordred2015-03-19 18:21:08,826 INFO nodepool.image.build.devstack-trusty: Downloading pandas-0.15.2.tar.gz (4.6MB)18:22
*** jordanP has quit IRC18:22
mordredI just think that's a funny line18:22
clarkbasselin: not sure then18:23
clarkbasselin: but I think you need to request credentials before they have n id18:23
clarkbasselin: so try getting the api key if you haven't yet18:23
*** Somay has quit IRC18:23
asselinclarkb, yeah I crated a new user, got the api key, but the're no credentials-id....seems they moved stuff around in the latest version of jenkins....I will poke a bit more18:24
anteayadims: thank you18:24
anteayaclarkb: did we want to clear that cache?18:25
SpamapSmordred: It's been years since I downloaded pandas18:25
clarkbanteaya: it clears on its own18:25
anteayahow long does it take?18:25
sdagueanteaya: have you banned "Evgeny Antyshev" yet?18:25
clarkbanteaya: I am unsure18:25
sdaguehttps://review.openstack.org/#/c/165953/ - they are commenting on *every* nova patch atm18:25
anteayasdague: yes, but with open connections we have to either clear the cache or wait for it to expire18:25
clarkbldap groups has a cache of 1 hour, web sessions has a cache of 12 hours18:26
sdagueany idea who they work for?18:26
clarkbI don't think this will be in ldap groups18:26
anteayawe disalbed them more than an hour ago18:26
anteayasdague: parallels.com?18:27
*** ghostpl_ has joined #openstack-infra18:27
clarkbI don't think its a web session cache either since comments should come via ssh?18:27
anteayawe disabled them 3 hours ago18:27
*** AJaeger_ has joined #openstack-infra18:28
*** reed has joined #openstack-infra18:29
anteayaI'm not seeing that they created a new account18:29
AJaeger_pleia2: We need the zanata client installed on systems that run the post translation jobs and the translation proposal18:30
*** spzala has quit IRC18:30
asselinclarkb, I got it thanks18:30
anteayaand nodes in building state just went up over 45018:30
clarkbshow-caches makes me think it is the accounts cache but I can't find docs on the timeout for that18:30
AJaeger_fungi: You just answered my question on https://review.openstack.org/165661 - but I'm still confused18:30
*** emagana has quit IRC18:31
anteayaclarkb: can we clear the cache?18:31
AJaeger_Wouldn't a normal merge have already updated the content and the release job do it again?18:31
clarkbanteaya: we can, is this super urgent and letting the cache invalidate itself isn't cutting it?18:31
clarkb(I don't know how often this user is commenting)18:31
anteayasdague said it is every nova patch18:32
pleia2AJaeger_: AJaeger_ so translations proposal is the one that queries transifex/zanata to look for 75%+ translations and proposes them to gerrit, what are the post translations jobs?18:32
fungiAJaeger_: updating the _version_ if embedded in the document18:32
*** emagana has joined #openstack-infra18:32
fungiAJaeger_: as in release version number of the repo18:32
sdagueclarkb: it's just really misleading as we are now getting "This change was unable to be automatically merged with the current state of the repository. Please rebase your change and upload a new patchset."18:32
AJaeger_fungi: Ah, so we would have an update after the merge, than one with new version after a tag - and another one with the next change.18:33
sdaguefrom their poorly configured zuul18:33
AJaeger_So, if merges are rare, it's worth doing it both times - or what is the reason here?18:33
clarkbthe flush-caches command is running now18:34
anteayaclarkb: thank you18:34
AJaeger_pleia2: http://git.openstack.org/cgit/openstack-infra/project-config/tree/zuul/layout.yaml#n385 and http://git.openstack.org/cgit/openstack-infra/project-config/tree/zuul/layout.yaml#n392 - the upstream jobs18:34
fungiAJaeger_: it totally depends on the merge and release cadence of a given project. since i'm not intimately familiar with this one i simply proposed it do both. the extra run when the repo is tagged is not a lot of overhead18:34
AJaeger_pleia2: so, this is the push to transifex/zanata18:35
AJaeger_pleia2: while the proposal is the other way round18:35
AJaeger_fungi: Ok, understood. Thanks for the explanation18:35
clarkbmordred: can the following files in /opt/nodepool_dib be deleted: devstack-fedora21-dib-1426774571.qcow2-new devstack-trusty-1426785436.qcow2-new devstack-trusty-dib-1426774571.qcow2-new ?18:35
pleia2AJaeger_: ah, gotcha, I'll dig into which slaves these are18:35
mordred /opt/nodepool_dib/devstack-trusty-1426788873.qcow2-new is the current in-flight  one18:36
mordredclarkb: ^^18:36
clarkbsdague: anteaya dims caches are flushed, if you see comments after now then I would worry18:36
anteayaclarkb: thank you18:36
sdagueclarkb: thanks18:36
openstackgerritMerged openstack-infra/system-config: Revert "Temporarily stop adding devstack-f20 nodes in rax"  https://review.openstack.org/16562118:37
thingeeclarkb: hi, Phillip who you have helped in the past http://lists.openstack.org/pipermail/openstack-dev/2015-February/057585.html is claiming that his CI is waiting on some changes from you. I'm just verifying if you have any idea what he's talking about so I can take into consideration with Cinder's CI deadline for drivers.18:37
jeblairfungi: oh, did rax fix the problem ^ ?18:37
clarkbthingee: uh18:37
fungii'll keep an eye on ^ and make sure the new images there are working with ianw's patch in place18:38
fungijeblair: in theory, yes18:38
clarkbthingee: if there is a change up to propose the job to run on one node then I guess they would be waiting on us18:38
clarkbthingee: otherwise no I don't think there is anything preventing ^18:38
thingeeclarkb: ok, is that up for review? Sorry would check myself but context switching a bit at the moment.18:38
fungijeblair: patch to install_puppet.sh at https://review.openstack.org/16568118:38
clarkbthingee: my suggestion was to work on making the simple thing go before worrying about multi node and all that18:38
clarkbthingee: I don't think the simple thing has been proposed but I haven't checked recently18:39
thingeeclarkb: ok, thank you18:39
fungijeblair: hopefully we can clean that back up as soon as we stop using f2018:39
clarkbthingee: I would ask them18:39
jeblairfungi: oh ok18:39
*** ominakov has joined #openstack-infra18:39
fungiwhich i assume is not too far away given the testing going on with f21 currently18:40
clarkbthingee: see also http://lists.openstack.org/pipermail/openstack-dev/2015-February/057588.html18:41
mordredclarkb: I believe it worked18:41
mordred2015-03-19 18:40:28,630 INFO nodepool.DiskImageUpdater: Image devstack-trusty-1426790224.template.openstack.org in hpcloud-b5 is ready18:41
clarkbthingee: jeblair has a note on the simplest and quickest way to move forward18:41
mordred2015-03-19 18:37:04,903 INFO nodepool.DiskImageUpdater: Uploading dib image id: 11617 from /opt/nodepool_dib/devstack-trusty-1426788873.qcow2 for devstack-trusty-1426790224.template.openstack.org in hpcloud-b518:41
clarkbthingee: that is what I think has not happened18:41
*** hashar has joined #openstack-infra18:41
clarkbmordred: yay now we wait for one of those to boot successfully18:42
*** hdd has quit IRC18:42
mordredso - hpcloud-b5 is region to watch18:42
clarkbmordred: and if so you want to do `sudo -H -u nodepool nodepool image-upload hpcloud-bX devstack-trusty`18:42
*** dimtruck is now known as zz_dimtruck18:42
jeblairthingee: i don't see a pending project config change from someone named Philip18:42
mordredyup18:43
dimsclarkb: thanks!18:43
jeblairmordred, clarkb, fungi: fyi up to 625 instances of eoferror, so it's still been happening (so hopefully the new cloudinitless image will still help current situation)18:44
*** yamamoto has joined #openstack-infra18:44
fungifingers crossed18:44
mordredjeblair: it's good that it's still happening, so that we can see if the patch helped18:44
clarkbmordred: I have cleaned up those -new images that are not the one you just built18:45
clarkbmordred: that gives us a total of 22GB free disk after I also removed the older precise-dib, trusty-dib and centos7-dib images18:45
clarkbmordred: but basically we need more disk18:45
clarkbmordred: or fewer images18:46
* fungi is trying to make traction on the fewer images idea18:46
mordredclarkb: fewer images18:46
*** claudiub has quit IRC18:46
clarkbor smaller images18:46
clarkbor all of the above :)18:46
fungispeaking of fewer images, nova py27 job on devstack-trusty node using my experimental job which prepped the db and installed the distro packages needed http://logs.openstack.org/44/164444/1/experimental/experimental-nova-dsvm-python27/f4d9841/console.html18:47
fungilog confirms migration tests ran18:47
clarkbnice18:47
clarkbunfortunately since we use snapshots for all bare-* nodes ths doesn't quite help us :)18:48
fungisome eyeballs on that for obvious issues would be appreciated once things cool down18:48
clarkbat least not with the ENODISK problem18:48
*** tkelsey has quit IRC18:48
*** yfried has joined #openstack-infra18:48
*** yamamoto has quit IRC18:49
fungiwell, if we switched the tox builder to always run the database prep builders, we could drop mysql and postgres setup from our image build scripts and switch bare-.* to dib sooner18:49
clarkbthe back button doesn't work on jenkins' jira18:49
clarkbfungi: yup18:50
*** SotK has quit IRC18:50
fungiwhich is still a reasonable stepping-stone to where i want to get with bindep used for defining the additional packages to cache on the devstack images and install at the start of jobs18:50
clarkb"If this is a CloudBees customer we have two hotfixes that seem to work around the deadlock with the side-effect of degrading UI performance"18:50
fungiclarkb: open core!18:51
* greghaynes is curious if anyone is up for explaining how bindep gets us a single image vs many (that is the goal, right?)18:51
clarkbI am not going to login to respond to that but figured I should point out that this goes on on their tracker18:51
clarkbgreghaynes: single image per distro because every job can describe its system deps18:51
greghaynesoh! so the bindep will be at runtime18:51
clarkbyes18:52
greghaynesand then we can remove those deps out from buildtime18:52
greghaynesmake sense18:52
*** spzala has joined #openstack-infra18:52
fungigreghaynes: it _eventually_ enables us to start putting lists of required distro packages in individual repos, allowing us to self-test whether they're sufficient too18:52
clarkbwith lots of caching in the image build still18:52
greghaynesyea, that makes a ton of sense all of a sudden ;)18:52
fungiright, we'd iterate over all branches of all repos in gerrit and pre-cache the union of all their bindep files18:52
*** radez_g0n3 is now known as radez18:52
fungiconveniently, we already clone those repos onto the images anyway18:53
greghaynesIs there any next to dos on that?18:53
fungigreghaynes: you can start by reviewing my changes to bindep and testing it out if you haven't already18:53
reedfungi, isn't there a backup-as-a-service for the machines managed by infra? (re: the question on the community list about askbot)18:54
greghaynessounds good18:54
*** yolanda has quit IRC18:54
fungireed: there is, but what evgeny's talking about is dumping copies of critical files and database contents as a snapshot onto the local filesystem18:54
fungireed: then we can back that up to a remote server like we do for other critical systems, but the local dump is convenient for other reasons18:55
clarkbI am not finding any bugs on the jenkins tracker that matcht the behavior we see18:56
fungigreghaynes: i also have a bindep list mostly assembled that effectively turns our current devstack-.* workers into bare-.* workers https://review.openstack.org/164447 though that also needs some rigorous testing18:56
clarkbI will try to file one, pretty sure I have an account I can use18:56
openstackgerritAdam Gandelman proposed openstack/requirements: Relax pysendfile requirement  https://review.openstack.org/16596918:56
fungigreghaynes: if nothing else, it's a good example of a complex-ish list18:56
greghaynesawesome, ill try it out18:57
reedfungi, ok18:57
*** SotK has joined #openstack-infra18:58
clarkbhrm doesn't look like the heap stack error dumps that I collected the first time around have stack traces18:58
reedfungi, I think Marton overlooked those18:58
clarkbwell I will file what I have and give it a go18:58
fungireed: they were non-obvious, but thankfully also something which can be implemented in parallel with testing the rest since they're non-service-impacting to add18:58
mordredclarkb: we still don't have a GREAT way of answering "what image did this instance boot from" do we?18:59
clarkbmordred: we do not18:59
clarkbmordred: what you can do is nova show the instance19:00
clarkbmordred: then take the image id from there and cross check against nodepoo19:00
mordrednod19:00
clarkbmordred: so its doable, but you have to query nova not just nodepool19:00
jeblairclarkb, mordred: i did an image-list, looked at the age of the most recent image, then found instances in build whose ages were less than that19:00
jeblairclarkb, mordred: that got me, among a few others, 137042519:00
jeblairwhich i'm watching19:01
jeblairand it's in error state.19:01
fungimordred: or if you can ssh into it, you can see the image logged in /etc/nodepool but if the instance is erroring then not so much19:01
jeblair| fault                       | {"message": "No valid host was found. ", "code": 500, "created": "2015-03-19T18:56:46Z"} |19:01
*** timcline has quit IRC19:01
jeblairnext is 137042619:01
jeblair| image                              | devstack-trusty-1426790224.template.openstack.org (026f353e-0365-4769-a5a8-910e57d454a8) |19:02
jeblairmordred: is that right ^ ?19:02
mordredyes. that's the one we want19:02
jeblairno floatig ip yet19:02
jeblairb5 has a task queue 85 deep, could be a few mins19:03
openstackgerritDavid Shrewsbury proposed openstack-infra/shade: Add design and skeleton for an object interface  https://review.openstack.org/16450619:06
jeblairhrm.  we should invert what the provider manager does -- so that the calls happen in the worker threads, but the provider manager serializes and paces them.19:07
jeblairthat way when a deletion call takes 30 seconds, everything isn't waiting on it, but you still don't issue more than the target rate...19:07
*** pelix has left #openstack-infra19:07
*** Hal1 has joined #openstack-infra19:08
jeblairi wonder if openstack calculates rate based on when the request is received or completed?19:08
*** claudiub has joined #openstack-infra19:08
*** Hal1 has quit IRC19:08
fungii wonder where that rate limiting is implemented19:08
*** sushilkm has joined #openstack-infra19:08
funginova? keystone?19:08
*** Hal1 has joined #openstack-infra19:09
*** marun has quit IRC19:09
sushilkmis there a way to rerun the tests in between ... because out of multiple jenkins job the smalles job has failed .... and bigger jobs would continue running and come back later saying for a recheck19:10
anteayasushilkm: why have the jobs failed?19:10
anteayasushilkm: there may be a problem with the patch19:10
anteayain which case the jobs are working as expected19:10
sushilkmno problem is not with patch while verifying it worked, now there are same tests running for merge19:11
sushilkmand smallest test has failed19:11
sushilkmits a known bug19:11
anteayawhich patch?19:11
sushilkmwhich comes up sometimes19:11
sushilkmits regarding a patch in trove19:11
anteayawell if your patch is triggering a known bug your patch may be making the bug worse19:11
sushilkmhttps://jenkins02.openstack.org/job/gate-trove-python27/2302/console19:12
*** Hal2 has joined #openstack-infra19:12
*** Hal1 has quit IRC19:13
*** zz_dimtruck is now known as dimtruck19:13
anteayayeah patchset 2 failed 5 times, patchset 5 failed 3 times19:14
anteayaI think you need to consider the possiblity your patch may be increasing the tendency of a bug to appear19:14
clarkbhttps://issues.jenkins-ci.org/browse/JENKINS-2751419:15
fungiclarkb: now you're famous19:15
sushilkmok anteaya .. i wud have a look19:16
sushilkmthanks for the suggestion19:16
clarkbfungi: filing a bug is all it took to be famous? I should've done this ealier (I really thought that change I linked in the bug was the fix :( )19:16
fungiyou're in the who's who of jenkins jira bugs19:17
pleia2hehe19:17
fungimaybe cloudbees will offer you an open-core hotfix19:18
anteayaha ha ha19:18
*** marun has joined #openstack-infra19:19
*** otter768 has joined #openstack-infra19:20
openstackgerritKhai Do proposed openstack-infra/gerritlib: Update project docs and setup for tests  https://review.openstack.org/16597919:20
openstackgerritMatthew Treinish proposed openstack-infra/devstack-gate: DO NOT MERGE COPY ALL OF .TESTREPOSITORY to log server  https://review.openstack.org/14971919:20
*** HeOS has joined #openstack-infra19:21
openstackgerritMerged openstack-infra/project-config: Start using neutron for periodic stress job  https://review.openstack.org/16470519:22
clarkbmordred: any idea if your new image is working yet?19:24
*** bdpayne has quit IRC19:24
*** garyh has quit IRC19:24
*** otter768 has quit IRC19:25
openstackgerritJames E. Blair proposed openstack-infra/nodepool: Run tasks in worker threads  https://review.openstack.org/16598219:25
cineramatrying the new warfile in the puppet module19:25
jeblairclarkb, mordred: the test node i looked at is still in the nodepool queue19:25
fungiSlickNik: there are a couple changes in the gate for trove consistently failing python 2.7 unit tests. known issue?19:25
jeblairclarkb, mordred, fungi: which prompted me to write https://review.openstack.org/16598219:25
*** xyang1 has joined #openstack-infra19:26
pleia2cinerama: fingers crossed :)19:26
cinerama"we've secretly replaced the coffee here with folger's crystals. let's watch"19:26
jeblairclarkb, mordred, fungi: which is either going to be a _massive_ improvement to our throughput and efficiency, radically reducing the amount of servers we have sitting in build and delete19:26
*** hdd has joined #openstack-infra19:26
clarkbjeblair: looking at it now19:26
jeblairclarkb, mordred, fungi: or it could make everything blow up all at once19:26
clarkbjeblair: thankfully we have some tests now19:26
jeblairclarkb, mordred, fungi: frankly, a little hard to tell from where i'm sitting19:26
SlickNikfungi: funny you should mention that — that's what I'm looking at right now.19:26
jeblairclarkb: actually it passes tests :)19:26
cineramaoh pants, i forgot to update the template19:27
fungiSlickNik: okay, cool. just making sure it was on your radar19:27
clarkbjeblair: thats a start then :)19:27
*** asettle has joined #openstack-infra19:27
pleia2cinerama: I haven't done a diff on the new config file, but I expect it to be unpleasant19:27
jeblairclarkb: i think the failure mode is likely to be that we find that "start of api request" is not how rate limits are calculated19:27
jeblairclarkb: i think the next most probably failure is "nodepool runs closer to specified api rate limit which is wrong and needs to be adjusted to slow it down a bit"19:28
jeblairthat's pretty tractable19:28
clarkbjeblair: ya we should be able to handle that easily19:28
*** bhunter71 has quit IRC19:28
fungithere's a knob for that19:28
*** asettle has quit IRC19:28
fungii'm cool with that risk19:28
jeblairi need to grab food; if people like that change, i can hand-apply it on nodepool and restart when i get back19:29
*** bhunter71 has joined #openstack-infra19:29
anteayafix the world or blow it up, what's not to like19:29
clarkbjeblair: can you explain the changes to error handling?19:30
fungithe world's partway to blown up already, so not sure anyone will notice19:30
cineramapleia2: it looks like we only need the infinispan bit (though the version is different). the other bit in the release notes refers to a one-off thing needed for migration from an older version so if we start on the new version we should be good19:30
jeblairclarkb: old error handling was to capture the exception in provider manager thread, and transfer it to calling thread19:30
cineramanow for some spaghetti testing = throw it against the wall & see if it sticks19:30
clarkbjeblair: I see19:30
jeblairclarkb: (i thought that was a neat trick ;)19:30
clarkbjeblair: but now with run happening in the calling thread it can bubble straight up19:31
jeblairclarkb: new change runs actual work in calling thread, so is simpler -- task.main() will just raise the exception in the right context19:31
jeblairyep19:31
*** ddieterly has quit IRC19:31
*** ddieterl_ has joined #openstack-infra19:31
anteayafungi: good point19:32
greghaynesclarkb: it could before I think19:32
*** pc_m has quit IRC19:32
clarkbgreghaynes: ya they bubbled up before via the transfer thing jeblair described19:32
clarkbbut thats no longer necessary so we can rip it out19:32
greghaynesah, sorry, I thought you were pointing that out as an issue not a statement of fact ;)19:33
clarkbgreghaynes: ah, mostly just wrapping my head around the change19:33
clarkbjeblair: it lgtm I say go for it when you are done with lunch19:33
greghaynesI wonder if it passes my monkeypatch tests19:34
greghaynesI had to poke around a lot in providermanager for that19:34
fungiclarkb: jeblair: looks good to me too. wanting to hold off approving it until it's proven in production?19:36
greghaynestest in prod!19:36
*** tjones1 has quit IRC19:37
fungigreghaynes: well, nodepool _is_ already tested19:37
*** SumitNaiksatam has quit IRC19:39
clarkbit is afternoon already19:39
fungiyep, it's about time to rename the board ml19:40
zaroclarkb: sorry, haven't been paying attention.  scp plugin is still leaking memory?19:40
clarkbzaro: no, the ssh-slaves plugin is leaking threads see https://issues.jenkins-ci.org/browse/JENKINS-27514?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel19:40
clarkbzaro: this is the same issue from a few weeks ago, updating the ssh-slaves plugin does not fix it19:40
openstackgerritLars Kellogg-Stedman proposed openstack-infra/project-config: Add "heat-coe-templates" project  https://review.openstack.org/16480619:41
*** sushilkm has left #openstack-infra19:41
*** garyh has joined #openstack-infra19:41
*** _nadya_ has joined #openstack-infra19:42
*** _nadya_ has quit IRC19:43
clarkbSlickNik: looks like trove is still using probuscis19:43
clarkbSlickNik: looks like something database related though?19:45
SlickNikclarkb: yes for its legacy fake tests. We've moved all of the new unit tests to run under testr.19:45
SlickNikIt's one of the new testr unit tests that is causing the issue.19:45
clarkbSlickNik: its failing in the old tests too19:45
SlickNikSpecifically: FAIL - mgmt.test_models.TestMgmtInstanceDeleted.test_show_deleted_mgmt_instances19:46
clarkboh wait those tracebacks from run_tests are normal?19:46
clarkbI should just stop looking19:46
*** dprince has quit IRC19:47
SlickNikclarkb: Yeah it sucks that the old tests puke stacktraces for the negative tests that are supposed to result in exceptions. There's a bug to fix that in the works.19:47
*** sdake_ has joined #openstack-infra19:48
clarkbSlickNik: you should be able to just wrap in expected exception context managers or asserts19:48
Shrewsfungi: clarkb: When are the infra meetings held?19:50
anteayatuesdays 190019:50
Shrewsthx19:50
anteayain #openstack-meeting19:50
zaroclarkb: so is it happening only on some servers and not others?19:50
anteayathank you for asking, be nice to see you there19:50
SlickNikclarkb: yes we do. There was a change to the way that we patch eventlet for some of the fake tests that caused this to happen. I haven't dug deep enough into it yet to figure out why that was the case.19:50
Shrewsanteaya: do you have a posted agenda?19:50
anteayayes19:50
* anteaya fetches19:50
clarkbzaro: so far I think it has hit 01, 07, 02, 05, 0419:50
EmilienMcan I get a chance to get a +A on https://review.openstack.org/#/c/164451/ ? - it's about "zuul: add check-grenade-dsvm-neutron to tempest checks"19:51
anteayaShrews: https://wiki.openstack.org/wiki/Meetings/InfraTeamMeeting19:51
Shrewsanteaya: thanks again19:51
*** sdake has quit IRC19:51
anteayawelcome19:51
anteayaEmilienM: not from me today, anything that isn't required for ff gets put off to tomorrow for me19:52
anteayaEmilienM: we are having difficulty getting resources for patches in check19:52
zaroclarkb: ooh, 5 out of 9?  i thought it might be something different between the machines, like the underlying ssh library?19:52
anteayadon't want to add to that19:52
fungiShrews: the agenda is usually in a mild state of limbo between things which were discussed in the previous meeting which need to be removed and things which have been added to discuss in the next meeting, but add whatever you want to talk about19:52
clarkbzaro: 5 out of 719:52
EmilienManteaya: ack19:52
anteayaEmilienM: thanks19:53
*** sdake has joined #openstack-infra19:53
*** [HeOS] has joined #openstack-infra19:53
*** sdake_ has quit IRC19:55
*** kgiusti has left #openstack-infra19:56
mesteryHi infra folks! Looks like this patch (https://review.openstack.org/155470) has been stuck in the gate for 20 hours or so, any ideas what may be happening? /cc otherwiseguy19:56
*** HeOS has quit IRC19:56
zaroclarkb: gearman only offlines the node correct? not actually disconnects them.19:57
mesterythe check-tempest-neutron-dsvm-dvr job appears to be just hung there19:57
clarkbzaro: correct, then nodepool deletes them19:57
clarkbmestery: that job was running on a jenkins master that fell over, you can push a new patchset to unstick it19:57
mesteryarmax: The stuck patch is yours, FYI19:57
mesteryclarkb: Ack, armax can you do that?19:57
mesteryclarkb: Thanks!19:57
clarkba small number of changes were affected so rather than restart the world we decided that patch authors that are affected could address their specific changes19:58
mesteryclarkb: Makes sense to me19:58
*** eharney has quit IRC20:00
anteayamestery armax changing the commit message should be enough for zuul to consider that an event20:00
*** teran has joined #openstack-infra20:01
mesterythanks anteaya20:02
*** timcline has joined #openstack-infra20:02
anteayamestery: you're welcome20:03
clarkbmordred: http://paste.openstack.org/show/193676/ still shows up in the console log so I don't think removing that package completely cleans out cloud init20:03
clarkbmordred: but it doesn't appear to look for metadata so at least that bit seems to work20:04
anteayaclarkb: can it find the keys?20:04
*** mmedvede has joined #openstack-infra20:04
clarkbanteaya: nodepool hasn't tried to do that yet20:04
*** kgiusti has joined #openstack-infra20:04
anteayaah20:04
clarkbwhich is what jeblair's change from before should address20:04
mordredclarkb: does it have an ip address yet?20:05
*** amuller has joined #openstack-infra20:06
anteayaah having the requests performed in the calling thread20:06
clarkbmordred: only on the 10 net20:06
*** sambetts has quit IRC20:06
mordredsigh20:06
*** timcline has quit IRC20:06
mordredclarkb: I really want to know why it's trying to do something with cloud-init20:06
clarkbmordred: I think thats an init script20:06
*** sambetts has joined #openstack-infra20:06
clarkbmordred: its lilely that init script isn't removed by the ensure => absent20:07
zaroclarkb: looks like there's a known issue with memory leak if too many slaves.  https://issues.jenkins-ci.org/browse/JENKINS-2062020:07
*** shardy has quit IRC20:07
*** AJaeger_ has quit IRC20:07
zarohow many slaves do we have setup for max?20:07
mordredclarkb: probably should have done => purged - does puppet support that?20:07
clarkbzaro: its about 200 per max20:07
clarkbmordred: I do not know20:07
clarkbzaro: but its not really a memory leak. The threads leak20:08
clarkbzaro: the threads are relatively light on memory20:08
clarkbzaro: but we only have like 100MB for threds to live in20:08
mordredclarkb: yp - that's in an init script20:08
clarkbso once we hit that we lose20:08
*** e0ne has joined #openstack-infra20:09
*** amuller has left #openstack-infra20:10
*** dprince has joined #openstack-infra20:10
clarkbmordred: 8b8f8f1b-e1bf-48f4-aae6-3b0e91d13978 is the node I looked at fwiw20:10
clarkbmordred: yes you can use purged20:11
clarkbmordred: https://docs.puppetlabs.com/references/latest/type.html#package-attribute-ensure20:11
openstackgerritMonty Taylor proposed openstack-infra/system-config: Purge cloud-init  https://review.openstack.org/16599620:11
mordredclarkb: I agree ^^ :)20:11
*** kgiusti has left #openstack-infra20:12
clarkbconsole log shows it started ssh too but without mention of generating new host keys20:13
clarkblunch now, back in a bit20:13
anteayaenjoy food20:13
pleia2eek lunch, it's late!20:14
*** e0ne has quit IRC20:14
anteayahappy food to you too20:14
pleia2thanks :)20:15
fungimordred: also http://admintweets.com/regenerate-ssh-host-keys-on-boot-on-ubuntu/ (looking now to confirm)20:15
mordredfungi: sigh20:16
mordredfungi: so I guess I probably want to do that then20:16
fungiit can't hurt, but it's got me curious because i thought most platforms took care of that20:16
*** mrda-away is now known as mrda20:17
fungii'm testing with a held bare-trusty worker now20:18
*** Hal2 has quit IRC20:19
*** Hal1 has joined #openstack-infra20:19
*** emagana has quit IRC20:20
fungimordred: yeah, at least for the bare-trusty worker i tested with in rax, it comes back up with sshd running but erroring on connect20:20
mordredfungi: k. I've got a patch coming20:20
fungiSpamapS: theory disproven :/20:20
openstackgerritMonty Taylor proposed openstack-infra/project-config: Regenerate ssh host key on boot  https://review.openstack.org/16600020:21
mordredfungi: ^^ we can do that better - with a proper upstart script that goes after openssh server rather than in rc.local20:22
mordredfungi: but we have an rc.local we're installing anyway20:22
anteayamordred: nice round number on that patch20:22
mordredwoohoo!20:22
anteayamordred: extra line in https://review.openstack.org/#/c/166000/1/nodepool/scripts/prepare_node.sh20:23
* greghaynes points at his patch 10000020:23
anteayagreghaynes: I was just thinking of that20:23
greghaynes:)20:23
anteayaI remember the race for that number20:24
anteayaI had forgotten it was you that won the lottery20:24
openstackgerritMonty Taylor proposed openstack-infra/project-config: Regenerate ssh host key on boot  https://review.openstack.org/16600020:24
greghaynesAlso, seems like 99-unbound is becoming the 'do everything at the end' script rather than 99-unbound20:24
mordredyeah20:24
mordredthat whole stack needs some serious refactoring20:24
*** _nadya_ has joined #openstack-infra20:24
*** _nadya_ has quit IRC20:24
fungimordred: testing that rc.local patch with another held trusty machine20:24
mordredfungi: woo20:25
fungishould know in a second when it finishes booting20:25
*** emagana has joined #openstack-infra20:25
fungino dice. i'll try to step through what's wrong there20:25
*** Swami has quit IRC20:26
*** salv-orlando has quit IRC20:27
*** tkelsey has joined #openstack-infra20:27
*** tjones1 has joined #openstack-infra20:27
fungimordred: i think your conditional is wrong20:27
zaroclarkb: have you read this? https://wiki.jenkins-ci.org/display/JENKINS/Remoting+issue20:27
fungi`type dpkg-reconfigure >/dev/null 2>&1 ! test -f /etc/ssh/ssh_host_rsa_keg` (checking for a nonexistent file) returns 1 for me there20:27
jeblairfungi, mordred: just to confirm -- you found a node with mordred's change, but it didn't work?20:28
fungijeblair: no, i manually tried adding mordred's rc.local change to regenerate host keys to a held trusty node and then deleted its host keys and rebooted it20:28
fungi"functional test"20:28
*** pritesh has joined #openstack-infra20:29
jeblairfungi: oh ok.  so you're pretty sure it wouldn't work.  the node i was looking at failed the ssh test20:29
jeblair2015-03-19 19:40:47,007 ERROR nodepool.NodeLauncher: EOFError launching node id: 1370426 in provider: hpcloud-b5 error:20:29
jeblairfungi: which i think may coroborrate your experiment20:29
mordredyah20:29
jeblairokay, so while mordred iterates, i'm going to apply nodepool patch and restart20:29
mordredwow20:29
greghaynesmordred: what is the type dpkg-reconfigure trying to do? just see if dpkg-reconfigure exists?20:29
mordredjeblair: fungi did I raelly do keg?20:30
jeblairoh noes!  my patch failed pep8!20:30
greghaynesmordred: why not just go for it and || true, if it fails then it doesnt exist ;)20:30
clarkbzaro: I had not but the problem isn't connection issues20:30
anteayajeblair: no20:30
mordredgreghaynes: could do that too20:30
*** marcusvrn1 has quit IRC20:30
clarkbzaro: the problem is that jenkins leaks the threads when trying to disconnect nodes20:30
fungimordred: no, that was an intentional typo on my part to simulate a missing host key20:30
greghaynes(I think the type line is wrong and thats easier than my thinking of how to fix it)20:30
clarkbzaro: but the nodes go away and all that20:30
fungimordred: as in test that conditional yourself, i think it doesn't do what you think it does20:31
*** tkelsey has quit IRC20:31
mordredfungi: I did - but I left out the &&20:31
mordredcopypasta problem20:31
fungiaha, yep. thought that might be what you were going for20:31
greghaynesah, that would explain my confusion for what youre trying to do ;)20:31
*** wenlock has quit IRC20:32
openstackgerritJames E. Blair proposed openstack-infra/nodepool: Run tasks in worker threads  https://review.openstack.org/16598220:32
fungimordred: inserting && it definitely does what i think we want20:32
Shrewsmordred: your shade refactor proposal makes me yearn for python class templates20:32
mordredShrews: ++20:32
clarkbShrews: ABC kinda does that20:32
fungii'll give that another round of functional testing20:32
zaroclarkb: go away because we don't see it the reference in the gerrit ui?20:32
clarkbzaro: ya20:33
clarkbzaro: we will only have a couple hundred nodes but maybe 8k threads20:33
openstackgerritMonty Taylor proposed openstack-infra/project-config: Regenerate ssh host key on boot  https://review.openstack.org/16600020:33
mordredfungi: this time with &&20:33
jeblairclarkb, fungi, mordred: i'm going to restart nodepool with my change now20:33
fungijeblair: awesome20:33
mordredclarkb: yeah - I used abc ... but it's still not exactly what we want20:33
mordredclarkb: because C++ class tempaltes are AWESOME20:33
zarowondering if maybe the thread for the connection is still around, just not reported on jenkins ui?20:33
clarkbmordred: you don't need them in python...20:34
*** dims has quit IRC20:34
mordredclarkb: yeah you do - because you also need typing in python - because it's the biggest problem with python programming20:34
jeblairit is doing things very very quickly20:34
mordredjeblair: woot!20:34
clarkbjeblair: yay20:34
fungimordred: one other question. why the duplicate check for test -f /etc/ssh/ssh_host_rsa_key20:34
mordredjeblair: is it doing good things very very quickly?20:34
mordredfungi: because my fingers move too fast20:35
clarkbmordred: 3.5 is adding that or is it 3.6? but I think many pythonistas are cranky about it20:35
jeblairmordred: it's getting 500 unknown errors from hpcloud faster than ever before20:35
fungimordred: i'm going to test without the redundant inner one20:35
dprinceHas there been a Zuul change that would effect Fedora 20 TripleO nodes today?20:35
openstackgerritMonty Taylor proposed openstack-infra/project-config: Regenerate ssh host key on boot  https://review.openstack.org/16600020:35
jeblairTimeout: HTTPSConnectionPool(host='region-b.geo-1.compute.hpcloudsvc.com', port=443): Read timed out.20:35
dprinceI've not seen Fedora 20 CI jobs running for several hours...20:35
anteayamordred: it isn't blowing up, the choices were doing things quickly or blowing up20:35
jeblairthat is new20:35
mordredjeblair: ooh. that is new20:36
asselindoes infra host a build copy of scp.hpi anywhere?20:36
clarkbasselin: on tarballs.openstack.org20:36
mordreddprince: well, we've been having one of those days20:37
jeblairmordred: f20 is different20:38
dprincemordred: sorry to hear that. I figured I'd ask because I'm perplexed by this.20:38
clarkbf20 broke because rax did something weird in the base image or something20:38
clarkbfwiw I am not sure that install_puppet.sh is where we should put the workaround for these rpobems20:38
clarkbshould go in prepare_node.sh20:39
greghaynesWould that effect tripleo f20 though?20:39
*** ssam2 has quit IRC20:39
mordredwhich workaroudn is in install_puppet?20:39
mordredclarkb: ??20:39
clarkbmordred: the f20 make grub work workaround20:39
mordredah. gotcha20:39
mordredyeah20:39
mordredI agree20:39
clarkbgreghaynes: oh probably not20:39
*** dkranz has quit IRC20:39
greghaynesYea, that would be wierd if it did20:39
clarkbgreghaynes: unless f20 is broken across the board and tripleo has same bug as rax?20:39
mordredclarkb: is it possible taht the f20 make grub work workaround broke the tripleo cloud?20:39
dprinceFWIW, I see Fedora 20 nodes getting created in our cloud. But nothing is running actualy CI jobs for a few hours now20:39
clarkbmordred: maybe20:39
jeblairclarkb, fungi, mordred: i've lowered the rate limit to see if that affects the read time outs20:40
mordredjeblair: cool20:40
fungijeblair: that's a good sign though, i think20:40
clarkbI wonder if those nodes got lost when 04 fell over20:40
clarkbI can look at nodepool to check20:40
clarkbdprince: all of the nodes are either building or deleting20:40
clarkblet me try manually deleting the deleting nodes20:41
mordredfungi: any luck with that latest patch?20:41
fungimordred: looks like it worked20:41
anteayayay20:41
dprinceclarkb: okay, I saw some ACTIVE instances from a Nova prospective20:41
fungimordred: yep, new host keys on boot now20:41
*** yfried is now known as yfried|afk20:42
*** timcline has joined #openstack-infra20:42
mordredfungi: woot. then I'm going to push in those two patches and try rebuilding b5 again20:42
dprinceclarkb: perhaps they are stuck in one of the prepare scripts?20:42
*** timcline has quit IRC20:42
fungimordred: doing one last boot without deleting the keys just to make sure it's also not regenerating them when they already exist20:42
clarkbdprince: maybe, hopefully this foreground delete will shed some light on what may be happening20:42
mordredfungi: excellent point20:42
*** timcline has joined #openstack-infra20:42
fungimordred: also fine20:42
openstackgerritMerged openstack-infra/system-config: Purge cloud-init  https://review.openstack.org/16599620:43
fungimordred: good to go, at least for trusty20:43
*** radez is now known as radez_g0n320:43
openstackgerritMerged openstack-infra/project-config: Regenerate ssh host key on boot  https://review.openstack.org/16600020:43
dprincesudo rm -f /etc/ssh/ssh_host_* ?20:44
dprinceAre we sure ^^^ that doesn't hose everything?20:44
* dprince isn't sure what is deployed... just checking history20:44
mordreddprince: well, we are fixing that right now ...20:44
dprincemordred: okay, I'll leave you to it and check back in a bit20:44
clarkband it wasn't deployed to tripleo20:44
mordreddprince: although that's very new - so it's unlikely to be in your tripleo nodes20:45
clarkband it should work just fine on fedora because supposedly fedora does the right thing unlike ubuntu20:45
mordredrebuilding b520:46
jeblairClientException: Unknown Error (HTTP 502)20:46
jeblairalso seeing a bunch of those20:46
jeblairthe read timeouts have subsided though20:47
mordredjeblair: UnknownError are my favorite20:47
cineramapleia2: okay i think we got it going with the new zanata20:47
openstackgerritKhai Do proposed openstack-infra/gerritlib: Update project docs and setup for tests  https://review.openstack.org/16597920:47
mordredcinerama: woo20:47
cineramapleia2: oh wait i need more coffee :)20:47
openstackgerritKhai Do proposed openstack-infra/gerritlib: Update project docs  https://review.openstack.org/16597920:48
cineramai reverted one of my manifests but it also reverted the version change :)20:50
cineramabut the new template doesn't fail? :)20:50
*** jaypipes has quit IRC20:50
mordrednibalizer: 2015-03-19 20:50:42,147 INFO nodepool.image.build.devstack-trusty: Warning: Setting templatedir is deprecated. See http://links.puppetlabs.com/env-settings-deprecations20:50
mordrednibalizer: apparently puppet really wants us to know that20:51
*** ibiris is now known as ibiris_away20:51
anteayattx is waiting on https://review.openstack.org/#/c/150267/ to cut nova, just thought I would let you know20:51
anteayait is in check and almost finished running tests20:52
clarkbException: Timeout waiting for server 22243675-f398-4779-9650-d3e7da24b782 deletion in tripleo-test-cloud-rh120:52
mordredclarkb: wow. they're having the delete server problem too?20:52
clarkbmordred: I guess? I am trying a different node to see if it is consistent20:52
mordredShrews: btw - I figure it'll take us a few passes at one of the resources to get the pattern right20:53
nibalizerclarkb: wut20:53
openstackgerritKhai Do proposed stackforge/python-jenkins: Add info on how to contribute.  https://review.openstack.org/16600620:53
mordredShrews: I'm starting to think we actually don't want ResourceRequest after all20:53
nibalizermordred: it DOES20:53
nibalizerclarkb: IT NEEDS YOU TO KNOW20:53
nibalizerer mordred20:53
mordredShrews: since you coudl also "get" a resource and have that resource be still in a building state20:53
nibalizerclarkb: so what did I break?20:53
clarkbnibalizer: I do not know what you are referring to20:53
mordredShrews: so I think I may have been wrong about the ResourceRequest class20:53
*** emagana has quit IRC20:53
Shrewsmordred: I'm working through modeling a scenario now to see what this would actually look like20:54
mordred++20:54
mordredI started going down that path and then got distracted20:54
cineramatime for more wackypants java errors20:54
clarkbcinerama: o/20:54
*** emagana has joined #openstack-infra20:54
Shrewsmordred: i'll post something up that's depedent on your review, when i have it20:55
cineramathis has really been old home week for languages i haven't done in a while20:55
openstackgerritKhai Do proposed openstack-infra/jenkins-job-builder: Add standard openstack info on how to contribute.  https://review.openstack.org/16600820:55
*** Hal1 has quit IRC20:56
cineramai'm going to have to do perl soon aren't i?20:56
fungicinerama: algol20:56
cineramafungi: i have an algol book somewhere around here20:57
*** ominakov has quit IRC20:57
fungidon't we all?20:57
*** Sukhdev has joined #openstack-infra20:57
mordredclarkb: blast - today has been so fun I forgot to turn on the day of sportsball20:58
clarkbmordred: any upsets yet? I haven't really followed it either20:58
cineramai can see it from here actually. it's next to 'code complete' and 'common lisp'20:58
Shrewsmordred: it has been exciting20:58
Shrewsnotre dame really tried to lose20:58
clarkbmaybe I should go downstairs and turn on the tv20:58
*** bdpayne has joined #openstack-infra20:59
Sukhdevclarkb: need a quick favor - hope you can help20:59
mordredclarkb: yes - iowa state lost20:59
clarkbSukhdev: maybe20:59
anteayaSukhdev: just ask20:59
fungicinerama: you need "fortran programming: a spiral approach" i can't recommend it enough20:59
mordredclarkb: to uab20:59
mordredclarkb: and Georgia State beat Baylor20:59
SukhdevIf I look at these instructions, http://docs.openstack.org/infra/manual/drivers.html#end-of-release21:00
mordredI had both of those in the 8 - so that's not great for the old bracket21:00
mordredbut go GA State21:00
SukhdevIt says to delete a release select the release and delete it21:00
clarkbthe eastern wa game is one I want to se21:00
cineramafungi: sweet cover but i'm disappointed it's not spiral bound21:00
Sukhdevbut, when I go here - https://review.openstack.org/#/admin/projects/stackforge/networking-arista,branches21:00
Sukhdevthere is no way to select the release -21:01
*** asselin_ has joined #openstack-infra21:01
Sukhdeveg. I want to delete proposed/kilo21:01
mordredclarkb: I have georgetown there- but I'll be rooting for ewa21:01
anteayaSukhdev: have you tagged your release?21:01
Sukhdevnot yet21:01
clarkbmordred: at some point the big sky is going to win a tournament game21:01
anteayayou have to tag before you delete21:01
clarkbmordred: and this is a good shot for them to do it21:01
anteayaSukhdev: This step should be performed by the OpenStack Release Manager after the release is tagged.21:01
fungiSukhdev: see where that section starts with "This step should be performed by the OpenStack Release Manager..."21:02
*** yfried|afk is now known as yfried21:02
jeblairclarkb, fungi, mordred: i'm continuing to tweak tunables to see if i can get this to perform better21:02
mordredjeblair: cool21:02
fungiSukhdev: you'll need to get a release manager or infra admin to delete branches for you. it's not a gerrit permission we can safely grant21:02
mordredclarkb: yeah - I don' tbelieve in georgetown at all21:02
*** patrickeast_ has joined #openstack-infra21:02
clarkbfungi: I think we can grant it to stackforge projecs?21:02
fungiclarkb: didn't we try?21:03
*** tsg_ has quit IRC21:03
clarkbfungi: I am trying to remember if that was possible, they have to be branch owners right?21:03
Sukhdevfungi: Ah ha - that is the part I was missing - thanks for clarification21:03
anteayahow do we prevent them from deleteing prior to tagging21:03
anteayathat will be a fun one to try to fix21:03
fungiclarkb: i think they had to be project owners. zaro: do you recall?21:03
*** dannywil_ has quit IRC21:03
*** patrickeast has quit IRC21:03
*** patrickeast_ is now known as patrickeast21:03
clarkbjeblair: anything we can do to help?21:03
clarkbfungi: ya thats what it was21:03
clarkbfungi: which imprated a bunch of other prems that werent great21:03
fungianteaya: we don't prevent anything. we just load the gun and they can tell us if they want it aimed at their foot21:04
clarkbok second node in rh1 failed to delete due to a timeout as well21:04
clarkbso all the clouds are broken21:04
anteayafungi: yes21:04
*** dannywilson has joined #openstack-infra21:04
anteayapicturing infra become a waiting room with bleeding feet next release21:04
*** dangers is now known as dangers_away21:04
*** ivar-laz_ has joined #openstack-infra21:04
mordredclarkb: have I mentioned that I don't think deleting things should be hard?21:04
Sukhdevfungi anteaya: Actually, I first called it proposed/kilo, and then later decided, I should really call it kilo - so, end up creating two branches - will tag the kilo one, but, wanted to get rid of proposed/kilo -21:04
anteayaSukhdev: glad you are thinking about your own feet21:05
*** asselin_ has quit IRC21:05
*** rfolco has quit IRC21:05
*** ivar-laz_ has quit IRC21:05
fungiSukhdev: if you're ready for that branch to be deleted, i can do so. but having a branch named "kilo" instead of "stable/kilo" is going to cause integration testing problems21:06
jeblairClientException: Unknown Error (HTTP 503)21:06
jeblairwow!21:06
openstackgerritMerged openstack-infra/jenkins-job-builder: Add missing '  https://review.openstack.org/16578621:06
jeblairso many unknown errors!21:06
clarkbBadRequest: Error. Unable to associate floating ip (HTTP 400) (Request-ID: req-ae2cf0db-28bc-4081-b09b-79a60e5495f2) is the rh1 issue I think21:06
*** ihrachyshka has quit IRC21:06
*** ivar-laz_ has joined #openstack-infra21:06
jeblair500, 502, 503... what, no 501?21:06
mordredclarkb: also - texas just lost - so that's 3 losses for the B1221:06
mordredjeblair: I'm sure there's a 501 coming ...21:06
anteayayou are missing the full series21:06
anteaya501 is the collector's item apparently21:06
clarkbdprince: so if you want ot look into something I think ^ is the thing21:06
Sukhdevfungi: Thanks for pointing this out - in that case, can you please do me a favor and delete them both - I will create stable/kilo and tag that one.21:06
*** ivar-laz_ has quit IRC21:07
*** bhuvan has quit IRC21:07
*** ivar-laz_ has joined #openstack-infra21:07
Sukhdevfungi: Thank you - that did it…appreciate it...21:08
fungiSukhdev: done. i've deleted them both, they each had the same commit sha as the tip of master, 467aa2a8c0f5cc1f52eeef07e36fb6894f8641c321:08
*** ivar-lazzaro has quit IRC21:08
*** masayukig has quit IRC21:08
*** masayukig has joined #openstack-infra21:08
dprinceclarkb: are you getting 501 errors from Nova API?21:09
clarkbdprince: no that is a 400 error21:09
*** bswartz has quit IRC21:10
mordredhpcloud-b5 image built - uploading21:10
mordred2015-03-19 21:09:40,919 INFO nodepool.DiskImageBuilderThread: DIB image 118717 with file devstack-trusty is built21:10
dprinceclarkb: I'll check the logs, thanks21:11
*** andreykurilin_ has joined #openstack-infra21:12
*** yfried is now known as yfried|afk21:12
*** Somay has joined #openstack-infra21:12
clarkbwow we upload logs for every instance booted by tempest?21:13
mtreinishclarkb: well the qemu logs, we added that a while ago21:15
*** ldnunes has quit IRC21:15
*** ivar-laz_ has quit IRC21:15
clarkbdprince: fwiw I can keystone catalog but operations like nova list and neutron floatingip-list hang21:16
*** ominakov has joined #openstack-infra21:16
*** yamamoto has joined #openstack-infra21:16
*** radez_g0n3 is now known as radez21:17
Sukhdevfungi: Is there a way to delete the tags?21:17
fungiSukhdev: not really, no. tag deletion propagates to push mirror but not via pull mechanisms like fetch all or remote update21:19
fungiSukhdev: so lots of places will have the old tags cached after deletion21:19
clarkbdprince: ERROR (ConnectionRefused): Unable to establish connection to http://10.1.8.37:5000/v2.0/tokens the plot thickens21:19
clarkbdprince: seems like a broken proxy or something21:19
jeblairclarkb, fungi, mordred: i'm continuing to slow the rate and increase timeouts on hpcloud, but i'm also continuing to see problems21:19
Sukhdevfungi: cool - thanks21:19
*** andreykurilin_ has quit IRC21:20
*** dboik_ has joined #openstack-infra21:20
mordredjeblair, clarkb, fungi, pleia2: hpcloud seems to be having some systemic issues, fwiw21:20
*** andreykurilin_ has joined #openstack-infra21:20
*** dboik_ has quit IRC21:20
*** otter768 has joined #openstack-infra21:21
*** yamamoto has quit IRC21:21
*** davideagnello has joined #openstack-infra21:21
*** dboik_ has joined #openstack-infra21:21
clarkbmordred: yes, ec2 meta service is unwork :)21:21
jeblairi hope we didn't cause them.  afaik, we never exceeded our rate limit on the calling side21:21
mordredclarkb: nope- bigger21:21
mordredclarkb: an incident has started21:22
anteayano21:22
anteayanot an incident21:22
mordredclarkb: not just "metadata service is yuck"21:22
clarkbmordred: jeblair I would not be surprised if this has been part of the larger underlying issue we have had with hpcloud over the last week or two21:22
mordredclarkb: ++21:22
*** ominakov has quit IRC21:22
*** dboik has quit IRC21:23
jeblairwe're currently at around one api call every 0.8 seconds (across all 5 providers)21:23
*** davideagnello has quit IRC21:24
jeblairconsidering that my 'nova show' command i've been trying periodically for the past 10 minutes still is returning 502 every time i try, i'm starting to think something has fallen over21:24
*** ivar-lazzaro has joined #openstack-infra21:24
*** dkliban is now known as dkliban_afk21:24
*** davideagnello has joined #openstack-infra21:25
*** xyang1 has quit IRC21:25
*** otter768 has quit IRC21:26
jeblairand hrm, we've got basically nothing from hpcloud in use21:27
*** salv-orlando has joined #openstack-infra21:27
*** achanda has quit IRC21:28
anteayaoi21:28
clarkbmaybe we should take this as an opportunity to patch apache against openssl problems >_>21:30
*** andreykurilin_ has quit IRC21:30
*** andreykurilin_ has joined #openstack-infra21:30
jogoI see the Evgeny Antyshev user is still spamming21:30
clarkbjogo: link?21:31
jogoclarkb: https://review.openstack.org/#/c/165767/21:31
jogohmm21:31
jogoone moment21:31
jogoclarkb: ahh yup https://review.openstack.org/#/c/165767/21:32
anteayathat happened just a few minutes ago21:32
anteayasame email21:32
*** dkranz has joined #openstack-infra21:33
anteayais there more that one user with the same full name and email address in the gerrit db?21:33
clarkbno, and I have confirmed that user is marked inactive21:33
clarkband I did flush the caches21:33
clarkbso maybe if they have a connection open it stays open?21:34
anteayaI get one account id for that email21:34
clarkbanteaya: yes, 1343121:34
anteayaany idea how we id which connection is theirs and kill it?21:34
anteayaclarkb: confirmed21:34
clarkbanteaya: no, not sure21:34
clarkbits possible we could find PTR records that belonged to parallels.com and shoot those tcp connections?21:35
anteayaI'm for that21:35
anteayahow can I help?21:35
*** carl_baldwin has quit IRC21:35
*** andreykurilin_ has quit IRC21:35
clarkbanteaya: well I am not really for it :)21:35
anteayaokay21:35
anteayawhat are you for?21:35
clarkbits imprecise and even then the tools for it are not great21:35
*** mattfarina has quit IRC21:35
jogoanyone try contacting that email address?21:36
clarkbso first question, has anyone sent email to that address?21:36
anteayayes21:36
*** andreykurilin_ has joined #openstack-infra21:36
anteayajogo: dims did this morning and cc'd me, no reply21:36
*** ChuckC has quit IRC21:36
*** achanda has joined #openstack-infra21:37
anteaya11:15am eastern time21:37
clarkboh sweet gerrit show-connections will show you uids too21:38
anteayaawesome21:38
clarkbnow to figure out if there is a gerrit command to kill a connection21:39
*** alexpilotti has joined #openstack-infra21:39
*** dims has joined #openstack-infra21:39
clarkbzaro: do you know if gerrit kill can be fed a session id?21:39
jogogood news is I only count about 40 comments from him today21:40
jogoso this is bad but not terribad21:40
*** salv-orlando has quit IRC21:40
anteayajogo: well we needed to know if flushing caches solved the problem21:40
jogoanteaya: right21:40
anteayaand it didn't, so thanks for letting us know21:40
clarkbhttps://groups.google.com/forum/#!topic/repo-discuss/_xWxgvEryVg21:41
anteaya201321:41
jogoanteaya: do you have a email for CloudByte CI ?21:41
jeblairso, last-ditch option is restart gerrit21:42
jogohmm never mind may not be related21:42
anteayaopenstack-dev@cloudbyte.com21:42
*** salv-orlando has joined #openstack-infra21:42
*** garyh has quit IRC21:43
jogoanteaya: it looks like he was testing open patches in the sandbox and there are a lot of cloudbyte ones21:43
anteayajogo: ah21:43
clarkbjeblair: just looking at the graphs for nodes it doesn't appear that rax has fallen over due to your chnage21:43
jogowas hoping he was working on a CI system that had a contact ... :/21:43
clarkbjeblair: seems likely that your change is a good one considering that21:44
jeblairclarkb: i agree21:44
*** dboik_ has quit IRC21:44
jeblairclarkb: well, i don't know if it has caused hpcloud to fall over :/21:44
anteayagerman company going by parallels.com21:44
*** rlandy has quit IRC21:44
asselinjogo,  I have some e-mails from cloudbyte21:44
clarkbjeblair: ya I guess that it may still have caused that :)21:44
*** dboik has joined #openstack-infra21:45
*** timcline has quit IRC21:45
asselinjogo, let me konw if you want more names e-mail addresses21:45
*** peristeri has quit IRC21:46
anteayaI'm more concerned we don't have a way of closing an open connection for a disabled account21:46
anteayaI'm surprised this never came up before21:46
*** dboik_ has joined #openstack-infra21:46
*** dhritishikhar has quit IRC21:48
*** eharney has joined #openstack-infra21:48
*** mtanino has quit IRC21:48
*** mriedem is now known as mriedem_away21:49
clarkbanteaya: ya not sure what gerrit is doing internally there21:49
*** tsg has joined #openstack-infra21:49
clarkbanteaya: its possible that the session timeout of 12 hours is what will clean it up21:49
bhunter71darn.   I see a 'gerrit close-connection' command in gerrit 2.11.21:49
mordredthere's a 2.11 already?21:50
*** dboik has quit IRC21:50
*** eharney has quit IRC21:50
jogoasselin: that was a red herring, thanks anyway21:50
anteayayes I'm looking and so far your unanswered question from 2013 is the best lead21:50
bhunter71modred: perhaps not, google just took me to 2.11-rc121:51
bknudsonone thing kind of funny about depends-on is that I *could* use the same change-id in 2 projects... e.g., same changeid in keystone as in devstack.21:51
*** dboik_ has quit IRC21:51
bknudsonand I would prefer to since it makes the other one easy to find.21:51
cineramapleia2: this time for real :)21:52
*** achanda has quit IRC21:53
clarkbsince things are sort of derpy right now anyone want to review https://review.openstack.org/#/c/165542/ to fix html rendering for console.html files?21:53
pleia2cinerama: how's it going? :)21:53
mordred2015-03-19 21:28:40,286 INFO nodepool.DiskImageUpdater: Image devstack-trusty-1426799380.template.openstack.org in hpcloud-b5 is ready21:54
openstackgerritStephanie Miller proposed openstack-infra/puppet-zanata: Initial commit of Zanata-specific sections  https://review.openstack.org/14794721:54
cineramapleia2: actually works now. i've just pushed the template changes21:54
pleia2cinerama: nice! I'll test21:54
*** andreykurilin_ has quit IRC21:55
cineramapleia2: there were a few more changes in the standalone template that weren't advertised in the release notes21:55
anteayaclarkb: yes the kill command takes a task id, but I don't see any returned values from show-connections that could be construed as a task id21:55
*** andreykurilin_ has joined #openstack-infra21:55
anteayaclarkb: could we try username though or ip?21:55
pleia2cinerama: I'm not surprised21:55
zaroclarkb: it looks like you kill with an ID.21:55
zarofrom the docs anyways21:55
clarkbzaro: ya but session ids are not task ids from what I can tell21:55
cineramapleia2: i added the infinispan stuff they mentioned, which curiously is not in the prebaked 3.6.0 standalone xml21:56
anteayaclarkb: can we try?21:56
pleia2cinerama: yeah, I don't even know what that is21:56
anteayaclarkb: worst thing that can happen is it doesn't work, yeah?21:56
clarkbanteaya: no worst thing is we kill a real task that is doing something else with the same id number21:56
anteayaclarkb: oh21:56
anteayaclarkb: yeah okay let's not that21:57
anteayasigh21:57
zaroclarkb: you could probably just do suexec to do anything you want21:57
clarkbzaro: well we need to disconnect an inactivated account, I don't think suexec will do that21:58
*** dustins has quit IRC21:58
*** yfried|afk has quit IRC21:58
anteayazaro: account 13431 was disabled about 6 hours ago21:58
anteayaand we flushed the caches21:58
*** yfried|afk has joined #openstack-infra21:58
anteayabut it is still spamming patches with comments from a misconfigured ci21:59
*** hdd has quit IRC21:59
anteayamost recent was about 30 minutes ago21:59
*** oomichi has joined #openstack-infra21:59
anteayanot a big deal but we would like the ability to be able to stop disabled accounts from posting patches after they are disabled21:59
jeblairclarkb, mordred, fungi: hpcloud is effectively offline for us22:00
anteayaposting comments to patches, sorry22:00
anteayajeblair: :(22:00
jeblairno new nodes in the past couple of hours22:00
*** eharney has joined #openstack-infra22:00
clarkbsdague: comment on one of your os-loganalyze test cleanups, I think we can make it even simpler22:00
zaroclarkb, anteaya : you then you probably can do with close-connection command?22:00
mordredjeblair: my node upload just wokred22:01
clarkbzaro: that doesn't seem to exist in our gerrit22:01
anteayazaro: do tell us about the close-connection command, does it have some docs?22:01
clarkbmordred: image upload? probably beause nova != glance?22:01
zarooops must be new, https://gerrit-review.googlesource.com/Documentation/cmd-close-connection.html22:01
mordredclarkb: it had been failing for the last $time22:01
mordredclarkb: becaue the glance apis are broken too22:01
*** achanda has joined #openstack-infra22:01
jeblairmordred: my 'nova show' still fails22:01
clarkbmordred: fun22:01
anteayahttps://gerrit-review.googlesource.com/Documentation/cmd-close-connection.html22:01
mordredjeblair: sigh22:02
zaroclarkb, anteaya maybe i can cherry pick that into our gerit?22:02
anteayazaro: 2.11-rc122:02
anteayazaro: oh please22:02
anteayayes please22:02
jheskethMorning22:02
clarkbwell that won't fix anything immediately22:02
clarkbjhesketh: good morning22:02
anteayamorning jhesketh22:02
anteayaclarkb: true22:02
anteayaclarkb: do we have any options for what will?22:02
*** dannywilson has quit IRC22:03
*** virmitio has joined #openstack-infra22:03
clarkbbtw the heat functional test takes almost 2 hours22:03
anteayawonderful22:04
anteayagood thing ttx went to bed22:04
*** hashar has quit IRC22:04
jeblairwow, so much for the 1 hour budget22:04
mordredjeblair: yah - nova command also no worky for me22:05
clarkbjhesketh: basically hpcloud and rh1 tripleo cloud are both borked22:06
clarkbjhesketh: so its been a fun day :) https://review.openstack.org/#/c/165542/ may interest you22:06
*** adalbas has quit IRC22:07
zaroanteaya, clarkb : looks promising, no conflicts.  now just need to test it.22:07
jheskethclarkb: ouch, sounds fun.. Let me know if I can help22:07
jheskethWill take a look, thanks22:07
*** dannywilson has joined #openstack-infra22:07
jogojust got a few more spams from the parallels guy ... :(22:07
anteayazaro: k, I see that command has a wait option, so if the wait option is not invoked it kills an open command immediately?22:08
anteayazaro: that is the functionality I'm looking for22:08
anteayajogo: dang, so far I don't know of a good way to kill that connection22:09
anteayajogo: right now the best option I have heard is wait on the expiration of some other kind of cache22:09
clarkbat this point I say we ride it out to the 12 hour session invalidation22:09
*** jamielennox|away is now known as jamielennox22:09
clarkbif its still trouble then we restart gerrit22:10
anteayazaro: open connection, not open command22:10
anteayaclarkb: if the only option is to restart gerrit then yes22:10
*** dboik has joined #openstack-infra22:10
*** e0ne has joined #openstack-infra22:11
jogosounds like this is a feature we want for future gerrit versions22:11
*** dboik_ has joined #openstack-infra22:11
anteayajogo: yes22:11
anteayaso a session is invalid 12 hours after it begins?22:12
anteayaclarkb: do we have any ability to search that account in open-connections and see when the session began?22:12
clarkbanteaya: 12 hours is when the cache should check if the account is still valid22:12
anteayagerrit username: eantyshev22:12
clarkbanteaya: we do, but I am not sure its from when the session began22:12
anteayahmmm22:12
clarkbits from whenver the cache was last updated for that entry most likely22:13
clarkbat worst it should be no later than 12 hours after we marked the account inactive22:13
zaroanteaya: yeah, this one right? https://gerrit-review.googlesource.com/#/c/63100/22:13
anteayafrom the docs it should show-connections should show a start time22:13
anteayaoh22:13
anteayazaro: that looks like the one, let me look at the patch22:14
*** dboik has quit IRC22:15
*** packet has quit IRC22:15
*** timcline has joined #openstack-infra22:16
anteayalooks like you can feed it a list22:16
anteayaand looks like the wait option is if you want the command to wait for the connection to close before exiting22:16
*** VijayTripathi has joined #openstack-infra22:16
anteayaI had read that wrong22:16
anteayayou feed it a session id22:17
*** yamamoto has joined #openstack-infra22:17
anteayaclarkb: show-connections gives you session ids?22:17
anteayazaro: yes please, I would like taht functionality22:18
*** aysyd has quit IRC22:18
*** e0ne has quit IRC22:18
*** AJaeger has quit IRC22:18
*** dims has quit IRC22:19
YorikSarIs there any place where Jenkins job run times are collected and stats for specific jobs can be viewed?22:20
*** aysyd has joined #openstack-infra22:20
anteayahttp://status.openstack.org/zuul/22:20
*** timcline has quit IRC22:20
anteayastats? what kind of stats?22:21
cineramavagrant upgrade, time to put the kettle on22:21
anteayawe have graphs at the bottom22:21
*** ghostpl_ has quit IRC22:21
*** yamamoto has quit IRC22:21
fungiYorikSar: graphite.openstack.org22:22
YorikSaranteaya: I want to see if some specific jobs became faster or changes pace in any way.22:22
anteayaah yes as fungi says http://graphite.openstack.org/22:22
*** ashleighfarnham has quit IRC22:22
*** dprince has quit IRC22:23
fungiYorikSar: also we have a new service tracking individual tests by interpreting the subunit data and extracting failure rates and timing for them, though it's currently only exposed as a read-only mysql query socket22:23
YorikSarfungi: Oh, cool. Will try to find it there...22:23
fungibut that may be more granular than you need22:23
*** ChuckC has joined #openstack-infra22:24
openstackgerritMerged openstack-dev/pbr: Don't avoid egg-info quite so often  https://review.openstack.org/16551122:24
YorikSarfungi: Wow... MySQL socket exposed to Internet?..22:24
YorikSarfungi: Sounds dangerous22:24
fungiYorikSar: yep. we're brave like that22:25
*** xyang1 has joined #openstack-infra22:25
mtreinishYorikSar: the docs for the db service are here: http://docs.openstack.org/developer/subunit2sql/22:25
fungitrust me we talked through all the security implications therein ;)22:25
YorikSarfungi: OpenStack Infra - bunch of cool brave people :)22:26
mtreinishfungi: heh, didn't we just say something like meh we'll throw it away if there's an issue :)22:26
mordredYorikSar: it's ... aggregated test data - if you hack it - we will have lost aggregated test run timing data - it's probably ok :)22:26
mordredyah22:26
fungimtreinish: yep!22:26
*** bknudson has quit IRC22:27
YorikSarfungi: Oh, I don't need to trust you. It's you who needs to trust you ;)22:27
mordredYorikSar: well, I stopped doing thatyears ago22:27
fungii try not to trust me22:27
YorikSarmordred: I'd prefer to mine some *coins on your MySQL cluster, why kill it? ;)22:27
*** xyang1 has quit IRC22:28
*** xyang1 has joined #openstack-infra22:28
fungiYorikSar: just cut us in on the take22:28
clarkbanteaya: yes session ids are returned22:29
*** mtanino has joined #openstack-infra22:29
*** amitgandhinz has quit IRC22:29
cineramawe should just create infracoin22:29
zaroanteaya, clarkb : https://review.openstack.org/16602522:29
fungicinerama: for all you know, we already have22:29
zaroanteaya: you need to be an admin to close connections though.22:29
mordredcinerama: your patch cannot land until it has mined a coin22:30
anteayaclarkb: great22:30
zaroanteaya, clarkb : tested and worked like a charm.22:30
clarkbjhesketh: fwiw I think we should move forward are using swift logs only for not dsvm jobs, and also add swift logs to the dsvm jobs but still copy to disk22:30
cineramafungi: nice digital currency you've got there...it'd be a real shame if someone busted it up22:30
anteayazaro: understood, but not at least an admin can close a connection, right now they can't22:30
clarkbjhesketh: then the last task will be sorting out the help footer22:30
anteayazaro: awesome thank you22:30
*** AJaeger has joined #openstack-infra22:30
*** AJaeger has quit IRC22:30
*** AJaeger has joined #openstack-infra22:30
YorikSarOh, it looks (from the first Google page) that noone thought about mining coins inside MySQL cluster.22:30
clarkbjhesketh: but everything continues to work with swift so keep pushing on it imo22:30
anteayazaro: what is your position on getting this into gerrit on saturday?22:31
*** xyang1 has quit IRC22:31
fungiwe have this new digital currency whereby service providers donate virtual machine resources and in return they get tested software they can use22:31
mtreinishYorikSar: it would probably just be easier to push up a self replicating patch to do that inside of gate jobs... :)22:31
jheskethclarkb: sounds good to me. I need to get onto the footer stuff, but will push some more jobs to flick over soon22:31
zaroanteaya: ohh, wait up that was for 2.9 wasn't it?  let me give it a try for 2.8 and see22:32
YorikSarmtreinish: Naaah, too easy :)22:32
anteayazaro: I'm fine with either but if we can have it for 2.8 on saturday I'd love to try22:32
anteayazaro: do you still have a 2.8 up for testing?22:32
clarkbhrm is this saturday that saturday?22:33
*** patrickeast has quit IRC22:33
* clarkb may have made a scheduling fubar22:33
zaroyep this is that22:33
mordredthis saturday is trusty22:33
anteayaclarkb: this saturday is upgrade gerrit os to trusty yes22:33
*** mjturek1 has quit IRC22:34
anteayaclarkb: starting at 150022:34
clarkbderp, I am supposed to go learn to not kill babies on saturday22:34
anteayaclarkb: you were on errands22:34
anteayaclarkb: well that is important too22:34
openstackgerritZane Bitter proposed openstack-infra/elastic-recheck: Add fingerprint for bug 1433555  https://review.openstack.org/16602622:34
openstackbug 1433555 in heat "functional tests are taking too long and cause false failures" [Critical,In progress] https://launchpad.net/bugs/1433555 - Assigned to Angus Salkeld (asalkeld)22:34
mordredclarkb: here's what you need to know: "don't kill them"22:34
anteayaclarkb: considering you have impending babies, I'd go with that as the higher priority22:34
anteayaclarkb: we have almost everyone else I think22:34
pleia2yeah, we've got a full house22:34
mordredI might be drunk22:34
clarkbmordred: did duke already lose?22:35
clarkb>_>22:35
* mordred throws an agitated emu at clarkb22:35
anteayahow can you tell an emu is agitated22:35
anteayaI've looked at one22:35
anteayahow do you know?22:35
*** boris-42 has joined #openstack-infra22:36
mordredanteaya: more rapid hollow thumping noises22:36
anteayaah22:36
anteayathis one made zero thumping noises22:36
mordredI believe emu's win the prize for one of the weirdest noises22:36
*** VijayTripathi has quit IRC22:36
zaroclarkb: if it's that cpr thing, i fogot it all like the next day22:37
cineramakookaburras sound like random jungle noises from every movie ever22:37
clarkbzaro: its like a full day of everything22:37
zaroyeah, forgot it all22:37
anteayamordred: really22:37
openstackgerritThanh Ha proposed openstack-infra/jenkins-job-builder: Add FindBugs support to Publishers  https://review.openstack.org/16602822:37
anteayamordred: for me that is won by partriges22:38
anteayaclarkb: the biggest thing you learn is who to call and when22:38
anteayaclarkb: so you don't have to think about it when you have to call22:38
*** gordc has quit IRC22:38
cineramacpr...there's some mnemonic...Airway Be Closing?22:38
fungiairways always be closing22:39
jogohttps://review.openstack.org/#/c/165629/322:39
jogoThis change depends on a change that failed to merge.22:39
jogonot sure what is failing there22:39
anteayahttps://review.openstack.org/#/c/135768/ this isn't approved22:40
jogois it because it is on top of a patch that has a depends-on in it22:40
jogoanteaya: right, not trying to merge anything per se, just have the tests run22:40
pleia2cinerama: when you're testing, is your local.pp still the one I pastebined a while back?22:40
mordredjhesketh: oh - do you have powers to do magical things to people's accounts?22:41
anteayajogo: oh depends-on and dependencies shouldn't prevent check jobs running on patches that need check jobs to run22:41
jeblairjogo: oh, you got that in the check pipeline?22:41
cineramapleia2: lemme get what i currently have for you...i split out the mysql on request22:41
jogojeblair: yup22:41
*** erlon has quit IRC22:41
pleia2cinerama: right, thank you22:41
jeblairjogo: cool, we've been waiting for that to appear again since we turned on extra debugging22:41
mordredjhesketh: because Ng made a rackspace account, but they put him in the LON region which apparently has nothing to do with the rest of rackspace22:41
*** patrickeast has joined #openstack-infra22:41
jogojeblair: o_O so your saying it may not be me doing something silly22:41
mordredjeblair: I don't know if you caught that this morning before you crisis-moded - but apparently rackspace LON does not work like DFW, IAD and ORD22:42
mordredjeblair: and it is not possible to use python-glanceclient with it22:42
jeblairjogo: yeah, is a bug somewhere i think; you can recheck for now, and i'll go see if we have logs that tell us what happened22:42
openstackgerritThanh Ha proposed openstack-infra/jenkins-job-builder: Add FindBugs support to Publishers  https://review.openstack.org/16602822:42
jeblairmordred: i guess we shouldn't ask for quota there then? :)22:42
fungimordred: or presumably syd which is where i split half of my personal systems for global redundancy purposes22:42
*** VijayTripathi has joined #openstack-infra22:42
fungisyd and iad seem to work very similarly from what i've seen anyway22:43
jogojeblair: thanks22:43
cineramapleia2: something like http://paste.openstack.org/show/193725/22:43
pleia2cinerama: thanks22:43
pleia2simple, I like it :)22:44
cineramapleia2: the mysql module does funky things at the moment if you specify a blank password for the zanata db user22:44
YorikSarOk, I'm lost... I'm trying to find job run time somewhere in "stats.timers.zuul.pipeline.check.job.check-tempest-dsvm-neutron-full.SUCCESS.*", but I don't understand those meters.22:44
*** radez is now known as radez_g0n322:44
pleia2cinerama: yeah, I saw your comment in the review a couple days back, nice find22:45
fungii think i got the foundation-board{,-confidential} ml split done correctly. after the list admin gets a chance to confirm, the foundation-board ml goes public with an (initially) empty archive22:45
cineramapleia2: i mention it again because i don't want it to eat your afternoon :)22:45
pleia2cinerama: much appreciated :)22:45
anteayafungi: splitting mailing lists, was it fun?22:45
fungianteaya: as fun as splitting firewood, but minus the blisters22:46
anteayaoh I like splitting firewood22:46
anteayaand yay for no blisters22:46
fungii'm okay with splitting firewood except when you get a really dense hardwood like locust where the blade just bounces right back at you22:47
anteayaah yes22:47
anteayanever swung at locust22:47
anteayabut ironwood bounces22:47
mordredfungi: I find the best way to split wood is to go to your neighborhood artisinal log store22:48
mordredfungi: you can get some lovely heirloom organic logs22:48
zaroanteaya: not so easy to get close-connection onto gerrit 2.822:48
anteayawith your tight pants22:48
anteayazaro: not to worry22:49
jheskethmordred: sadly no sorry. I'd be talking to support, likely like you are22:49
zaroanteaya: build errors, will continue to look at it.22:49
fungimordred: manhattan never ceases to amaze me22:49
anteayazaro: 2.9 is not far away22:49
jheskethAnd yeah, lon is a little special22:49
mordredjhesketh: yah - that's already in the works22:49
anteayazaro: don't hurt yourself, we can get by for another month or so22:49
mordredjhesketh: I just figured I never know what you have passwords to22:49
anteayazaro: glad 2.9 was an easy cherry pick22:49
anteayazaro: thank you22:49
zaroanteaya: ok, i won't bother then.22:49
mordredfungi: :)22:49
clarkbmordred: what are you londoning?22:49
anteayazaro: yeah, don't bother22:49
clarkblast I looked into it I needed a new account entirel22:50
jheskethmordred: yep fair question. I'm also happy to help bat for things as needed too22:50
openstackgerritThanh Ha proposed openstack-infra/jenkins-job-builder: Add support to CFP in builders sections  https://review.openstack.org/16450422:50
mordredjhesketh: thanks! I'll let you know if we run in to any roadblocks - as soon as we get the account working, Ng can finish hacking on the systemd stuff for the nodepool-dib things for rax22:51
fungiclarkb: i think the idea was not to london anything22:51
fungiclarkb: but Ng got inadvertently londoned and can't break out22:51
*** Somay has quit IRC22:51
anteayaI've heard that about that place22:52
anteayasomething about a tower22:52
clarkbNg can't use our fine artisinal MURICAN VMs?22:52
funginot even the organic ones22:52
jeblairclarkb: lol!22:52
*** asettle has joined #openstack-infra22:53
jheskethmordred: sounds good22:53
*** sputnik13 has quit IRC22:54
mordredclarkb: the MURICAN VMs have the ssh port on the opposite side22:54
*** mmedvede has quit IRC22:54
*** gordc has joined #openstack-infra22:54
mordredclarkb: the ohio state vcu game is pretty good22:54
fungimordred: and you don't "boot" murican vms, you trunk them22:55
clarkbfungi: ++22:55
*** sputnik13 has joined #openstack-infra22:55
* jeblair groans22:55
mordred++22:55
clarkbthat was good I laughed22:55
mordredfungi wins today22:55
jeblairthat was good i groaned22:55
* fungi fires off a groaner22:55
openstackgerritKhai Do proposed stackforge/python-jenkins: Add info on how to contribute  https://review.openstack.org/16600622:56
clarkblooks like the current top of stack heat test is giong to cuase a reset22:56
clarkbI am tempted to just shoot that test in the head early22:56
jeblairso what happened to our one hour budget anyway?22:56
clarkbjeblair: I have no idea22:56
anteayaclarkb: no22:56
anteayarargh22:56
openstackgerritMin Pae proposed openstack-infra/system-config: adding openstack-cue to eavesdrop  https://review.openstack.org/16603322:56
anteayattx needs the nova patch next in line, which has completed successfully22:57
openstackgerritKhai Do proposed openstack-infra/jenkins-job-builder: Add standard openstack info on how to contribute  https://review.openstack.org/16600822:57
jeblairsdague: was there a conscious decision to abandon the 1-hour budget?22:57
*** tiswanso has joined #openstack-infra22:57
anteayaboo22:57
clarkbhttps://review.openstack.org/#/c/154325/ bumped it22:57
*** Sukhdev has quit IRC22:57
clarkbI reviewed it and didn't catchthe affect it would have22:58
clarkbmerely pointed out how to make the timeout more better :/22:58
jeblairwell, i mean we just approved a 1.5 hour trove change yesterday too22:58
*** dannywilson has quit IRC22:58
*** gordc has quit IRC22:58
*** arxcruz has joined #openstack-infra22:59
*** hodos has quit IRC22:59
clarkbgreghaynes: btw one good thing that did happen today is we deployed your nodepool fix22:59
clarkbI did not get around to landing the nodepool image build logging change though and would rather avoid that right now simply because hair on fire22:59
*** bswartz has joined #openstack-infra23:00
openstackgerritMin Pae proposed openstack-infra/project-config: Add openstack-cue to gerritbot  https://review.openstack.org/15801123:00
*** vipul is now known as vipuls23:00
clarkbanteaya: to make that test failure even more awesome it failed due to an internal timeout23:01
fungijeblair: yep, though that trove job was already at 1.5 hours and we bumped it up an extra 10 minutes or something23:01
*** ddieterl_ has quit IRC23:01
clarkbanteaya: took longer than 1200 seconds23:01
anteayaclarkb: wooooo23:01
anteayalove that23:01
clarkbwhich I think is what the bug above that zaneb's change points out is about23:01
jeblairstevebaker: hi, what would it take to get the heat functional job down to about an hour?23:02
*** oomichi has quit IRC23:02
jeblairstevebaker: is there something that could be more efficient, or more paralellized, or can we split it into two jobs?23:02
fungilooks like they're trying to address it via https://review.openstack.org/16603023:03
harlowja_refrain refrain!23:03
harlowja_lol23:03
jeblairfungi, stevebaker: oh cool :)23:04
mordredclarkb: overtime23:04
clarkbmordred: what channel do I need to turn on?23:04
*** bhunter71 has quit IRC23:04
clarkbfungi: that change hasn't run that job yet but should tell us soonish I guess23:04
mordredclarkb: tnt23:04
fungiindeed23:04
clarkbhrm no more dprince23:04
*** andreykurilin_ has quit IRC23:05
openstackgerritDoug Wiegley proposed openstack-infra/project-config: For neutron and neutron-lbaas, skip more wasted jobs  https://review.openstack.org/16603523:05
*** pritesh has quit IRC23:05
clarkbI would sggest just turning off the rh1 region but there is no fallback so meh23:05
*** dimtruck is now known as zz_dimtruck23:05
ianwclarkb: np; the config-validation jobs are also ready to go i think23:05
fungiharlowja_: I OK kook i rigor w assessment of ofiii23:05
harlowja_lol23:05
*** asettle has quit IRC23:05
harlowja_what u say about my mom23:05
harlowja_how dare u23:05
fungii tried to run it through rot-13 but to no avail23:05
*** asettle has joined #openstack-infra23:06
clarkbfungi: I prefer rot2623:06
clarkbeasier to read23:06
fungiclarkb: yeah, i think that e-mail was drunken keyboard mashing run through rot-2623:06
*** arxcruz has quit IRC23:06
fungiclarkb: did my change to turn off tripleo-hp1 finally merge?23:07
fungiahh, yep, earlier today looks like\23:07
mordredfungi: yup23:07
*** jamespage has quit IRC23:08
*** gnuoy has quit IRC23:08
mordredfungi: turns out nodepool alien-list with it still in there was UNHAPPY23:08
*** gnuoy has joined #openstack-infra23:08
*** jamespage has joined #openstack-infra23:08
fungimordred: yep, that was one of the reasons i submitted that change23:09
mordredfungi: I also submitted a nodepool patch to not be unhappy on that command if a cloud is unhappy23:09
mordredfungi: but it's been a day - so I have not followed up on it23:09
fungioh, great idea23:09
clarkbadd a test please :)23:10
anteayayes test please23:10
*** bookwar has left #openstack-infra23:10
clarkbwe actually have some decent testing going on now23:10
anteayayay testing23:10
clarkboh this game is at the rose garden23:11
mordredclarkb: yah. you have rounds there in town23:11
fungii've been forcing myself to add tests for each of the new bindep features i committed too, since it unsurprisingly started out with a decent testing framework and i didn't want to regress it23:11
anteayanice work23:12
mordredclarkb: so the real question - is harvard going to beat unc?23:12
anteayathat heat test is still running23:12
jeblairokay, so i've confirmed that a gerrit query for a change issued immediately after a patchset is created may lack depends-on information23:13
anteayastarted at 21:0823:13
clarkbjeblair huh23:13
mordredOH23:13
anteayafinished23:13
mordredjeblair: aroo?23:13
jeblairso the error that jogo (and sdague before him) reported about changes in check being ejected because they depend on a change that failed to merge is because later on someone comments on a change and zuul updates information for it, and the second time sees the dependency that was not originally there23:13
anteayaBuild timed out (after 125 minutes). Marking the build as failed.23:14
mordredjeblair: that must have been fascinting to track23:14
*** tiswanso_ has joined #openstack-infra23:15
*** tiswanso has quit IRC23:15
jeblairmordred, clarkb: it may happen when a patch series is being updated simultaneously23:15
clarkbjeblair does it see the old patch of the dep or no dep at all?23:16
fungijeblair: so you were able to diagnose that with the additional logging then?23:16
jeblairclarkb: the dependsOn section is entirely missing from the returned data23:16
jeblairfungi: yeah, logging all of the traffic to/from gerrit23:17
*** aysyd has quit IRC23:17
fungihuh, so gerrit's omitting that from the event sometimes?23:18
jeblairyeah, i suspect it either hasn't received the other commit, or processed it, or something.23:18
fungithat seems like it's going to be weird to work around23:18
jeblairi can think of a thorough-and-slow way and a good-enough-and-fast way...23:19
jeblairthe thorough way would be to query for the parent commit sha, and if it is not a merged change, keep waiting for a depends on to show up.23:19
jeblairthe good-enough way would be to sleep for a few seconds :)23:20
anteayaha ha ha23:20
clarkbthis is thr gerrit internal depends on ya?23:20
fungithat's how i interpreted it23:21
funginot our commit message header23:21
jeblaircorrect, only talking about git-depends here23:21
fungiso, yeah, the sleep doesn't seem all that bad23:21
*** ghostpl_ has joined #openstack-infra23:21
*** otter768 has joined #openstack-infra23:22
*** tjones1 has quit IRC23:22
fungiit's already a rare enough occurrence23:22
fungii've only seen it reported a few times in the past couple weeks23:22
fungiso if that gets rid of most of the incidents, then it'll be rare enough we just don't care23:23
fungii wouldn't be surprised if it only happens when gerrit's really busy23:23
*** stevemar has quit IRC23:23
openstackgerritDoug Wiegley proposed openstack-infra/project-config: Promote lbaasv2 api tempest tests to voting for neutron and neutron-lbaas  https://review.openstack.org/16603823:24
*** kutija has quit IRC23:24
*** _nadya_ has joined #openstack-infra23:25
*** ghostpl_ has quit IRC23:26
*** otter768 has quit IRC23:27
ianwdo we have issues with symlinks in repos?23:27
*** chlong has joined #openstack-infra23:27
fungiianw: packaging them is problematic23:27
ianwparticularly i'm wondering if symlinking top-level HACKING.rst into the docs directory is ok23:27
fungiianw: you can already tell sphinx to include files from the top-level directory. we have at least some packages which do that for their readme23:29
ianwahh, yeah, that's a better path to take23:29
fungii don't know an example off the top of my head, but someone else might23:29
*** bdpayne has quit IRC23:30
*** _nadya_ has quit IRC23:30
*** camunoz has quit IRC23:32
mordredjeblair: ah - I totally thought you were talking about Depends-On footers23:33
*** stevemar has joined #openstack-infra23:33
mordredianw: we have people doing symlinks23:33
openstackgerritMerged openstack-infra/jenkins-job-builder: Add 'Job Log Logger' wrapper  https://review.openstack.org/16433823:34
mordredopenstack-infra/infra-specs:doc/source is a23:34
mordredgreat place to look23:34
*** MarkAtwood has quit IRC23:34
openstackgerritMerged openstack-infra/jenkins-job-builder: Remove nonexistent servername and cleanup build publisher  https://review.openstack.org/15609123:34
*** garyh has joined #openstack-infra23:34
fungimordred: is it mostly symlinking directories and/or python files which gets hairy for packaging?23:35
*** dannywilson has joined #openstack-infra23:35
mordredfungi: I think it's mostly the python files23:36
fungiianw: though poking around, you can apparently do:23:38
fungi.. include:: ../../HACKING.rst23:38
*** tonytan4ever has quit IRC23:38
fungiwhich will cause it to just slurp that content into your doc/source/hacking.rst or whatever23:39
*** tjones1 has joined #openstack-infra23:39
*** stevemar has quit IRC23:40
mtreinishfungi: yeah that's what I think everyone does. Some projects used to use symlinks but there was some reason we stopped23:40
mtreinishI can't remember exactly why though23:40
fungiwhich is apparently slightly more friendly to our comrades on filesystems which lack symlinks23:40
mtreinishfungi: bah, I think we should just insert arbitrary symlinks into all repos23:41
clarkbsymlinks break distutils or setuptools or both23:41
*** garyh has quit IRC23:42
mordredfungi, jeblair, clarkb: we may be about to get a request to turn off HP for a bit to help them recover23:43
*** tjones1 has quit IRC23:44
fungiwow!23:44
*** camunoz has joined #openstack-infra23:44
greghayneshah23:45
clarkbmordred it is effectively off23:45
mordredyup. they would like us to stop hitting their API for a bit23:45
mordredclarkb: no it's not23:45
*** radez_g0n3 is now known as radez23:45
mordredit's continually hitting their api asking for things23:45
clarkbwell ya but its not actually doing anything23:46
fungihpcloud is effectively off for us, we're not effectively off for hpcloud23:46
mordredright. what fungi said23:46
ianwclarkb: many things can disturb the distutils and setuptools gods.  i bet someone didn't make their sacrifice and now they've smote hp cloud23:46
mordredthey're having an API layer disaster at the moment23:46
fungihpcloud may _also_ be effectively off because of us. who knows23:46
*** ajmiller has quit IRC23:46
anteayaianw: I haven't seen a smoting in centuries23:46
openstackgerritMonty Taylor proposed openstack-infra/system-config: Turn off HP Public Cloud  https://review.openstack.org/16604323:48
mordredfungi, clarkb, jeblair, pleia2, jhesketh: ^^23:48
anteayareviewed23:49
pleia2sad trombone23:49
mordredjeblair: do you have local changes applied?23:49
*** derekh has joined #openstack-infra23:50
*** mfink_ has quit IRC23:51
openstackgerritMerged openstack-infra/system-config: Turn off HP Public Cloud  https://review.openstack.org/16604323:52
clarkbmight also be worth pointing out that they don't need us to do that for them...23:52
clarkbbut maybe firewalling off nodepool.o.o isn't a nice thing to do?23:52
*** derekh has quit IRC23:52
SpamapSwell on the bright side if there's no more changes to make hpcloud work, that's more time we can spend making the new cloud ;)23:53
anteayawell when I deal with third party ci, I always consider it polite if they turn themselves off23:53
anteayarather than forcing me to shut them down, as I am slow to turn them back on23:53
clarkbanteaya: sure, happy to do it (and have done it many times before)23:53
anteayayup23:53
clarkbfor all the clouds too23:53
anteayaso hoping they sort this out23:54
mordredjeblair: ok - puppet is disabled on nodepool I believe because you have local changes23:54
openstackgerritMatthew Treinish proposed openstack/requirements: Bump tempest-lib min version  https://review.openstack.org/16604423:54
*** mtanino has quit IRC23:54
clarkbmordred: jeblair probably the change to nodepool to do the requests at the rate limit?23:54
mordredclarkb, jeblair: I applied the quota changes to nodepool.yaml by hand23:55
mordredclarkb: it's been a while - is there a better way to get nodepool to pick up nodepool.yaml changes?23:55
clarkbmordred: no, next time through the run loop it will get them23:56
fungimordred: it automagically sees them23:56
mordredclarkb: awesome23:56
mordredthanks23:56
clarkbmordred: maybe we should be looking at spreading our across east and west? can we ask about using west again?23:58
*** garyh has joined #openstack-infra23:59
mordredclarkb: I will ask about that post-crisis23:59
clarkbs/our/out/23:59

Generated by irclog2html.py 2.14.0 by Marius Gedminas - find it at mg.pov.lt!