18:05:19 #startmeeting sahara 18:05:20 Meeting started Thu Apr 10 18:05:19 2014 UTC and is due to finish in 60 minutes. The chair is SergeyLukjanov. Information about MeetBot at http://wiki.debian.org/MeetBot. 18:05:21 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 18:05:24 The meeting name has been set to 'sahara' 18:05:27 #ling https://wiki.openstack.org/wiki/Meetings/SaharaAgenda 18:05:34 hey folks 18:05:49 traditionally, the first topic is.... 18:05:51 #topic News / updates 18:05:55 folks, please 18:06:25 I was mostly finalizing the docs 18:06:30 I've started the merging process for the dashboard into horizon. I hope to have all the CRs up no later than tomorrow. 18:06:40 sahara is going to get air time during #devnation next week (assuming i start & finish the presentation). also, it'll get air time during hadoop summit in san jose. 18:07:07 I am testing all things related with Sahara 18:07:12 mattf: cool! 18:07:53 aignatov, i may snag your todo demo for #devnation. i can't seem to get my sigmask in place to make something. 18:08:25 I was working a few on docs and tested all related heat stuff in sahara, also updated sahara's resource in heat 18:08:30 re conference - has anyone submitted to hadoop world in nyc? 18:08:53 mattf: you mean demo about "Top todoers"? 18:09:18 crobertsrh, awesome, could you please announce new CRs to #openstack-sahara? 18:09:37 feel free to show it for masses :) 18:09:40 aignatov, quite possibly 18:09:45 #link https://review.openstack.org/#/c/86648/ 18:09:51 #link https://etherpad.openstack.org/p/sahara-icehouse-tbd 18:09:58 #link https://review.openstack.org/#/c/86614/ 18:10:10 aignatov, i'm thinking about adding a time component. who's removing their todos vs who is just accumulating them 18:10:40 #info the blocker for RC2 is HDP features / desciption update https://review.openstack.org/#/c/84797/ 18:10:49 when it'll be merged, we're reading to rc2 18:10:58 mattf: sounds like an interesting variation 18:11:05 I'll double check that all needed CRs are backported to m-p 18:11:11 dmitryme, i'll look pretty bad in it 18:11:18 and ask ttx to push the rc2 tag 18:11:24 dmitryme, could be good for public shaming 18:11:36 mattf: nice idea, I mean to get sources for one date and sources for another date and then a diff in TODOs, right? 18:11:59 SergeyLukjanov: can we put https://review.openstack.org/#/c/86641/ into rc2 as well? 18:12:01 aignatov, for speed i might just pull release tarballs 18:12:05 all, I'm pursuing the bigpetstore app as a demo, but it's not there yet. 2 parts, data generation, and csv processing. I ran the first part (with hdfs, needs mods to use swift). The 2nd part launches PigServer from inside a java app, so I'm trying to run it as a Java action but it needs a pig lib on the classpath. The other option is to extract the Pig script built in the java app and allow Oozie's pig runner to work on that. 18:12:22 probably this idea is a good on more demo's candidate for Atalanta summit 18:13:10 ideally we can have a different demo for #devnation, os summit, hadoop summit 18:13:11 dmitryme, sure, nice tip 18:13:39 mattf, aignatov, jspeidel, alazarev, please, review/approve https://review.openstack.org/#/c/86641/ 18:13:56 SergeyLukjanov, i've been super distracted, but at some point i'd like to formalize the criteria for breaching feature freeze or rc 18:14:38 mattf, it was already defined - hadoop2 updates and docs 18:14:59 o/ 18:15:25 SergeyLukjanov, my sampling of merges suggests we're not sticking to that. your notion of the criteria must not match my notion. 18:15:27 yep, seems to be hadoop 2 related changes are already in the icehouse 18:15:34 only docs left 18:15:46 mattf, and open issue, that are blocking some important features 18:15:50 #info https://launchpad.net/sahara/+milestone/icehouse-rc2 18:16:16 important is subjective 18:16:39 mattf, note that master is Juno already, not everything merged to master backported to m-p 18:17:14 SergeyLukjanov, /me nods 18:17:50 i'd like to discuss at some point, but after next week... 18:18:18 release will be on the next week :) 18:18:26 * mattf nods 18:18:32 mattf, we'll have much more for this process next release 18:18:43 * mattf breaks neck nodding 18:19:19 mattf, this time we have last two weeks before FF renaming savanna to sahara instead of doing things that we're doing now 18:19:49 so, that's why I've approved more backports than I wnat to see during the rc perion 18:19:50 SergeyLukjanov, i'm just raising a warning flag that we should find a way to formalize the criteria 18:19:53 period* 18:19:58 and clearly it's too late for this release 18:20:18 mattf, there is no way to formalize it ;) take a look on other projects 18:20:35 well, at least socialize it then 18:20:50 but...we should move on 18:21:13 all of the stuff was added to etherpad and milestone that were announced and discussed several times in irc and ml 18:21:41 oh, I find useful link 18:21:44 #link https://github.com/openstack/sahara/compare/2014.1.rc1...milestone-proposed 18:21:50 i really don't want to entirely derail the meeting w/ details 18:22:57 so, for rc2, we're waiting for https://review.openstack.org/#/c/86641/ https://review.openstack.org/#/c/84797/ and https://review.openstack.org/#/c/83036/ 18:23:06 any objections? 18:23:11 and additions? 18:23:30 oh, hext topic is already here 18:23:30 #topic Icehouse release status 18:23:37 next* 18:24:28 hehe, probably we already discussed Icehouse release status :) 18:24:39 #info both action items re aggregating list of needed CRs completed 18:24:54 #info done: aignatov to compost list of issues that should be fixed in Icehouse 18:25:00 #info done: elmiko to compose list of docs that should be done in Icehouse 18:25:08 aignatov, elmiko, thanks 18:25:18 #link https://etherpad.openstack.org/p/icehouse-fixes-after-rc 18:25:24 mattf, socialization, you've requested :) 18:25:29 will someone explain the functional impact of https://bugs.launchpad.net/sahara/+bug/1304995 ? 18:26:04 sreshetnyak, ^^ 18:26:05 performance, hadoop can work more faster if it has native libraries 18:26:07 what will people be able to see before and after the "fix"? 18:26:32 mattf, IIRC it's a significant perf. improvement for hadoop 2 users 18:26:38 aignatov, i get that in theory, but has anyone demonstrated it? what kind of performance increase are you looking at...? 18:27:10 SergeyLukjanov, if you have some data on that please pls pls toss it in the bug so we can work from the same facts 18:27:13 mattf: yes, try to create cluster w/o those libs and w/ it 18:27:34 mattf: mapreduce speed up, I think 18:27:44 generally i'm +1 for anything potentially perf related, but i'm more interested right now in making sure we communicate what the impact of changes is 18:27:45 more info - http://hadoop.apache.org/docs/r2.3.0/hadoop-project-dist/hadoop-common/NativeLibraries.html 18:28:19 in the first case you'll see multiple warnings about absence of those libs and all hadoop related actions will work more slowly in comparison with case w/ libs 18:28:55 AFAIK hadoop expect that these native libs will be available, it's just weird that they are not included 18:28:58 hadoop 1.2 contained native libs in del and rpm packages so we didn't worried about that 18:29:05 correction: only 32-bit ones included 18:29:07 del -> deb 18:29:54 imho removing a warning isn't important enough 18:30:45 i get that there's a discrepancy between what may be expected and what's provided, but if that doesn't change the user experience significantly why add it now instead of juno? 18:31:18 mattf: we actually have a backup plan in case images with native libs start to fail 18:31:59 dmitryme, you're way ahead of me. so what happens when those libs don't load on fedora because they were built for ubuntu etc? 18:32:19 * mattf grants ^^ is a theoretical issue atm 18:32:33 we will simply rebuild them without native libs and place new images instead of old ones on our download site 18:32:53 without changing images' names 18:33:00 so we will not have to change the docs 18:33:01 * mattf eeks a bit 18:33:14 * mattf expects tosky just shed a tear 18:33:28 eh :( 18:33:46 tosky: why? 18:33:56 well, at least the assumption is that "it works as before" 18:34:10 so not the top issue - I have other issues at the moment 18:34:34 should I ask for info on a bug now, or should I wait for the "open discussion" ? 18:34:46 (potentially rc bug if confirmed, I would say) 18:34:53 tosky, now 18:34:54 tosky, i don't think there's a bug for what to do if native libs fail 18:35:30 so, this authentication bug when launching a job https://bugs.launchpad.net/sahara/+bug/1305210 was reported by me but found almost at the same time by elmiko too 18:35:54 on 2 different systems no less 18:35:55 he went into details more than me 18:35:57 aignatov, dmitryme, SergeyLukjanov, imho the bar for changes during a stabilization period should have a clear functional justification and a way to test it 18:36:35 libhdfs.so library provide C API for hdfs 18:36:38 it's probably a bunch of work to do for this native library change, so i'd rather it go in at the beginning of the next cycle so we have 5mo to stumble on the issues 18:36:59 tosky: elmiko that's looks strange because we have savanna-ci which runs Pig jobs on the near-master devstack with keystone v3 18:37:00 so i'll probably not +2 it 18:37:08 #info RC2 will be earlier friday after merging and backporting last changes 18:37:23 this is the kind of criteria that we should lay out for stabilization 18:37:32 aignatov: do we generate the request for the "/v3/tokens" endpoint? 18:37:42 * mattf attempts to step down off soapbox 18:37:56 #info only fixes for super critical issues will be accepted after RC2 (like sahara couldn't start because....) 18:37:58 aignatov: do you use the internal storage for the jobs, or swift? 18:39:28 hmm 18:39:44 mattf, it's only related to the vanilla hadoop 2, that's why we're considering i 18:39:46 it* 18:39:49 for jobs I've used internal storage 18:40:00 tosky, I think swift is used in CI 18:40:26 uhm, could you please check? Also, if I missed some details in the bug, please tell me and I will add the missing steps 18:40:33 SergeyLukjanov, that "hadoop 2" bucket is very wide. it would definitely help if someone wrote down in a bp what it meant. but i'm off my soapbox now. 18:40:59 aignatov, SergeyLukjanov, i'm still curious if we are generating the url for the "/v3/tokens" endpoint? 18:41:15 elmiko, we're generating it if it's not available 18:41:17 tosky: the stack trace indicates that the code fails in 'upload_job_files' 18:41:38 elmiko, but it's used only for trusts-related stuff 18:41:42 AFAIU this code is should common for both Hadoop 1 and Hadoop 2 18:41:55 tmckay, aignatov, can you confirm? 18:42:09 SergeyLukjanov: my concern is that it's not the proper endpoint to keystone 18:42:17 dmitryme: uploading job files is the same for both versions 18:42:24 maybe i'm misunderstanding how we use it 18:43:10 https://github.com/openstack/sahara/blob/master/sahara/utils/openstack/base.py#L73 18:43:20 aignatov, dmitryme, agree 18:44:23 SergeyLukjanov: we might have a small issue then, v3 keystone api has "/v3/auth/tokens" but no "/v3/tokens" 18:44:35 elmiko, one more thing could be is the ".sahara" suffix 18:44:49 it is only needed for data sources 18:44:57 aignatov: on the swift:// uri? 18:45:09 elmiko, we're not generating the url, we're using keystone client 18:45:34 elmiko: for uploaded job binaries you don't need to provide that suffix 18:45:43 all, actually, I just got this error 10 minutes ago :) 18:45:52 SergeyLukjanov: ok, i just saw the same bug as tosky and somewhere a GET is made on "/v3/tokens" and it returns 404 18:46:08 elmiko, probably, old keystoneclient version? 18:46:10 aignatov: ok, i had to remove it for my stuff 18:46:18 using neutron, hadoop 2.3.0, tip of devstack. But, I only used swift for a binary -- which is not accessed from the hadoop cluster 18:46:21 SergeyLukjanov: could be 18:46:26 it's pulled by sahara and then copied 18:46:40 so, I wonder if it's a problem related to neutron or the tip of devstack 18:46:57 elmiko, the -transient job @ sahara-ci tests all our code that is related to keystone api v3 18:47:21 elmiko, it creates transient cluster, that means it creates trusts and etc. 18:47:43 SergeyLukjanov: ok, it was just odd that tosky and i saw this on 2 radically different setups 18:48:02 SergeyLukjanov: but transient cluster job doesn't run edp actions 18:48:12 SergeyLukjanov: he was using devstack and i was using rdo/icehouse 18:48:15 aignatov, yup, that's correct 18:48:23 elmiko, I should poke at this some, too. seems to be widespread 18:48:45 SergeyLukjanov: transient cluster, do you mean the cluster is created on-the-fly ("Launch on a new cluster")? 18:48:53 tmckay: cool, i'm trying to get my stack working again to see if it's still happening. i ended up having to back off v3 auth to get it working 18:48:58 tosky, yup 18:49:06 SergeyLukjanov: I ran the job on an existing cluster 18:49:15 tosky: same for me 18:49:21 bah 18:49:44 which keystone version you have in endpoints? 18:50:19 elmiko, I've got this issue right now :) I can hit "swift download blah" from the cli, with the same credentials, but not from sahara ... 18:50:53 SergeyLukjanov: which file should I check exactly? 18:51:00 SergeyLukjanov: i'm not sure, i think it was v3 but i had to bring down that stack 18:51:12 tmckay: CLI works, but Sahara does not, with the same credentials, right? 18:51:17 yes 18:51:48 tmckay: can you show OS_AUTH_URL you use for CLI? 18:52:07 let's continue discussing it after the meeting 18:52:12 oki 18:52:25 and consider it as a potential issue for rc3 18:52:25 SergeyLukjanov: right, we can do it in Sahara channel 18:52:48 in fact the v2 keystone api version is still the main one 18:53:07 and there is still no good deprecation plan for it 18:53:18 #topic Open discussion 18:53:19 didn't they switch it already? Someone was even talking about killing the tests in Juno timeframe 18:53:56 tosky, they was stopped by tc due to the attempt to depricate v2 api while not all other projects support v3 :) 18:54:13 6 mins left to discuss something 18:54:41 has anyone filed a bp for massaging itests to look like tempest tests? 18:54:47 i have been using sahara-image-elements to create centos+hdp2 images, i continue to see ambari-server setup failuers during boot. 18:55:12 elmiko, no hwx folks around, doh 18:55:21 oh well, i tried 18:55:28 Hey people, I wanted to remind you that we have https://ask.openstack.org/en/questions/ 18:55:36 elmiko: I'm afraid no one except HW knows how HDP 2 plugin works 18:55:44 ylobankov_, please, ensure "[22:54:42] has anyone filed a bp for massaging itests to look like tempest tests?" is done 18:55:48 where people post questions for Sahara (and Savanna) from time to time 18:56:07 aignatov: i've made it work, but only if i give the instance access to the internet. i'll ask them next time we talk. 18:56:08 ylobankov_, please subscribe me when it's done 18:56:58 ok 18:57:21 elmiko: if installer does not find JRE or HDP packages on the image, it starts downloading them from Hortonworks mirror 18:57:34 mattf: ok 18:57:57 dmitryme: yea, according to the hwx guys they were installed 18:58:26 elmiko: maybe different version of packages? 18:59:46 let's move to the #openstack-sahara channel 18:59:53 thanks folks! 18:59:56 thanks all! 18:59:59 #endmeeting