Wikimedia Release Engineering Team/Checkin archive/20180108
Appearance
2018-01-08
[edit]Vacations/Important dates
[edit]- Jan 15 (Mon): Martin Luther King Day (All US Staff)
- Jan 22/23: Dev Summit
- Jan 24: Tech Management F2F
- Jan 25/26: WMF All Hands
- Jan 29-31: Team offsite
- Feb 19 (Mon): President's Day (All US Staff)
- Mar 30 (Fri): WMF Holiday
Team Business
[edit]Rotating positions and absences
[edit]Maniphest query for deployment blocker tasks: https://phabricator.wikimedia.org/maniphest/?project=PHID-PROJ-fmcvjrkfvvzz3gxavs3a&statuses=open%28%29&group=none&order=newest#R
Jan 1 and Jan 8
[edit]Jan 15 and Jan 22
[edit]- Train: Tyler
- wmf.17
- No train: Mukunda
- SoS:
- Out
- Jan 15 (Mon): Martin Luther King Day (All US Staff)
Actions from last meeting
[edit]Scrum of Scrums
[edit]- Greg to copy to etherpad after meeting: https://etherpad.wikimedia.org/p/Scrum-of-Scrums
This week
[edit]Release Engineering
[edit]- Blocking
- Blocked
- Updates
Last week
[edit]- Blocking
- Blocked
- Teams that did not port Selenium tests from Ruby to Node.js (T139740): No team (1), Editing (1), Fundraising Tech (1), Global Collaboration (4), Multimedia (1), Reading Web (3), WMDE Engineering (5)
- Updates
- REMINDER: No deploys this week and next (18th and 25th). Take a break! [wiki][email]
- We’re (Chad) going to spend some time this week cleaning up log-spam.
- Our Q3 goals posted
- Completed the security upgrade of RuboCop and Rubyzip
- Our Q2 “SSD” (Program 6) goals are now all complete [wiki][email]
- Antoine is migrating the CI docker image build process to using docker-pkg (from Giuseppe)
- Follow-up continues on the ORES post-mortem follow-ups
- REMINDER: No deploys this week and next (18th and 25th). Take a break! [wiki][email]
Puppet SWAT
[edit]- list of patches you want to submit to Puppet SWAT
Logspam \ Last week's train updates
[edit]* Almost all logspam eliminated: https://i.imgur.com/1iOT9S1.png
Other Team Business
[edit]
- Create a -releng-feed channel and move bots there?
- https://phabricator.wikimedia.org/T181582
- already just deleted the selenium irc notifications, no one was watching them
Q3 goal/project check-in
[edit]- All of it in table form: https://www.mediawiki.org/wiki/Wikimedia_Release_Engineering_Team/Goals/201718Q3
Quarterly Goals
[edit]Program 1: Outcome 5: Milestone 1: Develop and migrate to a JavaScript-based browser testing stack
[edit]- Due: End of this quarter
- What: Specific improvements to the now canonical framework, see: task T182421, notably:
- Upgrade webdriverIO to version 4.9
- Investigate replacing nodemw with mwbot
- Video recording for Selenium tests in Node.js
- Task: task T182421
* no major updates this past week: Z's been sick over the holdiays, sadly
Program 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure
[edit]- Goals
- Draft requirements for a Kubernetes based solution for CI - task T183513
- Migrate MediaWiki PHPUnit tests to Shipyard (docker-based CI) (~40% of Nodepool usage) - task T183512
- most npm jobs are migrated. Gotta handle the /deploy repositories that have a node_modules commited in
- PHPUnit not started yet
- Unify production and CI docker image build process - task T177276
- npm from the Debian package (1.4.x) does not support running behind a proxy task T183569. Prevents building npm based Docker container on contint1001.
Program 3: Outcome 1: Objective 2: Identify and find stewards for high-priority/high use code segment orphans
[edit]- Due: End of quarter
- task T174091
Nothing new to report
Program 3: Outcome 2: Objective 2: Define and implement a process to regularly address technical debt across the Foundation
[edit]- Due: End of quarter
- task T174095
Nothing new to report
Program 3: Outcome 2: Objective 3: Promote and surface important technical debt topics at large gatherings of Wikimedia developers (e.g., DevSummit and Hackathon(s))
[edit]- Due: End of next quarter
- task T174096
Investigating what can be done during this coming dev summit/all-hands
Program 6: Outcome 2: Objective 2: Set up a continuous integration and deployment pipeline
[edit]- Due: End of this quarter
- Keyword: SSD
- phab project: https://phabricator.wikimedia.org/project/view/2453/
- Goal:
- Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
- Functional PoC within integration in the deployment-pipeline
- Deploy to isolated k8s
- Verify basic functionality of 'production' deployment and image (initially targeting mathoid):
- had a sad meeting last week, just lack of attendance
- dan filed tasks
- I (tyler) filed tasks this morning
- Child tasks of https://phabricator.wikimedia.org/T183165
Quaterly non-goal "Work"
[edit]Program 1: Outcome 1: Objective 1: Scap (Tech Debt Sprint FY201718-Q2)
[edit]- Need to get the scap CI situation figured out because of repeated breakage of scap in beta. Two competing needs:
- Need a place to test scap changes
- Other teams use scap in beta to test their own deployments
Program 1: Outcome 5: Objective 1: Maintain existing shared Continuous Integration infrastructure
[edit]Program 1: Outcome 6: Milestone 1: Maintain Gerrit
[edit]- We need to find out when we're doing the zuul/gerrit upgrade
- Deb package needs to be rebuild. It is in bad shape https://phabricator.wikimedia.org/T158243
Program 1: Outcome 6: Milestone 2: Maintain Phabricator
[edit]- Streamline logspam workflows by adding some integration with phabricator
- Store git-lfs (and other phab uploads) in swift: task T182085
Program 1: Outcome 5: Objective 1: MW Nightlies server
[edit]- We're building non-security nightlies! (not quite stable enough to announce to public yet)
- \o/
- Working with security team to figure out best way to manage patches with minimal overhead
- any changes to weekly branch cut process coming soon?
Other work
[edit]- Working on a Hive query to find out how many unique users are hitting WikipediaMobileFirefoxOS in a month
select count(distinct ip) from webrequest where year = 2017 and month = 12 and uri_host = 'meta.wikimedia.org' and uri_path = '/WikipediaMobileFirefoxOS/index.html'; ^^^ Seems like it over-counts, given how frequently mobile IPs can change ^^^
- More apache docroot cleanup
Grooming
[edit]Team Kanban Board Review and Triage
[edit]- closed and touched in the 7 days
- No update for 4 weeks
- No update for 3 weeks
- No update for 2 weeks
- No update for 1 week
- All Open
- Review To Triage column of #releng
Once / month-ish review of backlog(s)
[edit]- releng Review To Triage column of #releng
- releng-kanban Review unassigned in kanban
- releng-kanban Review 'backlog' colum of -kanban
- releng-next - Review for things we need to put on our kanban backlog
- releng-backlog - oh my, the huge backlog of things...