Page MenuHomePhabricator

Cloud-ServicesUmbrella
ActivePublic

Details

Description

Cloud-Services is the umbrella project for tasks related to the products managed by the Wikimedia Cloud Services team. A general overview of services offered can be found at wikitech:Help:Cloud Services Introduction

The team itself has a separate project at cloud-services-team which includes among other things a Kanban board (cloud-services-team (Kanban)). That board is for the team themselves to manage, but can be referred to by people interested in the team's active projects.

Recent Activity

Mon, Apr 18

bd808 changed the subtype of T306391: Allow Toolforge scheduled jobs to have a maximum runtime from "Task" to "Feature Request".
Mon, Apr 18, 8:52 PM · Cloud-Services, Kubernetes
bd808 added a parent task for T306391: Allow Toolforge scheduled jobs to have a maximum runtime: T285944: Toolforge: beta phase for the new jobs framework.
Mon, Apr 18, 8:52 PM · Cloud-Services, Kubernetes
MusikAnimal created T306391: Allow Toolforge scheduled jobs to have a maximum runtime.
Mon, Apr 18, 8:47 PM · Cloud-Services, Kubernetes

Sat, Apr 9

JJMC89 added a parent task for T305780: toolforge-jobs – wikihistory needs a container with both php7 and mono: T285944: Toolforge: beta phase for the new jobs framework.
Sat, Apr 9, 4:00 PM · Toolforge
Wurgl created T305780: toolforge-jobs – wikihistory needs a container with both php7 and mono.
Sat, Apr 9, 2:06 PM · Toolforge

Thu, Apr 7

aborrero added a subtask for T165531: rack/setup/install labvirt101[5-8]: T305631: cloudvirt1016: sudden reboot.
Thu, Apr 7, 12:46 PM · cloud-services-team (Kanban), Patch-For-Review, ops-eqiad, Cloud-Services, SRE

Wed, Apr 6

bd808 added a parent task for T305592: Underscore in job name gives non-helpful error in toolforge-jobs: T285944: Toolforge: beta phase for the new jobs framework.
Wed, Apr 6, 9:57 PM · cloud-services-team (Kanban), Toolforge
Joutbis created T305592: Underscore in job name gives non-helpful error in toolforge-jobs.
Wed, Apr 6, 9:40 PM · cloud-services-team (Kanban), Toolforge

Sat, Apr 2

Umherirrender removed a project from T103552: Support connections from bastion to other hosts: Patch-For-Review.
Sat, Apr 2, 8:33 PM · Cloud-Services

Wed, Mar 30

dcaro moved T127717: Move Cloud VPS auth.logs to central logging from Inbox to Watching on the cloud-services-team (Kanban) board.
Wed, Mar 30, 3:15 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services
dcaro moved T127717: Move Cloud VPS auth.logs to central logging from Needs discussion to Inbox on the cloud-services-team (Kanban) board.
Wed, Mar 30, 3:15 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services
Ladsgroup placed T305064: Make linktarget table visible on cloud wiki replicas up for grabs.
Wed, Mar 30, 1:51 PM · Data-Engineering, Data-Services, User-Ladsgroup
Ladsgroup created T305064: Make linktarget table visible on cloud wiki replicas.
Wed, Mar 30, 1:51 PM · Data-Engineering, Data-Services, User-Ladsgroup

Mon, Mar 28

bd808 added a comment to T304869: StrikerBot doing deprecated action=oathvalidate&totp= queries.

Seems this may come from 2 different User-Agents:

Mon, Mar 28, 4:52 PM · Striker, Horizon, cloud-services-team (Kanban)
gerritbot added a project to T304869: StrikerBot doing deprecated action=oathvalidate&totp= queries: Patch-For-Review.
Mon, Mar 28, 4:44 PM · Striker, Horizon, cloud-services-team (Kanban)
gerritbot added a comment to T304869: StrikerBot doing deprecated action=oathvalidate&totp= queries.

Change 774401 had a related patch set uploaded (by Reedy; author: Reedy):

[operations/puppet@production] Keystone: Update deprecated action=oathvalidate calls

https://gerrit.wikimedia.org/r/774401

Mon, Mar 28, 4:44 PM · Striker, Horizon, cloud-services-team (Kanban)
Reedy edited projects for T304869: StrikerBot doing deprecated action=oathvalidate&totp= queries, added: Cloud-Services; removed Horizon, Striker.
Mon, Mar 28, 4:40 PM · Striker, Horizon, cloud-services-team (Kanban)
elukey created T304872: Volume stuck for ml-sandbox.machine-learning.eqiad1.wikimedia.cloud.
Mon, Mar 28, 4:36 PM · User-dcaro, Cloud-VPS, Machine-Learning-Team
dcaro moved T304096: move cloudcephmon1003.eqiad.wmnet from rack B2 to rack C8 from To refine to Refined on the User-dcaro board.
Mon, Mar 28, 9:59 AM · User-dcaro, Cloud-Services, DC-Ops
ArielGlenn added a comment to T57503: Mirror more Kiwix downloads directories.

@ArielGlenn Sorry, I meant T286588

Mon, Mar 28, 9:06 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown
Kelson added a comment to T57503: Mirror more Kiwix downloads directories.

@ArielGlenn Sorry, I meant T286588

Mon, Mar 28, 9:02 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown
ArielGlenn added a comment to T57503: Mirror more Kiwix downloads directories.

@ArielGlenn Hi, I already come back to you! T299993 is hidden to me and I have no visibility on it. Is that already implemented. If "no", in which timeline are we moving in?

Mon, Mar 28, 9:00 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown
Kelson added a comment to T57503: Mirror more Kiwix downloads directories.

@ArielGlenn Hi, I already come back to you! T299993 is hidden to me and I have no visibility on it. Is that already implemented. If "no", in which timeline are we moving in?

Mon, Mar 28, 8:55 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown

Mar 22 2022

valerio.bozzolan created T304416: Route problems from some gateways of Italy to WMCloud and Toolforge.
Mar 22 2022, 11:11 AM · SRE, Cloud-VPS, netops, Infrastructure-Foundations

Mar 18 2022

dcaro added a comment to T127717: Move Cloud VPS auth.logs to central logging.

Hi @Southparkfan, so if I understand it correctly, the next step would be to figure out:

Mar 18 2022, 10:32 AM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services

Mar 17 2022

dcaro updated subscribers of T304096: move cloudcephmon1003.eqiad.wmnet from rack B2 to rack C8.
Mar 17 2022, 5:51 PM · User-dcaro, Cloud-Services, DC-Ops
dcaro added a project to T304096: move cloudcephmon1003.eqiad.wmnet from rack B2 to rack C8: User-dcaro.
Mar 17 2022, 5:45 PM · User-dcaro, Cloud-Services, DC-Ops
dcaro added a project to T304096: move cloudcephmon1003.eqiad.wmnet from rack B2 to rack C8: Cloud-Services.
Mar 17 2022, 5:45 PM · User-dcaro, Cloud-Services, DC-Ops

Mar 16 2022

Epantaleo added a comment to T303922: etytree tool fault message.

Hello Andrew, I don't think I can remove or recreate affected files. However, any earlier backup would work as I haven't changed any of the files since first installation. Does this help?
Thanks a lot

Mar 16 2022, 5:07 PM · Cloud-VPS, VPS-Projects, Cloud-Services-Origin-User, Cloud-Services-Worktype-Unplanned
Andrew added a comment to T303922: etytree tool fault message.

I suspect that this VM was damaged during a migration between hypervisors (part of bug T281276). This has happened to a very small number of VMs and I'm unclear on the cause.

Mar 16 2022, 2:35 PM · Cloud-VPS, VPS-Projects, Cloud-Services-Origin-User, Cloud-Services-Worktype-Unplanned
Aklapper added a project to T303922: etytree tool fault message: Cloud-Services.

Assuming this is about Cloud-Services

Mar 16 2022, 10:11 AM · Cloud-VPS, VPS-Projects, Cloud-Services-Origin-User, Cloud-Services-Worktype-Unplanned

Mar 4 2022

nskaggs added a parent task for T303058: hw troubleshooting: move cloudcephmon1003.eqiad.wmnet from rack B2 to rack C8: T297083: [ceph] Getting rack level HA.
Mar 4 2022, 3:33 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops
nskaggs created T303058: hw troubleshooting: move cloudcephmon1003.eqiad.wmnet from rack B2 to rack C8.
Mar 4 2022, 3:32 PM · SRE, cloud-services-team (Hardware), ops-eqiad, DC-Ops

Feb 22 2022

dpifke created T302323: Errors listing or launching instances in WMCS horizon.
Feb 22 2022, 5:34 PM · Cloud-Services-Origin-User, Cloud-Services-Worktype-Unplanned, User-dcaro, cloud-services-team (Kanban), Horizon

Feb 16 2022

dcaro added a project to T127717: Move Cloud VPS auth.logs to central logging: User-dcaro.
Feb 16 2022, 4:47 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services
Aklapper removed a project from T129936: Ensure that Terms of Use document restrictions on third-party web interactions: Community-Tech-Tool-Labs.
Feb 16 2022, 1:08 PM · WMF-Legal, Cloud-Services
aborrero closed T237971: Cron <www-data@cloudweb2001-dev> /usr/local/bin/mwscript extensions/TorBlock/maintenance/loadExitNodes.php --wiki=labswiki --force > /dev/null as Resolved.
Feb 16 2022, 12:23 PM · cloud-services-team (Kanban), Cloud-Services

Feb 14 2022

Majavah moved T127717: Move Cloud VPS auth.logs to central logging from Inbox to Needs discussion on the cloud-services-team (Kanban) board.
Feb 14 2022, 7:00 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services

Feb 13 2022

Southparkfan added a comment to T127717: Move Cloud VPS auth.logs to central logging.

Now that the patch above has been merged, we can start thinking about applying the syslog client configuration by default on Cloud VPS instances. The central syslog server should be in the cloudinfra project. There are a few challenges to tackle, though:

Feb 13 2022, 8:38 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services

Feb 11 2022

Dzahn added a comment to T165885: Create a cron to clean clientbucket every day or hour.

@Paladox Since it's now possible to opt-in to this and then get a timer (see T165885#6585808 if you still want to try it), is this ticket resolved for you?

Feb 11 2022, 7:22 PM · Patch-For-Review, Infrastructure-Foundations, cloud-services-team (Kanban), Cloud-Services, SRE, Puppet
Dzahn added a parent task for T165885: Create a cron to clean clientbucket every day or hour: T273673: replace all puppet crons with systemd timers.
Feb 11 2022, 7:19 PM · Patch-For-Review, Infrastructure-Foundations, cloud-services-team (Kanban), Cloud-Services, SRE, Puppet

Feb 10 2022

Majavah closed T152866: Make clush safer as Declined.

Clush has been replaced with Cumin.

Feb 10 2022, 9:59 AM · Cloud-Services, Toolforge

Feb 4 2022

Majavah closed T60865: [Epic] Toolserver.org tools that have not been migrated, a subtask of T60788: Toolserver migration to Tools (tracking), as Resolved.
Feb 4 2022, 7:59 PM · User-bd808, Cloud-Services, Tracking-Neverending, Toolforge

Jan 27 2022

ArielGlenn added a comment to T57503: Mirror more Kiwix downloads directories.

I don't know details myself but the relevant task is T286588

Jan 27 2022, 10:16 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown
Kelson added a comment to T57503: Mirror more Kiwix downloads directories.

@ArielGlenn Thank you for putting WMCS in the loop. In which timeline this refresh should happen? I guess nothing will be done as far as this is not done.

Jan 27 2022, 10:07 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown

Jan 25 2022

Kelson added a comment to T57503: Mirror more Kiwix downloads directories.

@ArielGlenn Thank you for your feedback. I have created an other task here https://phabricator.wikimedia.org/T299993

Jan 25 2022, 7:12 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown
Kelson moved T57503: Mirror more Kiwix downloads directories from TRIAGE to TOP on the affects-Kiwix-and-openZIM board.
Jan 25 2022, 7:11 AM · Cloud-Services, affects-Kiwix-and-openZIM, SRE, Datasets-General-or-Unknown

Jan 20 2022

Maintenance_bot removed a project from T127717: Move Cloud VPS auth.logs to central logging: Patch-For-Review.
Jan 20 2022, 7:11 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services
gerritbot added a comment to T127717: Move Cloud VPS auth.logs to central logging.

Change 682259 merged by Andrew Bogott:

[operations/puppet@production] Add WMCS specific cloud role for syslog server

https://gerrit.wikimedia.org/r/682259

Jan 20 2022, 6:36 PM · User-dcaro, Sustainability (Incident Followup), cloud-services-team (Kanban), Cloud-Services

Jan 19 2022

jbond edited projects for T299519: systemd job "Sync keys for Keystone fernet tokens to ${thishost}" potentially broken, added: Cloud-Services; removed SRE.
Jan 19 2022, 2:25 PM · cloud-services-team (Kanban), Cloud-VPS