Phabricator Link
|
Wiki Link
|
Status
|
Priority
|
Author
|
Assignee
|
Projects
|
Subtasks
|
Parent Tasks
|
T105780
|
T105780: Create a doc explaining the SLA between services and the monitoring tool
|
open
|
Medium (orange)
|
mobrovac (Marko Obrovac)
|
|
|
|
|
T106346
|
T106346: setup an alertable threshold for Cassandra heap dumps
|
open
|
Medium (orange)
|
Eevans (Eric Evans)
|
|
|
|
|
T107108
|
T107108: Flow notification links on mobile point to desktop
|
open
|
Medium (orange)
|
Jdlrobson (Jon Robson)
|
|
|
|
|
T109950
|
T109950: Repair "svn.wikimedia.org/doc/" redirect for doc.wikimedia.org
|
open
|
Low (yellow)
|
Southparkfan (Ferran Tufan)
|
Dereckson (Sébastien Santoro)
|
|
|
|
T110240
|
T110240: [Discussion] Consider validating JSON schemas when running x-ample tests?
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T119612
|
T119612: Consider a serialization that supports random access for storage in the DB for Wikidata
|
open
|
Lowest (sky)
|
JanZerebecki (Jan Zerebecki)
|
|
|
|
|
T120085
|
T120085: RFC: Serve Main Page of Wikimedia wikis from a consistent URL
|
open
|
Medium (orange)
|
Krinkle (Timo Tijhof)
|
|
|
|
|
T121240
|
T121240: Network isolation for production and semi-production services
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T122867
|
T122867: Evaluate the feasibility of cache invalidation for the action API
|
open
|
Medium (orange)
|
Tgr (Gergő Tisza)
|
|
|
|
|
T125085
|
T125085: Split the API MediaWiki appserver pool into two external/internal pools
|
open
|
Medium (orange)
|
faidon (Faidon Liambotis)
|
|
|
|
|
T129682
|
T129682: Look into solutions for replaying traffic to testing environment(s)
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T133001
|
T133001: Decom legacy ex-parsoidcache cxserver, citoid, and restbase service hostnames
|
open
|
Medium (orange)
|
BBlack (Brandon Black)
|
|
|
|
|
T133178
|
T133178: RESTBase support for www.wikimedia.org missing
|
open
|
Medium (orange)
|
Krinkle (Timo Tijhof)
|
|
|
|
|
T133523
|
T133523: Decide how to improve parsercache replication, sharding and HA
|
open
|
Medium (orange)
|
jcrespo (Jaime Crespo)
|
Marostegui (Manuel Aróstegui)
|
|
|
|
T134551
|
T134551: Create functional cluster checks for all services (and have them page!)
|
open
|
Medium (orange)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
|
T134809
|
T134809: Apache <=> mariadb SSL/TLS for cross-datacenter writes
|
stalled
|
Medium (orange)
|
aaron (Aaron Schulz)
|
|
|
|
|
T136603
|
T136603: Update limit.sh to support systemd-based cgroup management
|
open
|
Medium (orange)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
|
T138093
|
T138093: Investigate query parameter normalization for MW/services
|
open
|
Medium (orange)
|
BBlack (Brandon Black)
|
|
|
|
|
T140813
|
T140813: Protect sensitive user-related information with a UserData / auth / session service
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T141897
|
T141897: Review new service 'pre-deployment to production' checklist
|
open
|
Medium (orange)
|
greg (Greg Grossmeier)
|
|
|
|
|
T149847
|
T149847: RFC: Use content hash based image / thumb URLs
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T150673
|
T150673: Thumb API: Varnish / CDN questions
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T162818
|
T162818: nodejs / restbase services (mobileapps, aqs, recommendation-api, etc?) fail persistently after short windows of DNS unavailability
|
open
|
High (red)
|
BBlack (Brandon Black)
|
|
|
|
|
T167972
|
T167972: Respect host header in RESTBase, and redirect /rest_v1 to /rest_v1/
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T176335
|
T176335: logs sent to logstash are lost when the elasticsearch cirrus cluster is unavailable
|
open
|
Medium (orange)
|
Gehel (Guillaume Lederrey)
|
|
|
|
|
T178839
|
T178839: New upstream jvm-tools
|
open
|
Medium (orange)
|
Eevans (Eric Evans)
|
|
|
|
|
T180051
|
T180051: Reduce the number of fields declared in elasticsearch by logstash
|
open
|
Low (yellow)
|
Gehel (Guillaume Lederrey)
|
|
|
|
|
T184534
|
T184534: Cached page previews not shown when refreshed
|
open
|
High (red)
|
Niedzielski (Stephen Niedzielski)
|
|
|
|
|
T185216
|
T185216: Handle HBA controllers in get-raid-status-hpssacli
|
open
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T193664
|
T193664: Knock down puppet 4 deprecation warnings
|
open
|
Medium (orange)
|
herron (Keith Herron)
|
|
|
|
|
T193766
|
T193766: Ship host syslogs to ELK
|
open
|
Medium (orange)
|
herron (Keith Herron)
|
|
|
|
|
T197173
|
T197173: Ship MX logs to ELK
|
open
|
Medium (orange)
|
herron (Keith Herron)
|
|
|
|
|
T198787
|
T198787: Revisit default settings for c-foreach-restart
|
open
|
Medium (orange)
|
MoritzMuehlenhoff (Moritz Mühlenhoff)
|
hnowlan (Hugh Nowlan)
|
|
|
|
T198901
|
T198901: Migrate production services to kubernetes using the pipeline
|
open
|
Medium (orange)
|
thcipriani (Tyler Cipriani)
|
|
|
|
|
T199008
|
T199008: sql enwik gives a poor error message when db doesn't exist
|
open
|
Low (yellow)
|
Reedy (Sam Reed)
|
|
|
|
|
T199431
|
T199431: Consider the possibility of separating ChangeProp and JobQueue on Kafka level
|
open
|
Medium (orange)
|
Pchelolo
|
|
|
|
|
T201409
|
T201409: Harmonise the identification of requests across our stack
|
open
|
Medium (orange)
|
mobrovac (Marko Obrovac)
|
|
|
|
|
T204024
|
T204024: Store WikibaseQualityConstraint check data in persistent storage instead of in the cache
|
stalled
|
Medium (orange)
|
Addshore (Adam_WMDE)
|
|
|
|
|
T204032
|
T204032: Support meta tag refresh redirects in citoid to support elsevier's linking hub
|
open
|
Medium (orange)
|
Mvolz (Marielle Volz)
|
|
|
|
|
T205361
|
T205361: Make an HTML dump of the output of the CodeReview extension on MediaWiki.org
|
open
|
High (red)
|
Jdforrester-WMF (James D. Forrester)
|
Legoktm (Legoktm)
|
|
|
|
T210704
|
T210704: Migrate node-based services in production to node10
|
open
|
Medium (orange)
|
MoritzMuehlenhoff (Moritz Mühlenhoff)
|
|
|
|
|
T212129
|
T212129: Move MainStash out of Redis to a simpler multi-dc aware solution
|
open
|
Medium (orange)
|
Joe (Giuseppe Lavagetto)
|
aaron (Aaron Schulz)
|
|
|
|
T214054
|
T214054: Stress test ORES/kubernetes (above 4.5k scores/second)
|
stalled
|
Low (yellow)
|
Halfak (Aaron Halfaker, EpochFail, halfak)
|
|
|
|
|
T218217
|
T218217: Make services swagger specs standard compliant
|
open
|
Medium (orange)
|
Pchelolo
|
|
|
|
|
T221976
|
T221976: Have Varnish set the `X-Request-Id` header for incoming external requests
|
open
|
Medium (orange)
|
mobrovac (Marko Obrovac)
|
|
|
|
|
T224041
|
T224041: Kask functional testing with Cassandra via the Deployment Pipeline
|
open
|
Medium (orange)
|
thcipriani (Tyler Cipriani)
|
|
|
|
|
T226840
|
T226840: Consistent HTTP 503 Error on some urls for some logged-in users (CentralAuth Set-Cookie storm)
|
open
|
Medium (orange)
|
Krinkle (Timo Tijhof)
|
|
|
|
|
T230245
|
T230245: Make SwiftFileBackend::doStoreInternal defer the opening of file handles to stay in the concurrency limit
|
open
|
Medium (orange)
|
Dzahn (Daniel Zahn)
|
aaron (Aaron Schulz)
|
|
|
|
T230861
|
T230861: PHP 7.2 is very slow on an allocation-intensive benchmark
|
open
|
Medium (orange)
|
tstarling (Tim Starling)
|
|
|
|
|
T233047
|
T233047: Apache mod_status aggregator
|
open
|
Medium (orange)
|
ori (Ori Livneh)
|
|
|
|
|
T236048
|
T236048: Move debugging symbols and tools to a new class
|
open
|
Medium (orange)
|
jijiki (effie mouzeli)
|
jijiki (effie mouzeli)
|
|
|
|
T238285
|
T238285: Pages whose title ends with semicolon (;) are intermittently inaccessible (likely due to ATS)
|
open
|
Medium (orange)
|
DannyS712
|
|
|
|
|
T238803
|
T238803: Retire fixcopyright.wikimedia.org
|
open
|
Medium (orange)
|
CCicalese_WMF (Cindy Cicalese)
|
CCicalese_WMF (Cindy Cicalese)
|
|
|
|
T239713
|
T239713: Citoid is logging all request / response headers as separate fields
|
open
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T240685
|
T240685: MediaWiki Prometheus support
|
open
|
High (red)
|
colewhite (cwhite)
|
colewhite (cwhite)
|
|
|
|
T242855
|
T242855: Undeploy graphoid
|
open
|
Medium (orange)
|
akosiaris (Alexandros Kosiaris)
|
|
|
|
|
T245374
|
T245374: WMFTimeoutException on non-existent files
|
open
|
Medium (orange)
|
TheSandDoctor
|
|
|
|
|
T250205
|
T250205: Reduce rate of purges emitted by MediaWiki
|
open
|
Medium (orange)
|
CDanis (Chris Danis)
|
|
|
|
|
T250231
|
T250231: purgeParserCache.php: Cannot purge this kind of parser cache
|
open
|
Medium (orange)
|
RLazarus (Reuven Lazarus)
|
|
|
|
|
T256641
|
T256641: Delay spinner showing for graphs for 1s
|
open
|
Medium (orange)
|
Jseddon (n/a)
|
|
|
|
|
T260330
|
T260330: RFC: PHP microservice for containerized shell execution
|
open
|
Medium (orange)
|
tstarling (Tim Starling)
|
tstarling (Tim Starling)
|
|
|
|
T260943
|
T260943: Don't set cookies for api.wikimedia.org at the caching layer
|
open
|
Medium (orange)
|
eprodromou (Evan Prodromou)
|
|
|
|
|
T262691
|
T262691: High numbers of HTTP 429 errors
|
open
|
High (red)
|
LGoto (Lani Goto (WMF))
|
|
|
|
|
T263220
|
T263220: Limit concurrency of DPL queries
|
open
|
High (red)
|
Urbanecm
|
|
|
|
|
T263437
|
T263437: Allow easier ICU transitions in MediaWiki (change how sortkey collation is managed in the categorylinks table)
|
open
|
High (red)
|
Joe (Giuseppe Lavagetto)
|
tstarling (Tim Starling)
|
|
|
|
T263545
|
T263545: Decide on logging in k8s for ShellBox
|
open
|
Medium (orange)
|
Pchelolo
|
|
|
|
|
T263727
|
T263727: Separate mediawiki latency metrics by endpoint
|
open
|
High (red)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
|
T267581
|
T267581: Phasing out "redis_sessions" MediaWiki cluster and away from the memcached cluster
|
open
|
Medium (orange)
|
jijiki (effie mouzeli)
|
|
|
|
|
T268427
|
T268427: Make Shellbox actually do streaming
|
open
|
Low (yellow)
|
tstarling (Tim Starling)
|
tstarling (Tim Starling)
|
|
|
|
T271822
|
T271822: Add support for scraping php applications to the kubernetes prometheus scraper
|
open
|
Medium (orange)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
|
T273334
|
T273334: Re-imaged mw app servers can end up with missing l10n cache for old versions of MW needed for rollback
|
open
|
High (red)
|
Reedy (Sam Reed)
|
|
|
|
|
T273738
|
T273738: Get platform engineering team green light for Cloud NAT to wikis change
|
open
|
Medium (orange)
|
aborrero (arturo)
|
|
|
|
|
T277183
|
T277183: Phase out nutcracker for connecting to redis
|
open
|
Medium (orange)
|
Joe (Giuseppe Lavagetto)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
T278516
|
T278516: Decide on future of public services@ mailing list (which has no maintainers)
|
open
|
Low (yellow)
|
Aklapper (Andre Klapper (WMF))
|
|
|
|
|
T279146
|
T279146: Remove mediawiki Request loops from production
|
open
|
Medium (orange)
|
jijiki (effie mouzeli)
|
|
|
|
|
T279664
|
T279664: Decide on details of progressive Multi-DC roll out
|
open
|
Medium (orange)
|
Krinkle (Timo Tijhof)
|
Krinkle (Timo Tijhof)
|
|
|
|
T280586
|
T280586: Move "redis_sessions" to "redis_misc" cluster
|
open
|
Medium (orange)
|
jijiki (effie mouzeli)
|
|
|
|
|
T280628
|
T280628: Securely connect Wikimedia Enterprise Infrastructure with WMF Kafka Streams
|
open
|
Needs Triage (violet)
|
AnnaMikla
|
RBrounley_WMF (Ryan)
|
|
|
|
T282887
|
T282887: Avoid accepting Kafka messages with whacky timestamps
|
open
|
Medium (orange)
|
Ottomata (Andrew Otto)
|
|
|
|
|
T284981
|
T284981: SELECT query arriving to wikidatawiki db codfw hosts causing pile ups during schema change
|
open
|
Medium (orange)
|
Marostegui (Manuel Aróstegui)
|
|
|
|
|
T293530
|
T293530: Improve slow read query handling
|
open
|
Medium (orange)
|
Legoktm (Legoktm)
|
|
|
|
|
T294372
|
T294372: Q2:(Need By: TBD) rack/setup/install restbase103[123].eqiad.wmnet
|
open
|
Medium (orange)
|
RobH (Rob Halsell)
|
Jclark-ctr (John Clark)
|
|
|
|
T298805
|
T298805: Import Debian package of Cassandra 3.11.11 as 'dev' version
|
open
|
Needs Triage (violet)
|
Eevans (Eric Evans)
|
|
|
|
|
T50835
|
T50835: Separate Cache-Control header for proxy and client
|
open
|
Medium (orange)
|
tstarling (Tim Starling)
|
|
|
|
|
T66214
|
T66214: Define an official thumb API
|
open
|
Medium (orange)
|
GWicke (Gabriel Wicke)
|
|
|
|
|
T83729
|
T83729: Fix monitoring of poolcounter service
|
open
|
Medium (orange)
|
greg (Greg Grossmeier)
|
|
|
|
|
T91820
|
T91820: Create HTTP verb and sticky cookie DC routing in VCL
|
open
|
Medium (orange)
|
aaron (Aaron Schulz)
|
|
|
|
|
T92471
|
T92471: enable authenticated access to Cassandra JMX
|
open
|
Medium (orange)
|
Eevans (Eric Evans)
|
|
|
|
|
T93049
|
T93049: Same MassMessage is being sent more than once
|
open
|
High (red)
|
Nemo_bis (Nemo)
|
|
|
|
|
T98165
|
T98165: Figure out an etcd deploy strategy that includes multi DC failure scenarios.
|
open
|
Medium (orange)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
|