Phabricator Link
|
Wiki Link
|
Status
|
Priority
|
Author
|
Assignee
|
Projects
|
Subtasks
|
Parent Tasks
|
T102015
|
T102015: put new restbase servers in service
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T102557
|
T102557: investigate new restbase machine disks timeouts
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Cmjohnson (cmjohnson)
|
|
|
|
T102575
|
T102575: document graphite failover/backfill procedures
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T111382
|
T111382: codfw 3x spares for cassandra encryption testing
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
RobH (Rob Halsell)
|
|
|
|
T113733
|
T113733: column family cassandra metrics size
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T113939
|
T113939: assess impact of many cassandra seed nodes with multi instance
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T125791
|
T125791: swiftrepl replication pass for thumbnails eqiad -> codfw
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
aaron (Aaron Schulz)
|
|
|
|
T126253
|
T126253: additional graphite machines request, 1x per DC
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
RobH (Rob Halsell)
|
|
|
|
T128107
|
T128107: install restbase1010-restbase1015
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
Cmjohnson (cmjohnson)
|
|
|
|
T128590
|
T128590: Cassandra uses default ip address for outbound packets while bootstrapping
|
declined
|
Low (yellow)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T134889
|
T134889: put additional graphite machines in service
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T135385
|
T135385: investigate carbon-c-relay stalls/drops towards graphite2002
|
declined
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T141524
|
T141524: eventbus should send statsd in batches
|
declined
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Ottomata (Andrew Otto)
|
|
|
|
T141541
|
T141541: Certs from cassandra-ca-manager should have the FQDN in cert's CN
|
progress
|
Low (yellow)
|
fgiunchedi (Filippo Giunchedi)
|
hnowlan (Hugh Nowlan)
|
|
|
|
T144479
|
T144479: Ensure thumbor container access is preserved by mw filebackend setzoneaccess
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
Gilles (Gilles Dubuc)
|
|
|
|
T150560
|
T150560: More verbose messages from service-checker-swagger
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Joe (Giuseppe Lavagetto)
|
|
|
|
T152364
|
T152364: db1047 out of disk space, eventlogging_sync spam
|
resolved
|
Needs Triage (violet)
|
fgiunchedi (Filippo Giunchedi)
|
Marostegui (Manuel Aróstegui)
|
|
|
|
T181964
|
T181964: Clean metrics for restbase erroneus legacy tables from cassandra 3 cluster
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T185089
|
T185089: Alert UNKNOWN for restbase cassandra graphite alerts
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Eevans (Eric Evans)
|
|
|
|
T185216
|
T185216: Handle HBA controllers in get-raid-status-hpssacli
|
open
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T203169
|
T203169: Logstash hardware expansion
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T204245
|
T204245: Run MediaWiki media originals active/active
|
open
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T205849
|
T205849: Begin the implementation of Q1's Logging Infrastructure design (2018-19 Q2 Goal)
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T205850
|
T205850: Procure and provision Logging pipeline hardware in multiple datacenters
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
herron (Keith Herron)
|
|
|
|
T205851
|
T205851: Migrate >=90% of existing Logstash traffic to the logging pipeline
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T205852
|
T205852: Onboard at least 10 new non-sensitive log producers to the logging pipeline
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T205855
|
T205855: Investigate approaches to ingest sensitive log producers
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T205873
|
T205873: Investigate Kafka main cluster usage for logging pipeline
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T206454
|
T206454: Setup Kafka cluster, producers and consumers for logging pipeline
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
herron (Keith Herron)
|
|
|
|
T206633
|
T206633: Setup rsyslog to be able to produce logs to Kafka
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T208047
|
T208047: Deduplicate LoadBalancer.php "Transaction spent X second(s) in writes, exceeding the limit of Y." logs
|
invalid
|
Needs Triage (violet)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T211124
|
T211124: Move mediawiki to new logging infrastructure
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T211125
|
T211125: Move service-runner to new logging infrastructure
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Pchelolo
|
|
|
|
T211765
|
T211765: 504 from /api/rest_v1/page/random/summary
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
Pchelolo
|
|
|
|
T211859
|
T211859: cronspam from elasticsearch-curator on stretch
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
herron (Keith Herron)
|
|
|
|
T212303
|
T212303: kartotherian sends javascript instead of statsd metric name
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
MSantos (MSantos)
|
|
|
|
T214166
|
T214166: Improve cassandra JBOD integration post-reimage
|
resolved
|
Needs Triage (violet)
|
fgiunchedi (Filippo Giunchedi)
|
MoritzMuehlenhoff (Moritz Mühlenhoff)
|
|
|
|
T220081
|
T220081: Allow swift https access from analytics to prod
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
ayounsi (Arzhel Younsi)
|
|
|
|
T220709
|
T220709: Upgrade statsd_exporter to 0.9
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
akosiaris (Alexandros Kosiaris)
|
|
|
|
T222960
|
T222960: Fix restbase1017's physical rack
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Eevans (Eric Evans)
|
|
|
|
T226986
|
T226986: Client side error logging production launch
|
resolved
|
Needs Triage (violet)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T238083
|
T238083: Citoid logs fields explosion
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
mobrovac (Marko Obrovac)
|
|
|
|
T238344
|
T238344: MediaWiki Math invalid JSON in logs on Restbase server error
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
mobrovac (Marko Obrovac)
|
|
|
|
T239090
|
T239090: Restbase logging indexing conflict on 'res' and 'body' logging fields
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
Pchelolo
|
|
|
|
T239713
|
T239713: Citoid is logging all request / response headers as separate fields
|
open
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
|
T269676
|
T269676: Mediawiki logging indexing conflict on 'status' for 'authevents'
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
colewhite (cwhite)
|
|
|
|
T269680
|
T269680: MediaWiki logging indexing conflict on 'session' for 'session-ip' channel
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
colewhite (cwhite)
|
|
|
|
T275752
|
T275752: Jobrunner timeouts on cross-DC file uploads because of HTTP/2
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
Legoktm (Legoktm)
|
|
|
|
T85451
|
T85451: scale graphite deployment (tracking)
|
invalid
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T85907
|
T85907: acquire graphite hardware in codfw and eqiad
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T85908
|
T85908: replicate metric traffic in eqiad and codfw
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T85909
|
T85909: migrate graphite to new hardware
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T86316
|
T86316: graphite clustering plan
|
declined
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T89461
|
T89461: trebuchet puppet provider broken on systems without upstart
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
GWicke (Gabriel Wicke)
|
|
|
|
T89636
|
T89636: setup internal LVS for restbase eqiad servers
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T89639
|
T89639: restbase1006 faulty disk controller
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T89857
|
T89857: scale statsd reporting/aggregation (plan)
|
invalid
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T90111
|
T90111: replace txstatsd
|
resolved
|
High (red)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T90591
|
T90591: backfill metrics from tungsten to graphite1001
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|
T97024
|
T97024: some cassandra metrics sent with invalid values
|
resolved
|
Medium (orange)
|
fgiunchedi (Filippo Giunchedi)
|
fgiunchedi (Filippo Giunchedi)
|
|
|
|