Phabricator Link
|
Wiki Link
|
Status
|
Priority
|
Author
|
Assignee
|
Projects
|
Subtasks
|
Parent Tasks
|
T157088
|
T157088: [EPIC] Develop a JobQueue backend based on EventBus
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T172479
|
T172479: Collect error logs from jobchron/jobrunner services in Logstash
|
declined
|
Low (yellow)
|
|
|
|
|
|
T172480
|
T172480: Add a jobrunner server to the Scap canary pool
|
open
|
Low (yellow)
|
|
|
|
|
|
T175146
|
T175146: [RfC] Move RunJobs.php to the mediawiki (core) repository
|
stalled
|
Medium (orange)
|
|
|
|
|
|
T192085
|
T192085: PHP Fatal in AffectedPagesFinder::getChangedAspects
|
resolved
|
High (red)
|
|
|
|
|
|
T198220
|
T198220: Stop and remove old job runners
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T199432
|
T199432: Consider disabling automatic topic creation in main-kafka
|
declined
|
Medium (orange)
|
|
|
|
|
|
T202107
|
T202107: Job queue should not overload the DB servers when there is replication lag
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T204031
|
T204031: Deploy regular running of wikidata constraint checks using the job queue
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T204183
|
T204183: EventBus error "Unable to deliver all events: (curl error: 28) Timeout was reached"
|
invalid
|
High (red)
|
|
|
|
|
|
T206186
|
T206186: Create LimitedConcurrencyKafkaConsumer
|
open
|
Lowest (sky)
|
|
|
|
|
|
T207329
|
T207329: Clear watchlist on enwiki only removes 50 items at a time
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T207809
|
T207809: PHP error "CdnPurgeJob never inserted."
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T208922
|
T208922: PHP Fatal Error: Class undefined: JobExecutor (jobrunners try to run labswiki jobs)
|
resolved
|
High (red)
|
|
|
|
|
|
T210451
|
T210451: Kafka eqiad.mediawiki.page-delete topic is empty
|
resolved
|
High (red)
|
|
|
|
|
|
T210910
|
T210910: Jobs for otrs-wiki are slower than expected to process
|
invalid
|
Needs Triage (violet)
|
|
|
|
|
|
T212335
|
T212335: EventBus or CirrusSearch: DomainException from line 353 of /srv/mediawiki/php-1.33.0-wmf.9/vendor/firebase/php-jwt/src/JWT.php: Unknown JSON error: 5
|
resolved
|
High (red)
|
|
|
|
|
|
T213506
|
T213506: Grafana alerting broken after upgrade to 5.0.0
|
resolved
|
High (red)
|
|
|
|
|
|
T217683
|
T217683: Delete the mediawiki/services/cp-jobqueue repo
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T218170
|
T218170: Finish removal of ukwikimedia wiki
|
resolved
|
Low (yellow)
|
|
|
|
|
|
T218692
|
T218692: read only on mediawiki generates "LoadBalancer.php: Cannot access the database: Unknown error"
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T218812
|
T218812: RFC: Provide the ability to have time-delayed or time-offset jobs in the job queue
|
open
|
Needs Triage (violet)
|
|
|
|
|
|
T219159
|
T219159: Partition htmlCacheUpdate job topic
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T220127
|
T220127: Reuse JobExecutor within a JobRunner
|
declined
|
Medium (orange)
|
|
|
|
|
|
T220399
|
T220399: Migrate cpjobqueue to kubernetes
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T221224
|
T221224: showJobs.php maintenance script useless and misleading in production
|
open
|
Low (yellow)
|
|
|
|
|
|
T221368
|
T221368: cdnPurge and other jobs fail completely to execute
|
resolved
|
Unbreak Now! (pink)
|
|
|
|
|
|
T224425
|
T224425: MW Job consumers sometimes pause for several minutes
|
resolved
|
High (red)
|
|
|
|
|
|
T225097
|
T225097: "JobExecutor not loaded" error for BounceHandlerJob on wikitech.wikimedia.org
|
open
|
Low (yellow)
|
|
|
|
|
|
T226109
|
T226109: Write incident report for jobs not being executed on 1.34.0-wmf.10
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T228525
|
T228525: If JobQueueEventBus fails to send a job exception is left uncaught
|
invalid
|
Needs Triage (violet)
|
|
|
|
|
|
T228911
|
T228911: DBTransactionError "transaction round 'LinksUpdate::doUpdate' still running" (via RunSingleJob.php)
|
declined
|
Medium (orange)
|
|
|
|
|
|
T230730
|
T230730: Better way to pause writes on elasticsearch
|
open
|
Medium (orange)
|
|
|
|
|
|
T231194
|
T231194: Increase concurrency of the cirrusCheckerJob
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T232362
|
T232362: Massmessages not going through, log looks fine
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T232379
|
T232379: MassMessage problems - multiple deliveries and missing deliveries
|
resolved
|
Low (yellow)
|
|
|
|
|
|
T232392
|
T232392: EventBus extension must not send batches that are too large
|
resolved
|
High (red)
|
|
|
|
|
|
T233584
|
T233584: Re-adjust cirrusSearchLinksUpdate vs cirrusSearchLinksUpdatePrioritized concurrency
|
resolved
|
High (red)
|
|
|
|
|
|
T234226
|
T234226: MassMessage not delivering messages
|
resolved
|
High (red)
|
|
|
|
|
|
T235358
|
T235358: Could not enqueue jobs: "Unable to deliver all events: 500: Internal Server Error"
|
duplicate
|
High (red)
|
|
|
|
|
|
T238296
|
T238296: job queue insert rate metrics gone from Grafana
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T239394
|
T239394: Can't rename the translatable page mw:Global_templates/Draft_spec
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T240518
|
T240518: Some jobs are not being processed / are processed slowly
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T241448
|
T241448: Job queue broken on Beta Cluster
|
resolved
|
Unbreak Now! (pink)
|
|
|
|
|
|
T241905
|
T241905: Investigate JobQueue outage from 2020-01-04 22:00 UTC
|
declined
|
Medium (orange)
|
|
|
|
|
|
T242722
|
T242722: Document the danger of the release timestamp feature in code and on-wiki
|
open
|
Medium (orange)
|
|
|
|
|
|
T242726
|
T242726: Improve the Kafka job queue's handling of unknown new jobs
|
open
|
Medium (orange)
|
|
|
|
|
|
T243096
|
T243096: Jobrunner monitoring still calles /rpc/runJobs.php
|
resolved
|
Medium (orange)
|
|
|
|
|
|
T244387
|
T244387: Change-Prop consumer group must respect service name
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T244758
|
T244758: ChangeProp old job warning should take job delay into account
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T244770
|
T244770: Enable RunSingleJobHandler endpoint on Job Runner Cluster
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T244826
|
T244826: Create rest endpoint for executing jobs instead of /rpc/RunSingleJob
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T244828
|
T244828: Investigate alternatives to MEDIAWIKI_JOB_RUNNER global
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T245803
|
T245803: Make changeprop chart depend on Kafka-dev for minikube
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T246156
|
T246156: Convert JobRunner into a service class
|
resolved
|
Needs Triage (violet)
|
|
|
|
|
|
T246371
|
T246371: Move job traffic from rpc/RunSingleJob to REST endpoint
|
open
|
Low (yellow)
|
|
|
|
|
|
T246389
|
T246389: Enable MW REST API on job runners and video scalers (for the new rest.php job executor)
|
open
|
Medium (orange)
|
|
|
|
|
|
T247114
|
T247114: Find a way to set elevated timeouts for job running
|
open
|
Medium (orange)
|
|
|
|
|
|
T247622
|
T247622: RunSingleJob.php timeout too low at 180 seconds
|
resolved
|
High (red)
|
|
|
|
|
|
T248602
|
T248602: Lots of "EventBus: Unable to deliver all events: 504: Gateway Timeout"
|
duplicate
|
Medium (orange)
|
|
|
|
|
|
T249745
|
T249745: Could not enqueue jobs: "Unable to deliver all events: 503: Service Unavailable"
|
open
|
High (red)
|
|
|
|
|
|
T255684
|
T255684: Fix aggregation of change-prop exec time metric
|
open
|
Medium (orange)
|
|
|
|
|
|
T257583
|
T257583: Run EventBus tests in MediaWiki core CI
|
open
|
Needs Triage (violet)
|
|
|
|
|
|
T260062
|
T260062: Jobqueue warnings for old processMediaModeration jobs should be turned off
|
open
|
Needs Triage (violet)
|
|
|
|
|
|
T260274
|
T260274: JobQueueError: Could not enqueue jobs from stream [stream]
|
duplicate
|
High (red)
|
|
|
|
|
|
T261691
|
T261691: Investigate change-prop commits when messages stop coming into the topic
|
open
|
High (red)
|
|
|
|
|
|
T263301
|
T263301: Old image unexpectedly overwritten by a revision several years later (after Internal server error)
|
open
|
High (red)
|
|
|
|
|
|
T267520
|
T267520: Translation pages are updated after a long delay on Wikimedia sites
|
resolved
|
High (red)
|
|
|
|
|
|
T275432
|
T275432: MassMessage not delivering
|
open
|
Medium (orange)
|
|
|
|
|
|
T278382
|
T278382: Clean up CirrusSearch job retries
|
open
|
Medium (orange)
|
|
|
|
|
|
T278906
|
T278906: Change cpjobqueue "processing" time metrics from pre-aggregated quantile to native Prometheus histogram bucket
|
open
|
Medium (orange)
|
|
|
|
|
|
T299302
|
T299302: Linter jobs are running slowly
|
resolved
|
High (red)
|
|
|
|
|
|
T93049
|
T93049: Same MassMessage is being sent more than once
|
open
|
High (red)
|
|
|
|
|
|
T96113
|
T96113: Set up dedicated local-dc runners for 'enqueue' router jobs
|
invalid
|
Needs Triage (violet)
|
|
|
|
|
|