This site is no longer actively maintained. It exists for historical purposes as an example of Phabricator integration and Lua scripting.

Tasks By Project

From cpt

Project: Data-Engineering

Switch to Active Tasks 38 Phabricator task(s).

Phabricator Link Wiki Link Status Priority Author Assignee Projects Subtasks Parent Tasks
T120242 T120242: Consistent MediaWiki state change events | MediaWiki events as source of truth open Medium (orange)
T216492 T216492: Page-links-change stream doesn't capture duplicated links open Medium (orange)
    T216504 T216504: page-links-change stream is assigning template propagation events to the wrong edits open Medium (orange)
      T217271 T217271: Some event data (like the one that comes from mediawiki events such us revision create) should not get sanitized open Medium (orange)
        T233004 T233004: Schema changes for `cu_changes` and `cu_log` table stalled Medium (orange)
        T238230 T238230: Decommission EventLogging backend components by migrating to MEP open Medium (orange)
        T239591 T239591: Update mediawiki-history to use new Multi-Content-Revision tables open High (red)
          T239630 T239630: Mediaviewer views should be reworked to be an eventlogging event open Medium (orange)
            T240387 T240387: MW REST API Historical Data Endpoint Needs open High (red)
            T249755 T249755: Cassandra3 migration for Analytics AQS open High (red)
            T255141 T255141: Upgrade the Cassandra AQS cluster to Cassandra 3.11 duplicate High (red)
              T257572 T257572: Set up a testing environment for the AQS Cassandra 3 migration open Needs Triage (violet)
                T259163 T259163: Migrate legacy metawiki schemas to Event Platform open High (red)
                T259712 T259712: Allow disabling/enabling configured streams via wgEventStreams config open High (red)
                  T271429 T271429: Replace Oozie with better workflow scheduler open High (red)
                  T282012 T282012: WikipediaPortal Event Platform Migration open Medium (orange)
                    T282033 T282033: Airflow collaborations open High (red)
                    T282035 T282035: Catalog, Categorize, and Templetize existing scheduled workflows open Medium (orange)
                      T282131 T282131: Determine which remaining legacy EventLogging schemas need to be migrated or decommissioned open High (red)
                      T282887 T282887: Avoid accepting Kafka messages with whacky timestamps open Medium (orange)
                        T284566 T284566: Replace Airflow's HDFS client (snakebite) with pyarrow open Medium (orange)
                          T285692 T285692: Write a job entirely in Airflow with spark and/or sparkSQL open High (red)
                            T288247 T288247: SPIKE - Will Hadoop 3 container support help us for Airflow deployment pipelines? open Needs Triage (violet)
                              T288271 T288271: Make it possible to use anaconda + stacked conda envs for Airflow executors declined High (red)
                                T289161 T289161: Update cassandra oozie jobs to load cassandra3 using Spark job resolved High (red)
                                  T290068 T290068: Check AQS with cassandra (serving + data) resolved High (red)
                                  T290303 T290303: Migrate WikibaseTermboxInteraction EventLogging Schema to new EventPlatform thingy open High (red)
                                    T291120 T291120: MediaWiki Events as a Source of Truth - Problem Statement open High (red)
                                      T291469 T291469: Repair and reload all cassandra-2 data tables but the 2 big ones resolved High (red)
                                        T291470 T291470: Repair and reload cassandra2 mediarequest_per_file data table resolved High (red)
                                          T291472 T291472: Snapshot and Reload cassandra2 pageview_per_article data table from all 12 instances resolved High (red)
                                            T291620 T291620: Better observability/visualization for MediaWiki jobs open Low (yellow)
                                            T294024 T294024: [Airflow] Automate sync'ing archiva packages to HDFS open High (red)
                                            T294026 T294026: [Airflow] Create repository for Airflow DAGs open High (red)
                                              T297460 T297460: Send cassandra3 (new hosts) logs to logstash open High (red)
                                                T297803 T297803: Switch over the Cassandra AQS cluster to the new hosts open High (red)
                                                  T298516 T298516: Investigate high levels of garbage collection on new AQS nodes open High (red)
                                                    T298805 T298805: Import Debian package of Cassandra 3.11.11 as 'dev' version open Needs Triage (violet)