Jump to content

Wikimedia Release Engineering Team/DataDataData Sync Up/2019-04-12

From mediawiki.org

2019-04-12

[edit]

Last time

[edit]


Today's Agenda

[edit]
  • Is " === TEC3 (Pipeline): Outcome 1 / Output 1.2 === : GOAL: Instrument Quibble for data collection" dependent on Data^3?
    • noticed update said "no place to store data".


Outline plan for Analytics

[edit]

JR: Analytics might have thoughts on having easy access to large pool of data and querying that is flexible to changing needs

What data we have currently or are planning to collect

[edit]
  • Schema
  • Data samples

How we might want to query that data

[edit]
  • Our data is highly structured (see schemas)
    • Is Hadoop or ES more appropriate for that? Would we lose structure by putting it in Hadoop?
    • How much do we have to know about how data's structure before we put it in ES?
      • Can relationships/schema be changed after data is stored?

TODOs (by next meeting)

[edit]
  • Dan to draft email to Analytics (include dashboard mockup)
    • include dashboard mockup
    • example instrumenting quibble
  • JR to check with Mukunda re his dependency on Data^3 for === TEC3 (Pipeline): Outcome 1 / Output 1.2 === : GOAL: Instrument Quibble for data collection
    • invite to this meeting if he's interested