Wikimedia Technical Documentation Team/Doc metrics/v0
This page summarizes the first set of metrics (v0) the Tech Docs team plans to test as part of the Doc metrics project.
Proposed metrics
[edit]Based on the research and analysis completed during Q1 of FY24-25, TBurmeister proposed the following for a first round (v0) of technical documentation metrics:
Doc characteristic | Relevant | Accurate | Usable | Findable |
Are succinct; avoid walls of text
#Succinct |
✅✅ | |||
Use consistent organization and structure
#ConsistentStructure |
✅ | ✅✅ | ||
Are readable on mobile devices, with all information visible
#MobileDevice |
✅✅ | |||
Use consistent format and style
#ConsistentFormat |
✅✅ | |||
Orient the reader within a collection; Are organized alongside related pages
#CollectionOrientation |
✅ | ✅✅ | ||
Freshness
#Freshness |
✅ | ✅✅ | ✅ | |
Are translatable / translated
#Translation |
✅ | |||
Align with real developer tasks and problems
#Developers |
✅✅ | ✅ | ||
Are connected to / findable from / automated by code
#CodeConnection |
✅ | ✅ | ✅ |
Data elements to measure
[edit]Data element | Doc characteristics | Metrics categories |
Cross-references between code and on-wiki docs | CodeConnection, CollectionOrientation | Findable, Relevant |
Page headings | Succinct, ConsistentStructure, ConsistentFormat | Usable, Findable (see Discussion) |
Page title | Succinct, ConsistentStructure, ConsistentFormat | Usable, Findable |
Page sections
|
Succinct, ConsistentStructure,
CollectionOrientation |
Usable, Findable |
Navigation
|
ConsistentStructure, CollectionOrientation, MobileDevice | Findable, Usable |
Revisions
|
Freshness; Developers | Accurate, Relevant |
Tables | Succinct (+); MobileDevice (-) | Usable (+Inclusive) |
CSS, HTML | ConsistentFormat (-); MobileDevice (?) | Usable (+Inclusive) |
Lists
Page length # of paragraphs |
Succinct | Usable |
Translate markup
Pages w/o language links |
Translation | Usable (+Inclusive) |
Code samples | Developers; CodeConnection | Relevant |
Incoming links | Developers | Relevant |
Pageviews | Developers | Relevant |
Cross-references between code and on-wiki docs
[edit]May include:
- Links from code repos to wiki pages:
- Could be within README files, other files in subdirectories, or in Github / Gitlab / Gerrit repository summaries
- Links to code repos from wiki pages:
- Could be in templates (example) or paragraphs on wiki pages (example).
- Difficult to differentiate general links to code repos from links to the code repo aligned with a specific wiki page. Pages may link to upstream repos or other random code references.
Helps us assess:
- Whether docs are connected to / findable from / automated by code (#CodeConnection)
- Whether docs and code orient the reader within a collection; Are organized alongside related pages (#CollectionOrientation)
Available data sources:
- Special:LinkSearch (includes all links in all namespaces)
- Doesn't capture short links like [[gitlab:__]].
- Example list of links to Gerrit.
- Template:Extension has a repo field (among many other fields that may be useful links to code)
- For doc.wikimedia.org, we can access referer data through the webrequest pipeline
Unavailable data:
- Referer or clickstream data is not available for technical wikis unless the referer is another wiki page
- For privacy reasons, pageview tables and the tools that use them limit referer info to coarse buckets (referer_class : Can be none (null, empty or '-'), unknown (domain extraction failed), internal (domain is a wikimedia project), external (search engine) (domain is one of google, yahoo, bing, yandex, baidu, duckduckgo), external (any other))
Page headings
[edit]Raw data element | Data aspect | Documentation characteristic(s) |
Navigation template | Is there a navigation template on the page? | Orient the reader within a collection
Organized alongside related pages |
Navigation template | How long is the navigation template? | Avoid walls of text (succinctness) |
Incoming links | Are there many incoming links from other wiki pages? | Avoid duplication
Use consistent organization and structure |
Incoming links | Are there very few incoming links from other wiki pages? | Align with real developer tasks and problems (relevance)
Use consistent organization and structure |
Incoming links | Are there any incoming links from code repos? | Align with real developer tasks and problems (relevance)
Update when code changes |
Next steps
[edit]We will pick 3-5 of these metrics to test on a sample set of documentation collections. See the project timeline and milestones for more info.