Content translation/Development Plan/Roadmap/CX01Release
Content Translation 0.01 release
[edit]See below for the detailed development plan for each of these features
Increase language support (Labs->Beta->Prod)
[edit]- Languages with high-quality support through Apertium
- Catalan -> Spanish (reverse language pair)
- Spanish -> Portuguese
- Maltese -> Arabic (RTL)
- Languages well supported by Apertium, but will require better markup alignment support
- English - Spanish
- English - Catalan
Feature Set
[edit]- Entry Points
- Red Interlanguage Link (implemented; needs integration)
- New entry point showing in Wikidata.
- Entry points showing by: editing on other languages, Babel, browser language, Accept-Language
- Language tools
- Editing support (basic text styling)
- Existing paragraph translation - only design, not coding (improvement)
- Display disambiguation page links for link adaptation
- Increase number of tools Eg: MT backends, dictionary providers, Reference
- Translation dashboard
- continue ongoing translations, suggestions and impact (enhancement)
- Templates
- Pass through template support
- Inline templates adaptation - show placeholders instead of removing
- Category adaptation (not addition/editing)
- Infrastructure improvements (improvement)
- Improve Machine translation abuse algorithms
- Latest upstream Apertium requires for supporting more language pairs
- MT Infrastructure enhancement- Work on leftover technical items on MT backend
- Service provider registry
- Performance
- Test coverage
- Basic Analytics Dashboard: (enhancement)
- Collection
- Visualization
Production Deployment - Resources & Provisioning
[edit]Milestones
[edit]Completion Date/Milestones | Features | Sprints |
---|---|---|
August 18 - August 29 2014 |
|
73 |
September 2014 |
|
74-75 |
Development Plan
[edit]Feature | Details |
---|---|
Entry Points | |
Editor |
|
Link Adaptation | |
Machine Translation Support (mt) | |
Dictionary Support | |
Templates Support | |
Architecture (technical feature) | |
Research and preliminary development |
CX Deployment Plan for 0.01 Release September 2014
[edit]Deployment date: TBD (September 20-25 2014)
Project: Content Translation Framework
Release: 0.01 (second release after the MVP release)
Long-term project roadmap: Content_translation/Roadmap
Language Pairs to be supported: Catalan-Spanish, Spanish-Portuguese, Maltese-Arabic, English-Spanish, English-Catalan
Release as: Beta Feature
Overall Plan
[edit]System Architecture
[edit]See: https://www.mediawiki.org/wiki/Content_translation/Technical_Architecture
https://www.mediawiki.org/wiki/Content_translation#Workflow_and_Technical_Architecture
https://www.mediawiki.org/wiki/Content_translation
Caching Architecture
[edit]The following diagram includes the caching requirements for the CX framework.
https://www.mediawiki.org/wiki/Content_translation/Server_communications_workflow
https://commons.wikimedia.org/wiki/File:CX_ArchitectureV1.svg
Components to be provisioned for production
[edit]CX server installation and configuration: https://git.wikimedia.org/markdown/mediawiki%2Fservices%2Fcxserver.git/HEAD/README.md
See Setup: https://www.mediawiki.org/wiki/Content_translation/Setup for detailed information about component, installation and configuation and instructions.
- Node.js
- Dictd server (Also see: https://www.mediawiki.org/wiki/Content_translation/Dictionaries)
- Apertium
- Extension dependencies:
- BetaFeatures
- CLDR
- EventLogging
- Backend Services
Varnish:
- External APIs called by CX
- Wikidata
- Parsoid API
- Configuration Scripts
Upstart and Systemd scripts are at: https://www.mediawiki.org/wiki/Content_translation/Setup
Provisioning Plan
[edit]- Storage Requirements
To be determined from discussion with ops
- Hardware Requirements
To be determined from discussion with ops
- Bandwidth Requirements
To be determined from discussion with ops
- Performance expectations
- MT TPS (Transactions per second)
- User responsiveness
- MT Round trip
- General guidelines
- https://www.mediawiki.org/wiki/Performance_guidelines
- https://www.mediawiki.org/wiki/Performance_profiling_for_Wikimedia_code
Monitoring and metrics
[edit]- EventLogging activity for CX
- Number of users enabling the feature
- Performance of S:CX, backend calls?
- Check for node and varnish? Who to page?
- Graph showing requests or timings for the WikiData API(s) we are calling
- Graph showing requests or timings for the Parsoid API(s) we are calling
External Signoffs Required
[edit]- Faidon - Ops
- Gabriel - Infrastructure architecture
- Ori - Performance
- Chris Steipp - Security
- Greg G - Release engineering
- Mark - Ops
- Tim - Platform
LE Team responsibilities
[edit]- Kartik - Deployment, Engineer
- Niklas - Engineer, Code Reviewer
- Santhosh - Engineer, Code Reviewer
- David - Engineer, Code Reviewer
- Joel - Engineer, Code Reviewer
- Runa - Team Scrum-Ninja / testing and communications
- Pau - Feature UX reviewer, designer
- Amir - Feature signoff
- Alolita - Engineering coordination, Eng Manager