User:OrenBochman/bots
Appearance
Some bot Ideas
Rule Based Bots
[edit]- Phonologist. Use TTS code from Mbrola etc to to add IPA, Sampa, MBrola phonetical data in registered languages.
- IPA to Sampa etc. conversion.
- QA and confidence tests on against existing IPA.
- Compound word mode processing.
- String matching algorithm to map text n-grams to IPA ngrams (space,phon,phon,phone).
- production rule extraction from above (as per paper).
Mine Feedback loop
[edit]- Mine for data in wikis
#Get all he.wiktionary entries and add them to en.wiktionary + othographt
- Edit terms and store it there.
Template Labeler & Checker
[edit]- Add ID or MD5 HASH to mark template boundaries.
- Detect and Mark with categorized template mistakes.
- e.g. orphan tags/bad tidy code.