Thread history

Fragment of a discussion from Portal talk:Fa
Viewing a history listing
Jump to navigation Jump to search
Time User Activity Comment
No results

nice

ebraminiotalk18:24, 9 April 2011

Please explain your problem. I can provide a list of pages in a particular language.

Siebrand19:09, 9 April 2011

This is good, but the thing that I really need is having a tool for finding occurrence of a regex (and chars) on that pages. I know with list of my language translations I can do that with pywikipedia (I used pywikipedia in my wikipedia for a lot propuses). But this can done better and easier without that.

For example we have some hidden character in Persian (zwnj, zwj and direction control chars that is common in RTL languages). If I had a tool for finding misused occurrence of them in translation, I could easily fix them. For example this regex needed for my language (and some languages with Arabic script): s/([ازرذدژوة])\u200c/\1/g also finding non-standard chars (for my languages) with Special:Search is not possible because Special:Search only yields occurrence of words and not chars.

ebraminiotalk19:57, 9 April 2011

Using regex on over a million pages in translatewiki.net will be blocking our database for too long. We really cannot make that functionality available. If you were to provide the regexes you are looking for, we will try and find a way to help you out.

Siebrand23:49, 9 April 2011
Edited by author.
Last edit: 11:11, 11 April 2011

Okay, Thanks for your great jobs on TranslateWiki. Seems i must use Pywikipedia. If you have time, Please make list of Persian translations for me. Also SQL or XML dump of this wiki (separated for each language) or a replication on toolserver is other solutions I think.

May it is better this thread moved to Portal:Fa. Thanks again!

ebraminiotalk08:37, 11 April 2011

Feel free to move the thread. Will get back with a reply. Data dump: have to ask Niklas.

Siebrand09:28, 11 April 2011
 

See https://translatewiki.net/static/temp:

  • fa.txt contains all pages ending "/fa"
  • translatewiki.net-dump-2011-04-13.xml.bz2 contains an XML dump of translatewiki.net
Siebrand11:31, 15 April 2011