<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://purl.org/rss/1.0/" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:syn="http://purl.org/rss/1.0/modules/syndication/" xmlns:admin="http://webns.net/mvcb/">
  <channel about="http://blog.gmane.org/gmane.science.linguistics.wikipedia.technical">
    <title>gmane.science.linguistics.wikipedia.technical</title>
    <link>http://blog.gmane.org/gmane.science.linguistics.wikipedia.technical</link>
    <description/>
    <syn:updatePeriod>hourly</syn:updatePeriod>
    <syn:updateFrequency>1</syn:updateFrequency>
    <syn:updateBase>1901-01-01T00:00+00:00</syn:updateBase>
    <items>
      <rdf:Seq>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40765"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40761"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40752"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40751"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40750"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40741"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40735"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40725"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40722"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40718"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40712"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40690"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40687"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40671"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40666"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40664"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40660"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40636"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40619"/>
        <rdf:li rdf:resource="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40608"/>
      </rdf:Seq>
    </items>
    <image rdf:resource="http://gmane.org/img/gmane-25t.png"/>
    <textinput rdf:resource=""/>
  </channel>
  <image rdf:about="http://gmane.org/img/gmane-25t.png">
    <title>Gmane</title>
    <url>http://gmane.org/img/gmane-25t.png</url>
    <link>http://gmane.org</link>
  </image>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40765">
    <title>FlaggedRevs statistics?</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40765</link>
    <description>There are nice statistics at:
http://toolserver.org/~aka/cgi-bin/reviewcnt.cgi?lang=english&amp;action=overview,&lt;http://toolserver.org/~aka/cgi-bin/reviewcnt.cgi?lang=english&amp;action=overview&gt;Is
it possible to have similar for the other projects, as well, that have the
flaggedrevs extension enabled?

Best regards,
Bence Damokos
</description>
    <dc:creator>Bence Damokos</dc:creator>
    <dc:date>2008-12-01T20:01:44</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40761">
    <title>terms &amp; conditions checkbox on registering</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40761</link>
    <description>Hello guys,

we´re having a wonderful wiki based on mediawiki but our legal deparment 
said we should include some terms and conditions on the registration 
process. We tried to find an extension which provide a simple checkbox 
with a text in the registration page but we cannot find anything.

We don´t really want to touch the mediawiki files so i would prefer a 
solution in the skin files or with an extension. We tried to find an 
existing mediawiki with terms and conditions, but there does not seem to 
be one.

Here my question:
Do you know an extension which will include terms and conditions 
checkbox to the register page?
Do you know a theme that includes such a terms&amp;conditions checkbox?

Which files do I need to touch to include such a feature? where is the 
register page defined, and where is it prozessed?

thank you very much

Falko
</description>
    <dc:creator>Falko Richter</dc:creator>
    <dc:date>2008-12-01T18:33:07</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40752">
    <title>Stub quota</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40752</link>
    <description>
After some more thought on the origins of stub articles and a 
better overview of the contents of the Swedish Wikipedia, it is 
clear that very few individuals are responsible for creating large 
numbers of stubs, a few years back.  Now, depending on religion 
(mergists, deletionists...), these should either be deleted, 
improved, merged or put on lists of necessary quality 
improvements. Either way, it's a lot of work and it would have 
been better to have stopped those invidiuals back then.  At least 
we want to stop such individuals today, so the same mistake isn't 
repeated while the old mess is being cleaned up.

What we want is to foster a spirit of writing better articles, 
improving the one you started, before you start the next one.

But instead of increased patrolling and speedy deletions, this 
could be implemented in the Mediawiki software.  If a user (logged 
in or IP address) tries to create a new page, their recent 
contribution history could be checked, and if any of their five 
most recentl</description>
    <dc:creator>Lars Aronsson</dc:creator>
    <dc:date>2008-12-01T09:29:37</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40751">
    <title>FW: [FOSDEM] Your devroom request has not been accepted</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40751</link>
    <description>FYI.

This is too bad. It would have been a great opportunity for our project to
go forward by bringing a substantial number of developers together in the
official part of FOSDEM 2009.

On the other hand: I and a few others will definitely be going to FOSDEM
2009[1]. Is it worth exploring possibilities to have some kind of thing in
Brussels in the same weekend, should we try and organise something
completely different as soon as possible, or should we cancel the idea for
the next 6 months altogether?

Cheers! Siebrand

[1] http://www.mediawiki.org/wiki/Project:FOSDEM_2009

-----Oorspronkelijk bericht-----
Van: Pascal Bleser [mailto:loki-8Ycg9RsmSNMdnm+yROfE0A&lt; at &gt;public.gmane.org] 
Verzonden: maandag 1 december 2008 1:37
Aan: Siebrand Mazeland
CC: FOSDEM DevRooms
Onderwerp: [FOSDEM] Your devroom request has not been accepted

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

We're sorry to give you bad news, but your developer room request for FOSDEM
2009 has not been accepted.

We realize that this must be disappo</description>
    <dc:creator>Siebrand Mazeland</dc:creator>
    <dc:date>2008-12-01T08:09:37</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40750">
    <title>Bugzilla Weekly Report</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40750</link>
    <description>MediaWiki Bugzilla Report for November 24, 2008 - December 01, 2008

Status changes this week

Bugs NEW               :  93                                  
Bugs ASSIGNED          :  4                                   
Bugs REOPENED          :  10                                  
Bugs RESOLVED          :  45                                  

Total bugs still open: 3168                                

Resolutions for the week:

Bugs marked FIXED      :  31                                  
Bugs marked REMIND     :  0                                   
Bugs marked INVALID    :  5                                   
Bugs marked DUPLICATE  :  3                                   
Bugs marked WONTFIX    :  2                                   
Bugs marked WORKSFORME :  3                                   
Bugs marked LATER      :  1                                   
Bugs marked MOVED      :  0                                   

Specific Product/Component Resolutions &amp; User Metrics 

New Bugs Per Component

Ge</description>
    <dc:creator>reporter-FUOLwUHO8CAB44msSl0Y14aCUm+kVXi4&lt; at &gt;public.gmane.org</dc:creator>
    <dc:date>2008-12-01T03:00:01</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40741">
    <title>Non-latin characters broken in donation comments</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40741</link>
    <description>Hello,

Names with non-Latin characters in the donation comments are broken
and outputting as question marks. Some people are understandably
unhappy that their names are not appearing next to their donations.
For example, see &lt;
http://wikimediafoundation.org/wiki/Special:ContributionHistory?offset=1228023677

(Thanks to [[ja:user:Aotake]] for pointing it out in #wikimedia.)

</description>
    <dc:creator>Jesse Plamondon-Willard</dc:creator>
    <dc:date>2008-11-30T09:52:21</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40735">
    <title>Donor's names are broken</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40735</link>
    <description>Why are some names of donors broken in
http://wikimediafoundation.org/wiki/Special:ContributionHistory ?

examples:
http://wikimediafoundation.org/wiki/Special:ContributionHistory?offset=1227942255#160557
http://wikimediafoundation.org/wiki/Special:ContributionHistory?offset=1227942255#160539

I think no-latin characters are displayed not successfully.

----
   mizusumashi
</description>
    <dc:creator>mizusumashi</dc:creator>
    <dc:date>2008-11-29T07:16:48</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40725">
    <title>Local wikipedia server configuration</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40725</link>
    <description>Greetings,

I am trying to setup a local box to run it as a local Wikipedia server. It
would have Ubuntu 8.04 with PHP5 and Mysql 5. I would like to replicate all
en.wikipedia with the history and conversation. This would be part of an
academic research on the quality of Wikipedia as an information source.

I need your input on the hardware configuration please.

   - RAM: 16 GB or 32 GB?
   - CPU: double quad-core (Xeon 2 Ghz) or 1 quad-core or 1 double-core?
   - I intented to have 4-5 TB of storage space.

How much space you think I need for temporary tables? Is 300 GB enough or I
should get more. The full database might be around 600 GB uncompressed.

One user only is going to run queries on this server and some queries might
span several entire tables.

Thanks in advance for all your suggestions.


bilal
</description>
    <dc:creator>Bilal Abdul Kader</dc:creator>
    <dc:date>2008-11-28T07:38:46</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40722">
    <title>Depth and other quality estimates</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40722</link>
    <description>
On my favorite page, 
http://meta.wikimedia.org/wiki/List_of_Wikipedias

there is a column for "depth", which is "a rough indicator of a 
Wikipedia's quality, showing how frequently its articles are 
updated".  Tomorrow that column has been there for two full years, 
with slight modifications of its formula.

I wrote a separate page about this, 
http://meta.wikimedia.org/wiki/Depth

(Note that this is completely unrelated to
http://en.wikipedia.org/wiki/Wikipedia:Depth )

There has been a lengthy discussion on the good and evil of trying 
to estimate the quality of Wikipedia.  But I think "depth" is the 
only measurement that we can track over such a long time.

What other estimates of Wikipedia quality do we have, that can be 
applied across language versions?

Erik Zachte's Wikipedia Statistics (last updated in May 2008) 
presents a number of values that could be used to calculate a 
quality estimate: number of articles, number of articles longer 
than 0.5 kbytes or 2 kbytes (excluding some markup), mean </description>
    <dc:creator>Lars Aronsson</dc:creator>
    <dc:date>2008-11-28T02:08:49</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40718">
    <title>autolink revision numbers</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40718</link>
    <description>What is the easiest way to automatically link the pattern /r(\d+)/ to a
URL like http://code.google.com/p/qubit-toolkit/source/detail?r=\1
wherever it appears in MediaWiki articles?

For example, I want an article containing:

... blah blah r1632 blah blah...

- to have r1632 automatically replaced with a link to
http://code.google.com/p/qubit-toolkit/source/detail?r=1632
</description>
    <dc:creator>Jack Bates</dc:creator>
    <dc:date>2008-11-27T19:09:32</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40712">
    <title>Stats on navigation through Wikipedia pages</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40712</link>
    <description>There's a question being discussed on wikien-l at present about
whether [[en:Ireland]] should be the country, the island or a
disambig.

The arguments are both on correctness and on reader expectation. The
first is for Manual of Style wonks (as well as anyone with an
opinion), but the second is theoretically numerically ascertainable.

The question is:

* how to get numbers on how readers travel from one page to another
within Wikipedia
* without risking a privacy violation.

Examples of the latter would be if we included jumps to or from a user
page or even a WP: pseudo-pagespace page. So let's ignore those.

The first idea that springs to my mind is logging referers for article
pages, if the referer is an article page on the same Wikipedia or
[[Special:Search]].

1. Is this technically feasible given our logging structure?
2. Is there a privacy gotcha I'm ignoring?

The huge benefit from this would be seeing how readers actually use
Wikipedia, which would give us solid reasons to put given pages,
links, re</description>
    <dc:creator>David Gerard</dc:creator>
    <dc:date>2008-11-26T15:32:56</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40690">
    <title>programmatically  add text to edit page text area</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40690</link>
    <description>I've been looking through the following links looking for a hook that
will allow me to insert text into the text area on the edit page. 

http://www.mediawiki.org/wiki/Extension:PageSecurity/OutputPage.php

(the EditPage.php doc is still empty)

http://www.mediawiki.org/wiki/Manual:Hooks/EditPage::showEditForm:fields

 

Am I looking in the wrong place again? 

 

Matt Alline




***************************************************************************************************
The information in this email is confidential and may be legally privileged. Access to this email by anyone other than the intended addressee is unauthorized. If you are not the intended recipient of this message, any review, disclosure, copying, distribution, retention, or any action taken or omitted to be taken in reliance on it is prohibited and may be unlawful. If you are not the intended recipient, please reply to or forward a copy of this message to the sender and delete the message, any attachments, and any copies thereof from </description>
    <dc:creator>Alline, Matt</dc:creator>
    <dc:date>2008-11-25T16:17:13</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40687">
    <title>wfLocalFile($arg) and file uploads</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40687</link>
    <description>Does anyone know where to find the method wfLocalFile($arg)? I've tried
to grep for the method but I've not gotten anything but calls to the
method. I'm still new to php and mediawiki development so I may be
looking in the wrong place.

 

The overlying problem:

I am currently working on a new extension that aims to allow users to
upload a document (doc, docx, odf,...) and convert that document to
wikitext server side. I've gotten other major components of this already
worked out. I am currently hung up trying to upload the file (possibly
using the UploadForm) while using mediawiki's existing validations. I do
not want to use the current base UploadForm to store the document in the
current architecture for a few reasons. I have already tried to capture
the temp file created by php before calling processUpload but it seems
the file gets cleaned up before I can grab it.

 

I could write the the extension to handle the upload as you normally
would in php, but again, I would like to use some of the existing
va</description>
    <dc:creator>Alline, Matt</dc:creator>
    <dc:date>2008-11-25T14:42:24</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40671">
    <title>User agent statistics</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40671</link>
    <description>See https://bugzilla.wikimedia.org/show_bug.cgi?id=15059 for a request.
But such additional stats seem to be *not* really intended, however.


</description>
    <dc:creator>Melancholie</dc:creator>
    <dc:date>2008-11-24T17:21:58</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40666">
    <title>Bugzilla Weekly Report</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40666</link>
    <description>MediaWiki Bugzilla Report for November 17, 2008 - November 24, 2008

Status changes this week

Bugs NEW               :  86                                  
Bugs ASSIGNED          :  6                                   
Bugs REOPENED          :  14                                  
Bugs RESOLVED          :  75                                  

Total bugs still open: 3138                                

Resolutions for the week:

Bugs marked FIXED      :  47                                  
Bugs marked REMIND     :  0                                   
Bugs marked INVALID    :  8                                   
Bugs marked DUPLICATE  :  9                                   
Bugs marked WONTFIX    :  4                                   
Bugs marked WORKSFORME :  6                                   
Bugs marked LATER      :  1                                   
Bugs marked MOVED      :  0                                   

Specific Product/Component Resolutions &amp; User Metrics 

New Bugs Per Component

Un</description>
    <dc:creator>reporter-FUOLwUHO8CAB44msSl0Y14aCUm+kVXi4&lt; at &gt;public.gmane.org</dc:creator>
    <dc:date>2008-11-24T03:00:01</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40664">
    <title>$wgRestrictionLevels</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40664</link>
    <description>I created a new usergroup in my mediawiki install called Staff and I
want to set up a Restriction level so i can page edit
restrictions/protection except the manual page about RestrcitionLevels
&lt;http://www.mediawiki.org/wiki/Manual:$wgRestrictionLevels&gt; doesn't
show much information about it so i was wondering how i would go about
setting it up?

-Peachey
</description>
    <dc:creator>K. Peachey</dc:creator>
    <dc:date>2008-11-24T02:43:40</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40660">
    <title>User agent statistics</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40660</link>
    <description>Hi all,

as Domas' magic stuff(tm) currently gathers article traffic data very
efficient: Could this system be expanded to get a list of user agents
used to browse Wikipedia, sorted by their count?
I think this would be a very cool way to have accurate statistics
about browser usage, not only on Wikipedia.

Thanks,
Marco
</description>
    <dc:creator>Marco Schuster</dc:creator>
    <dc:date>2008-11-23T22:29:53</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40636">
    <title>UTF8 Normalization</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40636</link>
    <description>Hi,

I patched UtfNormal.php to use the new intl extension's
normalization function, http://php.net/manual/en/normalizer.normalize.php

But running UtfNormalTest.php causes 200+ errors, whilst the PHP
normalization routines are error free.

So kind at a standstill wondering what the problem is. I guess WM
use utf8_normalize() which is based on ICU like intl, does than have the
same errors?

Jared
</description>
    <dc:creator>Jared Williams</dc:creator>
    <dc:date>2008-11-22T13:54:40</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40619">
    <title>Upload filesize limit bumped</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40619</link>
    <description>-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Now that we've got uploads running on our new, beefier file servers,
I've experimentally bumped the upload limit from 20 to 100 megabytes.

Files nearing the high end of that range might not actually succeed,
though, as it'll be hitting post-size limits etc.

As time goes on we'll be improving ways to upload large video files in
particular...

- -- brion
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.8 (Darwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAkknVv4ACgkQwRnhpk1wk45HaQCgzH3rAHh9cvBUQtSt6FZtO0cF
s+0AoNNBOGGA9S8vFE4AQqALJuCHVtUZ
=YKTr
-----END PGP SIGNATURE-----
</description>
    <dc:creator>Brion Vibber</dc:creator>
    <dc:date>2008-11-22T00:49:02</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40608">
    <title>Donating/offering domain wikipedia.ro for use</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40608</link>
    <description>Dear all,

I am an administrator, bureaucrat and checkuser on the Romanian
Wikipedia. I have contacted the current owner of domain wikipedia.ro
asking him to consider donating the domain to Wikimedia Foundation, Inc.
(there is no local chapter in Romania). He's considering the option, but
in the meanwhile he has offered to allow us use of the domain -- in
other words, he asked for the appropriate nameserver IP addresses he
should associate with that domain (obviously, that should lead to the
content currently served at ro.wikipedia.org). Could that be arranged?
If so, please provide the respective IP addresses so I can pass them on
to him.

In a different train of thoughts, should he agree donating the domain
altogether to WMF, can WMF to take ownership, or is that against any policy?

Best regards,
Gutza
</description>
    <dc:creator>Gutza</dc:creator>
    <dc:date>2008-11-21T20:00:28</dc:date>
  </item>
  <item rdf:about="http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40595">
    <title>Correct method of pre-processing article text?</title>
    <link>http://comments.gmane.org/gmane.science.linguistics.wikipedia.technical/40595</link>
    <description>I have an extension which parses the contents of a page to store the content
of certain embedded tags to the database, and I want the parsing to take
place after the pre-processing (comment removal, template expansion, etc.)
I also need the code to be compatible with MW1.6 as I am currently unable to
upgrade to PHP5 (hopefully soon...)

Here is the code I was using until recently (where $Text is the unmodified
article text):

 // Create new Parser object to deal with some transformations that are
 // required before saving.
  $Parser = new Parser();

 // Use the Parser object to strip out html comments, nowiki and pre tags
 // and whatever other bits shouldn't make it through when rendering (so
 // they don't affect saving).
  $ParserOptions = new ParserOptions();
  $StripState =&amp; $Parser-&gt;mStripState;
  $Parser-&gt;mOptions = $ParserOptions;
  $TidyText = $Parser-&gt;strip($Text, $StripState, true);

 // Then replace any variables, parser functions etc. so that 'hidden' tags
 // (e.g. tags that are created by cod</description>
    <dc:creator>Mark Clements (HappyDog</dc:creator>
    <dc:date>2008-11-20T15:16:20</dc:date>
  </item>
  <textinput about="http://search.gmane.org/?group=$group=gmane.science.linguistics.wikipedia.technical">
    <title>Search Engine</title>
    <description>Search the mailing list at Gmane</description>
    <name>query</name>
    <link>http://search.gmane.org/?group=$group=gmane.science.linguistics.wikipedia.technical</link>
  </textinput>
</rdf:RDF>
