Posts tagged: parsoid

Measuring the length of Wikipedia articles

There was recently a request to generate a report of featured articles on Wikipedia, sorted by length, specifically the "prose size". It's pretty straightforward to get a page's length in terms of the wikitext or even the rendered HTML output, but counting just the prose is more difficult. Here's how…

Two steps forward, one step back for mwbot-rs

I was intending to write a pretty different blog post about progress on mwbot-rs but...ugh. The main dependency of the parsoid crate, kuchiki, was archived over the weekend. In reality it's been lightly/un-maintained for a while now, so this is just reflecting reality, but it does feel like a huge…

uprightdiff 1.4.0

I just tagged the 1.4.0 release of uprightdiff, a utility to diff browser screenshots used for testing visual differences caused by changes to #MediaWiki's parser. The new version is now officially compatible with opencv4.https://www.mediawiki.org/wiki/Uprightdiff