Onur Güzel This was a triumph…

WikiLeaks Cablegate Count

Posted on November 29, 2010

Today, I wrote a small script to get the count of released WikiLeaks cables. Actually, it's extremely small. Just two lines of code.

import urllib, piksemel, re

wleaks = int(re.search(r"(\d+) / [0-9,]+", piksemel.parseString(urllib.urlopen("http://cablegate.wikileaks.org/index.html").read()).getTag("body").getTag("div").getTag("div").getTag("a").getTag("p").toString()).groups()[0])

Generally, I don't code like this. But I wanted to write it quick (5 minutes?) and keep it short. It uses piksemel to parse the page.

Enjoy!

How to install piksemel module

Posted on August 9, 2010

piksemel is a easy to use python XML parser, based on iksemel.

Not: If you're using Pardus, piksemel is preinstalled on your system.

Not: If you're using Windows, before you continue make sure you've already installed mingw32 to your system and its directory is included in PATH.

Steps for Linux users:

  • Download the package from http://cekirdek.pardus.org.tr/~bahadir/piksemel/piksemel-1.3.1.tar.gz and unpack it.
  • Run:
    python setup.py build
  • Run:
    python setup.py install

Steps for Windows users:

  • Download the package from http://cekirdek.pardus.org.tr/~bahadir/piksemel/piksemel-1.3.1.tar.gz and unpack it.
  • In piksemel-1.3.1 directory, run:
    setup.py bdist_wininst build --compiler=mingw32
  • Run the installer file under the piksemel-1.3.1/dist directory.