बातम्या

As for regexes: lxml does have some extensions that use regexes, using the EXSLT namespace (bottom of that section). Or, you can write a simple Python function to parse the field however you like.