This week, I’ve only found three questions and answers particularly interesting : Has the use of C to implement other languages constrained their designs? Is there a Linux distribution with a pure HTML interface? Problems with manually creating an ELF file.
Content
Tagged: HTML
Parsing (X)HTML in C is often seen as a difficult task. It’s true that C isn’t the easiest language to use to develop a parser. Fortunately, libxml2′s HTMLParser module come to the rescue. So, as promised, here’s a small tutorial explaining how to use libxml2′s HTMLParser to parse (X)HTML. First, you need to create a [...]
Recently, I had to build an (X)HTML parser in C. Instead of re-inventing the wheel, I looked at what already existed. There seems to be only 3 possibilities : libxml2′s HTMLparser libwww HTML Tidy After a quick look around, it seems as if libwww hasn’t been updated since 2002. As for HTML Tidy, it doesn’t [...]