Content

by Laurent Parenteau

Tagged: HTML

2010-08-24 :: Laurent Parenteau // tips & tricks
Questions/Answers : Digest 0×0004

This week, I’ve only found three questions and answers particularly interesting : Has the use of C to implement other languages constrained their designs? Is there a Linux distribution with a pure HTML interface? Problems with manually creating an ELF file.

Tagged: » »

 » Read the rest

2009-12-03 :: Laurent Parenteau // tips & tricks
Parsing (X)HTML in C – A libxml2 tutorial

Parsing (X)HTML in C is often seen as a difficult task.  It’s true that C isn’t the easiest language to use to develop a parser.  Fortunately, libxml2′s HTMLParser module come to the rescue.  So, as promised, here’s a small tutorial explaining how to use libxml2′s HTMLParser to parse (X)HTML. First, you need to create a [...]

Tagged: » » » »

 » Read the rest

2009-10-30 :: Laurent Parenteau // tips & tricks
Parsing (X)HTML in C

Recently, I had to build an (X)HTML parser in C.  Instead of re-inventing the wheel, I looked at what already existed.  There seems to be only 3 possibilities : libxml2′s HTMLparser libwww HTML Tidy After a quick look around, it seems as if libwww hasn’t been updated since 2002.  As for HTML Tidy, it doesn’t [...]

Tagged: » » »

 » Read the rest