New Feature CORE-29
1 vote

Provide an improved RDFa 1.1 Parser

Created by Rob Vesse on 7/26/2011 12:46 PM Last Updated by Rob Vesse on 1/15/2013 4:12 PM
%
 (hrs)
Logged: 0  (hrs)

 Description

The RDFa parser currently provided has several flaws:

  1. Highly recursive - can hit OutOfMemory or StackOverflow exception for heavily nested HTML
  2. Is limited to HTML only
  3. Code is complicated by need to support RDFa 1.0 and RDFa 1.1 which have some significant differences

Instead a new abstract RDFa Core 1.1 parser will be developed which will allow different host languages - HTML, XML, SVG - to be plugged into it as desired.  The parser should be pure RDFa 1.1 and invoke the old RDFa 1.0 parser only where the specification says it should

    Rob Vesse (Tuesday, January 15, 2013 4:12 PM) #

My intention if I ever get around to this is to port the state of the art  100% compliant Java RDFa parser from the Semargl project which is ALv2 licensed so we can port this without any legal issues

    Rob Vesse (Wednesday, May 23, 2012 6:55 PM) #

Moving to backlog as this is not high priority for me

    Rob Vesse (Friday, February 03, 2012 5:59 PM) #

Bumped to 0.6.1 because RDFa is just annoying to parse correctly and the 1.1 spec has changed from when I last coded on this and I can't bring myself to read the damn thing again