Skip to content

recp/xml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

61 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ”‹ XML parser for C

Build Status Build status Codacy Badge

This is very simple and very powerful XML parser. It creates DOM-like data structure and allows to iterate and process XML objects very simple way. It does not alloc any memory for XML itself, it only allocs memory for tokens. It also does not use recursive way to build data structure which makes it very fast to build DOM-like tree structure.

Documentation

Almost all functions (inline versions) and parameters are documented inside related headers.
Complete documentation: TODO.

Features

  • header-only or optional compiled library
  • option to store members and arrays as reverse order or normal
  • option to separate xml tag prefix
  • doesn't alloc memory for keys and values only for tokens
  • creates DOM-like data structure to make it easy to iterate though
  • simple api
  • provides some util functions to print xml, get int32, int64, float, double...
  • very small library
  • unique way to parse XML (check the object map section)
  • helper to get string nodes, primitive values (int, float, bool) for both attribs and values

Object Map

Here a VERY UNIQUE and VERY COOL and VERY EASY and VERY FAST way to parse known XML: objmap.

void
callback_1(xml_t * __restrict xml, void * __restrict obj) {
  printf("entered callback_1\n");
}

xml = xml_parse(/* XML string */, true, true);

xml_objmap_t objmap[] = {
    {
      .key = "key1",
      .foundFunc = {
        .func  = callback_1,
        .param = "callback 1 param" 
      }
    },
    {
      .key = "key2",
      .foundFunc = {
        .func = callback_1
      }
    }
};

/* or you can use macro helpers which is more readable if you don't need more details: */
xml_objmap_t objmap[] = {
    XML_OBJMAP_FN("key 1", func1, param1),
    XML_OBJMAP_FN("key 2", func2, param2),
    XML_OBJMAP_FN("key 3", func3, param3),
    /* ... */
};

xml_objmap_call(xml, objmap, ARRAY_LEN(objmap), NULL);

/* or use this to iterate objmap manually */
xml_objmap(xml, objmap, ARRAY_LEN(objmap));

In this way you don't have to compare keys in a loopi just map the keys with a function or with userdata. You don't have to use function in this way, you may use to map xml object to userdata which may be a GOTO LABEL (to use compound gotos) or something else.

Important Note for tags and values

  • xml doesn't copy keys and values, it only gives pointers to key and values. So when compaing keys or copying values, you must use tagsize or valsize. Or you can use builtin inline functions.

TODOs

  • provide header only library and optionally compile version
  • provide option to preserve array order (currently array order is reversed, because it is easy to parse it in this way; this may be changed. Please follow new commits or releases)
  • provide option to separate tag prefixes
  • windows build
  • documentation
  • handle or ignore comments?
  • cmake?
  • tests
  • extra optimizations
  • usage in detail
  • Unicode support (UTF-8)
  • null object

Build

Unix (Autotools)

sh autogen.sh
./configure
make
[sudo] make install

you can grap library in .libs folder after build finished

Windows (MSBuild)

Windows related build files, project files are located in win folder, make sure you are inside tm/win folder. Code Analysis are enabled, it may take awhile to build

cd win
.\build.bat

Cmake

todo.

Header-only or Compiled Library

The functions has the xmlc_ prefix are compiled version which is called from library. To use this feature you must include xml/call/xml.h header.

To use header-only library you must include xml/xml.h header. The functions has the xml_ prefix are forced to be inlined. When you use this, you don't have to compile the library.

todo.

Example usage

You can inspect xml_print() to view usage in more detail. The example will be updated later to give more detail.

#include <xml/xml.h>
#include <xml/print.h>

int main(int argc, const char * argv[]) {
  xml_doc_t *doc;
  xml_t     *root;
  
  doc  = xml_parse(/* XML string */, true, true);
  root = doc->root;

  xml_print_human(stderr, root);

  xml_free(doc);

  return 0;
}
const xml_doc_t *xmlDoc;
const xml_t     *xml;

xmlDoc = xml_parse(/* XML string */, true, true);
xml    = xmlDoc->root->value;

/* already defined in util.h */
XML_INLINE
bool
xml_tag_eq(const xml_t * __restrict obj, const char * __restrict str);

while (xml) {
    if (xml_key_eq(xml, "tag 1")) {
      int aNumber;

     aNumber = xml_int32(xml, 0);

     /* ... */
    } else if (xml_tag_eq(xml, "tag 2")) {
      const char *nonNullTerminatedString;
      const char *nullTerminatedString;

      /* just pointer */
      nonNullTerminatedString = xml_string(xml);

       /* null-terminated string (strdup), needs to be freed */
      nullTerminatedString    = xml_string_dup(xml);

     /* ... */
    } else if (xml_key_eq(xml, "tag 3")) {
      xml_t *aChild;
      
      aChild = xml->value;
      while (aChild) {
          /* handle child node */
          aChild = aChild->next;
      }
    }

    xml = xml->next;
}

Using Object Map

Here a VERY UNIQUE and VERY COOL and VERY EASY and VERY FAST way to parse known XML: objmap.

void
callback_1(xml_t * __restrict xml, void * __restrict obj) {
  printf("entered callback_1\n");
}

xml = xml_parse(/* XML string */, true, true);

xml_objmap_t objmap[] = {
    {
      .key = "key1",
      .foundFunc = {
        .func = callback_1
      }
    },
    {
      .key = "key2",
      .foundFunc = {
        .func = callback_1
      }
    }
};

/* or you can use macro helpers which is more readable if you don't need more details: */
xml_objmap_t objmap[] = {
    XML_OBJMAP_FN("key 1", func1, param1),
    XML_OBJMAP_FN("key 2", func2, param2),
    XML_OBJMAP_FN("key 3", func3, param3),
    /* ... */
};

xml_objmap_call(xml, objmap, ARRAY_LEN(objmap), NULL);

In this way you don't have to compare keys in a loopi just map the keys with a function or with userdata. You don't have to use function in this way, you may use to map xml object to userdata which may be a GOTO LABEL (to use compound gotos) or something else.

License

MIT. check the LICENSE file