Overview | How to Run | Capabilities | Form Representation | Program Structure
This is a small project aiming to simplify the process of transferring forms from Integrify to Orbeon. Because this process regularly involves much clicking, copying, pasting, and reconfiguration of generic parameters, orbify reduces this waste of time and allows Workflow Engineers to focus on translating the more-complicated logic/layout features between the platforms.
Below I detail what exactly this tool can do for you, and how to use it, along with some development notes. Note, the program is specialized in a few places for FTD Solutions, but can be used and extended by others.
Orbify is designed to require minimal effort to use. In order to run this script, you must
- Install Python 3.10.x or greater.
- Get the python module pyperclip.
pip install pyperclip
- Clone this repository.
git clone git@github.com:MatousAc/orbify.git
- Navigate into the orbify repository on the command line.
- Copy the JSON you want translated to your clipboard. You can just open Integrify's View JSON modal and then hit
Ctrl + A, Ctrl + C
to select and copy all the JSON on the page. This selects a bit more than just the JSON in the modal, but Orbify should ignore anything that comes before the JSON. - Run
orbify.py
in this repository.
python orbify.py
- Enter the form's name (sadly, this isn't stored in the JSON) and hit
Enter
. Orbify then uses the JSON on your clipboard to create XML. - The XForms source code will then be copied to your clipboard and the control names for all data fields will be printed out in the terminal. Paste this source code into Orbify's "Edit Source" modal and click
Apply
.
Just about all pieces of the layout are translated. The only significant difference is that in integrify there are only columns inside containers, while Orbeon's grids have uniform columns and rows. This can cause some minor horizontal misalignments which can be adjusted by hand.
Shown below, the layout elements in every row serve generally the same functionality.
Integrify | Orbeon |
---|---|
section | section |
container | grid |
column | <fr:c/> |
field | field |
Orbify translates the following fields from Integrify (left) to Orbeon (right) without known bugs:
Integrify | Orbeon |
---|---|
short text | text input |
long text | text area |
search box | text input or static select with service and action (FTD API) |
hyperlink | text input with link regex |
rich text | formatted text input |
number | number input or currency input if currency detected |
email address | email input |
contact search | static select with service and action using FTD API |
calendar | date input |
radio button | radio input |
checkbox | multi-select checkboxes |
select list | static dropdown or static select with service and action (FTD API) |
(multi) file attachment | file attachment (multi is not available in Orbeon PE 2019 |
password | secret input |
signature | handwritten signature |
form text | explanatory text |
horizontal line | explanatory text containing <hr/> tag |
image | blank space |
blank space | just a placeholder in the grid. no field associated |
There are special cases:
- When a radio select or checkbox input has only the options "yes" and "no" (or true/false, case insensitive), it is translated into a boolean input, known as YesNo input in Orbeon. This is done because the intent of the input is clear, yet misconfigured.
- Images get a placeholder. because the data for them is not included in the JSON and downloading and uploading them to Orbeon has not yet been implemented. However, FTD Logo Images are common enough in FTD forms to hardcode and include.
- Phone numbers are detected based on field labels ("Cell Phone", "Phone #", "Work Phone", etc . . .) and are translated into text inputs with special phone-number-like constraints and rigid formatting. These allow for up-to 13 digit phone numbers (international format).
- Integrify Grid translation to Orbeon is not currently supported. Grids/table inputs are replaced with blank spaces in Orbeon.
Below are attributes on fields and/or containers and their respective names in Integrify and Orbeon. Some are supported, while others are not.
Integrify | Orbeon | Supported? |
---|---|---|
required | required | |
readonly | readonly | |
hidden | visibility/relevancy | |
show | visibility/relevancy | X |
digits-after-decimal | digitsAfterDecimal | |
Allowed File Types | Supported Mime Types | X |
This section simply explains how forms are represented in both of the two systems.
Integrify uses a pure JSON representation to keep track of its forms. There isn't much complexity to their setup. Below, it is hierarchically broken down with the json representation italicized and the logical meaning emboldened:
- an Integrify form is just JSON. At the top level, it is a list of objects
- each object is a section, the building block of a form
- each section's "contents list" contains container objects
- if there are two objects in the column list, then this container has two columns. if there's one object, then there is just a single column in this container (common sense)
- inside each container's columns are item objects. each item at this level corresponds to a field in the form
- fields are objects that contain the field's attributes. these attributes include things like
- QuestionType (ShortText, FileAttatchment)
- Label (Job Title, Acceptance Letter)
- Class (for CSS)
- show
- isdirty
- id
- validation {required, minMessage, . . .}
Note that there are structures (such as sections, and columns) and attributes (Label, show, validation) that we need to retain, and others we won't really care to try to translate.
Also, some fields for Orbeon are calculated (any control name can be easily based on "Label" or element count).
Notice that Integrify's JSON notation does not contain the name of the form itself, so this is collected at runtime.
Orbeon uses XForms to represent its forms. XForms is an XML-style representation that leverages what is called the "MVC approach".
The model describes the form data and sets up constraints, submission-types, API calls, and form resources.
The view describes what controls appear in the form, how they are grouped together, and what data they are bound to.
The very first tag is the xh:html tag, containing various imports from w3, Orbeon, and Saxon. This first tag is just about identical in each form and is handled using a simple string in Python.
To a high degree, all of the following sections mirror the form structure. The model primarily deals with data and data-binding, while the view will deal a lot more with placement and such.
The model is contained inside the head tag alongside a title (I think there can be more than one model per document). The model contains instances which are basically groupings of different types of information about all elements in the form.
There are five instances: control names, bindings, metadata, attachments, and resources.
The first "instance" contains a self-closing tag for all the forms "control names"
<control_name/>
except form images and attachment fields include more information in their tags.
We start with the generic
<xf:instance id="fr-form-instance" xxf:exclude-result-prefixes="#all" xxf:index="id">
on the outside, and then the hierarchy is as follows:
xf:Instance > form > section_control > grid_control > field_control
Next is the xf:bind section, so I'll call this the "bind." It generically starts with
<xf:bind id="fr-form-binds" ref="instance('fr-form-instance')">
Inside this are tags that, again, mirror the form structure, except this time they only include form fields and "bind" the control names to a "name," "ref," and "type." The tag format is as follows:
<xf:bind id="control_name-bind" ref="control_name" name="control_name" type="xf:datatype"/>
where datatype
can be boolean, anyURI, date, decimal, etc. Fields such as radio buttons, checkboxes, or dropdowns are no different from text fields with regards to how they store data, so datatype isn't specified for them here. They are just text in the background. In our program, however, they will each definitely have their own representation.
The datatype is optional and not included in section or grid tags. Basic text input don't indicate a datatype, but typically include an xxf:whitespace="trim" attribute.
Constraints can also be set here. For instance, the currency datatype seems to be denoted by type="xf:decimal" constraint="xxf:fraction-digits(2)"
. Also, this is where other data attributes such as required and relevant (visible?) may go.
Next there is another instance tag this time of the form:
<xf:instance id="fr-form-metadata" xxf:readonly="true" xxf:exclude-result-prefixes="#all">
This appears to provide metadata about the form and isn't difficult to generate as it's layout is completely static. The form attachments instance tag, likewise, appears to be simple. It is invariably empty regardless of what attachments may be in the form.
<xf:instance id="fr-form-attachments" xxf:exclude-result-prefixes="#all">
<attachments/>
</xf:instance>
The final piece of the model is the resources instance. This section seems to keep static data for the form that is only "for display" including form text, and checkbox/radio options. We have the following tags:
<xf:instance xxf:readonly="true" id="fr-form-resources" xxf:exclude-result-prefixes="#all">
<resources>
<resource xml:lang="en">
Following this are tags for each piece of data in the form:
<control_name>
<label>Control Name</label>
<hint/>
</control_name>
Inputs with choices, such as radio buttons and checkboxes, have items in the resources section:
<item>
<label>Option</label>
<hint/>
<value>option</value>
</item>
After this instance, the XForms model and head tags close.
This section begins with the generic nested tags
<xh:body>
<fr:view>
<fr:body xmlns:xbl="http://www.w3.org/ns/xbl" xmlns:p="http://www.orbeon.com/oxf/pipeline" xmlns:oxf="http://www.orbeon.com/oxf/processors">
Within this there are tags for every section of the form:
<fr:section id="control_name-section" bind="control_name-bind">
and every grid in the same format. Every data field is contained within an fr:c tag. This tag places data fields within the grid. The format is:
<fr:c x="#" y="#" w="#" h="#">
It appears that XForms has borrowed a bootstrap-style grid scheme (or the other way around).
Form translation is orchestrated in orbify.py.
- First the form's name is collected and the JSON is extracted from whatever is on the clipboard
name = input("Form Name: ") src = extractJSON(pyperclip.paste()) src = json.loads(src)
- Next, the form object is created.
JSON is retrieved from the clipboard and parsed. Then the form creates section objects, with in turn create grids, and so on and so forth. Each form "block" contains objects within it that are iteratively constructed using parsed JSON objects.
form = Form(name, src)
Eventually, we haveForm
>Section
>Grid
>Place
-Field
where each level block contains multiple smaller blocks except each place contains only one field.
This might feel like overkill, but it is actually a very useful and scalable code representation for continuous development. - The second line of code
constructs a string with all the XML/XForms source code in it. From a developer's perspective, one would hope to have all the code that generates the model's resources in one place, while code that generates the view would be elsewhere. Although there are other ways to organize (possibly by field and class type, because of the hierarchical nature of XML, I chose to use the visitor design pattern.
result = gen_xhtml(form)
This is a slightly more complex programming paradigm, so I won't explain it here, but it's proved to work extremely well for this use case. - Finally,
prints out all the data field's control names and
for cn in fieldControls: print(cn)
places the source code for the form on your clipboard - ready to paste into Orbeon.pyperclip.copy(result)