Document-level Identification

Top  Previous  Next

The table below briefly describes the EIGHT main data elements which UnForm uses to identify documents at the document level. With the exception of the date/time stamp, and some character-separator rules enforced by UnForm on some of the fields, the data format for each of these text elements is user-configurable. A significant part of the administrator's implementation process is to design a document identification structure for the archive which will meet the enterprise's needs over a meaningful period of time.

 

Document Type

First segment of the primary document identifier-key.
Maximum 20 characters
Null value allowed

 

Example document types from our sample libraries:

 

demo_sales

demo_accounting

demo_purchasing

"ArStatement"
"Invoice"

"Order"
"Quote"

"ApAging"
"ApCheck"
"ArAging"
"GLDailyDetail"
"OpSalesRegister"

"PurchaseOrder"

 

Document ID

Second segment of the primary document identifier-key.
Maximum 20 characters
Null value NOT allowed. However, the null is trapped by UnForm and replaced with a unique, date-oriented sequence number.

 

The combination of Document Type and Document ID form a unique identifier-key to a document within a library.

 

Note that libraries are distinct units to each other so document identifier-keys are only unique to a library, in other words, identical document identifier-keys can exist in two distinct libraries without over-writing.

 

Categories

Category indexes provide segmented keys to help navigate to specific document collections within a library.  For example, a category index could be based on customer name and customer PO.  If orders, packing lists, and invoices were all keyed with the same customer|PO category, they would provide a convenient method of collecting these related documents into a single, browsable or searchable index.

 

There can be any number of category indexes assigned to each document, and each index can contain up to ten segments, allowing much flexibility in custom indexing.  When editing or creating categories, the system uses pipe symbols (|) to delimit segments.

 

If a given category segment will potentially contain many thousands of items, it may be desirable to divide the segment into two tiers.  For example, if a customer name is used as a segment, and there are thousands of customers, a two-tiered customer name could be designed, such as left(custname$,2) + "|" + custname$.  During browsing, the user would first locate the customer alphabetical group based on the first two characters of the name, then access just that sub-group of customers.

 

Document Title

A broad general description of the document, sometimes composed of several major data values that help distinguish the document from other similar documents.

 

Keywords

Additional document identifiers that can help narrow and limit searches to locate documents and groups of documents, improving search efficiency.  Keywords are semi-colon delimited words or phrases.  Often they are auto-generated from the content of the job submitted for UnForm processing.  When keywords are auto-generated, the generation is subject to configuration rules found in the [archive] section if the uf100d.ini file.  Keywords are indexed, to provide fast searching and browsing.

 

Links

A list of links to other documents, either in the archive system or external to it.  This list is displayed in the web browser interface when viewing the document.  The list is semi-colon delimited, with each link being one of these formats:

A URL, such as http://acme.com, or a complete link to an UnForm browser page.  If it starts with http or ftp, it can be prefixed with title= to specify a title for the browser to display.
A pipe-delimited structure that identifies the library, document type, document ID, and optionally image sub ID.  The structure is library|type|docid|subid, with the |subid portion being optional.

 

Entity ID

A security data element which can be included with a document and/or user account to filter access to specific documents or groups of documents to login user accounts which carry access authorization referencing the same entity ID.

The concept of the Entity ID is one of ownership, designed for situations where external web access to documents in a shared library needs to be restricted to the entity specified. For example, where customer XYZ can login and browse, list and view invoices for customer XYZ without ever seeing documents for other entities listed.

If documents are written to the archive with an empty entity ID field, then any user account with an empty entity ID will pass the entity test for access to a record. In an environment with empty entity ID field on documents in the archive, simply assigning a user any non-blank entity ID value can be used to restrict access to all documents in the archive.

 

Date / Time

The date/time stamp of a document is used as a secondary sort-key in the library to allow a by-date browse-method drill-down to locate documents.  The date/time value can be maintained by the UnForm rule set used to archive the document, or via a command line option.  It defaults to the initial date and time the document was added.

 

Additional dateupdated and timeupdated fields are maintained each time the document is updated.

 

Notes

Free-form text notes can be stored with a document and can be edited in the document properties box of a located document.