How it works
Document Assistant is implemented as a server-side itembase listener, which monitors user-defined Expressroom directories for new or updated source files. When triggered by these source files, Document Assistant will transform the files to site-specific XML in a process that may include one or more steps, or transforms. The ability to transform documents over the course of multiple steps allows site implementers to build powerful conversion processes, using smaller components that are easier to build, configure, manage, and reuse.
Each set of transformations is defined in a chain. These chains define the source directory and file extension the Document Assistant is "listening" for, the specific transforms to run, and the destination directory and filename extension to which to write the results.
Document Assistant can monitor multiple directories concurrently, and a single update to a file can trigger several concurrent transformations, as necessary. The result is the information rendered on pages on your Web site.
Available transforms
Here are the six primary transforms that are included with Document Assistant:
• native—Transforms from most desktop application formats, such as Microsoft Word, WordPerfect, and others to an internal Flat XML format.
• xsl—Transforms an XML document using an XSLT stylesheet.
• cmd—Executes a command line command to perform a transformation. Useful when using a third-party application that does not have a compatible API.
• custom—Performs a transformation by calling a custom built Java class.
• batik—Converts images from one format to another using the Apache Batik library.
• copy—Copies files from one location to another.
Extending Document Assistant
The Transform Java interface is available and customers can build their own transforms to meet their own business-specific needs.
What you’ll need to know
In most cases, you will have to configure the transforms for a chain. This implies having a solid understanding and comfort level with XML overall and specifically with XSLT. Java is not required unless you’re going to build a custom transform.
See it in action
Take our online demonstration for a test drive at http://www.industrialmedium.com/das/... it marries our Expressroom Document Assistant Service with the Calais Web Service from Reuters to generate and embed rich semantic metadata in your Microsoft® Word documents, returning the regenerated and enhanced document for your use.

