Tuesday, January 26, 2016

The Role of MISC in ECM


I used to loathe working on categorization projects because I knew eventually someone would say, “I don’t know, put it in a miscellaneous folder”. This meant that the overall design of the categories was flawed or that we didn’t have enough time and energy to work out every minute detail, only to have it change in a few months anyways.

Flexible categorization makes sense, but the tools are still designed to tag content with fixed values. Big Data solutions might eventually help with this if you have millions of dollars to spend on them. For small to medium sized systems, we are stuck with good old fashioned indexing and search. However, this might prove to be a better long-term solution to content mining. I still believe the better the metadata, the better search results.

The Misc folder suits a number of different purposes:

In general, this folder can be used to analyze new trends in metadata values, that is, some patterns of values will become apparent as more content goes there. Over time, the patterns will become folders/categories and there metadata values will become part of the indexing process. Likewise, categories that are almost empty will be merged with others because their index values are too restrictive.

In taxonomy, a miscellaneous folder is a black box, something that gets all that is outside the scope of the people working on it at the time. Emphasis on “at the time” here because as ways of organizing information changes, so goes the taxonomy.

In workflow, miscellaneous really means a place/bucket where all the routing mistakes are sent, or more likely where any new unanticipated content types go. This works well as it is obvious as the content builds up which queue they should be routed too. The alternative would be to ignore the outliers which would leave them for discovery projects in the future.


Monday, January 18, 2016

Drivers and Followers and ECM

Futurama (New York World's Fair)

In The Option of Urbanism, Chris Leinberger, describes how there are “drivers” and “followers” in the product types of real estate. For example, a driver would be residential home development and a follower would be a mall constructed close to where the new homes are built.

The same is true for enterprise content management. For example, a driver in healthcare applications would be an EMR, and the follower would be all the other applications, except registration and billing. EMRs were late to the game, but are now front and center. ECM systems, like Hyland’s Onbase, fill in the gaps as a follower in the long line of other applications.

Gaps occur over time as applications are moved in and out of favor, CIO’s change things up to reduce costs, requirements change, and priorities are mandated. Gaps are filled with invoice scanning solutions for paper processing. When regulations change, ECM has the flexibility to meet the requirements until the larger “driver” systems and adjust.

Hospital merger and acquisitions are “drivers” which spur multiple content and information migrations. The “followers” help clean up the information messes caused by disparate systems. Content types and metadata have to be mapped and reclassified. Who does this? ECM applications and professionals are well versed in these types of situations.


Understanding ECM’s flexibility and its major role as an IT “follower” is essential when making strategic decisions on where to deploy it. It innovates when “drivers” need it to. It fills in the gaps quickly to provide the needed bridge for unforeseen obstacles of business change. It’s easy to get excited about innovation “drivers” and much harder to involve the “follower” applications in this excitement.