Posted tagged ‘search’

Making the switch from navigating to searching with help from ICC and Regex.

September 29, 2013

Many companies are still making that transition from shared drives (who has never had a S:\ or H:\drive!?!?!?) to Document Management System (DMS) or full blown Enterprise Content Management (ECM) systems. There are many reasons for making the switch from overloaded hardware to new business demands but a key point in many of these systems is how the end user uses the system.

Lots of shared network drives are a prime example of content chaos with no naming or folder standardization and users left to create their own folders. However, some more well thought out network shares have a semi-structured foldering system with maybe  a base template of a folder structured which is copied and pasted for new projects, claims, matters or applications.

filesystem

 

Whatever the structure or lack of it on a shared file system it is generally a case of users having to browse for the content they are after. What happens if you can’t find that vital document you worked on 3 months ago? You always have the option to search but then you are presented with the dozens if not hundreds of hits the full text search brings back. I think of this type of use case as discovery – users are having to discover what they are looking for rather than being able to pinpoint it straight away. More on this topic here at a previous post: Difference between search and discovery.

With this in mind it is important for our end users to realize that any migration to a new DMS or ECM system demands a different way of working – hopefully a smarter and more efficient way of working. Although, sometimes DMS or ECM system are implemented badly and mimic the folder browsing approach which seems crazy in today’s world with the content explosion. Saying that I am sure there are cases for the old style folder browsing such as case management solutions that have adhoc document collections.

We have established the source system disadvantages, the benefits are new target system will bring and determined that we have a semi-structured foldering system which could be used to place some categorization and property values to our content in the new system. Up steps IBM Content Collector (ICC)!

I am no expert with ICC but I love it’s module design and flexibility it provides for ingesting content from a variety of sources to a repository. You don’t need to be a programming genius to achieve some great results but how do we determine index information based on folder names in document file paths? In short we are looking for patterns in a string and what better way that using Regular Expressions….groan I hear you sigh! I was never a fan of Regular Expressions mainly because it looked like hieroglyphics however after spending sometime on a number of projects and getting into the weeds I have changed my mind and realize how powerful they can be. Saying that I will likely forget everything I have learnt in a couple of months.

Below is a screenshot of how to build Regular Expressions into your ICC Task Route. I haven’t detailed the Regular Expressions used as that is a topic all on its own but will post again on typical expressions and how they can be combined with ICC Lists to provide some powerful lookups.

regex

Trouble finding those filed emails & staying on top of your inbox?

August 6, 2013

Great little tool that a colleague introduced to me back in Lotus Notes 6. I have just started to use it again to help me keep on top of my inbox and file things more effectively.

SwiftFile:

SwiftFile is an intelligent assistant for Lotus Notes that helps users organize their e-mail into folders. SwiftFile uses a text classifier to learn each user’s mail-filing habits. SwiftFile uses the model it learns to predict the three folders into which the user is most likely to place each incoming message. The predictions are presented to the user as three shortcut buttons that allow the user to quickly file each message into one of the predicted folders. When one of SwiftFile’s  predictions is correct, the effort required to file a message is reduced to a single button click. This file contains instructions for installing and using SwiftFile.

Such a simply little tool that may even help with your records management and email archiving strategy.

Get the latest version here:

http://www-01.ibm.com/support/docview.wss?uid=swg24019219

SwiftFile

The difference between Search and eDiscovery

July 21, 2013

Is there a difference between search and discovery? I hear the words search and discovery used interchangeably in meetings and I often try to highlight the differences. Below I have tried to summarize the differences in the execution and results of both. Hopefully this will help me explain it better in the future:

Execution:

  • Search – a user executes a search once
  • eDiscovery – a dynamic process that is executed over a period of time

End Results:

  • Search – a set of results that match the defined criteria
  • eDiscovery – an awareness or understanding gained from searching

The Findings:

  • Search – the results are likely and expected
  • eDiscovery – the results are unexpected but relevant to the user’s goals

Content:

  • Search – results are based on top ratings and popularity of the content
  • eDiscovery – is a process of investigation that the content leads due to its relevance

Recognition:

  • Search – formula based
  • eDiscovery  – led by user decision

From checking definitions in dictionaries and of-course Wikipedia, eDiscovery is defined as being legally driven for litigation. Although I believe eDiscovery isn’t just legally driven. Discovery can be more general such as deciding on where you go on holidays since you start out not knowing but by the end of your googling and investigating travel websites and blogs you have come to a conclusion/decision. The fact that there is an ‘e’ in front of the word discovery doesn’t solely make it legally focused, it simply means it is an electronic process.

Finally, to summarize, there maybe differences between Search and eDiscovery but in conclusion, eDiscovery relies heavily on search and can even be said to compliment it.