Control Indexing of Content in Alfresco

Alfresco indexes all properties of alfresco contents and actual text from content for some of the content types (ex.word,pdf,doc). Indexing of text from content is only possible for all content types for which conversion to text file possible because Alfresco internally use Lucence to index everything and Lucene only able to read text file so, when you upload pdf file internally it gets converted into text file and then consumed by Lucene to index all its data. This is how it works but sometimes we have requirements where we do not to index some of the contents

How can we control indexing for particular content?

You can control index by using cm:indexControl aspect. All we know that to add metadata to the content item aspect is being used. Alfresco Share provides you with a list of default aspects. This is one of the aspect cm:indexControl which is useful to set up the indexes in Alfresco Share

With the help of this cm:indexControl aspect you will be able to control indexes for the control items.

The following table shows the possible combinations of settings along with the behavior for each case:

cm:isIndexed cm:isContentIndexed Result
True True Metadata is indexed. Content is indexed.
True False Metadata is indexed. Content is not indexed.
False True No indexing at all.
False False No indexing at all.

Here are two properties of this cm:indexControl aspect which will allow configuration of indexing of nodes to which it is applied.

  • cm:isIndexed ((content + metadata)): This property controls whether the node is indexed or not.
  • cm:isContentIndexed: This property controls whether the node content (binary) is indexed or not. Setting this to false inhibits full text indexing of the document binary.

Using this aspect you can choose to disable repository-wide indexing. This can prove useful in situations, such as bulk loading.

Hope this gives you very good insight on indexing and how to control indexing for particular content.

Reference:

http://docs.alfresco.com/4.1/tasks/library-item-manage-aspects.html

http://forums.alfresco.com/forum/general/non-technical-alfresco-discussion/how-does-alfrescos-content-indexing-work-12102008

Looking for quality Alfresco Web Hosting? Look no further than Arvixe Web Hosting!

Tags: , , , , , , , , , , , , , , , , | Posted under Alfresco | RSS 2.0

Author Spotlight

mitpatoliya

mitpatoliya

I love opensource technologies working with those technologies from the time I have stepped in to the Software Industry. Alfresco CMS is my area of expertise. I have worked on various complex implementations which involved integration of Alfresco with other technologies, extensively worked with JBPM workflows and Webscripts.

Leave a Reply

Your email address will not be published. Required fields are marked *


× 1 = 8

You may use these HTML tags and attributes: <a href="" title="" rel=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>