Error during document indexing: An unknown error occured

book

Article ID: 160713

calendar_today

Updated On:

Products

Data Loss Prevention Enforce

Issue/Introduction

During document indexing, Enforce shows this status for the Index:

          Error during document indexing: An unknown error occured

And Tomcat shows an error similar to this:

17 JAN 2012 16:07:20,760- Thread: 24 SEVERE [com.vontu.profiles.manager.InfoSourceIndexCreator] Error during document indexing
Cause:
com.vontu.cracker.jni.NativeException: Exception caught in FileTypeIdentifier::GetFileType
com.vontu.cracker.jni.NativeException: Exception caught in FileTypeIdentifier::GetFileType
 at com.vontu.cracker.jni.NativeFileTypeIdentifier.getNativeDocumentTypeFromByteArray(Native Method)
 at com.vontu.cracker.jni.NativeFileTypeIdentifier.getFileType(NativeFileTypeIdentifier.java:24)
 at com.vontu.cracker.NativeFileTypeIdentifierAdapter.getFileType(NativeFileTypeIdentifierAdapter.java:17)
 at com.vontu.cracker.TypeIdentifierDelegate.getTypeForFile(TypeIdentifierDelegate.java:34)
 at com.vontu.profiles.manager.document.CrackingProcessor.getDocumentTypeForFile(CrackingProcessor.java:41)
 at com.vontu.profiles.manager.document.CrackingProcessor.getFormatForFile(CrackingProcessor.java:35)
 at com.vontu.profiles.manager.document.DocumentSourceIndexCreator.putDocumentInIndexer(DocumentSourceIndexCreator.java:379)
 at com.vontu.profiles.manager.document.DocumentSourceIndexCreator.doIndex(DocumentSourceIndexCreator.java:227)
 at com.vontu.profiles.manager.document.DocumentSourceIndexCreator.indexInfoSourceOnManager(DocumentSourceIndexCreator.java:123)
 at com.vontu.profiles.manager.InfoSourceIndexCreator.indexListOfDataSources(InfoSourceIndexCreator.java:138)
 at com.vontu.profiles.manager.document.DocumentSourceIndexJob.index(DocumentSourceIndexJob.java:9)
 at com.vontu.profiles.manager.InfoSourceIndexJob.execute(InfoSourceIndexJob.java:46)
 at org.quartz.core.JobRunShell.run(JobRunShell.java:202)
 at org.quartz.simpl.SimpleThreadPool$WorkerThread.run(SimpleThreadPool.java:543)

Resolution

An exception with the FileTypeIdentifier indicates that the indexer was unable to determine the file type, probably due to corruption. The client should review the files to be indexed to be sure that there are no obviously corrupted files, and especially to see if there are any spanned Zip files that are missing a piece. Once the source of the error is located and fixed or deleted, attempt to index again.