Collection Directory-Key Components


Key components in the collection directory include the word list and the documents table, as described below.

Word List

The word list is a list of all words indexed by the Verity engine for a single partition. You can view the contents of the word list using a command-line utility called didump. A word list segment that was output using didump is shown below.


didump -words /z/colls/collnew/parts/00000003.did
didump - Verity, Inc.
Text Size Doc Word
A 10 3 4
a 34 5 24
abbreviations 4 1 1
about 4 1 1
acronym 5 1 2
acronyms 4 1 1
actual 4 1 1
administrator 3 1 1
advance 3 1 1
all 8 2 3
also 9 2 4
Always 4 1 1
In the word list segment shown above, you will notice that the word list contains mixed-case entries. The mixed-case entries support case-sensitive queries. You may want to look at the word list to determine why a query returned the results it did.

Documents Table

The documents table includes field definitions for all collection fields in a single partition. A number of fields are defined by default and custom fields can be defined. You can view the documents table using a command-line tool called browse. A documents table segment that was generated using browse is shown below.


50 Created FIX-date ( 4) = 12-Jan-1998 01:52:27 pm
51 Modified FIX-date ( 4) = 24-Sep-1997 02:40:26 pm
52 Size FIX-unsg ( 4) = 5381
53 DOC_OF FIX-unsg ( 4) = 0
54 DOC_SZ FIX-unsg ( 4) = 4294967295
55 DOC_FN_OF FIX-unsg ( 4) = 436
56 DOC_FN_SZ FIX-unsg ( 2) = 58
57 _CACHE_FN_OF FIX-unsg ( 4) = 2922
58 _CACHE_FN_SZ FIX-unsg ( 2) = 0
59 _ParentID_OF FIX-unsg ( 4) = 354
60 _ParentID_SZ FIX-unsg ( 2) = 46
61 Title_OF FIX-unsg ( 4) = 2481
62 Title_SZ FIX-unsg ( 2) = 15

Special Internal Fields

Field Name
Description
VdkVgwKey
The document key used by the to manage the document. If the document is accessed through the file system, the primary key is a path name. If the document is accessed through a web server, using HTTP, the primary key is a URL.
Score
The score assigned to each retrieved document, based on its relevance to the query. Valid value for the score field is an integer between 0.000 and 1.000. Because the score field is populated during search processing, this field is blank when you view the documents table using a tool like browse. This field can be viewed in a results template.





Copyright © 1998, Verity, Inc. All rights reserved.