1 |
# ----- Example 8 - Filtering PDF files ------- |
2 |
# |
3 |
# Please see the swish-e documentation for |
4 |
# information on configuration directives. |
5 |
# Documentation is included with the swish-e |
6 |
# distribution, and also can be found on-line |
7 |
# at http://swish-e.org |
8 |
# |
9 |
# |
10 |
# This example demonstrates how to use swish's |
11 |
# "filter" feature to index PDF documents. |
12 |
# |
13 |
# Filters can be used to filter PDF or MS Word documents |
14 |
# to uncompress gzipped files, or to modify content |
15 |
# before indexing. |
16 |
# |
17 |
# You will need the xpdf package installed to use |
18 |
# this filter. |
19 |
# |
20 |
# See filter-bin/_pdf2html.pl for more information. |
21 |
# |
22 |
# Please see the documentation on File Filters in |
23 |
# the SWISH-CONFIG.pod manual page. |
24 |
# |
25 |
# Note: |
26 |
# If you are filtering many documents and/or using |
27 |
# a perl script to filter, see example9.config for |
28 |
# perhaps a faster way to filter. |
29 |
# |
30 |
#--------------------------------------------------- |
31 |
|
32 |
# Include our site-wide configuration settings: |
33 |
|
34 |
IncludeConfigFile example4.config |
35 |
|
36 |
# Index the example config files and .pdf files |
37 |
# in the current directory (and sub directories) |
38 |
|
39 |
IndexDir . |
40 |
IndexOnly .config .pdf |
41 |
|
42 |
|
43 |
# Assign the pdf2text.pl filter to .pdf files |
44 |
# Please see docs on what data can be passed to the filter. |
45 |
|
46 |
FileFilter .pdf ../filter-bin/_pdf2html.pl |
47 |
|
48 |
|
49 |
# end of example |