1 |
adcroft |
1.1 |
# ----- Example 8 - Filtering PDF files ------- |
2 |
|
|
# |
3 |
|
|
# Please see the swish-e documentation for |
4 |
|
|
# information on configuration directives. |
5 |
|
|
# Documentation is included with the swish-e |
6 |
|
|
# distribution, and also can be found on-line |
7 |
|
|
# at http://swish-e.org |
8 |
|
|
# |
9 |
|
|
# |
10 |
|
|
# This example demonstrates how to use swish's |
11 |
|
|
# "filter" feature to index PDF documents. |
12 |
|
|
# |
13 |
|
|
# Filters can be used to filter PDF or MS Word documents |
14 |
|
|
# to uncompress gzipped files, or to modify content |
15 |
|
|
# before indexing. |
16 |
|
|
# |
17 |
|
|
# You will need the xpdf package installed to use |
18 |
|
|
# this filter. |
19 |
|
|
# |
20 |
|
|
# See filter-bin/_pdf2html.pl for more information. |
21 |
|
|
# |
22 |
|
|
# Please see the documentation on File Filters in |
23 |
|
|
# the SWISH-CONFIG.pod manual page. |
24 |
|
|
# |
25 |
|
|
# Note: |
26 |
|
|
# If you are filtering many documents and/or using |
27 |
|
|
# a perl script to filter, see example9.config for |
28 |
|
|
# perhaps a faster way to filter. |
29 |
|
|
# |
30 |
|
|
#--------------------------------------------------- |
31 |
|
|
|
32 |
|
|
# Include our site-wide configuration settings: |
33 |
|
|
|
34 |
|
|
IncludeConfigFile example4.config |
35 |
|
|
|
36 |
|
|
# Index the example config files and .pdf files |
37 |
|
|
# in the current directory (and sub directories) |
38 |
|
|
|
39 |
|
|
IndexDir . |
40 |
|
|
IndexOnly .config .pdf |
41 |
|
|
|
42 |
|
|
|
43 |
|
|
# Assign the pdf2text.pl filter to .pdf files |
44 |
|
|
# Please see docs on what data can be passed to the filter. |
45 |
|
|
|
46 |
|
|
FileFilter .pdf ../filter-bin/_pdf2html.pl |
47 |
|
|
|
48 |
|
|
|
49 |
|
|
# end of example |