|
04-09-2004, 07:04 AM | #1 |
Green Mole
Join Date: Apr 2004
Location: Bordeaux, France
Posts: 3
|
pstotext problem.
Hi. Again a problem trying to index pdf files.
First : the environment Debian linux running Apache 1.3.26 . PHP 4.1.2. PHP dig 1.8.0 Succesfully installed pstotext. In console mode, pstotext runs very well : The command 'pstotext file.pdf ' display the text contained in the pdf on the screen. I also tried to redirect output of pstotext in a text file successfully. phpdig config : define('PHPDIG_INDEX_PDF',true); define('PHPDIG_PARSE_PDF','/usr/bin/pstotext'); define('PHPDIG_OPTION_PDF','-cork'); verified (twice) that pstotext is in /usr/bin/ directory The trouble is the following : phpdig seems to read correctly pdf files but doensn't index them at all. help me please.
__________________
Even though he's free, can't fly with these heavy chains |
04-09-2004, 08:24 AM | #2 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Hi. Are the directories to pstotext and the pstotext file itself set to 755 permissions?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
04-10-2004, 02:14 AM | #3 |
Green Mole
Join Date: Apr 2004
Location: Bordeaux, France
Posts: 3
|
Hi.
Thank you for answering so quiclky. The directories and pstotext file itself are set to 755 rights (rwxr-xr-x) phpdig reads the pdf files but doesn't index them.
__________________
Even though he's free, can't fly with these heavy chains |
04-10-2004, 01:44 PM | #4 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Hi. Maybe something in this thread will help.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
04-13-2004, 12:19 AM | #5 |
Green Mole
Join Date: Apr 2004
Location: Bordeaux, France
Posts: 3
|
Hi.
Thank you for your help. I patched spider.php and robot_functions.php and it seems to be working now. Phpdig now index some of my pdf. I still have some problems when trying to index a directory containing only pdf files, but i'm still searching. Thank you again
__________________
Even though he's free, can't fly with these heavy chains |
04-13-2004, 07:32 PM | #6 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
>> I still have some problems when trying to index a directory containing only pdf files, but i'm still searching.
Hi. Are there links to all these PDF files? As PhpDig follows links, it won't index a standalone directory of files. Also, it seems some PDF files just take too much memory. See this thread for more details.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
problems with pstotext - path-problem? | jmeyerdo | External Binaries | 1 | 01-26-2006 12:30 PM |
How do you install pstotext | krugar | External Binaries | 2 | 12-08-2004 12:53 PM |
problem with pstotext | loicoco | External Binaries | 2 | 07-16-2004 03:17 AM |
pstotext issue | killer27 | External Binaries | 7 | 05-12-2004 02:28 PM |
Pstotext problem!!!!! | mario | External Binaries | 14 | 03-24-2004 04:16 AM |