|
06-19-2006, 12:32 PM | #1 |
Green Mole
Join Date: Jun 2006
Posts: 4
|
pdf indexing blocks when spidering
Hi,
First, congratulations for PhpDig ! So i've installed phpdig v.1.8.9 RC1 on my localhost. All is ok. I would like to index pdf file. I've added the 3 part of code in "read me before..." When i try to index pdf files, it blocks at "echo is_executable" line, see commentar : Is result test http an array: 1 What is result test http status: PDF Is result test an array : 1 What is result test status : PDF Use is executable is set to : 0 Index the pdf is set to : 1 Parse the pdf is set to : D:\phpdig\ext\pdftotext.exe Does parse pdf exist : It blocks here, with no result. I think the command is_executable doesn't work. Because it run on a windows server, i've tried to change the value define('USE_IS_EXECUTABLE_COMMAND','0'); in config file Could you please help me. THX Last edited by sepult; 06-19-2006 at 12:33 PM. Reason: wrong title |
06-19-2006, 01:18 PM | #2 |
Green Mole
Join Date: Jun 2006
Posts: 4
|
Config :
Server under Windows PHP Version 4.4.0 Last edited by sepult; 06-19-2006 at 01:21 PM. |
06-20-2006, 04:45 AM | #3 |
Green Mole
Join Date: Jun 2006
Posts: 4
|
I saw that pascalp had the same problem.
How did you resolve it ? |
06-29-2006, 06:24 AM | #4 |
Green Mole
Join Date: Jun 2006
Posts: 4
|
Finally, I resolved my problem.
Here is my solution. I'm using Windows server with apache 2 and can't use command "is_executable", same for "cat" or "type" command shell. So after robot functions : if ($usetool) { rename($tempfile1,$tempfile2); exec($command,$result,$retval); unlink($tempfile2); add : $f_handler = fopen($tempfile2.$ext,'r'); while (!feof($f_handler)) { $result[] .= utf8_encode(fgets($f_handler,8192)); } fclose($f_handler); unlink($tempfile2.$ext); and comment lines after : /*if (!empty($ext)) { $command = 'type '.$tempfile2.$ext; exec($command,$result,$retval); unlink($tempfile2.$ext); }*/ So for me it works fine. Bye |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Indexing PDF | dlaperle | Troubleshooting | 1 | 03-21-2007 08:00 PM |
indexing pdf | philippeguerind | External Binaries | 11 | 02-21-2004 11:50 AM |
Problem whith inclute in template SSI blocks | SavageNoName | How-to Forum | 11 | 12-31-2003 02:59 AM |
PDF indexing | lelandv | External Binaries | 15 | 12-08-2003 05:23 PM |
PDF indexing | aryan | External Binaries | 11 | 11-27-2003 08:51 AM |