PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 06-19-2006, 12:32 PM   #1
sepult
Green Mole
 
Join Date: Jun 2006
Posts: 4
pdf indexing blocks when spidering

Hi,

First, congratulations for PhpDig !

So i've installed phpdig v.1.8.9 RC1 on my localhost. All is ok.
I would like to index pdf file.
I've added the 3 part of code in "read me before..."

When i try to index pdf files, it blocks at "echo is_executable" line, see commentar :

Is result test http an array: 1
What is result test http status: PDF

Is result test an array : 1
What is result test status : PDF
Use is executable is set to : 0
Index the pdf is set to : 1
Parse the pdf is set to : D:\phpdig\ext\pdftotext.exe
Does parse pdf exist :

It blocks here, with no result.

I think the command is_executable doesn't work. Because it run on a windows server, i've tried to change the value
define('USE_IS_EXECUTABLE_COMMAND','0'); in config file

Could you please help me.
THX

Last edited by sepult; 06-19-2006 at 12:33 PM. Reason: wrong title
sepult is offline   Reply With Quote
Old 06-19-2006, 01:18 PM   #2
sepult
Green Mole
 
Join Date: Jun 2006
Posts: 4
Config :

Server under Windows
PHP Version 4.4.0

Last edited by sepult; 06-19-2006 at 01:21 PM.
sepult is offline   Reply With Quote
Old 06-20-2006, 04:45 AM   #3
sepult
Green Mole
 
Join Date: Jun 2006
Posts: 4
I saw that pascalp had the same problem.

How did you resolve it ?
sepult is offline   Reply With Quote
Old 06-29-2006, 06:24 AM   #4
sepult
Green Mole
 
Join Date: Jun 2006
Posts: 4
Finally, I resolved my problem.
Here is my solution.

I'm using Windows server with apache 2 and can't use command "is_executable", same for "cat" or "type" command shell.

So after robot functions :
if ($usetool) {
rename($tempfile1,$tempfile2);
exec($command,$result,$retval);
unlink($tempfile2);

add :
$f_handler = fopen($tempfile2.$ext,'r');
while (!feof($f_handler)) {
$result[] .= utf8_encode(fgets($f_handler,8192));
}
fclose($f_handler);
unlink($tempfile2.$ext);

and comment lines after :

/*if (!empty($ext)) {
$command = 'type '.$tempfile2.$ext;
exec($command,$result,$retval);
unlink($tempfile2.$ext);
}*/

So for me it works fine.
Bye
sepult is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Indexing PDF dlaperle Troubleshooting 1 03-21-2007 08:00 PM
indexing pdf philippeguerind External Binaries 11 02-21-2004 11:50 AM
Problem whith inclute in template SSI blocks SavageNoName How-to Forum 11 12-31-2003 02:59 AM
PDF indexing lelandv External Binaries 15 12-08-2003 05:23 PM
PDF indexing aryan External Binaries 11 11-27-2003 08:51 AM


All times are GMT -8. The time now is 12:35 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.