PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > External Binaries

Reply
 
Thread Tools
Old 02-04-2004, 03:59 AM   #1
adtphpDig
Green Mole
 
Join Date: Dec 2003
Posts: 3
Check box from spider.php

I am still trying to get phpDig to crawl pdf's as they do not appear in the search listing.

Since the fonts were installed in a seperate directory I had to do the following in my config.php

define('PHPDIG_INDEX_PDF',true);
define('PHPDIG_PARSE_PDF','/usr/local/bin/pstotext');
define('PHPDIG_OPTION_PDF','-gs "gs -I/usr/ghostscript/fonts/default/Type1"');

Is the above acceptable?

Similar to some of the other postings, I see the pdf's in the list, however not in any of the search engine results. I also noticed that there is no green checkbox next to these pdf's. Does no check box or x mark mean something?

Anyway I'm using 1.6.5 with the critical change to the config.php file. Also, I've attached my spider.php file.

Thanks,
Anand
adtphpDig is offline   Reply With Quote
Old 02-05-2004, 09:44 AM   #2
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. What do you get when you run the following from shell?

/usr/local/bin/pstotext -gs "gs -I/usr/ghostscript/fonts/default/Type1" filename.pdf
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 02-05-2004, 10:50 AM   #3
adtphpDig
Green Mole
 
Join Date: Dec 2003
Posts: 3
To answer your question. I get the text from the pdf file.

I've moved on believing that the pstotext was not install correctly and have tried pdftotext which I have actually gotten green check marks as well as a 0 result from the exec() command in the robot_functions.php.

In the result from exec() I get 0, 1, and 3. What is 1 and 3 represent?
adtphpDig is offline   Reply With Quote
Old 02-05-2004, 12:00 PM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. What happens if you copy /usr/local/bin/pstotext to your account and then change PHPDIG_PARSE_PDF to reflect the full path to the new location of pstotext, making sure that the directories and file are all 755 permissions?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Unable to connect to database : Check the connection script. marb Troubleshooting 5 04-12-2004 03:35 AM
sleep(5) in spider.php? bsw114 How-to Forum 2 03-12-2004 02:31 PM
Unable to connect to database : Check the connection script. Nosmada Troubleshooting 2 02-27-2004 09:15 AM
spider.php via bash tomas Troubleshooting 16 02-07-2004 05:23 PM
Unable to connect to database : Check the connection script. Wayne McBryde Script Installation 7 11-04-2003 12:08 PM


All times are GMT -8. The time now is 07:08 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.