|
11-14-2006, 04:40 AM | #1 |
Green Mole
Join Date: Nov 2006
Posts: 2
|
Swedish characters in catdoc
Hi!
I'm about to develop a site that has a search engine, and I've been looking at PhpDig, and it seems really nice! I saw that it used catdoc to get text from MS Word-files, so I started testing with it, but it doesn't seem to work. The two lines: define('PHPDIG_PARSE_MSWORD','W:\www\catdoc\catdoc.exe'); define('PHPDIG_OPTION_MSWORD','-s 8859-1'); are in my config.php, but some characters are not translated correctly. If I test it with a Word-file i created, "ä" becomes "d", "ö" becomes "tz" and "å" becomes "e", and in rtf-files, it becomes other wierd characters. I've spent the last hours googling on it, but I can't make it work. Am I missing something? I've read about character substitution in catdoc, but I really don't know how I would do it. All help is appreciated! |
11-16-2006, 02:21 AM | #2 |
Green Mole
Join Date: Nov 2006
Posts: 2
|
Ah! Stupid me, you just edit the charsets/ascii.rpl file, so that "ö" gets replaced with something unique, that you can replace before you insert it into the database.
|
11-21-2006, 06:47 AM | #3 |
Green Mole
Join Date: Feb 2006
Posts: 2
|
Can you explain that in Sweidhs... erhm... English...
Just tell me what you did. :p How do I replace it with something "unique"? |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Japanese characters on an English page | Shdwdrgn | Troubleshooting | 1 | 03-15-2005 09:28 AM |
urls with collection of weird characters | revenazb | Troubleshooting | 6 | 01-10-2005 02:09 AM |
ignore special characters like - | mirdin | Troubleshooting | 5 | 09-11-2004 07:48 AM |
Compiled or corrupted characters | tryangle | How-to Forum | 1 | 04-20-2004 10:47 AM |
National characters < Please Help | plodz | How-to Forum | 4 | 10-29-2003 10:27 AM |