MAKE Y0UR INTERNET OUTREACH MULTILINGUAL
Geraldine Wright
Soul Food Ministry |
My E-mail: gw_grin@yahoo.com
Soul Food Ministry web site: http://www.soulfoodministry.org
Getting documents translated
- Volunteer vs. hired translators
- Volunteers are useful for short translations: letters, 1 or 2 paragraph articles, prayer requests, flyers, brochures, etc.
- Larger articles need professional translators
- Translation is a lot of work and volunteers will probably not be willing to do a 2000-3000 or more word document for free
- If you are paying someone, you can demand reasonable turnaround time and quality work
- Finding translators for hire
- Talk to others who have had documents translated. They may have recommendations.
- Check the phone book - this will get you a translator who is physically close
- Check on the Internet - many companies advertise on the Internet and are perfectly happy to receive and deliver documents and invoices through E-mail
- Many companies can do multiple languages, so if you find one you trust they may be able to do all your translation work
- Expect to pay around $0.15 a word
- May find cheaper if you go with a foreign company
- Someone who asks $0.25-0.30 a word is probably overcharging
- If you need additional services, such as putting it into HTML or highly technical language, expect to pay more
- Make them prove themselves
- For the first translation you have them do, send a small document. Have someone who reads the language check it for accuracy
- If the document is full of errors (more than occassional typos) and they balk at fixing it, don't give them any more business
- Don't deal with companies that have sloppy business practices: can't get the invoices to you, forget about your job, estimates far off from actual job cost
- Who we use
- For Spanish - Alfatrad, translate@alfatrad.com
- For French, Portuguese, Russian - American Translators Group, ametrans@earthlink.net
- Proofing translated documents
- Since translation is done by mere humans, no document will be perfect
- There will be mispellings, missing punctuation and minor mistranslations
- Translators are usually native speakers of the target language. Therefore, you will get a translation that is grammatically correct, but the translator may not catch all the shades of meaning of the English phrases, and mistranslations can result
- Recruit a volunteer checker who is a native speaker of English and is fairly fluent in the target language. They will be able to spot typos as well as mistranslations.
- Pay your translators promptly!
- If you are known as a good customer you will likely get better service.
Getting documents into electronic format
- Make the translators give you an electronic version of the document, not just a hard copy
- Get the proper fonts onto your computer
- Western European languages are no problem; the characters can all be produced by the default fonts which are automatically installed when you load Windows
- Other languages, excluding Far Eastern (Eastern European, Middle Eastern, Russian, African, etc.)
- These need to be installed on your computer, but they come with Windows, so you don't need to buy anything
- Actual instructions will differ depending on the version of Windows you are using. Go to the HELP function on the start menu and search for something like multilanguage support. Display the instructions for installing languages, and follow them. You will need the Windows installation discs.
- Far Eastern (Chinese, Japanese, Korean, etc.)
- Depending on your version of Windows, these fonts may come on the install discs. In that case, just follow the instructions for installing them, just like above.
- If the fonts do not come on your Windows install discs, you will need to either buy or download them. To download, do an Internet search for "Japanese fonts", "Chinese fonts", etc. Two of the sources I came up with when I did a search are: http://chinese.yahoo.com/docs/info/setup.html for Chinese, and http://user.dtcc.edu/~berlin/font/japanese.htm for Japanese.
- Entering foreign characters
- Using the CHARACTER MAP to enter characters not on your keyboard
- On the Windows START menu, go to PROGRAMS, then to ACCESSORIES, then to SYSTEM TOOLS. Choose CHARACTER MAP.
- The character map shows the characters available from the selected font. Depending on your version of Windows, it may show all the characters, or only the characters for Western European languages. Win95 and 98 will only show the Western European characters. Win2000 and XP will show all the characters the chosen font can produce.
- You can put any visible character on the clipboard by double clicking it, then hitting the COPY button. The character can now be pasted into your document.
- Using ALT+Numeric keypad to enter characters not on your keyboard
- Each character of all the languages has a numerical encoding according to the international standard called Unicode. If you know the Unicode number for a character, you can enter it by pressing ALT and at the same time typing in the number using the numeric keypad. More information about the Unicode standard is available at www.unicode.org.
- Using standard US keyboard to enter characters from Spanish, Portuguese, Italian and French (and perhaps a few others)
- This technique varies slightly from computer to computer, so there may be some trial and error involved. You might want to get a geeky friend to help you.
- You will need to change the KEYBOARD LAYOUT - go to the HELP function on the START menu and search for something like "keyboard layout". Display the instructions for changing the KEYBOARD LAYOUT and follow them. You will end up in a window which lists your default language (probably US English), along with its default keyboard layout (probably something like US STANDARD). You will want to change the KEYBOARD LAYOUT to something like US-INTERNATIONAL.
- You can now get the non-English characters by a combination of key presses: ' + c for a c with a cedilla, ~ + letter for a letter with a tilde over it, ' + letter for a letter with an acute accent, ` + letter for a letter with a grave accent, ^ + letter for a letter with a carat over it, " + letter for a letter with an umlaut.
- Entering characters from other languages, excluding Far Eastern
- If you will do a lot of typing in the language, you best bet is to buy a keyboard specially made for that language.
- You will then add the new language and its keyboard layout to your system configuration. This will be similar to changing the keyboard layout, as above, but first you will need to add the language (see II B above). Then go into the window to change the KEYBOARD LAYOUT as described in the previous section, but this time add your new language to the list. Set the KEYBOARD LAYOUT for this new language to match the new keyboard you put on the system. You can set the new language to be the default, or leave the default as it was.
- You can now switch between the 2 languages by presing ALT and SHIFT at the same time. A small blue square will appear in the Windows menu bar at the right with a 2 letter code telling you which language is in effect
- For entering small amounts of text, you can trick the computer by adding a language to the KEYBOARD LAYOUT screen and setting the keyboard for that language to one of that language's keyboards, even though you don't have one. You will then need to make a map of which foreign characters are produced by pressing which letters.
- Entering characters for Far Eastern languages
- You can buy a version of Windows in Chinese, Japanese, Korean, etc. You will also need a keyboard.
- You can buy an "enabling system", which has fonts for the language and drivers for entering characters using the English keyboard and storing the files.These cost around $150 to $200. The instructions tend to be in Chinese, but it is possible to figure out how to use them. We at Soul Food Ministry use RichWin2000 for Chinese.
- Getting hard copy into electronic format
- You will need a Twain scanner and a Twain compliant optical character recognition (OCR) program. Get a scanner with 300 or 600 dpi resolution in order to increase the accuracy of the OCR.
- OCR for Western European languages
- For just one language, you can get OCR's for around $40 to $50. For non-English OCR's, you may need to go to non-English web sites to find a single language OCR for that language.
- You can get OCR's that do many languages. Depending on the number of languages and the bells and whistles, these cost from around $50 to around $150.
- Examples: WinOCR & FineReaderOCR, European version
- OCR for other languages, excluding Far Eastern
- As for the Western European languages, it is probably possible to find OCR's that do a single language for a fairly reasonable price.
- Multilanguage OCR's that can do things like Russian, Hebrew, and other non-Western European languages tend to do a large number of languages, and therefore are more expensive. They tend to cost from around $300 to around $600.
- Example: FineReaderOCR, Cyrillic version
- Far Eastern
- Expensive. A single language will run around $200. Plus, you have to have the enabling system for that language, since it has drivers the OCR program needs.
- The instructions may not be in English, but it will probably be somewhat intuitive
- At Soul Food Ministry we use PenPower for Chinese OCR
- OCR tips
- No OCR is 100% accurate. Check your file after OCR to catch mis-recognitions.
- Scan at high resolutions, 300 or 600 dpi, in order to get higher fidelity in the character recognition.
- Formats for storing files
- For Western European languages, you can get away with storing files as plain text in order to save disc space.
- For anything else, you will either want to store it as a word processor file, or as a "Unicode Text File". These files take up more disc space, but save the extra information to tell the computer what character set the document uses.
Getting documents on the Internet
- HTML. What is it? Do you need it?
- HTML is the set of commands that tells the browser how to interpret and display the content you sent it. You can edit the files with a simple word processor like WordPad, or buy an HTML editing program like HomeSite.
- Many HTML editors are available which allow you edit the content on a WYSIWYG (What You See Is What You Get) screen. The editor then generates the HTML code behind the scenes for you. FrontPage, DreamWeaver and Netscape Composer are examples of such editors. Microsoft Word can also save things as HTML files, but puts in a lot of extraneous, unneeded code, that bloats the file size
- The HTML standard is found on the web site www.w3.org
- Specifying character sets
- The character set for Western European languages is default. You don't have to do anything special
- For anything else, you or your web page editor will need to put in a special HTML command to tell the browser which character set to use when displaying the page
- Examples:
to specify the Russian character set

to specify the character set for Chinese as written in Taiwan

- Your HTML editor will allow you to set the character encoding for the document, and it will automatically generate this command for you. FrontPage allows you to set it as part of the PAGE PROPERTIES on the FILE menu. In Netscape Composer, choose SAVE AS CHARSET on the FILE menu.
- You will also need to set up your browser to see the characters so you can read the pages you create
- Make sure the correct fonts are on you computer
- You may need to tell the browser which fonts to use for specific character sets. This is done in PREFERENCES for Netscape and INTERNET OPTIONS for Internet Explorer.
- Specifying text direction - refer to the reproduced workshop slide below

- Left to right is the default. In this case, the document is in Hebrew, and we need to set the base direction for the whole document as right to left, so we use the command

The text will then start at the right edge, and the letters will displayed with the first one furthest to the right. Notice that the order as displayed in the browser is reversed from the order in the HTML command. The code
,
which codes for the character

and which you entered first, is rendered at the right edge. The second character, which is coded by
,
appears 2nd from the right, and so on. Your HTML editor may or may not be able to put this in for you; I could not get either FrontPage or Composer to put it in. You need to be prepared to insert the code by hand.
- To specify one complete paragraph as right to left in an otherwise left to right document (or vice versa), enclose the paragraph in the commands

There is a paragraph in English embedded in the otherwise Hebrew document, so we need to replace the "rtl" in this command with "ltr".
- To specify an isolated right to left word or phrase in an otherwise left to right paragraph, (or vice versa) enclose the word or phrase with the HTML commands

We have an English phrase embedded in an otherwise Hebrew paragraph, so we need to replace the "rtl" in this command with "ltr".
- Getting the text into the web page
- Typing directly into the WYSIWYG screen of the editor
- Entering plain text from a file
- Using the HTML editor to open up a text document
- Open the document in WordPad or NotePad and cut and paste into the HTML editor
- Unicode text documents
- Not all HTML editors can handle Unicode text files properly. If yours does, then you can load them into your web page editor just as if it were a simple text file. If not, you may need to employ a 2-step process to get the Unicode text file into your web editor.
- Method 1: Open the file in WordPad, and insert the minimal amount of HTML needed to get it to display on a browser. Store the document as a Unicode text file and specify its file extension as .htm. You can now display the document by opening it up with the browser, but it will be in a funny format that has some peculiarities. To get it into a more standard format, copy the text directly off the browser window and paste it into your HTML editor. You may have to set the character encoding, since the editor may not be able to automatically detect it.
- Method 2: Import the Unicode text file into a word processor, such as Microsoft Word. Save the document as a web page (.htm file). This will give you a file that will open in the browser, but will likely have lots of extra junk code that doesn't add anything to the file, just bloats it. You can edit out the excess by hand, or display the file as above, copy the file off the broswer window and paste it into your editor. You may have to set the character encoding, since the editor may not be able to automatically detect it.
- Special techniques for inserting small amount of text with foreign characters
- Using graphics - refer to the reproduced workshop slide below

- If you have a small amount of text (a word or phrase, no more than a paragraph) which uses a character set different from the rest of the document and you suspect your readers will not have fonts which allow them to see the characters, you can use a graphic.
- First, get the text into a graphic. You can use the text function of a graphics editor and save the results as a gif of jpeg. You can also create the text in a word processor, then do a PRINT SCREEN, which puts it onto the clipboard, You can then paste it into a graphics editor and save it as a gif or jpeg. In both cases, crop the graphics to include just the text, in order to make your graphics small.
- You can now place your graphic into the web page just like any other graphic.
- Look at the picture of a section of the Soul Food Ministry home page reproduced on the left side of the workshop slide. There are 5 lines of text: the 2nd from the bottom is in Russian, and the bottom is in Chinese. The Russian and Chinese text are actually graphics.
- Using Unicode numbers - refer to the reproduced workshop slide below
- You can insert the Unicode number for the specific character by using a & followed by a # followed by the number followed by a ;. Your editor may automatically generate this code for you.
- In the example, the character

is coded for by
,

is coded for by
,

is coded for by

and 
is coded for by
.
The author of this material is Geraldine Wright, the multilingual mischief maker who manages the non-English pages of the Soul Food Ministry web site.