<p>Hello,</p>
<p>I'm trying to mentally work through mechanics of internationalization of textual content (or HTML content) and I have many questions. I've read much of the wiki on the pootle, i18n, l10n, translating, etec. topics, but they are all oriented towards code, not textual content, so much is still not clear to me. </p>
<p>Let's assume that I have some great text in English, starting from plain-vanilla ASCII, but there may be some words that are going to be better if they are represented in italics or bold (for emphasis). Let's say I take care of that by using HTML mark-up, so I now have an English HTML text tha I want to internationalize for Pootle submission. </p>
<p>Most of the HTML is doing stuff behind the scenes (links, font-size, etc.) which poses no special i18n issues, but some HTML mark up has the effect of modifying the presentation of text in a way that actually does have an impact on the text's meaning.</p>
<p>(see pseudo-HTML below).</p>
<p>sentence1 is phrase1a + phrase1b + phrase 1c</p>
<p>sentence2 is phrase2a + <bold>phrase2b</bold> + phrase2c</p>
<p>sentence3 is <italics>phrase3a</italics> + phrase3b + phrase3c</p>
<p> </p>
<p>sentence1 is no real challenge, it goes straight into a .pot file as a single string.</p>
<p>But what about sentences 2 or 3? </p>
<p>Do italics and/or bold tags translate into non-latin alphabets? (especially say Nepali)</p>
<p>How do you parse this for presentation in Pootle?</p>
<p>Do you put full sentence with HTML tag in it, hoping translator mentally interprets and adjusts HTML tags as needed to preserve emphasis desired within context of sentence?</p>
<p>or do you break it into sub-phrases (broken at the HTML tag junctions in English)?</p>
<p>I can envision the .po files (pseudo versions below are ASCII only), </p>
<p>sentence1<br>Where is the library?<br>Donde esta la biblioteca?</p>
<p>sentence2<br>Let's go to the beach!<br>Vamos a la playa!</p>
<p>does the internationalized backbone html look like this?</p>
<p><ahref="sentence1"><a href="http://www.laptop.org">www.laptop.org</a></a><br><br><br><ahref="sentence2"><a href="http://www.google.com">www.google.com</a></a></p>
<p>and if so what takes it and the .po file to produce a localized version? </p>
<p>Is this done with gettext ignoring the fact that it is not code, just text? </p>
<p>Do gettext tools understand HTML?</p>
<div>I know that is a lot of questions, but I am really hoping to get some health content prepared/bundled and it would help me very much to understand the later phases of 118n and l10n so that the entire process can be done in the most efficient fashion. I more-or-less grasp the process used for code, but I'm not sure I understand how or if the same tools work for plain-text or HTML.</div>
<div> </div>
<div>Any guidance (more wiki links, whatever) would be appreciated.</div>
<div> </div>
<div>cjl</div>
<div> </div>
<div> </div>
<div><u><font color="#0000ff"><a href="http://wiki.laptop.org/go/User:Cjl">http://wiki.laptop.org/go/User:Cjl</a></font></u></div>
<div> </div>
<div><a href="mailto:cjlhomeaddress@gmail.com">cjlhomeaddress@gmail.com</a></div>
<p> </p>
<p><br> </p>