#6777 HIGH Never A: non-ascii characters break make_index.py

Zarro Boogs per Child bugtracker at laptop.org
Fri Mar 28 09:09:33 EDT 2008


#6777: non-ascii characters break make_index.py
----------------------+-----------------------------------------------------
  Reporter:  walter   |       Owner:  sj            
      Type:  defect   |      Status:  new           
  Priority:  high     |   Milestone:  Never Assigned
 Component:  library  |     Version:                
Resolution:           |    Keywords:                
  Verified:  0        |    Blocking:                
 Blockedby:           |  
----------------------+-----------------------------------------------------

Comment(by mako):

 Walter,

 Because content bundles are served statically off-disk and text from
 different content bundles (conceivably in different languages) is mixed
 into the index file, all bundles currently need to be in UTF-8. When we
 move to a dynamic content library, we can dynamically re-encode/decode
 them. Until that happens any non-UTF-8 content bundle will simply be
 skipped.

 You need to find out what encoding your bundle is currently in -- most
 likely is LATIN1 if it's Spanish. Then you can convert it with iconv using
 something like:
 {{{
 iconv -f latin1 -t utf8 -o OUTPUT_FILENAME.html FILENAME
 }}}

 Please try that and let me know if that problem still exists.

-- 
Ticket URL: <http://dev.laptop.org/ticket/6777#comment:1>
One Laptop Per Child <http://laptop.org/>
OLPC bug tracking system



More information about the Bugs mailing list