#3443 NORM Untriag: Create bindings to use libabiword for file conversion

Zarro Boogs per Child bugtracker at laptop.org
Fri Sep 14 08:10:19 EDT 2007


#3443: Create bindings to use libabiword for file conversion
---------------------------------------+------------------------------------
  Reporter:  uwog                      |       Owner:  uwog     
      Type:  defect                    |      Status:  new      
  Priority:  normal                    |   Milestone:  Untriaged
 Component:  write-activity (abiword)  |     Version:           
Resolution:                            |    Keywords:           
  Verified:  0                         |  
---------------------------------------+------------------------------------

Comment(by tomeu):

 Extracting plain text from various formats will surely be important when
 we restore indexing of document contents and improve our fulltext search
 capabilities.

 Although we should compare the reliability of abiword filters against
 alternatives like pdftotext, odt2txt, etc, I see advantages in using
 abiword for all of them.

 Regarding the impl of python bindings for this functionality, lately I
 have thought it could be better if the datastore executed the conversion
 out of process, as abiword is a big piece of non-python code and if
 executed in-process by the DS, could bring down the whole DS process in
 case of a segmentation fault.

 So perhaps we don't need to add pyabiword bindings for content indexing,
 just bundle the abiword binary so it can have piped in docs and piped out
 plain text.

-- 
Ticket URL: <https://dev.laptop.org/ticket/3443#comment:1>
One Laptop Per Child <https://dev.laptop.org>
OLPC bug tracking system



More information about the Bugs mailing list