[OLPC-devel] Summer of Code introduction, progress update

Ivan Krstic krstic at fas.harvard.edu
Wed Jun 14 14:02:30 EDT 2006


This year, OLPC is one of the mentoring organizations in the Google
Summer of Code program. We received more than 75 applications, and had a
hard time choosing only 5 to mentor. Congratulations again to the
selected students!

This is the introductory SoC progress update, and the only one I'll be
compiling. I've asked the students to send progress updates to the
devel@ list every Sunday. I hope to see some great work from this bunch.


o   Matthew Harrison is a freshman studying computer science and
    astronomy at the University of Massachusetts Amherst. He's working
    on external data sources (EDS) for our eBook reader. This means
    things like importing books from Project Gutenberg, the
    International Children's Library (ICDL), and so on.

    Matthew reports: """I've got code to "detect" all the modules in a
    modules folder, code to download and save the PG catalog locally,
    code that strips the catalog of things that aren't book listings,
    and I've outlined the catalog search functions.  This week I'm going
    to work on finding the best/fastest way to search the catalog and
    return results.  I want it to be able to search by title, author,
    keyword, language, and etext number.  As for searching by language,
    PG appends "[Language: Foo]" to books that aren't in English on the
    next line after the book listing in the catalog, so it shouldn't be
    too hard to do language searches.

    I'm still looking into finding the best format for the books.  I'm
    still leaning towards XML or something XML-based, but depending on
    how exactly the interface is going to be implemented, some formats
    may be easier to work with than others."""


o   Eric Astor is a freshman at Swarthmore College, with an intended
    Math/Physics major. He is working on building a set of tools that we
    can use to do two things: one, to take a dump of Wikipedia page
    (such as the Simple English Wikipedia, or any other one) and produce
    an automatic ranking of the pages therein based on "importance",
    which really means graph-theoretical network centrality. This lets
    us say "what are likely the most useful pages to have if we can only
    dedicate 25MB to having encyclopedia content on the laptop?".
    Presumably, the list is then fed to human editors who make the final
    decisions, and processing this final list is the second part of
    Eric's work; his tools are to take it and produce from it a
    locally-browseable Wikipedia dump.

    Eric recently ran the Simple English Wikipedia through a
    PageRank-like algorithm with the help of an open source graph theory
    library, and reports: """As it turns out, the results from the
    PageRank run against the Simple English Wikipedia are very
    reasonably sized in OpenDocument format - so I've attached that.
    I've noticed a definite bias in the results - towards geography, and
    English-speaking countries in particular. I suspect that this is
    mainly because the group of contributors to the SEW is small and
    specialized."""

    I've posted Eric's result set here:
    http://solarsail.hcs.harvard.edu/~krstic/simplewiki-pagerank.ods


o   Eduardo Silva, from Chile, is studying networking at Inacap
    (http://www.inacap.cl) and is working on a new memory usage tool for
    developers.  He's trying to unite various sources of information
    about memory consumption, and present it in a simple, useful numeric
    and graphical form. His new tool, which we dubbed Memphis, will also
    support scripting with Python to allow for memory and CPU usage
    triggers and similar extensions.

    Eduardo reports: """The last 2 weeks I was reading and learning all
    about Python , GTK+ and Cairo.

    - The first step was to write a class in order to read procfs.
    - Write methods to get basic information for a given PID.
    - Design first GUI with Glade
    - Write methods in order to use our GTK TreeView where process
      information will be showed

    At this time, Memphis can:
    - read procfs and load data into a treeview: PID, PPID, program name
      and RSS
    - print memory status in console for a selected process in the
      treeview using just a static frequency

    Now I'm writing some classes to draw the stats in the graph for a
    selected PID."""

    The first screenshot of Memphis, still lacking the graph display,
    is here: http://solarsail.hcs.harvard.edu/~krstic/memphis-01.png


o   Arthur Wolf is working on a project that's potentially useful to
    OLPC in the long term: he's building a peer-to-peer solution for DNS
    and DHCP on mesh networks, to simplify deployment and use of citizen
    networks (free Metro Area Networks). He reports having started on
    the main P2P part, which he deems the hardest piece to do.


o   Erik Pukinskis is a graduate student of Human-Computer Interaction
    Design at the School of Informatics at Indiana University. He is
    actually a SoC student for the AbiWord project, but he's working on
    an AbiWord interface for OLPC, which means we get to lay claim to
    him for the purposes of this report. Erik reports:

    """A lot of what I've been doing so far is finding the people I need
    to be coordinating with.  Since there's a strong design component to
    my project, is I've made contact with a bunch of people who know
    some of the regions the OLPC is going to (Brazil, Nigeria, Thailand)
    and I'm interviewing some of them about what education and life is
    like for kids there.  I'm blogging about this here:

     http://www.snowedin.net/summerofcode/

    My first interview should be posted today. I've also been talking
    with Chris Blizzard to try to figure out what they have already
    decided about the UI so I can work within their expectations.  And
    I've got AbiWord building on my laptop, but I haven't started
    hacking much.  So that's basically where I'm at.  My next steps are
    to post on my interviews/user research, and what the emerging UI
    requirements look like, start sketching UI, and start playing with
    the AbiWord UI to learn how the codebase works.  I also want to get
    the SDK installed on this machine, but I am waiting for a new hard
    drive to come next week."""


o   Manu Cornet is a first year PhD student in bioinformatics at the
    École normale supérieure in France. He's done great work for Ubuntu
    during last year's Summer of Code program, and is now working on
    hacking GTK+ for our purposes. This includes minor tweaks (such as
    disabling cursor blinking for extra power-saving on the display) and
    major work on an OLPC-friendly GTK theme. He is being mentored
    by the inimitable Federico Mena Quintero of GNOME fame.

    Manu didn't send a progress report. Calls to his home phone,
    university phone, and cell phone have been unanswered. He hasn't
    been seen in days. Witnesses say he was last spotted in #olpc,
    where, we learn unofficially, his mentor chided unidentified
    "corporate bastards" for "doing his poor student's work". A full
    investigation is pending, with retribution by the corporate
    bastards being a possible cause of disappearance. We remain
    hopeful Manu is unharmed.

-- 
Ivan Krstic <krstic at fas.harvard.edu> | GPG: 0x147C722D



More information about the Devel mailing list