<div dir="ltr">ok, let's meet friday at 1500 EST on #kiwix on freenode,<br>for those who can make it, to discuss making a main page for an english 0.7 wikipedia bundle.<br><br>SJ<br><br><div class="gmail_quote">On Thu, Aug 28, 2008 at 12:20 PM, Martin Pascal <span dir="ltr"><<a href="mailto:pmartin@linterweb.com">pmartin@linterweb.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Yes Sj ,<br>
<br>
you could join #kiwix on <a href="http://irc.freenode.net" target="_blank">irc.freenode.net</a><br>
Cordialement<br>
Martin Pascal<br>
tel : 02 32 40 23 69, fax : 02 32 61 45 26<br>
gsm : 06 13 89 77 32<br>
----- Original Message ----- From: "Martin Walker" <<a href="mailto:walkerma@potsdam.edu" target="_blank">walkerma@potsdam.edu</a>><div class="Ih2E3d"><br>
To: "Samuel Klein" <<a href="mailto:sj@laptop.org" target="_blank">sj@laptop.org</a>><br>
Cc: "Madeleine Ball" <<a href="mailto:mad@printf.net" target="_blank">mad@printf.net</a>>; "Offline Wikireaders" <<a href="mailto:wikireader@lists.laptop.org" target="_blank">wikireader@lists.laptop.org</a>><br>
</div>
Sent: Thursday, August 28, 2008 6:16 PM<div class="Ih2E3d"><br>
Subject: Re: [Wikireader] english wikireaders and 0.7<br>
<br>
<br>
</div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div><div></div><div class="Wj3C7c">
SJ,<br>
<br>
I can manage an IRC meeting on Friday - say at 3pm EDT (1900h UTC)? If<br>
this is difficult for others, I will be around next week. We have the<br>
#wikipedia-1.0 channel ( irc://<a href="http://irc.freenode.net/#wikipedia-1.0" target="_blank">irc.freenode.net/#wikipedia-1.0</a> ) if you<br>
wish, but perhaps you have a wikireader channel that may be more<br>
appropriate?<br>
<br>
Martin<br>
<br>
<br>
Samuel Klein wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
@martin -- How about having a Friday afternoon wikireader meeting?<br>
For this week, whether or not we meet, a pressing question is :<br>
Generating the main page. For the spanish WP, Madeleine did most of<br>
the main page by hand with a bit of help. We may have to do the same<br>
here until better scripts are set up.<br>
<br>
A couple people built the main page for our spanish-language bundle<br>
more or less by hand from a portal template.<br>
<br>
Metadata :<br>
<br>
1. metadata that is currently particularly useful for us is:<br>
- a blacklist of article titles, and a blacklist of images, for the<br>
very few that we explicitly leave out despite other metadata<br>
- a whitelist of both, again to ensure inclusion.<br>
<br>
2. In a general system, I'd like to see this tagged with the name of<br>
the group associated; say olpc-peru-blacklist and olpc-peru-whitelist.<br>
<br>
@cfabian -- testing this on bee units sounds like a fun test of the<br>
metadata slimming!<br>
<br>
SJ<br>
<br>
ps - any news from the offline spanish wp project that got started a<br>
while back?<br>
<br>
<br>
On Sun, Aug 24, 2008 at 6:12 PM, Martin Walker <<a href="mailto:walkerma@potsdam.edu" target="_blank">walkerma@potsdam.edu</a><br>
<mailto:<a href="mailto:walkerma@potsdam.edu" target="_blank">walkerma@potsdam.edu</a>>> wrote:<br>
<br>
Things are looking very promising for the Version 0.7 selection -<br>
we should have a complete article list within a week or so,<br>
containing about 30,000 articles organized by a combination of<br>
quality and importance. With our basic system of compression ,<br>
using I think probably Zeno format), I believe we should be able<br>
to include 30,000 long-ish articles with thumbnails on one DVD,<br>
along with Kiwix and some index pages. I'd be interested to see<br>
how it would work with your compression system - we could get a<br>
few people to test that, I think.<br>
<br>
I know how you love metadata, SJ, and we now have loads of it<br>
(from 1.4 million articles) - so we can customize the selection<br>
for you at will using quality, wikiproject, or the four importance<br>
paramaters. Since this is for kids in specific places, we can<br>
emphasize dinosaurs or birds, exclude serial killers, or include<br>
all articles from (say) Uganda, all as requested. Let me know if<br>
this feature is useful. We don't have an equivalent ranking for<br>
images, I'm afraid - for V0.7 we just include all legal images (as<br>
thumbnails). As for a "main page", the plan is to have a set of<br>
index pages generated by bot and then corrected by a manual<br>
"reality check", but that will take another month or two.<br>
<br>
I'd really like to make sure that we make sure we work together in<br>
the coming months, because I think we can avoid a lot of duplicate<br>
work if we share our best resources, scripts, etc. Once the<br>
selection is done (~ 1st Sept), should we hold an IRC discussion<br>
on how we can best collaborate?<br>
<br>
Martin<br>
<br>
<br>
Samuel Klein wrote:<br>
<br>
There's lots of motivation to get an english wikireader, say,<br>
taking advantage of the article selection and processing of 0.7 .<br>
OLPC could include this in the upcoming G1G1 machines this<br>
winter / early next year. Other users could test wikireaders<br>
that read this zipped format on their own machines, which<br>
would flesh out the reader code.<br>
<br>
Martin -- what's the status on the 0.7 articlelist? Do you<br>
have a similar imagelist that ranks images by importance to<br>
that set of articles?<br>
How is work on a 0.7 main page? I'd love to see how large a<br>
snapshot is with our curent wikireader code (without even<br>
moving to 7z, or trimming the list).<br>
<br>
SJ<br>
<br>
<br>
<br>
<br>
<br>
</blockquote>
<br>
<br></div></div><div class="Ih2E3d">
_______________________________________________<br>
Wikireader mailing list<br>
<a href="mailto:Wikireader@lists.laptop.org" target="_blank">Wikireader@lists.laptop.org</a><br>
<a href="http://lists.laptop.org/listinfo/wikireader" target="_blank">http://lists.laptop.org/listinfo/wikireader</a> <br>
</div></blockquote>
<br>
</blockquote></div><br></div>