[Server-devel] Mirroring the olpc wiki for disconnected School Server?

Mike Dawson mikeofmanchester at gmail.com
Thu Jul 29 02:45:29 EDT 2010


Actually httrack is by default not evil to servers - and there are
plenty of options there to make it wait x seconds between pages, limit
the transfer rate, etc.  We've used it to copy a fair few websites and
it hasn't hurt anyone.

Everything else related to offline wikis lives here:

http://en.wikipedia.org/wiki/Wikipedia:Database_download

It also has an option to be evil to servers and run hundreds of
threads etc - that wouldn't be so nice...

As long as you have a machine that can run the job for however many
hours / days it takes to get the site through without putting a huge
load on the server should be fine.

If it helps I also made a Scratch gallery downloader so that you can
give it the ID of a gallery in Scratch and it will download all the
scratch projects in that one gallery.  Was used without anyone
complaining.



On 27/07/2010, Martin Langhoff <martin.langhoff at gmail.com> wrote:
> Hi Tom!
>
> Please don't hit our server with a crawler! ;-)
>
> SJ is the master of all things wiki.l.o -- he will know if it's
> possible for us to extract an HTML export of the wiki.
>
> cheers -
>
>
> m
>
> On Tue, Jul 27, 2010 at 7:57 AM, Tom Parker <tom at carrott.org> wrote:
>> We're about to go to Samoa with two school servers for two schools which
>> have no internet access. We would like, both for our own reference, and
>> for after we leave, to have a copy of the olpc wiki on the school
>> server.
>>
>> I don't find any mention of this on the olpc wiki.
>>
>> It seems like the least intrusive method of creating an offline
>> read-only html version of the wiki is to use the DumpHTML extension:
>> http://www.mediawiki.org/wiki/Extension:DumpHTML
>>
>> Is it possible to run this on the olpc wiki? In the next 3 days?
>>
>> Thanks.
>> Tom.
>>
>> _______________________________________________
>> Server-devel mailing list
>> Server-devel at lists.laptop.org
>> http://lists.laptop.org/listinfo/server-devel
>>
>
>
>
> --
>  martin.langhoff at gmail.com
>  martin at laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
> _______________________________________________
> Server-devel mailing list
> Server-devel at lists.laptop.org
> http://lists.laptop.org/listinfo/server-devel
>


More information about the Server-devel mailing list