[Server-devel] Running complete Wikipedia offline

Sameer Verma sverma at sfsu.edu
Sat Dec 15 15:46:05 EST 2012


On Wed, Dec 12, 2012 at 1:37 PM, Martin Langhoff
<martin.langhoff at gmail.com> wrote:
> On Wed, Dec 12, 2012 at 4:28 PM, Sameer Verma <sverma at sfsu.edu> wrote:
>> I've been debating the possibility of running a *complete* copy of
>> Wikipedia (txt and images) offline on the XS. At this point, the
>> targets are English (https://en.wikipedia.org) and Hindi
>> (https://hi.wikipedia.org).
>
> It would be trivial. Get the HTML-formatted dumps, serve them statically.
>

Got the XML dump for en-wiki

> My only comment is... let us know about the on-disk space usage once
> it's unpacked (du -sh /path/to/wikipedia )

sverma at elverma-xps13:~$ du -sh
/home/sverma/Downloads/enwiki-20121201-pages-articles.xml
40G	/home/sverma/Downloads/enwiki-20121201-pages-articles.xml


More information about the Server-devel mailing list