-
Notifications
You must be signed in to change notification settings - Fork 129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Runtime errors with ja
, zh
, and id
.
#550
Comments
Thank you! I'll take a look at fixing these runtime errors tomorrow. Will release a fix fir them asap |
hey @cyanic-selkie - both errors should be fixed now in Yeah - the |
I just reran it for The On a side note, I'd like to suggest using |
thanks - that's a real doozy. Wonder why it's only spanish?? i just ran const opts = {
input: path.join(dir, `/${lang}wiki-latest-pages-articles.xml`),
outputMode: "ndjson",
outputDir: path.join(dir, lang),
parse: function (doc) {
return doc.json()
}
}
dip(opts).then(() => {
console.log('done!')
}) will you try that, on your machine? |
good idea using |
@spencermountain I just fixed the variable declarations and it works perfectly. I'm not used to JS, so thanks! |
Hi,
Thank you for the awesome library!
I am currently using
dumpster-dip
to generate a dataset from all Wikipedia languages. It ran fine for all languages exceptja
,zh
,id
.Specifically, for
ja
andzh
I got the following error:For
id
, I got:It is also worth noting that for
es
to complete successfully, I had to set--max-old-space-size
to20000
, which seems excessive, especially since no other language requires changing the default. If I left it at default (or even set to10000
), I got the following error:The text was updated successfully, but these errors were encountered: