Skip to content

FeatureRequest: Migrate to a faster xml parser #10

@groceryheist

Description

@groceryheist

Others have reported improved performance when using expat to parse Wikimedia dumps. We are currently using ElementTree which provides a good balance between usability and speed.

There is probably potential to speed up this library by switching to a faster xml parser. Candidates include:

  • lxml
  • cElementTree
  • expat

Migrating to lxml or cElementTree might be relatively easy because they have similar APIs to ElementTree.

Metadata

Metadata

Assignees

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions