Skip to content

Add an option to turn off respecting robots.txt #124

@mmuehlfeldRH

Description

@mmuehlfeldRH

I honor that linkcheck respects robots.txt, but it makes the tool mostly useless for me.
A link checker should be able to check all links on the specified URL.

How about continuing respecting robots.txt by default but adding a command-line option to turn off this behavior, if needed?
For example: --ignore-robots-txt

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions