Skip to content

Conversation

@joshweir
Copy link

@joshweir joshweir commented Apr 3, 2017

No description provided.

…eg. https://www.google.com, google.com, etc

fix hardcoding of tokenizer path in test_tokenize_urls test

refactor tokenizer

fix bug when url contains directories the entire url would not be a single token

refactor lib/tokenizer
# The first commit's message is:
recognize a complete url as a token, this includes various url forms eg. https://www.google.com, google.com, etc

# This is the 2nd commit message:

fix hardcoding of tokenizer path in test_tokenize_urls test

# This is the 3rd commit message:

refactor tokenizer

# This is the 4th commit message:

fix bug when url contains directories the entire url would not be a single token
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant