Skip to content

Commit d3a34fd

Browse files
authored
Merge pull request #56 from m-misiura/add_slow_tokenizer
Update transformers to transformers==4.57.1; add slow tokenizers
2 parents 06c642a + d265b3e commit d3a34fd

File tree

2 files changed

+8
-2
lines changed

2 files changed

+8
-2
lines changed

detectors/huggingface/detector.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,11 @@ def initialize_model(self, model_files_path):
7272
"""
7373
Load and configure the model and tokenizer.
7474
"""
75-
self.tokenizer = AutoTokenizer.from_pretrained(model_files_path, use_fast=True)
75+
try:
76+
self.tokenizer = AutoTokenizer.from_pretrained(model_files_path, use_fast=True)
77+
except (ValueError, OSError, ImportError) as e:
78+
logger.warning(f"Failed to load fast tokenizer: {e}. Falling back to slow tokenizer.")
79+
self.tokenizer = AutoTokenizer.from_pretrained(model_files_path, use_fast=False)
7680
config = AutoConfig.from_pretrained(model_files_path)
7781
logger.info(f"Model Config: {config}")
7882

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1,3 @@
1-
transformers==4.50.0
1+
transformers==4.57.1
2+
sentencepiece==0.2.1
3+
tiktoken==0.12.0

0 commit comments

Comments
 (0)