-
Notifications
You must be signed in to change notification settings - Fork 7.2k
Open
Labels
🐞 bugSomething isn't working, pull request that fix bug.Something isn't working, pull request that fix bug.🙋♀️ questionFurther information is requestedFurther information is requested
Description
Self Checks
- I have searched for existing issues search for existing issues, including closed ones.
- I confirm that I am using English to submit this report (Language Policy).
- Non-english title submitions will be closed directly ( 非英文标题的提交将会被直接关闭 ) (Language Policy).
- Please do not modify this template :) and fill in all the required fields.
Describe your problem
I encountered a 'text' error while using the pipeline to parse documents.
dataset——> files——>parse(ingestion pipeline)
18:04:33 Task has been received.
[File]:
18:04:34: File fetched.
18:04:34: Done
Start the pipeline...
[Parser_0]:
18:04:34: Start to work on a PDF.
18:04:44:
18:04:46: OCR finished (12.11s)
18:04:52: Layout analysis (6.75s)
18:04:53: Table analysis (0.62s)
18:04:53: Text merged (0.00s)
18:04:53: Structured (0.25s)
18:04:54: Done
[按标题分割_0]:
18:04:54: Start to merge hierarchically.
18:04:54: Done.
18:04:54: Done
[提取器_摘要]:
18:04:54: Start to generate.
18:04:55: Done
[提取器_关键词]:
18:04:55: Start to generate.
18:04:56: Done
[提取器_问题]:
18:04:57: Start to generate.
18:04:59: Done
[提取器_元数据]:
18:04:59: Start to generate.
18:05:01: Done
[分词器_0]:
18:05:01: Start to tokenize.
18:05:01: Finish tokenizing.
18:05:01: Start embedding inference.
18:05:01: Finish embedding.
18:05:01: Done
18:05:01 [ERROR][Exception]: 'text'
dosubot
Metadata
Metadata
Assignees
Labels
🐞 bugSomething isn't working, pull request that fix bug.Something isn't working, pull request that fix bug.🙋♀️ questionFurther information is requestedFurther information is requested