-
Notifications
You must be signed in to change notification settings - Fork 754
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DBNET]-Bad performance on long text detection #376
Comments
Same Issue. What's the problem of the origin config? |
Solved. There are 2 problems. The first one is the original shrink and unclip methods in paper, which is not suitable for long text ( the unclipped box is thinner than ground truth), so I changed these methods by my understanding. The second one is a bug in the
0.01 is too big for long texts. The program will get two points rather than 4 points under this setting. Set this value to 0.002 is much better. |
Thanks for sharing your solution. Does the final performance look all good? |
Yes, much better. |
Actually, I'm confused about PPOCR's results. They also use dbnet and the presentation pictures in the repo is pretty good on long texts. The only different I found in the code between mmocr and ppocr is they use bigger unclip ratio. |
It's probably because PPOCR uses much more private training data... |
I wrote a blog about this issue, if anyone is interested in this issue, check it out link |
I think so. It works well in my project. |
Hello, Thank you for sharing your method. I tried in my project, it works well for long text, but I found it unclip too much on short text. Do you have same problem? How you fixed it? |
Hi, what is your r setting, you can try smaller r than 0.4 in paper, like 0.2. |
@Sanster @fatfishZhao @viviayi @gaotongxiao # it is my config |
Hi, thanks for your great job.
data:image/s3,"s3://crabby-images/bc30f/bc30f5d95aca0faec9f7af8258c01d8e896b635e" alt="image"
data:image/s3,"s3://crabby-images/0ecf4/0ecf4cbcea4c398975ca6136e8c0ab09868b7cff" alt="image"
data:image/s3,"s3://crabby-images/7e42e/7e42eea7af1d1d47bb01f89e9e9ac3440cdd3f8a" alt="image"
data:image/s3,"s3://crabby-images/42487/42487e748c57ee157100874757ae325c9a1ee746" alt="image"
I'm using R50DCN dbnet for chinese text detection. I used about 10k pictures for training based on the pretrain model.
When testing, long text cannot be detected, some examples are in the bottom.
Can you give me some explanation of this performance? How can I fix this problem?
The text was updated successfully, but these errors were encountered: