Bug #1613
Removing bad KPs which have comma and header variants also have comma
Start date:
09/06/2021
Due date:
% Done:
0%
Estimated time:
3.00 h
Description
In the C API book, we have cases where both KP and header variant has comma but the KP is bad as all the words are not matching with the header variant. Although the similarity scores are very low still they should not be present in the final annotated text file.
Example added in the screenshot below. The changes will be made in BR3_IR3_tagger.py.
"object, view-"
"object, use"
"object, i.e."
All the three KPs matched to the "objects, types and reference counts" header.
We need to fix this.
Test the changes with C-API, Whirlwind & Library Reference books
Files