Project

General

Profile

Bug #1613

Removing bad KPs which have comma and header variants also have comma

Added by Nandini Bansal about 3 years ago. Updated about 3 years ago.

Status:
Closed
Priority:
Normal
Assignee:
-
Target version:
Start date:
09/06/2021
Due date:
% Done:

0%

Estimated time:
3.00 h

Description

In the C API book, we have cases where both KP and header variant has comma but the KP is bad as all the words are not matching with the header variant. Although the similarity scores are very low still they should not be present in the final annotated text file.
Example added in the screenshot below. The changes will be made in BR3_IR3_tagger.py.

"object, view-"
"object, use"
"object, i.e."
All the three KPs matched to the "objects, types and reference counts" header.

We need to fix this.

Test the changes with C-API, Whirlwind & Library Reference books


Files

Also available in: Atom PDF