Project

General

Profile

Activity

From 08/17/2021 to 09/15/2021

09/15/2021

04:29 AM Task #1653 (Closed): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0
An extension of Feature #1615,
For all remaining header variants generated by the variation_middle_parenthesis fu...
Nandini Bansal

09/13/2021

08:31 AM Bug #1644 (Closed): Testing Change: Modify the method of header variants generation using variations_in_common_section_words
For variations_in_common_section_words, we need to modify the header variants generated by this function. It makes us... Nandini Bansal
08:22 AM Task #1643 (Closed): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05
In Task #1561, we allowed the doc_ids which have the same sim_score as the self-link. However, we now want the scores... Nandini Bansal

09/06/2021

02:24 PM Bug #1616 (Closed): Some unwanted removals of KPs starting with VBG/IN
As per Issue #1514 & #1520, we discarded some KPs which were starting with the following POS tags "IN", "VBG", "VB". ... Nandini Bansal
01:19 PM Feature #1615 (Closed): Add the str_between of variation_middle_parenthesis to processed_full_header list
In the C-API book, I have seen cases where the string extracted by the variation_middle_parenthesis is being tagged w... Nandini Bansal
12:31 PM Bug #1613 (Closed): Removing bad KPs which have comma and header variants also have comma
In the C API book, we have cases where both KP and header variant has comma but the KP is bad as all the words are no... Nandini Bansal

09/02/2021

02:08 PM Bug #1599 (Closed): Modification in variation_variable_declarations to change the return values of the function for some cases of headers
variation_variable_declarations function is responsible for generating header variants for headers which follow the f... Nandini Bansal
12:59 PM Feature #1598 (Closed): Remove return datatype from headers with empty parenthesis
The remove_function_signature function removes datatype and parenthesized string only when parenthesized string conta... Nandini Bansal

09/01/2021

01:38 PM Feature #1593 (Rejected): Eliminate certain header variants generated from variations_in_common_section_words
For cases where two-word header_var is entirely made up of NOUNs/PROPNs and a single word tmp_variant is generated an... Nandini Bansal
10:09 AM Bug #1591 (New): Change in the scheme of tagging partial KPs with hyphen
A partial KP should be tagged only when the same KP is not present in the vicinity Nandini Bansal
07:19 AM Feature #1587 (Closed): Discard and redude the scores of KPs with apostrophe when the header variant does not contain it
For this task, we need to make changes in the mapping_phrase_docid where we can identify the KP which contains apostr... Nandini Bansal

08/30/2021

04:58 AM Support #1570 (Closed): Reduce the time taken by get_candidates_for_variant function after modifications for matching a hyphenated word with a header without hyphen
For the Library Reference book, it is observed that after making the above changes of Issue #1544, the time taken by ... Nandini Bansal

08/27/2021

04:43 AM Task #1561 (Closed): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed
We recently made changes in the code that makes sure that if KP is linked to the same document in which is tagged, th... Nandini Bansal
04:34 AM Task #1560 (Closed): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
Just ANP/IN POS tags, we need to identify other cases of POS tags that contribute to unwanted words in the beginning ... Nandini Bansal

08/25/2021

05:03 PM Feature #1557 (Resolved): Ensure some documents do not show up in purple link results
Sometimes, we know that we do not want our annotation to point to sections like https://docs.python.org/3/howto/urlli... Ram Kordale

08/24/2021

09:18 AM Feature #1544 (Closed): Changes to match a token/word with hyphen with a header variant which does not have a hyphen but is exactly same
This is a new feature we would like to add to our pipeline. We have observed a case where a token in the Whirlwind te... Nandini Bansal
06:57 AM Task #1543 (Closed): Refactor the save_candidates function from BR3_IR3_tagger.py
In this task, we need to refactor a small portion of code from the "save_candidates" function from BR3_IR3_tagger.py.... Nandini Bansal

08/20/2021

08:04 AM Task #1521 (Rejected): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
Nandini Bansal
08:03 AM Task #1521 (Rejected): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
In *extract_single_uncommon_words* from *BR3_IR3_tagger.py*, all the single word keyphrases pass through the 20K comm... Nandini Bansal
08:04 AM Task #1522 (Closed): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
In *extract_single_uncommon_words* from *BR3_IR3_tagger.py*, all the single word keyphrases pass through the 20K comm... Nandini Bansal
05:49 AM Bug #1520 (Closed): Instead of removing entire key phrase starting with IN pos tag for all cases, we can process the keyphrase and discard just the word
This is an experiment that can be conducted to identify the right course of action for this task. While the KPs start... Nandini Bansal

08/19/2021

12:59 PM Bug #1515 (Closed): Bug in addition of P's in KP tagging
In the whirlwind book, a token "range(5)" was tagged as BR2P. As a result of which there is a very ugly tagging of th... Nandini Bansal
12:49 PM Bug #1514 (Resolved): Removing key phrases which are starting with IN pos tag or a preposition
There are some cases of key phrases such as:
1. "within regular expressions" linked to "regular expressions"
2....
Nandini Bansal
06:50 AM Bug #1506: Analyse & Discard single-word KPs with "ADV" POS tag
URL: https://edutestdev-240612.appspot.com/document/python-whirlwind-tour/m?documentURL=10054%2Fds9aug1528%2FWhirlwin... Nandini Bansal
06:49 AM Bug #1507: Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2"
In Python Whirlwind Tour.txt, we are seeing some tagged KPs such as "python, code" linked to the "How to run Python C... Nandini Bansal

08/18/2021

03:02 PM Bug #1507 (Closed): Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2"
In Python Whirlwind Tour.txt, we are seeing some tagged KPs such as "python, code" linked to the "How to run Python C... Nandini Bansal
02:53 PM Bug #1506 (Closed): Analyse & Discard single-word KPs with "ADV" POS tag
In annotated "Python Whirlwind Tour.txt", we have seen some terrible looking tagged keyphrases such as: generally, fu... Nandini Bansal
02:30 PM Task #1505 (Rejected): Sample task
Sample task Ram Kordale
 

Also available in: Atom