Project

General

Profile

Activity

From 09/17/2021 to 10/16/2021

10/14/2021

01:41 PM Bug #1755 (Resolved): In partial_header_match, increase penalty for some KPs where start and end words are same as header variant
There are cases where word count of KP > word count of header variant and the uncommon word in the KP is VERB. The PO... Nandini Bansal
01:37 PM Bug #1754 (Closed): In partial_header_match, increase penalty for some KPs where start and end words are same as header variant
For some cases, the word count of KPs and header variant is the same but they have uncommon middle words which make t... Nandini Bansal
01:27 PM Bug #1753 (Closed): In partial_header_match, skip penalty for some KPs where start and end words are same as header variant
There were observed some cases where the KP is a proper subset of the header variant but it was penalized because the... Nandini Bansal
01:20 PM Bug #1752 (Resolved): In partial_header_match, reduce penalty for some KPs where start and end words are same as header variant
In partial_header_match, we have a filter after the generation of candidates where we penalize the KPs because they s... Nandini Bansal
08:17 AM Bug #1751 (Resolved): ADR includes rows with "New Added Docs" but the additions look wrong
Nandini Bansal
06:56 AM Bug #1751 (In Progress): ADR includes rows with "New Added Docs" but the additions look wrong
Nandini Bansal
06:56 AM Bug #1751 (Closed): ADR includes rows with "New Added Docs" but the additions look wrong
Need to verify the code for generation of ADR to understand why there are rows showing "New Docs Added" when the anno... Nandini Bansal

10/13/2021

01:21 PM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes
Rohit Choudhary
12:28 PM Bug #1742 (In Progress): Preventing removal of KPs that are getting removed due to new/latest changes
Rohit Choudhary
06:02 AM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes
There are cases of KPs which are getting removed (most prominently in Library Reference) due to recent changes. It in... Nandini Bansal
01:20 PM Bug #1745 (Resolved): Skip all the KPs starting with words within CW 300
Rohit Choudhary
12:27 PM Bug #1745 (In Progress): Skip all the KPs starting with words within CW 300
Rohit Choudhary
07:10 AM Bug #1745 (Closed): Skip all the KPs starting with words within CW 300
Upon making changes in the generate_candidates function, we saw that a lot of KPs were getting tagged with the starti... Nandini Bansal
01:02 PM Task #1653 (Resolved): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0
Rohit Choudhary
01:02 PM Task #1653 (In Progress): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0
Rohit Choudhary
12:56 PM Task #1643 (Resolved): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05
Rohit Choudhary
12:55 PM Task #1643 (In Progress): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05
Rohit Choudhary
12:37 PM Bug #1688 (In Progress): KeyError in kp_variant_log
Rohit Choudhary
12:35 PM Bug #1507 (Resolved): Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2"
Anonymous
12:35 PM Task #1522 (Resolved): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
Anonymous
12:34 PM Task #1522 (In Progress): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
Anonymous
12:34 PM Bug #1616 (Resolved): Some unwanted removals of KPs starting with VBG/IN
Rohit Choudhary
12:34 PM Task #1560 (Closed): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
Anonymous
12:33 PM Task #1560 (Resolved): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
Anonymous
12:33 PM Task #1560 (In Progress): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
Anonymous
12:34 PM Task #1543 (Resolved): Refactor the save_candidates function from BR3_IR3_tagger.py
Anonymous
12:34 PM Task #1543 (In Progress): Refactor the save_candidates function from BR3_IR3_tagger.py
Anonymous
12:34 PM Feature #1615 (Resolved): Add the str_between of variation_middle_parenthesis to processed_full_header list
Rohit Choudhary
12:33 PM Feature #1544 (Resolved): Changes to match a token/word with hyphen with a header variant which does not have a hyphen but is exactly same
Anonymous
12:31 PM Task #1561 (Resolved): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed
Anonymous
12:31 PM Task #1561 (In Progress): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed
Anonymous
12:31 PM Feature #1598 (Resolved): Remove return datatype from headers with empty parenthesis
Rohit Choudhary
12:31 PM Task #1657 (In Progress): Modifications in remove_function_signature method to handle some new cases
Rohit Choudhary
12:30 PM Bug #1712 (Resolved): Discarding bad KPs due to uncommon word at the beginning or end
Rohit Choudhary
12:29 PM Support #1570 (Resolved): Reduce the time taken by get_candidates_for_variant function after modifications for matching a hyphenated word with a header without hyphen
Anonymous
12:29 PM Bug #1746 (In Progress): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them
Rohit Choudhary
07:17 AM Bug #1746 (Resolved): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them
After the addition of the "CC" POS tag in the list of POS tags in the generate_candidates function, we saw that "opti... Nandini Bansal
12:27 PM Bug #1599 (Resolved): Modification in variation_variable_declarations to change the return values of the function for some cases of headers
Anonymous
12:27 PM Bug #1613 (Resolved): Removing bad KPs which have comma and header variants also have comma
Anonymous
12:25 PM Bug #1644 (Resolved): Testing Change: Modify the method of header variants generation using variations_in_common_section_words
Anonymous
12:24 PM Task #1663 (In Progress): Adding new header variants from pos_nouns function with fullness_ratio 1.0
Anonymous
12:24 PM Bug #1710 (Resolved): Skip tokens that are URLs and file paths from tagging
Anonymous
12:23 PM Bug #1743 (In Progress): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list
Anonymous
06:53 AM Bug #1743 (Resolved): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list
An experimentation approach to further extend this task. Earlier we were only checking the tmp_var in the 4K CW list.... Nandini Bansal
12:23 PM Bug #1744 (In Progress): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants
Anonymous
07:00 AM Bug #1744 (Resolved): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants
Using the original headers, calculate the fullness_ratio of the header variants wrt the original headers to see if we... Nandini Bansal
12:21 PM Task #1726 (In Progress): Handling cases of bad header variants like "representation"
Anonymous
06:12 AM Task #1726: Handling cases of bad header variants like "representation"
Estimate time increased as we are stuck with some cases that are difficult to manage Nandini Bansal
10:26 AM Bug #1747 (In Progress): Add condition for "callbacks" in the lemmatization wrapper
"callback" & "callbacks" are not being lemmatised to the same root word allowing close by tagging of the KPs in the "... Nandini Bansal
08:24 AM Task #1505 (Rejected): Sample task
Sample ticket Ram Kordale
06:41 AM Bug #1670 (Resolved): Fix the bug in saving has_noun dictionary values
Nandini Bansal
06:41 AM Feature #1708 (Resolved): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases
Nandini Bansal
06:41 AM Bug #1515 (Resolved): Bug in addition of P's in KP tagging
Nandini Bansal
06:40 AM Bug #1669 (In Progress): Getting rid of all dependency on Colab for the current code path
Nandini Bansal

10/11/2021

05:42 AM Task #1726 (Resolved): Handling cases of bad header variants like "representation"
In BR3_IR3_tagger.py, we have a function called *variations_in_common_section_words* that strips all the common words... Nandini Bansal

10/06/2021

11:34 AM Bug #1712 (Closed): Discarding bad KPs due to uncommon word at the beginning or end
https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?documentURL=1... Nandini Bansal
10:54 AM Task #1711 (New): Establish similarity between 'a b' and 'c-or-a b' (also 'c b' and 'c-or-a b')
"positional arguments" can be a purple or bold purple link in https://edutestdev-240612.appspot.com/document/python-... Ram Kordale
10:25 AM Bug #1710 (Closed): Skip tokens that are URLs and file paths from tagging
In recreate function, before calling the match_token_and_KP() function, check whether the current token is a URL or f... Nandini Bansal
07:12 AM Bug #1709 (New): Find similarity between docstring and 'Documentation Strings'
In page https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?docum... Ram Kordale
05:20 AM Feature #1708 (Closed): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases
Looking at the screenshot attached below, we can see "iteration", "iterable" & "iterator" are tagged very closely. Th... Nandini Bansal

09/30/2021

08:26 AM Bug #1688 (In Progress): KeyError in kp_variant_log
kp_variant_log is a dictionary that keeps track of the variant with which the KP was linked/tagged. The key of the di... Nandini Bansal

09/20/2021

04:50 AM Bug #1670 (Closed): Fix the bug in saving has_noun dictionary values
I found an inconsistency in the way values are being stored in the has_noun function in two different places in BR3_I... Nandini Bansal

09/19/2021

12:25 PM Bug #1669 (Resolved): Getting rid of all dependency on Colab for the current code path
Ram Kordale

09/17/2021

08:59 AM Task #1663 (Resolved): Adding new header variants from pos_nouns function with fullness_ratio 1.0
The main goal behind this task is to add some meaningful header variants with fullness_ratio 1.0 if a large portion o... Nandini Bansal
 

Also available in: Atom