Activity - RK-A - Redmine

Activity

From 09/30/2021 to 10/29/2021

10/29/2021

01:29 PM Feature #1807 (In Progress): Add a functionality in the ADR to log the KPs where sim_scores are changing: Rohit Choudhary
01:28 PM Feature #1805 (In Progress): Remove KPs starting with numbers in words: Anonymous

10/28/2021

11:01 AM Feature #1817 (In Progress): Update the similar document score string for KPs matching with different header variants: Nandini Bansal
11:01 AM Feature #1817 (In Progress): Update the similar document score string for KPs matching with different header variants: For KPs, it could be possible that more than one subsection headers are similar to a KP, we should make such changes ... Nandini Bansal
06:56 AM Feature #1810 (Resolved): Implement the penalty algorithm for KPs matching with "word1.word2" header variants in update_similarity_with_context function: Anonymous
06:55 AM Feature #1811 (Resolved): For matching "word1" in with the KP, make use of token_processing function: Anonymous
06:55 AM Feature #1809 (Resolved): Checking the context of the KP matched with "word1.word2" header variant: Anonymous
06:53 AM Feature #1815 (In Progress): Implement a new context algorithm for the KPs matching with <word1.word2> subsection headers: Anonymous
05:50 AM Feature #1815 (In Progress): Implement a new context algorithm for the KPs matching with <word1.word2> subsection headers: In continuation of #1810
We need to implement a new context matching algorithm for the KPs matching with <word1.wo... Nandini Bansal
06:32 AM Feature #1766 (Resolved): Refactoring and updating the logic of update_tags function to make universal use in mainKP and subKP tagging: Nandini Bansal
06:32 AM Bug #1765 (Resolved): Updating the P of tagged KPs where subKP and mainKP are separated by symbols: Nandini Bansal
05:56 AM Bug #1744 (Resolved): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants: Anonymous
05:56 AM Bug #1743 (Resolved): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list: Anonymous
05:55 AM Task #1726 (Resolved): Handling cases of bad header variants like "representation": Anonymous

10/27/2021

10:47 AM Feature #1813 (In Progress): Analysis for improvising the algorithms: Nandini Bansal
10:44 AM Feature #1813 (In Progress): Analysis for improvising the algorithms: All the cases where the KP and header variant "word1.word2" were matching have been logged and saved in a text file. ... Nandini Bansal
10:46 AM Feature #1812 (In Progress): Analysis for improvising the algorithms: Nandini Bansal
10:44 AM Feature #1812 (Rejected): Analysis for improvising the algorithms: All the cases where the KP and header variant "word1.word2" were matching have been logged and saved in a text file. ... Nandini Bansal
10:46 AM Feature #1811 (In Progress): For matching "word1" in with the KP, make use of token_processing function: Nandini Bansal
10:42 AM Feature #1811 (Resolved): For matching "word1" in with the KP, make use of token_processing function: To check whether "word1" of header variant "word2.word1" is matching to the KP which is getting tagged or not, we sho... Nandini Bansal
10:46 AM Feature #1810 (In Progress): Implement the penalty algorithm for KPs matching with "word1.word2" header variants in update_similarity_with_context function: Nandini Bansal
10:36 AM Feature #1810 (Resolved): Implement the penalty algorithm for KPs matching with "word1.word2" header variants in update_similarity_with_context function: After all the difficulties faced during the implementation of the above penalty in the BR3_IR3_tagger.py in the *gene... Nandini Bansal
10:46 AM Feature #1809 (In Progress): Checking the context of the KP matched with "word1.word2" header variant: Nandini Bansal
10:30 AM Feature #1809 (Resolved): Checking the context of the KP matched with "word1.word2" header variant: We have generally observed if the KP is matching with the "word1.word2" header variant with KP being equivalent to "w... Nandini Bansal

10/26/2021

12:03 PM Feature #1807 (Resolved): Add a functionality in the ADR to log the KPs where sim_scores are changing: As of now, if the KPs only have sim_score changes, the ADR does not log them but that shouldn't be the case. We need ... Nandini Bansal
06:07 AM Feature #1805 (Resolved): Remove KPs starting with numbers in words: In the C-API book, I have seen some cases where the KPs are wrongly starting with numbers in words like "one position... Nandini Bansal
06:07 AM Bug #1746 (Resolved): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them: Rohit Choudhary
06:06 AM Task #1657 (Resolved): Modifications in remove_function_signature method to handle some new cases: Rohit Choudhary
06:05 AM Bug #1767 (Resolved): Add processed subsection headers in the final processed_subsections dictionary irrespective of skip_header value: Rohit Choudhary

10/25/2021

10:55 AM Bug #1752 (In Progress): In partial_header_match, reduce penalty for some KPs where start and end words are same as header variant: Rohit Choudhary

10/21/2021

10:23 AM Bug #1777 (In Progress): Correct processing of some headers to generate appropriate header variants: Rohit Choudhary
10:18 AM Bug #1777 (Resolved): Correct processing of some headers to generate appropriate header variants: It was observed in the C-API book that header variants are being generated inappropriately for some headers like "int... Nandini Bansal

10/20/2021

03:08 PM Task #1663 (Resolved): Adding new header variants from pos_nouns function with fullness_ratio 1.0: Anonymous
08:54 AM Bug #1751 (Closed): ADR includes rows with "New Added Docs" but the additions look wrong: Nandini Bansal
08:54 AM Bug #1745 (Closed): Skip all the KPs starting with words within CW 300: Nandini Bansal
08:53 AM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes: Nandini Bansal
08:53 AM Bug #1742 (In Progress): Preventing removal of KPs that are getting removed due to new/latest changes: Nandini Bansal
08:52 AM Bug #1712 (Closed): Discarding bad KPs due to uncommon word at the beginning or end: Nandini Bansal
08:52 AM Bug #1710 (Closed): Skip tokens that are URLs and file paths from tagging: Nandini Bansal
08:50 AM Feature #1708 (Closed): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases: Nandini Bansal
08:50 AM Bug #1670 (Closed): Fix the bug in saving has_noun dictionary values: Nandini Bansal
08:50 AM Task #1653 (Closed): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0: Nandini Bansal
08:50 AM Bug #1644 (Closed): Testing Change: Modify the method of header variants generation using variations_in_common_section_words: Nandini Bansal
08:41 AM Task #1643 (Closed): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05: Nandini Bansal
08:41 AM Bug #1616 (Closed): Some unwanted removals of KPs starting with VBG/IN: Nandini Bansal
08:41 AM Feature #1615 (Closed): Add the str_between of variation_middle_parenthesis to processed_full_header list: Nandini Bansal
08:40 AM Bug #1613 (Closed): Removing bad KPs which have comma and header variants also have comma: Nandini Bansal
08:40 AM Bug #1599 (Closed): Modification in variation_variable_declarations to change the return values of the function for some cases of headers: Nandini Bansal
08:40 AM Feature #1598 (Closed): Remove return datatype from headers with empty parenthesis: Nandini Bansal
08:38 AM Feature #1587 (Closed): Discard and redude the scores of KPs with apostrophe when the header variant does not contain it: Nandini Bansal
07:07 AM Feature #1587 (Resolved): Discard and redude the scores of KPs with apostrophe when the header variant does not contain it: Rohit Choudhary
08:38 AM Support #1570 (Closed): Reduce the time taken by get_candidates_for_variant function after modifications for matching a hyphenated word with a header without hyphen: Nandini Bansal
08:37 AM Task #1561 (Closed): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed: Nandini Bansal
08:37 AM Feature #1544 (Closed): Changes to match a token/word with hyphen with a header variant which does not have a hyphen but is exactly same: Nandini Bansal
08:37 AM Task #1543 (Closed): Refactor the save_candidates function from BR3_IR3_tagger.py: Nandini Bansal
08:36 AM Task #1522 (Closed): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method: Nandini Bansal
08:36 AM Bug #1520 (Closed): Instead of removing entire key phrase starting with IN pos tag for all cases, we can process the keyphrase and discard just the word: Nandini Bansal
07:10 AM Bug #1520 (Resolved): Instead of removing entire key phrase starting with IN pos tag for all cases, we can process the keyphrase and discard just the word: Rohit Choudhary
08:36 AM Bug #1515 (Closed): Bug in addition of P's in KP tagging: Nandini Bansal
08:35 AM Bug #1507 (Closed): Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2": Nandini Bansal
08:34 AM Bug #1506 (Closed): Analyse & Discard single-word KPs with "ADV" POS tag: Nandini Bansal
07:01 AM Bug #1506 (Resolved): Analyse & Discard single-word KPs with "ADV" POS tag: Rohit Choudhary
07:14 AM Bug #1514 (Resolved): Removing key phrases which are starting with IN pos tag or a preposition: Rohit Choudhary
06:01 AM Bug #1767 (In Progress): Add processed subsection headers in the final processed_subsections dictionary irrespective of skip_header value: Rohit Choudhary
05:03 AM Bug #1767 (Resolved): Add processed subsection headers in the final processed_subsections dictionary irrespective of skip_header value: As investigated by Rohit, the subsection headers with skip_header = True is not being added to the processed_subsecti... Nandini Bansal

10/19/2021

12:48 PM Feature #1766 (In Progress): Refactoring and updating the logic of update_tags function to make universal use in mainKP and subKP tagging: Nandini Bansal
12:48 PM Feature #1766 (Resolved): Refactoring and updating the logic of update_tags function to make universal use in mainKP and subKP tagging: We will have to update the update_tags function of the tagging_utils.py to make sure we can re-use the function for t... Nandini Bansal
12:42 PM Bug #1765 (In Progress): Updating the P of tagged KPs where subKP and mainKP are separated by symbols: Nandini Bansal
12:42 PM Bug #1765 (Resolved): Updating the P of tagged KPs where subKP and mainKP are separated by symbols: It was noticed that for single word subKPs and mainKPs separated by symbols, there were some discrepancies in the way... Nandini Bansal

10/14/2021

01:41 PM Bug #1755 (Resolved): In partial_header_match, increase penalty for some KPs where start and end words are same as header variant: There are cases where word count of KP > word count of header variant and the uncommon word in the KP is VERB. The PO... Nandini Bansal
01:37 PM Bug #1754 (Closed): In partial_header_match, increase penalty for some KPs where start and end words are same as header variant: For some cases, the word count of KPs and header variant is the same but they have uncommon middle words which make t... Nandini Bansal
01:27 PM Bug #1753 (Closed): In partial_header_match, skip penalty for some KPs where start and end words are same as header variant: There were observed some cases where the KP is a proper subset of the header variant but it was penalized because the... Nandini Bansal
01:20 PM Bug #1752 (Resolved): In partial_header_match, reduce penalty for some KPs where start and end words are same as header variant: In partial_header_match, we have a filter after the generation of candidates where we penalize the KPs because they s... Nandini Bansal
08:17 AM Bug #1751 (Resolved): ADR includes rows with "New Added Docs" but the additions look wrong: Nandini Bansal
06:56 AM Bug #1751 (In Progress): ADR includes rows with "New Added Docs" but the additions look wrong: Nandini Bansal
06:56 AM Bug #1751 (Closed): ADR includes rows with "New Added Docs" but the additions look wrong: Need to verify the code for generation of ADR to understand why there are rows showing "New Docs Added" when the anno... Nandini Bansal

10/13/2021

01:21 PM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes: Rohit Choudhary
12:28 PM Bug #1742 (In Progress): Preventing removal of KPs that are getting removed due to new/latest changes: Rohit Choudhary
06:02 AM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes: There are cases of KPs which are getting removed (most prominently in Library Reference) due to recent changes. It in... Nandini Bansal
01:20 PM Bug #1745 (Resolved): Skip all the KPs starting with words within CW 300: Rohit Choudhary
12:27 PM Bug #1745 (In Progress): Skip all the KPs starting with words within CW 300: Rohit Choudhary
07:10 AM Bug #1745 (Closed): Skip all the KPs starting with words within CW 300: Upon making changes in the generate_candidates function, we saw that a lot of KPs were getting tagged with the starti... Nandini Bansal
01:02 PM Task #1653 (Resolved): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0: Rohit Choudhary
01:02 PM Task #1653 (In Progress): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0: Rohit Choudhary
12:56 PM Task #1643 (Resolved): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05: Rohit Choudhary
12:55 PM Task #1643 (In Progress): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05: Rohit Choudhary
12:37 PM Bug #1688 (In Progress): KeyError in kp_variant_log: Rohit Choudhary
12:35 PM Bug #1507 (Resolved): Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2": Anonymous
12:35 PM Task #1522 (Resolved): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method: Anonymous
12:34 PM Task #1522 (In Progress): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method: Anonymous
12:34 PM Bug #1616 (Resolved): Some unwanted removals of KPs starting with VBG/IN: Rohit Choudhary
12:34 PM Task #1560 (Closed): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs: Anonymous
12:33 PM Task #1560 (Resolved): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs: Anonymous
12:33 PM Task #1560 (In Progress): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs: Anonymous
12:34 PM Task #1543 (Resolved): Refactor the save_candidates function from BR3_IR3_tagger.py: Anonymous
12:34 PM Task #1543 (In Progress): Refactor the save_candidates function from BR3_IR3_tagger.py: Anonymous
12:34 PM Feature #1615 (Resolved): Add the str_between of variation_middle_parenthesis to processed_full_header list: Rohit Choudhary
12:33 PM Feature #1544 (Resolved): Changes to match a token/word with hyphen with a header variant which does not have a hyphen but is exactly same: Anonymous
12:31 PM Task #1561 (Resolved): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed: Anonymous
12:31 PM Task #1561 (In Progress): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed: Anonymous
12:31 PM Feature #1598 (Resolved): Remove return datatype from headers with empty parenthesis: Rohit Choudhary
12:31 PM Task #1657 (In Progress): Modifications in remove_function_signature method to handle some new cases: Rohit Choudhary
12:30 PM Bug #1712 (Resolved): Discarding bad KPs due to uncommon word at the beginning or end: Rohit Choudhary
12:29 PM Support #1570 (Resolved): Reduce the time taken by get_candidates_for_variant function after modifications for matching a hyphenated word with a header without hyphen: Anonymous
12:29 PM Bug #1746 (In Progress): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them: Rohit Choudhary
07:17 AM Bug #1746 (Resolved): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them: After the addition of the "CC" POS tag in the list of POS tags in the generate_candidates function, we saw that "opti... Nandini Bansal
12:27 PM Bug #1599 (Resolved): Modification in variation_variable_declarations to change the return values of the function for some cases of headers: Anonymous
12:27 PM Bug #1613 (Resolved): Removing bad KPs which have comma and header variants also have comma: Anonymous
12:25 PM Bug #1644 (Resolved): Testing Change: Modify the method of header variants generation using variations_in_common_section_words: Anonymous
12:24 PM Task #1663 (In Progress): Adding new header variants from pos_nouns function with fullness_ratio 1.0: Anonymous
12:24 PM Bug #1710 (Resolved): Skip tokens that are URLs and file paths from tagging: Anonymous
12:23 PM Bug #1743 (In Progress): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list: Anonymous
06:53 AM Bug #1743 (Resolved): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list: An experimentation approach to further extend this task. Earlier we were only checking the tmp_var in the 4K CW list.... Nandini Bansal
12:23 PM Bug #1744 (In Progress): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants: Anonymous
07:00 AM Bug #1744 (Resolved): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants: Using the original headers, calculate the fullness_ratio of the header variants wrt the original headers to see if we... Nandini Bansal
12:21 PM Task #1726 (In Progress): Handling cases of bad header variants like "representation": Anonymous
06:12 AM Task #1726: Handling cases of bad header variants like "representation": Estimate time increased as we are stuck with some cases that are difficult to manage Nandini Bansal
10:26 AM Bug #1747 (In Progress): Add condition for "callbacks" in the lemmatization wrapper: "callback" & "callbacks" are not being lemmatised to the same root word allowing close by tagging of the KPs in the "... Nandini Bansal
08:24 AM Task #1505 (Rejected): Sample task: Sample ticket Ram Kordale
06:41 AM Bug #1670 (Resolved): Fix the bug in saving has_noun dictionary values: Nandini Bansal
06:41 AM Feature #1708 (Resolved): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases: Nandini Bansal
06:41 AM Bug #1515 (Resolved): Bug in addition of P's in KP tagging: Nandini Bansal
06:40 AM Bug #1669 (In Progress): Getting rid of all dependency on Colab for the current code path: Nandini Bansal

10/11/2021

05:42 AM Task #1726 (Resolved): Handling cases of bad header variants like "representation": In BR3_IR3_tagger.py, we have a function called *variations_in_common_section_words* that strips all the common words... Nandini Bansal

10/06/2021

11:34 AM Bug #1712 (Closed): Discarding bad KPs due to uncommon word at the beginning or end: https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?documentURL=1... Nandini Bansal
10:54 AM Task #1711 (New): Establish similarity between 'a b' and 'c-or-a b' (also 'c b' and 'c-or-a b'): "positional arguments" can be a purple or bold purple link in https://edutestdev-240612.appspot.com/document/python-... Ram Kordale
10:25 AM Bug #1710 (Closed): Skip tokens that are URLs and file paths from tagging: In recreate function, before calling the match_token_and_KP() function, check whether the current token is a URL or f... Nandini Bansal
07:12 AM Bug #1709 (New): Find similarity between docstring and 'Documentation Strings': In page https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?docum... Ram Kordale
05:20 AM Feature #1708 (Closed): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases: Looking at the screenshot attached below, we can see "iteration", "iterable" & "iterator" are tagged very closely. Th... Nandini Bansal

09/30/2021

08:26 AM Bug #1688 (In Progress): KeyError in kp_variant_log: kp_variant_log is a dictionary that keeps track of the variant with which the KP was linked/tagged. The key of the di... Nandini Bansal

Also available in: Atom