Activity
From 09/17/2021 to 10/16/2021
10/14/2021
- 01:41 PM Bug #1755 (Resolved): In partial_header_match, increase penalty for some KPs where start and end words are same as header variant
- There are cases where word count of KP > word count of header variant and the uncommon word in the KP is VERB. The PO...
- 01:37 PM Bug #1754 (Closed): In partial_header_match, increase penalty for some KPs where start and end words are same as header variant
- For some cases, the word count of KPs and header variant is the same but they have uncommon middle words which make t...
- 01:27 PM Bug #1753 (Closed): In partial_header_match, skip penalty for some KPs where start and end words are same as header variant
- There were observed some cases where the KP is a proper subset of the header variant but it was penalized because the...
- 01:20 PM Bug #1752 (Resolved): In partial_header_match, reduce penalty for some KPs where start and end words are same as header variant
- In partial_header_match, we have a filter after the generation of candidates where we penalize the KPs because they s...
- 08:17 AM Bug #1751 (Resolved): ADR includes rows with "New Added Docs" but the additions look wrong
- 06:56 AM Bug #1751 (In Progress): ADR includes rows with "New Added Docs" but the additions look wrong
- 06:56 AM Bug #1751 (Closed): ADR includes rows with "New Added Docs" but the additions look wrong
- Need to verify the code for generation of ADR to understand why there are rows showing "New Docs Added" when the anno...
10/13/2021
- 01:21 PM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes
- 12:28 PM Bug #1742 (In Progress): Preventing removal of KPs that are getting removed due to new/latest changes
- 06:02 AM Bug #1742 (Resolved): Preventing removal of KPs that are getting removed due to new/latest changes
- There are cases of KPs which are getting removed (most prominently in Library Reference) due to recent changes. It in...
- 01:20 PM Bug #1745 (Resolved): Skip all the KPs starting with words within CW 300
- 12:27 PM Bug #1745 (In Progress): Skip all the KPs starting with words within CW 300
- 07:10 AM Bug #1745 (Closed): Skip all the KPs starting with words within CW 300
- Upon making changes in the generate_candidates function, we saw that a lot of KPs were getting tagged with the starti...
- 01:02 PM Task #1653 (Resolved): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0
- 01:02 PM Task #1653 (In Progress): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0
- 12:56 PM Task #1643 (Resolved): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05
- 12:55 PM Task #1643 (In Progress): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05
- 12:37 PM Bug #1688 (In Progress): KeyError in kp_variant_log
- 12:35 PM Bug #1507 (Resolved): Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2"
- 12:35 PM Task #1522 (Resolved): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
- 12:34 PM Task #1522 (In Progress): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
- 12:34 PM Bug #1616 (Resolved): Some unwanted removals of KPs starting with VBG/IN
- 12:34 PM Task #1560 (Closed): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
- 12:33 PM Task #1560 (Resolved): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
- 12:33 PM Task #1560 (In Progress): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
- 12:34 PM Task #1543 (Resolved): Refactor the save_candidates function from BR3_IR3_tagger.py
- 12:34 PM Task #1543 (In Progress): Refactor the save_candidates function from BR3_IR3_tagger.py
- 12:34 PM Feature #1615 (Resolved): Add the str_between of variation_middle_parenthesis to processed_full_header list
- 12:33 PM Feature #1544 (Resolved): Changes to match a token/word with hyphen with a header variant which does not have a hyphen but is exactly same
- 12:31 PM Task #1561 (Resolved): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed
- 12:31 PM Task #1561 (In Progress): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed
- 12:31 PM Feature #1598 (Resolved): Remove return datatype from headers with empty parenthesis
- 12:31 PM Task #1657 (In Progress): Modifications in remove_function_signature method to handle some new cases
- 12:30 PM Bug #1712 (Resolved): Discarding bad KPs due to uncommon word at the beginning or end
- 12:29 PM Support #1570 (Resolved): Reduce the time taken by get_candidates_for_variant function after modifications for matching a hyphenated word with a header without hyphen
- 12:29 PM Bug #1746 (In Progress): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them
- 07:17 AM Bug #1746 (Resolved): Checking the vector similarity and wordnet similarity of "options" and "optional" to unlink them
- After the addition of the "CC" POS tag in the list of POS tags in the generate_candidates function, we saw that "opti...
- 12:27 PM Bug #1599 (Resolved): Modification in variation_variable_declarations to change the return values of the function for some cases of headers
- 12:27 PM Bug #1613 (Resolved): Removing bad KPs which have comma and header variants also have comma
- 12:25 PM Bug #1644 (Resolved): Testing Change: Modify the method of header variants generation using variations_in_common_section_words
- 12:24 PM Task #1663 (In Progress): Adding new header variants from pos_nouns function with fullness_ratio 1.0
- 12:24 PM Bug #1710 (Resolved): Skip tokens that are URLs and file paths from tagging
- 12:23 PM Bug #1743 (In Progress): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list
- 06:53 AM Bug #1743 (Resolved): Checking singular and plural forms of the tmp_var from variations_in_common_section_words in common words list
- An experimentation approach to further extend this task. Earlier we were only checking the tmp_var in the 4K CW list....
- 12:23 PM Bug #1744 (In Progress): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants
- 07:00 AM Bug #1744 (Resolved): Calculating the fullness_ratio of the header variants to decide a threshold for removal of header variants
- Using the original headers, calculate the fullness_ratio of the header variants wrt the original headers to see if we...
- 12:21 PM Task #1726 (In Progress): Handling cases of bad header variants like "representation"
- 06:12 AM Task #1726: Handling cases of bad header variants like "representation"
- Estimate time increased as we are stuck with some cases that are difficult to manage
- 10:26 AM Bug #1747 (In Progress): Add condition for "callbacks" in the lemmatization wrapper
- "callback" & "callbacks" are not being lemmatised to the same root word allowing close by tagging of the KPs in the "...
- 08:24 AM Task #1505 (Rejected): Sample task
- Sample ticket
- 06:41 AM Bug #1670 (Resolved): Fix the bug in saving has_noun dictionary values
- 06:41 AM Feature #1708 (Resolved): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases
- 06:41 AM Bug #1515 (Resolved): Bug in addition of P's in KP tagging
- 06:40 AM Bug #1669 (In Progress): Getting rid of all dependency on Colab for the current code path
10/11/2021
- 05:42 AM Task #1726 (Resolved): Handling cases of bad header variants like "representation"
- In BR3_IR3_tagger.py, we have a function called *variations_in_common_section_words* that strips all the common words...
10/06/2021
- 11:34 AM Bug #1712 (Closed): Discarding bad KPs due to uncommon word at the beginning or end
- https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?documentURL=1...
- 10:54 AM Task #1711 (New): Establish similarity between 'a b' and 'c-or-a b' (also 'c b' and 'c-or-a b')
- "positional arguments" can be a purple or bold purple link in https://edutestdev-240612.appspot.com/document/python-...
- 10:25 AM Bug #1710 (Closed): Skip tokens that are URLs and file paths from tagging
- In recreate function, before calling the match_token_and_KP() function, check whether the current token is a URL or f...
- 07:12 AM Bug #1709 (New): Find similarity between docstring and 'Documentation Strings'
- In page https://edutestdev-240612.appspot.com/document/python-3-tutorial-pl-2021-10-04-105027.800623-spl/python?docum...
- 05:20 AM Feature #1708 (Closed): For KPs located very closely, pick the one which is most similar & add a wrapper for lemmatisation to handle some exception cases
- Looking at the screenshot attached below, we can see "iteration", "iterable" & "iterator" are tagged very closely. Th...
09/30/2021
- 08:26 AM Bug #1688 (In Progress): KeyError in kp_variant_log
- kp_variant_log is a dictionary that keeps track of the variant with which the KP was linked/tagged. The key of the di...
09/20/2021
- 04:50 AM Bug #1670 (Closed): Fix the bug in saving has_noun dictionary values
- I found an inconsistency in the way values are being stored in the has_noun function in two different places in BR3_I...
09/19/2021
09/17/2021
- 08:59 AM Task #1663 (Resolved): Adding new header variants from pos_nouns function with fullness_ratio 1.0
- The main goal behind this task is to add some meaningful header variants with fullness_ratio 1.0 if a large portion o...
Also available in: Atom