Activity
From 08/18/2021 to 09/16/2021
09/16/2021
- 04:56 AM Task #1657 (Resolved): Modifications in remove_function_signature method to handle some new cases
- Some new cases of headers with functions have been found which are not being handled as given below:
1. time.clock_g...
09/15/2021
- 04:29 AM Task #1653 (Closed): Adding all the header variants generated by variation_middle_parenthesis to processed_full_header with fullness_ratio 1.0
- An extension of Feature #1615,
For all remaining header variants generated by the variation_middle_parenthesis fu...
09/13/2021
- 08:31 AM Bug #1644 (Closed): Testing Change: Modify the method of header variants generation using variations_in_common_section_words
- For variations_in_common_section_words, we need to modify the header variants generated by this function. It makes us...
- 08:22 AM Task #1643 (Closed): Modification in update_doc_id_score_list from tagging_utils.py such that for doc_ids with scores same as self links are reduced by 0.05
- In Task #1561, we allowed the doc_ids which have the same sim_score as the self-link. However, we now want the scores...
09/06/2021
- 02:24 PM Bug #1616 (Closed): Some unwanted removals of KPs starting with VBG/IN
- As per Issue #1514 & #1520, we discarded some KPs which were starting with the following POS tags "IN", "VBG", "VB". ...
- 01:19 PM Feature #1615 (Closed): Add the str_between of variation_middle_parenthesis to processed_full_header list
- In the C-API book, I have seen cases where the string extracted by the variation_middle_parenthesis is being tagged w...
- 12:31 PM Bug #1613 (Closed): Removing bad KPs which have comma and header variants also have comma
- In the C API book, we have cases where both KP and header variant has comma but the KP is bad as all the words are no...
09/02/2021
- 02:08 PM Bug #1599 (Closed): Modification in variation_variable_declarations to change the return values of the function for some cases of headers
- variation_variable_declarations function is responsible for generating header variants for headers which follow the f...
- 12:59 PM Feature #1598 (Closed): Remove return datatype from headers with empty parenthesis
- The remove_function_signature function removes datatype and parenthesized string only when parenthesized string conta...
09/01/2021
- 01:38 PM Feature #1593 (Rejected): Eliminate certain header variants generated from variations_in_common_section_words
- For cases where two-word header_var is entirely made up of NOUNs/PROPNs and a single word tmp_variant is generated an...
- 10:09 AM Bug #1591 (New): Change in the scheme of tagging partial KPs with hyphen
- A partial KP should be tagged only when the same KP is not present in the vicinity
- 07:19 AM Feature #1587 (Closed): Discard and redude the scores of KPs with apostrophe when the header variant does not contain it
- For this task, we need to make changes in the mapping_phrase_docid where we can identify the KP which contains apostr...
08/30/2021
- 04:58 AM Support #1570 (Closed): Reduce the time taken by get_candidates_for_variant function after modifications for matching a hyphenated word with a header without hyphen
- For the Library Reference book, it is observed that after making the above changes of Issue #1544, the time taken by ...
08/27/2021
- 04:43 AM Task #1561 (Closed): Modification in tagging_utils.py such that doc_ids with sim_score equal to the kp_doc_id are not removed
- We recently made changes in the code that makes sure that if KP is linked to the same document in which is tagged, th...
- 04:34 AM Task #1560 (Closed): Finding different POS tags that can be stripped from the beginning/ending of the keyphrase to result in better KPs
- Just ANP/IN POS tags, we need to identify other cases of POS tags that contribute to unwanted words in the beginning ...
08/25/2021
- 05:03 PM Feature #1557 (Resolved): Ensure some documents do not show up in purple link results
- Sometimes, we know that we do not want our annotation to point to sections like https://docs.python.org/3/howto/urlli...
08/24/2021
- 09:18 AM Feature #1544 (Closed): Changes to match a token/word with hyphen with a header variant which does not have a hyphen but is exactly same
- This is a new feature we would like to add to our pipeline. We have observed a case where a token in the Whirlwind te...
- 06:57 AM Task #1543 (Closed): Refactor the save_candidates function from BR3_IR3_tagger.py
- In this task, we need to refactor a small portion of code from the "save_candidates" function from BR3_IR3_tagger.py....
08/20/2021
- 08:04 AM Task #1521 (Rejected): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
- 08:03 AM Task #1521 (Rejected): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
- In *extract_single_uncommon_words* from *BR3_IR3_tagger.py*, all the single word keyphrases pass through the 20K comm...
- 08:04 AM Task #1522 (Closed): Changes to ensure that singular and plural forms of key phrases are also checked in 20K CW in extract_single_uncommon_words method
- In *extract_single_uncommon_words* from *BR3_IR3_tagger.py*, all the single word keyphrases pass through the 20K comm...
- 05:49 AM Bug #1520 (Closed): Instead of removing entire key phrase starting with IN pos tag for all cases, we can process the keyphrase and discard just the word
- This is an experiment that can be conducted to identify the right course of action for this task. While the KPs start...
08/19/2021
- 12:59 PM Bug #1515 (Closed): Bug in addition of P's in KP tagging
- In the whirlwind book, a token "range(5)" was tagged as BR2P. As a result of which there is a very ugly tagging of th...
- 12:49 PM Bug #1514 (Resolved): Removing key phrases which are starting with IN pos tag or a preposition
- There are some cases of key phrases such as:
1. "within regular expressions" linked to "regular expressions"
2.... - 06:50 AM Bug #1506: Analyse & Discard single-word KPs with "ADV" POS tag
- URL: https://edutestdev-240612.appspot.com/document/python-whirlwind-tour/m?documentURL=10054%2Fds9aug1528%2FWhirlwin...
- 06:49 AM Bug #1507: Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2"
- In Python Whirlwind Tour.txt, we are seeing some tagged KPs such as "python, code" linked to the "How to run Python C...
08/18/2021
- 03:02 PM Bug #1507 (Closed): Bad looking key phrases following the pattern: "word1, word2" while the header variant is "word1 word2"
- In Python Whirlwind Tour.txt, we are seeing some tagged KPs such as "python, code" linked to the "How to run Python C...
- 02:53 PM Bug #1506 (Closed): Analyse & Discard single-word KPs with "ADV" POS tag
- In annotated "Python Whirlwind Tour.txt", we have seen some terrible looking tagged keyphrases such as: generally, fu...
- 02:30 PM Task #1505 (Rejected): Sample task
- Sample task
Also available in: Atom