Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quote attributions to Character Ids #3

Open
wants to merge 27 commits into
base: master
Choose a base branch
from

Commits on Jul 6, 2016

  1. Added new function setCharacterIds

    New function setCharacterIDs contains code copy-pasted
    NikhilPr95 authored Jul 6, 2016
    Configuration menu
    Copy the full SHA
    a99b298 View commit details
    Browse the repository at this point in the history
  2. Added new function setCharacterIds

    New function setCharacterIDs contains code cut-pasted from PrintUtils.PrintTokens. This sets the characterIds of each token beforehand making the Ids easier to access as well as resulting in more accurate tokens during processing. Originally, the characterIds are set only during printing, leaving them inaccessible until that point.
    NikhilPr95 authored Jul 6, 2016
    Configuration menu
    Copy the full SHA
    ac7cbd3 View commit details
    Browse the repository at this point in the history
  3. Added extra attributes to print in PrintQuotes

    Added sentenceID as well as attributed Character Id for each quote that is printed. This makes it convenient to attribute quotes to characters even when the speakers are referred to only by pronouns such as 'he' and 'she'.
    NikhilPr95 authored Jul 6, 2016
    Configuration menu
    Copy the full SHA
    807ec7a View commit details
    Browse the repository at this point in the history
  4. Calling new function setCharacterIds

    Calling new function setCharacterIds that sets character Ids beforehand rather than during printing only.
    NikhilPr95 authored Jul 6, 2016
    Configuration menu
    Copy the full SHA
    4fd9ccf View commit details
    Browse the repository at this point in the history
  5. Update PrintUtil.java

    NikhilPr95 authored Jul 6, 2016
    Configuration menu
    Copy the full SHA
    228fe39 View commit details
    Browse the repository at this point in the history

Commits on Jul 7, 2016

  1. Configuration menu
    Copy the full SHA
    7b259e6 View commit details
    Browse the repository at this point in the history
  2. Added new function to attribute quotes.

    In each 'para' containing multiple quotes, all said quotes are spoken by the first speaker, except when an error is made by the parser itself. The code added reflects this change.
    NikhilPr95 authored Jul 7, 2016
    Configuration menu
    Copy the full SHA
    cf1ed82 View commit details
    Browse the repository at this point in the history
  3. Switched positions of functions

    switched positions of the calls of printWithLinksAndCorefAndQuotes and dumpForAnnotation, as the former uses information from the latter
    NikhilPr95 authored Jul 7, 2016
    Configuration menu
    Copy the full SHA
    a9aa15d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    fdf6eaa View commit details
    Browse the repository at this point in the history
  5. Updated PrintUtil

    NikhilPr95 authored Jul 7, 2016
    Configuration menu
    Copy the full SHA
    1496f32 View commit details
    Browse the repository at this point in the history

Commits on Jul 28, 2016

  1. Update BookNLP.java

    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    8efec25 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2870f3a View commit details
    Browse the repository at this point in the history
  3. Changed quote attributed name extraction method

    Used alternate method that does not rely on book.animateEntities for extracting phrase names. This was mostly done due to the changes added in the quote attribution method that required a quote-attributed name to be valid only if it came from a phrase put into animateEntities. The  improvements I made to the quote-attribution program stood in contradiciton with this, as the name I extracted for quote-attribution did not always stay in animateEntities. I could have added that particular 'phrase' containing the name to animateEntities instead, but I decided to subvert the requirement itself as I did not want to meddle with the code for extracting phrases unnecessarily. The code I wrote would add phrases which the phrase-generating code did not deem as legitimate to add. As my program does not need this requirement anyway (I don't know if requirement is the correct word - I saw the code few weeks ago, and all I can say is that all the quote-attributed names happen to be from phrases in animateEntities - I don't remember whether the name dictates whether the phrase is added or the other way round), I just skipped it and used an alternate method that requires the printHTML option to be processed first as can be seen in my changes to BookNLP.java
    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    0738987 View commit details
    Browse the repository at this point in the history
  4. Update NP.java

    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    0529f49 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c0f5525 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    260803e View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    640d27c View commit details
    Browse the repository at this point in the history
  8. Added new conditions and new feature

    Added condition for ner being 'PERSON' as well as a new feature for the same, isPerson
    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    307434f View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    ec8aed8 View commit details
    Browse the repository at this point in the history
  10. Added new conditions and code to attribute quotes

    Includes checking whether the quote is in same para as last quote, and assigning the former attribution to the latter
    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    3e33f76 View commit details
    Browse the repository at this point in the history
  11. Changed parser and added condition for tokenizing

    Now using Stanford CoreNLP (latest version) for parsing including dependency parsing (formerly done by MaltParser) including option 'depparse' for faster parsing using neural networks. This uses Universal Dependencies rather than Stanford dependencies as the CoreNLP does itself. Universal dependencies, however, create trees which sometimes have loops in them for dependency trees because of multiple found relations. This is dealt with here by choosing the best link, and tree from a graph.
    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    aff1d99 View commit details
    Browse the repository at this point in the history
  12. New coref weights

    New weights generated as a result of adding new feature 'isPerson'
    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    334e490 View commit details
    Browse the repository at this point in the history
  13. Updated with latest files

    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    b2ab929 View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    1600e84 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    39bf84d View commit details
    Browse the repository at this point in the history
  16. Updated for new CoreNLP

    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    801940d View commit details
    Browse the repository at this point in the history
  17. New coref weights due to new feature

    New feature isPerson added
    NikhilPr95 authored Jul 28, 2016
    Configuration menu
    Copy the full SHA
    3c68efc View commit details
    Browse the repository at this point in the history