-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Quote attributions to Character Ids #3
base: master
Are you sure you want to change the base?
Commits on Jul 6, 2016
-
Added new function setCharacterIds
New function setCharacterIDs contains code copy-pasted
Configuration menu - View commit details
-
Copy full SHA for a99b298 - Browse repository at this point
Copy the full SHA a99b298View commit details -
Added new function setCharacterIds
New function setCharacterIDs contains code cut-pasted from PrintUtils.PrintTokens. This sets the characterIds of each token beforehand making the Ids easier to access as well as resulting in more accurate tokens during processing. Originally, the characterIds are set only during printing, leaving them inaccessible until that point.
Configuration menu - View commit details
-
Copy full SHA for ac7cbd3 - Browse repository at this point
Copy the full SHA ac7cbd3View commit details -
Added extra attributes to print in PrintQuotes
Added sentenceID as well as attributed Character Id for each quote that is printed. This makes it convenient to attribute quotes to characters even when the speakers are referred to only by pronouns such as 'he' and 'she'.
Configuration menu - View commit details
-
Copy full SHA for 807ec7a - Browse repository at this point
Copy the full SHA 807ec7aView commit details -
Calling new function setCharacterIds
Calling new function setCharacterIds that sets character Ids beforehand rather than during printing only.
Configuration menu - View commit details
-
Copy full SHA for 4fd9ccf - Browse repository at this point
Copy the full SHA 4fd9ccfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 228fe39 - Browse repository at this point
Copy the full SHA 228fe39View commit details
Commits on Jul 7, 2016
-
Configuration menu - View commit details
-
Copy full SHA for 7b259e6 - Browse repository at this point
Copy the full SHA 7b259e6View commit details -
Added new function to attribute quotes.
In each 'para' containing multiple quotes, all said quotes are spoken by the first speaker, except when an error is made by the parser itself. The code added reflects this change.
Configuration menu - View commit details
-
Copy full SHA for cf1ed82 - Browse repository at this point
Copy the full SHA cf1ed82View commit details -
Switched positions of functions
switched positions of the calls of printWithLinksAndCorefAndQuotes and dumpForAnnotation, as the former uses information from the latter
Configuration menu - View commit details
-
Copy full SHA for a9aa15d - Browse repository at this point
Copy the full SHA a9aa15dView commit details -
Configuration menu - View commit details
-
Copy full SHA for fdf6eaa - Browse repository at this point
Copy the full SHA fdf6eaaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1496f32 - Browse repository at this point
Copy the full SHA 1496f32View commit details
Commits on Jul 28, 2016
-
Configuration menu - View commit details
-
Copy full SHA for 8efec25 - Browse repository at this point
Copy the full SHA 8efec25View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2870f3a - Browse repository at this point
Copy the full SHA 2870f3aView commit details -
Changed quote attributed name extraction method
Used alternate method that does not rely on book.animateEntities for extracting phrase names. This was mostly done due to the changes added in the quote attribution method that required a quote-attributed name to be valid only if it came from a phrase put into animateEntities. The improvements I made to the quote-attribution program stood in contradiciton with this, as the name I extracted for quote-attribution did not always stay in animateEntities. I could have added that particular 'phrase' containing the name to animateEntities instead, but I decided to subvert the requirement itself as I did not want to meddle with the code for extracting phrases unnecessarily. The code I wrote would add phrases which the phrase-generating code did not deem as legitimate to add. As my program does not need this requirement anyway (I don't know if requirement is the correct word - I saw the code few weeks ago, and all I can say is that all the quote-attributed names happen to be from phrases in animateEntities - I don't remember whether the name dictates whether the phrase is added or the other way round), I just skipped it and used an alternate method that requires the printHTML option to be processed first as can be seen in my changes to BookNLP.java
Configuration menu - View commit details
-
Copy full SHA for 0738987 - Browse repository at this point
Copy the full SHA 0738987View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0529f49 - Browse repository at this point
Copy the full SHA 0529f49View commit details -
Configuration menu - View commit details
-
Copy full SHA for c0f5525 - Browse repository at this point
Copy the full SHA c0f5525View commit details -
Configuration menu - View commit details
-
Copy full SHA for 260803e - Browse repository at this point
Copy the full SHA 260803eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 640d27c - Browse repository at this point
Copy the full SHA 640d27cView commit details -
Added new conditions and new feature
Added condition for ner being 'PERSON' as well as a new feature for the same, isPerson
Configuration menu - View commit details
-
Copy full SHA for 307434f - Browse repository at this point
Copy the full SHA 307434fView commit details -
Configuration menu - View commit details
-
Copy full SHA for ec8aed8 - Browse repository at this point
Copy the full SHA ec8aed8View commit details -
Added new conditions and code to attribute quotes
Includes checking whether the quote is in same para as last quote, and assigning the former attribution to the latter
Configuration menu - View commit details
-
Copy full SHA for 3e33f76 - Browse repository at this point
Copy the full SHA 3e33f76View commit details -
Changed parser and added condition for tokenizing
Now using Stanford CoreNLP (latest version) for parsing including dependency parsing (formerly done by MaltParser) including option 'depparse' for faster parsing using neural networks. This uses Universal Dependencies rather than Stanford dependencies as the CoreNLP does itself. Universal dependencies, however, create trees which sometimes have loops in them for dependency trees because of multiple found relations. This is dealt with here by choosing the best link, and tree from a graph.
Configuration menu - View commit details
-
Copy full SHA for aff1d99 - Browse repository at this point
Copy the full SHA aff1d99View commit details -
New weights generated as a result of adding new feature 'isPerson'
Configuration menu - View commit details
-
Copy full SHA for 334e490 - Browse repository at this point
Copy the full SHA 334e490View commit details -
Configuration menu - View commit details
-
Copy full SHA for b2ab929 - Browse repository at this point
Copy the full SHA b2ab929View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1600e84 - Browse repository at this point
Copy the full SHA 1600e84View commit details -
Configuration menu - View commit details
-
Copy full SHA for 39bf84d - Browse repository at this point
Copy the full SHA 39bf84dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 801940d - Browse repository at this point
Copy the full SHA 801940dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3c68efc - Browse repository at this point
Copy the full SHA 3c68efcView commit details