-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VN missing pages, continued #76
Comments
https://sanskrit-lexicon.uni-koeln.de/pwgindex.html alleges to be a scanned edition of PWG, Compare the volume 1 VN material from this cdsl source to They are very different. How to account for this difference? If we want to improve the VN coding in cdsl pwg.txt, which of the two sources should be used? Although AB does not divulge the exact source of his pdfs, perhaps he could retrieve some information from the title pages that would explain the difference. |
@maltenth has to respond about the scans at CDSL! And it is my lookout to reach to the best possible original sources (either scans or physical books) that enhances my collection, as a conitinuous process. So far as I am concerned, the text in the pwgheader (of older date) is exactly what is in the print volumes; and I just had proofed the same (and at times split some matter into separate lines) and posted earlier. Jim could probably start with converting the PWG-VN data in Thomas's original format to current CDSL format. |
This was 2002 or 2004, I received them on a CD from Germany in Moscow. |
This post indicates some of my PWG "sources".
If the Vol.1 does not contain any VN matter, how would Jim (and/or Thomas) explain the presence of the typed matter from those pages in the pwgheader file (that was received from Thomas)? |
Unfortunately, communication by me with Thomas has become unpredictable this year. |
Just now, I compared They are identical. From this I infer that the material in pwgheader/PWG.V.1.VN.pages.pdf is simply absent from the pwgindex images. Why absent -- no way to know. I have revised pwgindex program to include
|
Based on recent review (see revised readme_cdsl_vn.txt),
The VNTXT file has 599 lines 'to convert'. 131 of these are, following the printed text, without headwords. The first examples:
AB: Have you already determined these missing headwords? |
No, Jim; you may refer to my related post. |
If I am to do it, I might wish to re-look at the whole content for a possible 'revision'!! |
It may be noted that the form |
I think I have now properly changed these PWGVN lines, to the format as in the pwkvn pages. There are couple of places (the lines having |
|
...do... denotes that the VN line belongs to the same HW as above! |
And you may note that the |
Note: |
yes; in fact it should be referring to Probably these should be checked again all over for the homonyms and accent marks (which I had missed at some places), after you prepare the file. |
The "actual" reason I had in mind is not about the VN part in the Cologne-scan on Sp.1677-8 (which is present in PWG7 as well), but that many entries in the Bavarian copy do not "appear" anywhere else, incl. the CDSL text. |
transcodingThe transcoding to slp1 (from vntext_0_deva.txt to vntxt_0.txt) required a few edits of vntext_0_deva. See change_vntxt_0_deva.txt. |
vntxt_1.txtCorrect pwg-devanagari accents that were missed in vntxt_0.txt. |
From an examination of these 21 headwords with current PWG display:
I see no problem (and some minor benefit) in KEEPING lines 553-573, since this material corresponds to the It is mysterious that the Bavarian edition (per scan above)
BTW: it is good that you have not only filled in headwords, but also added page-references for corrections. |
long live @Andhrabharati |
[from Jim's file: pwgissues/issue76/readme.txt]
It is quite surprising to see that Jim has copied MW's transcoder files to "handle" the PWG transcoding, and had to "heavily edit" the same for the purpose!! Probably (a) MW is fully overshadowing Jim's thoughts, or (b) Jim is also now entering into "dotage" as Thomas, who himself said thus in response to one of my points earlier. Jim has a separate "transcoder file-set" for the PWG family from the very initial days (which he had updated for the devanagari accent, upon some prolonged debating with me); and the same should've been used here. Otherwise, it leads to unnecessary contamination of MW-style and PWG-style of accents, as can be seen from the below snippets from the PWG print and Jim's current revision-- [from Jim's file: change_vntxt_0_deva.txt] |
[from AB's file: PWGVN_1-6_reformatted_(dng).txt]
[from Jim's file: change_vntxt_0_deva.txt]
Incidentally, I had discussed about this very item with @drdhaval2785 in private mail exactly 3 years back! |
It's because Jim is following the slp1 from Peter Schraf, who had duly made a note of this particular point in "his study/survey" (before coming up with slp1)-- but for some reason, did not even "try" to propose any solution! So it is not just slp1 alone that doesn't handle this, but also (any and) every existing Roman transliteration scheme! If Jim is "willing" to "update" the CDSL transcoding rules (as he had done in quite many cases till now), I shall post my proposal to handle the same (with which the invertibility condition also gets satisfied). Probably, Jim might wish to get Peter Schraf's opinion also about that proposal (before taking any action on it). |
In fact, I would consider it to be exactly opposite that the CDSL scan is THE mystery case! As I had already indicated earlier, both the Bavarian Library scan (1868) and the Japanese reprint (1976) tally exactly with each other, so does any physical book that I had seen in various Indian libraries (or in market now for sale). Now I have found a scan copy digitised by Google [from the Sapienza University of Rome (Biblioteca di Studi Orientali)] in August 2013, which has both the "proper ending page" of Bavarian copy followed by the "extraneous page" of the CDSL scan (after a blank page). This is somewhat similar to what we had seen earlier in one of the MW99 scans having two of MD errata pages, about which some discussion has took place, and finally it was concluded that it was an error in binding that particular copy and those two pages were NOT brought into the MW annexure data. It is surprising that the CDSL scan copy has the "original" ending page (as in all the three above scan copies) MISSING and is left only with the dubious "extraneous" page. |
No, The reason I used the mw transcoders was that I had available I'm constructing deva1_slp1.xml now. |
vntxt_1_rev.txtThe inverse transcoder file deva1_slp1.xml now created. I should have done that in the first place. Jim thinks that vntxt_1_rev.txt is ready for further use.
I'm curious what such an update would look like. Let's see the proposed transcoder file. |
First things first! Against Jim's two posts 1 and 2 just above this, I would like to re-iterate from AB's post:
Here are the transcoders I have with me (as recd from Jim)-- [MW-version, which has no "deva1_slp1.xml" indeed] [pw-version, which DOES have the "deva1_slp1.xml"] And he had clearly said those days that the deva1 <> slp1 files were specifically made for the pw-family!! He had also indicated how to check the intertibility using the "to & fro transcoders" one after the other. I can as well show (point) him where he has posted these transcoders (for me) earlier, if he is still not convinced that these were already existing before! |
As I has mentioned in my mail to Dhaval (in the above post), a need to transcoding the vowel-marker (mAtrA) characters arises not only in case of grammar books, as in or in reference works, as in [Monier Williams dictionary] or in posters, as at [Marcis's post](#37 (comment)) Of course, for most of such works that go to actual publishing, other 'professional means' would be resorted to (and not these Roman transcoding schemes) for the intended text matter appropriately!! [... post continues further below ...] |
@Andhrabharati I can hardly imagine a case other than textbook for having the need to seperate the vowel representation. |
…ior version. The current transcodepwg/pwgtranscoder2/deva1_slp1.xml is slightly better. Use it
deva1 comparison@Andhrabharati After your comment, I was able to find a deva1_slp1.xml from 2023. Some details on comparison to the 2023 version are in readme_deva1.txt. |
agree @Andhrabharati |
question on
|
Also, I think there are two more at the beginning of the list
|
@funderburkjim I contintue to upload my new scans of Sanskrit dictionaries, do not know if better than what you have or not https://vk.com/samskrtamru?w=wall-88831040_22648 |
vn missing installedWork files are here. vntxt_4.txt contains the new entries. These entries have been inserted into pwg.txt, and csl-orig updated. I think the goals of this issue have been satisfied. Next step for me: changes to pwg.txt that were noticed during this missing VN work. Will detail these proposed changes after AB review of this vn work . |
Continue the discussion of VN (additions and improvements) for PWG, that was begun in #39.
The text was updated successfully, but these errors were encountered: