Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HarivaMSa links via MBH Calcutta #49

Open
funderburkjim opened this issue Jan 28, 2022 · 61 comments
Open

HarivaMSa links via MBH Calcutta #49

funderburkjim opened this issue Jan 28, 2022 · 61 comments

Comments

@funderburkjim
Copy link
Contributor

Since the Mahabharata Calcutta edition contains the HarivaMSa as a 19th parvan, it seems likely that PWG links to HarivaMsa can be resolved in a manner similar to that used for references to the first 18 parva (#48).

The last page of mbhcalc for 18th parvan is 443.
Page 444 is a blank page.

The pages of mbhcalc devoted to Harivamsha are from
internal page number 445
image

thru internal page number 1007
image

The

@Andhrabharati
Copy link

Andhrabharati commented Jan 28, 2022

I would suggest using the separately printed Harivamsa volume, instead of this continuation in Vol. 4 of MBh.
[Guess no need to give the reason (which indeed does exist)!]

sanskrit-lexicon/COLOGNE#371 (comment)

@funderburkjim
Copy link
Contributor Author

Reality check

11263 matches in 11255 lines for "<ls>HARIV" in buffer: pwg.txt and another 350 or so HARIV improperly marked.

One instance is <ls>HARIV. 708.</ls> occurring under headword akfSASva.

If verse 1 occurs on page 445 and there are about 30 verses per page, then verse 708 should occur about 708/30 = 24 pages after 445, or on page 469. And in fact verse 708 does occur on page 469: (478 is the external page number in volume 4 pdf).
mbh_calc_4 478.pdf

and we see that our word akfSASva अकृशाश्व is found in the line for verse 708:

image

Based on this example, we can conclude that the verse references in mbhcalc for parvan 19 are consistent with the
pwg verse references with HARIV.

So, mbhcalc can serve as link target for source HARIV in pwg.

@funderburkjim
Copy link
Contributor Author

@Andhrabharati Yes, please give your reason for preferring another edition. At this stage, it would be easier to use mbh-calcutta, so would want switching to another print version to be worth the trouble.

Also, have you checked the verse-numbering compatibility with pwg?

@funderburkjim
Copy link
Contributor Author

The Other Haribansa

Have downloaded and looked at the other version suggested above.
To my eye, this 'haribansa' pdf looks virtually identical to the 19th parvan in mbhcalc. The biggest difference noticed
is the internal page number. For comparison, here is the page from haribansa containing the verse 708. It can be
compared to the pdf mbh_calc_4 478.pdf above.

haribansa 36.pdf

@gasyoun
Copy link
Member

gasyoun commented Jan 28, 2022

[Guess no need to give the reason (which indeed does exist)!]

Guessed wrongly.

To my eye, this 'haribansa' pdf looks virtually identical to the 19th parvan in mbhcalc.

If such is the case, why bother @Andhrabharati ?

@funderburkjim
Copy link
Contributor Author

Since the two pdfs are so similar, I can base instructions for @KateRusse on the haribansa version preferred by @Andhrabharati, and will thus proceed. Instructions will be developed soon.

@Andhrabharati
Copy link

Andhrabharati commented Jan 29, 2022

Whatever my reason is, the PWG refers to the 4th Vol. of the Calc. ed. MBh. itself!

¤Hariv. = ¤Harivam̃śa im 4ten Bande des ¤MBh. Aus diesem Werke haben wir die Nomina propria mit Benutzung des Index in der ¤Langlois'-schen Uebersetzung (¤Gild. Bibl. 122) aufgenommen.

And the pwk mentions thus-
¤Hariv. = ¤Harivam̃śa. Mit einer Zahl die ältere Calc. Ausg. gemeint, mit drei Zahlen die neuere lithographirte.
[The Lithographed ed. is nothing but the Bomb. ed.]

@gasyoun
Copy link
Member

gasyoun commented Jan 29, 2022

[The Lithographed ed. is nothing but the Bomb. ed.]

Wow, what a research. Yes, please advise @KateRusse.

@funderburkjim
Copy link
Contributor Author

@Andhrabharati Is your position now that we should use images from the 4th volume of MBHcalc?

@Andhrabharati
Copy link

No, I was just mentioning what PWG said.

I still go with my original suggestion, which spans all across the Sanskrit Literature.

@funderburkjim
Copy link
Contributor Author

OK. But I am still curious why you prefer the separate Haribansa pdfs.

One difference I can see is that the size of each pdf page of Haribans is about 1MB, whereas the size of each pdf page in 4th volume of mbhcalc is about half that (0.5MB).

@gasyoun
Copy link
Member

gasyoun commented Jan 29, 2022

Haribans is about 1MB

Size is not an issue and does not speak about the quality of the scan as well. It's only the level of compression.

@funderburkjim
Copy link
Contributor Author

index instructions: get pdf

@gasyoun @KateRusse Here are instructions for creating an index of the pages in Harivansa.
This follows the model of the Index file created by @Andhrabharati for the Mahabharata calcutta edition.

Get pdf, option 1

Get Haribansa download at https://opacplus.bsb-muenchen.de/Vta2/bsb10219661/bsb:BV001652965?page=11

This reference gives screenshots that will be helpful in actually getting the pdf downloaded. The size is about 600MB.

Then you can view the pdf locally with a browser or your favorite pdf viewer.

Get pdf, option 2

You can view the pdf pages one at a time in the browser.

The individual pages have been uploaded to a repository: https://github.com/sanskrit-lexicon-scans/hariv.
The pdfs of each page are in the pdfpages directory. If you click on one, it will be displayed.
For example, page 1 comes up at url https://github.com/sanskrit-lexicon-scans/hariv/blob/main/pdfpages/hariv_001.pdf.

If neither of above work for you, we'll find another way in comments in this issue.

@funderburkjim
Copy link
Contributor Author

Index instructions: the index file

The main task is to create a table of information, with one line in the table for each page of the pdf,
from page 1 to page 563. The format of this table will be the similar to the format used for the Mahabharata calcutta edition; here is mbhcalindex. The difference is that there is
no need for the Vol. (Volume) and Parva columns in Haribansa index.
So the hariv_index file you create will have columns

  • Page This is the page number.
  • Start The number of the first verse appearing on the page
  • End The number of the last verse appearing on the page
  • count the number of verses on the page.
  • comment Normally there is nothing unusual about the page, so this can be left blank.

There are a few subtleties in deciding what the Start, End, and Count fields should be.
Once you get started, we can discuss questions as they arise.

Your hariv_index file can be created as a text file or a spreadsheet file.
If a text file, then separate the fields either with a tab character or with a colon character.

Example of page 3

page 3 pdf
page = 3
start = 49
end = 77
count = 29

@funderburkjim
Copy link
Contributor Author

Not every line is a verse

In the page 3 example, the body of the page (i.e., excluding the top line containing the page number)
actually has 30 lines. But the 6th line (the one ending ॥ १ ॥) is not counted as a verse. That's why
'Count = 3' for page 3 index.

Question for @Andhrabharati : What is such a non-verse line?

Possible 'verse gaps'

In mbhcalc, there were several pages where there was a 'gap' in the verse numbering.
If you notice such a gap in a Harivansa page, please note this in a comment.

@KateRusse When you've done the index for the first few pages, upload your hariv_index file so I can review.

@Andhrabharati
Copy link

Question for @Andhrabharati : What is such a non-verse line?

They are called 'colophones' and considered unanimously by all literati, to be not a 'part' of the main text.

@Andhrabharati
Copy link

Andhrabharati commented Jan 30, 2022

Like to see how long KateRusse would take, to finish the task.

(I had done it in just about two hours.)

@gasyoun
Copy link
Member

gasyoun commented Jan 30, 2022

(I had done it in just about two hours.)

What do you mean done? She can do harder tasks, if this one is done, no need to redo, as there are no other tasks, requiring these skills @Andhrabharati

@funderburkjim
Copy link
Contributor Author

funderburkjim commented Jan 31, 2022

@KateRusse I hope you will undertake this indexing. Please let us know your intention in this regard.

@KateRusse
Copy link

KateRusse commented Jan 31, 2022

@KateRusse I hope you will undertake this indexing. Please let us know your intention in this regard.

Is there anything left to do? I can continue this work

@funderburkjim
Copy link
Contributor Author

@KateRusse I do not have an index for Harivansa. So construction of that index remains to be done.

@KateRusse
Copy link

KateRusse commented Jan 31, 2022

I have done an index for the first 20 pages. If everything is alright, I can continue.
Harivansha-1.txt

@gasyoun
Copy link
Member

gasyoun commented Jan 31, 2022

I have done an index for the first 20 pages

Perfect. I've sent you a piece of software for recording of how you do it, thanks.

@funderburkjim
Copy link
Contributor Author

@KateRusse I spot-checked several of the first 20 lines, and everything looks fine!
Ok to proceed.

@KateRusse
Copy link

Here is an index of 150 pages.

Harivansha-1.txt

@sanskrit-lexicon sanskrit-lexicon deleted a comment from KateRusse Feb 5, 2022
@gasyoun
Copy link
Member

gasyoun commented Feb 5, 2022

59 1697 1725 29 After the verse 1713 the line is not a verse, the given numeration goes wrong from this place.

and

144 4195 4224 30 One more mistake in the given numeration

@Andhrabharati agree?

funderburkjim added a commit to sanskrit-lexicon-scans/hariv that referenced this issue Feb 6, 2022
@Andhrabharati
Copy link

Andhrabharati commented Feb 7, 2022

@KateRusse
Just to give you an example, your file has v. 1756 as the starting verse in p. 61; thus when Jim makes the linking active, the entry words "jahnu" & "nIla" (both in SLP1) in PWG link to the p. 61 where the verse 1756 containing jahnu (or nIla) cannot be seen at all (it actually being in the prev. page). So, this should be marked as the ending verse of p. 60. Hope, you understand the necessity now.

It is the responsibility of us, the humans, to give correct data to the computer programs to work correctly; they just act as per the data provided to them.
[of course the AI is a different field altogether, and none here are into it, I guess!]
----------------------
BTW, @funderburkjim, I've just seen that the jahnu entry in PWG has NO link to the "Mbh. 1,3722. fgg.", but the following verses 12,1717. 13,202. 13,7680 are properly linked.

So you still need to work on some more MBh. links; I'm sure you would look for other types of such pending combinations with just this clue.

@KateRusse
Copy link

@KateRusse Just to give you an example, your file has v. 1756 as the starting verse in p. 61; thus when Jim makes the linking active, the entry words "jahnu" & "nIla" (both in SLP1) in PWG link to the p. 61 where the verse 1756 containing jahnu (or nIla) cannot be seen at all (it actually being in the prev. page). So, this should be marked as the ending verse of p. 60. Hope, you understand the necessity now.

Please look through my new file, this mistake is already corrected there.

@Andhrabharati
Copy link

Andhrabharati commented Feb 7, 2022

Yes, seen it already.

I was typing my above message, while you had updated your file and posted.

So my message actually is addressing to your Harivansha-1 file, not the revised Harivansha-2 file.

@KateRusse
Copy link

I have corrected it one more time: Harivansha-2.txt

@gasyoun
Copy link
Member

gasyoun commented Feb 7, 2022

[Of course, I don't have a least doubt that there would be ANYONE matching me in speed or understanding things.]

Yes, we can't beat you.

There is no shortage for the pdf-linkable targets across the CDSL dictionaries; so more the 'skilled people', faster would be the work done.

Yes, for years to come.

Speaking of this, Jim might probably consider making a count of ls citations by "work name", like he has made a comparative list of verb (dhAtu) occurrences across the dictionaries, and every work occurring more than 5000 times (or may even be 3000) could be considered a worthy pdf-linkable target.

There was already such a list and the biggest link targets soon will be closed.

[of course the AI is a different field altogether, and none here are into it, I guess!]

Wrong guessing again - into AI since 1999.

@Andhrabharati
Copy link

Speaking of this, Jim might probably consider making a count of ls citations by "work name", like he has made a comparative list of verb (dhAtu) occurrences across the dictionaries, and every work occurring more than 5000 times (or may even be 3000) could be considered a worthy pdf-linkable target.

There was already such a list and the biggest link targets soon will be closed.

@gasyoun
could you get me the link to this list, so that I may help identifying the 'sources' to link?

funderburkjim added a commit to sanskrit-lexicon/csl-websanlexicon that referenced this issue Feb 8, 2022
funderburkjim added a commit to sanskrit-lexicon/csl-apidev that referenced this issue Feb 8, 2022
funderburkjim added a commit to sanskrit-lexicon/csl-orig that referenced this issue Feb 8, 2022
funderburkjim added a commit to sanskrit-lexicon/csl-pywork that referenced this issue Feb 8, 2022
@funderburkjim
Copy link
Contributor Author

Improvements made to PWG HARIV links

This work done in pwg_ls2/hariv folder.
Before the improvements, 9639 well-formed links to Harivamsa Calcutta edition were present in PWG.
At the end of the changes, 15595 such well-formed links were present.

Also, 26 links were identified as abnormal (see file change_abnormal.txt).
The changes made to markup appear in files change_01.txt and change_02.txt.

@funderburkjim
Copy link
Contributor Author

display program revision

The display program component (basicadjust.php) has been adjusted to provide active links to Harivamsa (see revisions to csl-websanlexicon and csl-apidev above).

The link target is currently https://sanskrit-lexicon-scans.github.io/hariv/.

These links are present for PWG, PW (with literary source abbreviation HARIV. and
for MW (with abbreviation Hariv.).

@funderburkjim
Copy link
Contributor Author

@Andhrabharati
Copy link

Andhrabharati commented Feb 9, 2022

Here is the "resolved" Hariv. abnormal cases file, for perusal-
PWG Hariv. abnormal cases.txt

My file has <1331. 5185. 10995> at the BAsvant entry, which would be properly resolved as a link.

On as second thought, the S. xxx citations could be linked as https://sanskrit-lexicon-scans.github.io/hariv/?xxx -- is this possible to do?

@gasyoun
Copy link
Member

gasyoun commented Feb 9, 2022

dsadasdsasad

gen. MBh. xv, 463 [C] inf. cyavitum), Mn. vii, 98 ; MBh. iii ; both are still missing @funderburkjim

@KateRusse
Copy link

400 pages Harivansha-2.txt

@Andhrabharati
Copy link

@KateRusse
Did you observe that 110xx block of verses is repeated at two places-- pp. 345-8 and pp. 376-9?
You need to mark them somehow, so that Jim would pay attention to it; otherwise the program may give wrong result, or even hang-up

@KateRusse
Copy link

KateRusse commented Feb 11, 2022

@KateRusse Did you observe that 110xx block of verses is repeated at two places-- pp. 345-8 and pp. 376-9? You need to mark them somehow, so that Jim would pay attention to it; otherwise the program may give wrong result, or even hang-up

Ok, I have marked them with asterixes (*). I have done till the end:
Harivansha-2.txt

@Andhrabharati
Copy link

Well done, @KateRusse.

You need to star mark pp. 409-13 also, where the verse numbers are 'reduced' by 11000, thus the range getting repeated.

@gasyoun
Copy link
Member

gasyoun commented Feb 12, 2022

I have done till the end

I give you my thanks. It took 12 days and after @Andhrabharati and @funderburkjim validates the data, the task can be called accomplished, thanks again Kate.

@KateRusse
Copy link

Well done, @KateRusse.

You need to star mark pp. 409-13 also, where the verse numbers are 'reduced' by 11000, thus the range getting repeated.

Ok, I have marked them also. Harivansha-2.txt

@Andhrabharati
Copy link

Jim,

pl. look at these '*' marked lines and do the needful, before you 'use' the full file.

funderburkjim added a commit to sanskrit-lexicon-scans/hariv that referenced this issue Feb 14, 2022
@funderburkjim
Copy link
Contributor Author

funderburkjim commented Feb 14, 2022

installed.

@KateRusse Your completed index to Harivansa now installed. THANK YOU.

Regarding the items marked with asterisks, I made changes so that the verses would
display the correct page, despite the misprint of verse numbers in the scanned image.
For details, see changes made to hariv_index.txt in https://github.com/sanskrit-lexicon-scans/hariv/tree/main/python.

For instance, without the change, verse 10150 would have shown page 345; but
with the change, it shows page 347, which is believed to be correct:
https://sanskrit-lexicon-scans.github.io/hariv/?10150.

@Andhrabharati
Copy link

Andhrabharati commented Feb 15, 2022

How about adding a note on those pdf pages reg. the wrong numberings (so that any user would be alerted while looking at those links)?

@gasyoun
Copy link
Member

gasyoun commented Feb 15, 2022

@KateRusse Your completed index to Harivansa now installed. THANK YOU.

Hurray!

despite the misprint of verse numbers in the scanned image.

And on the site they are documented where? https://github.com/sanskrit-lexicon-scans/hariv/tree/main/python I believe, but it will not be linked to from nowhere, right?

How about adding a note on those pdf pages reg. the wrong numberings (so that any user would be alerted while looking at those links)?

Makes sense.

@funderburkjim
Copy link
Contributor Author

@KateRusse Did you see the request at sanskrit-lexicon/PWK#83 (comment) ?

@gasyoun
Copy link
Member

gasyoun commented Mar 4, 2022

@KateRusse Did you see the request at sanskrit-lexicon/PWK#83 (comment) ?

Yap, she was ill, but will get soon.

@Andhrabharati
Copy link

I think this issue is closable now.

What do you say, @funderburkjim ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants