Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BHĀGAVATAPURĀṆA link target #81

Open
funderburkjim opened this issue Nov 19, 2024 · 35 comments
Open

BHĀGAVATAPURĀṆA link target #81

funderburkjim opened this issue Nov 19, 2024 · 35 comments

Comments

@funderburkjim
Copy link
Contributor

funderburkjim commented Nov 19, 2024

Suggest we make a link target for BHĀGAVATAPURĀṆA
18861 matches in 18837 lines for "<ls>BHĀG. P\. " in buffer: pwg.txt

pdf source suggestion

archive org download looks of good quality.
Produced by MAHARISHI UNIVERSITY OF MANAGEMENT VEDIC LITERATURE COLLECTION.

Individual pages are NOT images , as only 3.7MB for 1260 pages.

A couple of random checks suggests that the numeration of स्कन्ध, अध्याय , श्लोक agrees with that of PWG.
For example, under PWG विनष्टि headword is found {#suhfdvinazwi#} <ls>BHĀG. P. 3,1,21.</ls>
and सुहृद्विनष्टिं is found on page 111.

link target ui suggestion

Two possibilities seem relevant

I suspect either one could be used as model for Bhagavata.

repository name

A repository in sanskrit-lexicon-scans will contain the pdfs and user-interface. Should the name of this repo be "bhagavata" (name will appear in links, e.g. sanskrit-lexicon-scans.github.io/bhagavata/?1,2,3)

repository name revised to bhagp_bur

Sample links:
https://sanskrit-lexicon-scans.github.io/bhagp_bur/app1/

https://sanskrit-lexicon-scans.github.io/bhagp_bur/app1/?2,4,6

https://sanskrit-lexicon-scans.github.io/bhagp_bur/app1/index.html?2,4,6

A link to the repository contents (for revisions, etc.) is
https://github.com/sanskrit-lexicon-scans/bhagp_bur


Devanagari oddity

When one selects and copies some Devanagari text from the pdf, and then pastes into a text editor, the pasted text is NOT Devanagari. E.g. try
copying स्कन्धः from page 109 of the pdf; when pasted into Emacs text one sees SkN/" .

@funderburkjim
Copy link
Contributor Author

Three questions before I begin implementation:

  1. archive org source ok?
  2. which UI should be developed?
  3. 'bhagavata' ok for repository name?

Request others to provide feedback.

@Andhrabharati
Copy link

Though there aren't many variants in the BHĀG. P. texts (in vogue), just like to mention that PWG has cited from Burnouf's ed. as the primary source (and also has some citations from its French Introduction) for the first 9 skandhas (~16k of them) and for the last 3 skandhas [10-12] (~4k of them) used the Bomb. ed. [Ganapat Krishnaji, Gujarati Press].

@funderburkjim
Copy link
Contributor Author

burnouf bhagavata notes

  • volume 1 skandhas 1-3 (google books)

  • volume 2 skandhas 4-6 (archive org -- the link mentioned above)

  • volume 3 skandhas 7-9 (google books)

  • volume 4 skandha 10, first part (archive.org)

  • no source found for "Bomb. ed. [Ganapat Krishnaji, Gujarati Press"


Description of pwg and pw listings for "BHĀG. P." literary source:

PWG:
BHĀGAVATAPURĀṆA, nach Anführungen im VP. und ŚKDR. Von ऊ 
an sind die 9 von BURNOUF edirten Skandha (Gild. Bibl. 125) 
ausgebeutet worden.

Google translate:
According to quotes in VP. and ŚKDR. From ऊ 
onwards, the 9 Skandha (Gild. Bibl. 125) edited by BURNOUF have 
been exploited.

PW:
BHĀGAVATAPURĀṆA. Die 9 ersten Skandha nach BURNOUF's. 
Ausgabe, wenn nicht ausdrücklich. ed. Bomb. hinzugefügt wird; 
die letzten Skandha nach der ed. Bomb. In Klammern 
eingeschlossene Zahlen verweisen auf ed. Bomb.

Google translate:
The first 9 Skandha according to BURNOUF's. edition, unless 
expressly added. ed. Bomb.; the last Skandha according to the ed. 
Bomb. Numbers enclosed in parentheses refer to ed. Bomb.

@funderburkjim
Copy link
Contributor Author

The Burnouf material mentioned above contains French translation and comments. There is no Sanskrit text! I think this disqualifies Burnouf for cdsl link target.

@funderburkjim
Copy link
Contributor Author

funderburkjim commented Nov 21, 2024

I did further checking that MUM version mentioned above.
For each of the 12 skandhas, I chose five or so PWG references from the first adhyaya, and compared the corresponding verse in MUM (so about 60 comparisons altogether).

All comparisons looked plausible.

The MUM version looks to me like a good cdsl link target.

@Andhrabharati Do you see any problems with using this MUM version?

@Andhrabharati
Copy link

Andhrabharati commented Nov 21, 2024

I've no issues if the MUM version is decided "to be used", because it is "easily available"; I was just pointing to what the "actual sources" used in PWG and pwk were.

Surprised to see this post--

The Burnouf material mentioned above contains French translation and comments. There is no Sanskrit text! I think this disqualifies Burnouf for cdsl link target.

This clearly shows that Jim could not get to see the actual Burnouf edition volumes, and in a way has questioned the very intent of PWG & pwk that they had looked at the Burnouf ed.!!
[Indeed, it is somewhat difficult to reach out to these volumes, while the French translation volumes could be easily "got".]

Here are the 3 volumes of Burnouf with Sanskrit text--

image
image

image
image

image
image

and against the post

no source found for "Bomb. ed. [Ganapat Krishnaji, Gujarati Press"

these may be looked at--
[Vol.1] https://www.digitale-sammlungen.de/en/view/bsb10211024?page=7
[Vol.2] https://www.digitale-sammlungen.de/en/view/bsb10625867?page=5

@funderburkjim
Copy link
Contributor Author

in a way has questioned the very intent

Not at all; I did not find a reference to the Devanagari edition, so did not know it exists.
Now your pictures prove the existence of the Devanagari edition!
Will you provide links for the Devanagari editions?

@Andhrabharati
Copy link

Andhrabharati commented Nov 21, 2024

@Andhrabharati
Copy link

Surprised to see Jim still awake & active, so late a night-time at his place!

@Andhrabharati
Copy link

And here is the index file for Burnouf's Bh.P.--
Burnouf BhP index.txt

@funderburkjim
Copy link
Contributor Author

funderburkjim commented Nov 21, 2024

The gallica ... Burnouf downloads in the comment above

ARE IN FRENCH! (for volumes 1 and 2 -- Volume 3 is as desired - per next comment)

I tried your google drive link, but did not have permissions. Have requested permissions.

@Andhrabharati
Copy link

image

image

image

Yes, there seems to be some issue in downloading these now!

Granted permission.

@funderburkjim
Copy link
Contributor Author

Got the files from Google Drive. All three contain the Devanagari text. Hurray!
So I can begin working with your index.

@Andhrabharati
Copy link

Good; you may also try the latest links as given above once.

@funderburkjim
Copy link
Contributor Author

funderburkjim commented Nov 21, 2024

I did try the links above, and they all are as expected i.e. they have the Sanskrit)

@funderburkjim
Copy link
Contributor Author

funderburkjim commented Nov 21, 2024

As for the Bombay edition, I have rotated the pages before extracting.
Agree?

We'll need an index also for Bombay edition.

My plan: For each of Burnouf and Bombay

  • make a repo in sanskrit-lexicon-scans
  • make a simple UI (hosted at Github) like one of those in the first comment of this issue.
    • I'm leaning towards the Gorresio UI variant, which would be better as stand-alone 'reader'. Any thoughts on this? (@drdhaval2785 , @gasyoun any preference?)

Then the display logic at cdsl would choose (at least for PWG) the burnouf display for skandhas 1-9, and the Bombay display for skandhas 10-12.

@Andhrabharati
Copy link

Andhrabharati commented Nov 22, 2024

We'll need an index also for Bombay edition.

Yes, and for the MBh. (scan links posted at github by AB) and R. (scan links not yet posted!) as well.

  • I'm leaning towards the Gorresio UI variant, which would be better as stand-alone 'reader'. Any thoughts on this?

Probably, you could make this THE std. style for all the "linked works" (already existing and possible future ones).

@funderburkjim
Copy link
Contributor Author

Yes, and for the M.Bh. (scan links posted at github by AB) and R. (scan links not yet posted!) as well.

When this Bhagavata link target is finished, please remind me of what you are referring to here.

BTW: I am also investigating a bugbounty security issue.

@Andhrabharati
Copy link

Andhrabharati commented Nov 23, 2024

You have nothing to do about this, Jim!

I was just mentioning that those two also need to be indexed.

This may be read with my earlier posts post-1 and post-2 at the other issue of PWG.

@funderburkjim
Copy link
Contributor Author

Github app initialized.

Sample urls:
https://sanskrit-lexicon-scans.github.io/bhagp_bur/app1/

https://sanskrit-lexicon-scans.github.io/bhagp_bur/app1/?2,4,6

https://sanskrit-lexicon-scans.github.io/bhagp_bur/app1/index.html?2,4,6

Is this beautiful? Any warts to remove ?

Next step is to generate links for PWG et. al.

@drdhaval2785
Copy link

Looks beautiful to me.

@gasyoun
Copy link
Member

gasyoun commented Nov 28, 2024

Is this beautiful? Any warts to remove ?

Hardly usable because of the scrolling.

cxzcxzcxzczx

@funderburkjim
Copy link
Contributor Author

@gasyoun Yes, I noticed that also. How to remove 'scrolling' wart ? I suspect this is an html-css issue.
Do you have anyone who can readily fix ?

@funderburkjim
Copy link
Contributor Author

link standardization for pwg

5000+ lines were changed in pwg, to improve markup for 'Bhāg. P.' links.
The changes can be seen in files change_0a.txt, change_1.txt and change_2.txt in issue81 directory.

A comparison of the links with those of the index for the Burnouf version of skandhas 1-9 was made, and uncovered a few errors; most of which were solved; See readme_check_bur.txt for the identification of these.

The work started with 22156 references, and now there are 30564.

Some summaries might be of interest.

  • (30461) bhagp_standard_2.txt has all the standard 'Bhāg. P.' records
  • (103) bhagp_nonstandard_2.txt has other literary source items mentioning 'Bhāg. P.'.
  • bhagp_verse.txt has all the verses from the standard records, with their frequency. The (0,0,0) item is <ls>Bhāg. P.</ls> . Altogether, 10509 different verses were referenced in PWG.

@Andhrabharati
Copy link

Andhrabharati commented Dec 6, 2024

Here are the resolutions for the 20 "Unsolved Problems" (mentioned in readme_check_bur.txt)--
readme_check_bur--- Unsolved Problems.txt

[Jim needs to mark the print_changes, where required in these places, accordingly]

@funderburkjim
Copy link
Contributor Author

@Andhrabharati please check the revision

106569	sarj	995565	<ls>BHĀG. P. 4,31,32.</ls>  32 not found	
AB: it is at 4,11,32

At 4,11,32 there is udvijate, but entry is about 'sarj'. page 7-0802 udvi
I do see उपसृष्ट (which is from सर्ज् ) at 4,11,32 but was expecting to see
something like उद्विसृष्ट .

funderburkjim added a commit to sanskrit-lexicon/csl-orig that referenced this issue Dec 6, 2024
funderburkjim added a commit to sanskrit-lexicon/csl-corrections that referenced this issue Dec 6, 2024
@funderburkjim
Copy link
Contributor Author

implement the 'unsolved problems'.

Note: 4,11,32 was implemented, also.

This finishes the work for standardizing the PWG links for BHĀGAVATAPURĀṆA
Will begin the same for PW(K) (and then SCH and MW).

@Andhrabharati
Copy link

@Andhrabharati please check the revision

106569	sarj	995565	<ls>BHĀG. P. 4,31,32.</ls>  32 not found	
AB: it is at 4,11,32

My error, @funderburkjim !

The citation refers to 4,31,22, and not to 4,11,32.

funderburkjim added a commit to sanskrit-lexicon/csl-orig that referenced this issue Dec 6, 2024
funderburkjim added a commit to sanskrit-lexicon/csl-corrections that referenced this issue Dec 6, 2024
funderburkjim added a commit that referenced this issue Dec 6, 2024
@funderburkjim
Copy link
Contributor Author

revision made.

The print changes to pwg uncovered in this issue were surprising to me.
I wonder if there are any additional 'automated' ways to validate BHĀGAVATAPURĀṆA references. Seems like a difficult NLP problem.

@funderburkjim
Copy link
Contributor Author

funderburkjim commented Dec 7, 2024

For standardization of PW(K) references, see sanskrit-lexicon/PWK#109.

For standardization of PWKVN references, see sanskrit-lexicon/PWK#110

For standardization of SCH references, see sanskrit-lexicon/SCH#10

For standardization of MW references, see sanskrit-lexicon/MWS#182

@funderburkjim
Copy link
Contributor Author

@Andhrabharati Have you prepared an index for Bombay edition?

@Andhrabharati
Copy link

No, I haven't touched it at all.

@funderburkjim
Copy link
Contributor Author

Would making the Bombay index be a good task to ask Anna if she would do?

@Andhrabharati
Copy link

I haven't seen Anna refusing to take up any task so far!

But, I presume it is a task far below her capacity!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants