-
Notifications
You must be signed in to change notification settings - Fork 7
/
identifiers.txt
110 lines (85 loc) · 3.72 KB
/
identifiers.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
IDENTIFIERS
The identifiers that we associate with EEBO TCP texts all mean
something;
some are machine-actionable and some are not.
1. TCP ID
The TCP identifier identifies the TCP file in all its incarnations.
Given a
TCP ID of A69506, the original SGML version will be called A69506.sgm,
the P4 XML with header will be called A69506.headed.xml, and so forth.
Those versions of the files have not been mounted for individual
download,
though if you had downloaded them all, this ID is the way to retrieve
them
from your local repository.
Sebastian has indicated how to use the TCP ID in a URL to get his files.
(And I previously indicated how to use it in a script to download
swathes
of the TCP P5 files.)
It may also be used to access the online versions of the files on the
TCP sites at Michigan and Oxford. Given a TCP ID A69506, this
URL will access the top-level page at the Michigan site:
http://name.umdl.umich.edu/A69506.0001.001
and this URL will fetch the corresponding page on the Oxford Digital
Library site:
http://eebo.odl.ox.ac.uk/cgi/t/text/text-idx?c=eebo;cc=eebo;view=toc;idno=A69506.0001.001
Both of these pages are HTML generated in the traditional way from
indexed XML.
[One used to be able to pull down the corresponding page at the
PhiloLogic site (Chicago/Northwestern)
using just the TCP ID, but I'm not sure if that is still true. At least,
I can't immediately see
how to do it if it is.]
2. The 'VID' is an image-set identifier for ProQuest's EEBO product. If
your institution
is an EEBO member, and your VID is 94927, then the URL for image 1 of
image set
94927 is
http://gateway.proquest.com/openurl?ctx_ver=Z39.88-2003&res_id=xri:eebo&rft_id=xri:eebo:image:94926
or (in an old-fashioned shorthand that still mostly works):
http://wwwlib.umi.com/eebo/image/94927/1
Every EEBO page image may be uniquely identified thus by a combination
of
VID and sequence number ("94927/1").
3. The EEBO ID (which in some of our metadata is called a "BIBNO") is
the ProQuest ID for the bibliographic item (i.e. for the catalog record
for
the title, as opposed to the image set for the copy). Give an EEBO ID
of 12880700, this URL:
http://gateway.proquest.com/openurl?ctx_ver=Z39.88-2003&res_id=xri:eebo&rft_id=xri:eebo:citation:12880700
fetches the bibliographic record for this item from the ProQuest EEBO
site.
In many cases (but not all) these numbers are actually OCLC accession
numbers, and
therefore may be used in OCLC (e.g. in OCLC Connexion) to retrieve the
OCLC version
of the same record.
4. The STC numbers (sub-typed as Wing, Pollard&Redgrave, Evans, ESTC,
etc.) are all
forms of short-title catalog or (in the case of ESTC) full record
catalog. Most of
these are only human-actionable, unless you happen to have an electronic
copy of catalog in question. The ESTC number, however, can be used to
access the ESTC version of the bibliographic record for the item in
question.
I don't recall if this can be embedded in a URL. Try it and see.
PAGE COUNTS
Sebastian indicates that his page counts are generated by counting
PBs. This can only be an approximation, since in many books the
same page may be referenced multiple times (ie. one page may generate
multiple PB tags.)
Our headers also contain image counts (and I think Sebastian retained
those in his?).
If he did, they too are an approximate surrogate for an actual page
count, since
they count total number of page images in the image set, but (1) some of
those
may be duplicates; and (2) most but not all of them (in EEBO at least)
are
two-up images. Some are one-up (one page per image). and in a few cases
a single broadsheet page may be distributed over as many as a dozen
images.
pfs
--
Paul Schaffner Digital Library Production Service
PFSchaffner@umich.edu | http://www.umich.edu/~pfs/