-
Notifications
You must be signed in to change notification settings - Fork 3
Data preprocessing notes
JonThom edited this page Feb 6, 2022
·
3 revisions
- registered address
- in 622 cases, the
registered_address:is_address
role is played not by anode_address
type but by anorg_entity
. (468 different org_entities in total)
- in 622 cases, the
- 1139 records are duplicated in the
intermediary
andofficer
tables, and playintermediary_of:is_intermediary
andofficer_of:is_officer
.- in 7186 cases, the
is_officer
role is not present among the officers- in 7183 of these, the role player is in the intermediaries
- 1186 intermediaries play
is_officer
50487 times in total- these make up 1116 out of the 1139 (mentioned above) that appear in both the
officer
andintermediary
tables
- these make up 1116 out of the 1139 (mentioned above) that appear in both the
- 1186 intermediaries play
- the remaining 3 cases are accounted for by 1
org_entity
withid
1099616
- in 7183 of these, the role player is in the intermediaries
- all officers that play
is_intermediary
also occur in theintermediary
table, while 70 intermediaries that playis_officer
do NOT occur in theofficer
table
- in 7186 cases, the
- Edit the schema to allow
org_entity
to playregistered_address:is_address
- Edit the schema to allow
intermediary
to playofficer_of:is_officer
, andofficer
to playintermediary_of:is_intermediary
- Delete 3 rows from
officer_of
table where role player is anorg_entity
(rather than modifying schema)