Skip to content

brainuser5705/interaktiv-landernamen

Repository files navigation

Interactive Country Names

Read the CHANGELOG for updates to the project.

The original raw data set has been uploaded onto Kaggle. On there, I also provide the sources I use and explanations about the data fields on there. Here is some additional information not described on Kaggle:

  • The list of countries/territories in the data set were scraped from this Wikipedia entry and no additional countries/territories were added afterwards.
  • The main German source (Auswaertiges Amt - German Federal Foreign Office) did not provide official names, captials and/or grammatical genders for all countries and territories. The grammatical genders were confirmed with Wikitionary (de and en version) for both the long and short name forms. Missing captials were taken from Wikipedia entries (de, en).
  • As Auswartiges Amt did not cover all the territorities, the missing data is sourced from Destatis
  • Here's the translation for where Auswaertiges Amt got its information:
The English, French and Spanish state designations as well as the
Personal designations and the adjectival derivations of the independent states are
UNTERM, as far as the information contained therein is taken from official language usage
correspond to the respective states.
The foreign-language information on capitals and sovereign areas is official
Taken from documents as well as monolingual reference works and atlases.
The German designations correspond to the official ones specified by the Foreign Office
Use.
  • The links to flag icons in the dataset were scraped from the same Wikipedia entry to get the country names, but they were 32*32 pixels (i think) resolution so I ended up using data taken from this Country flags SVGs Github repository.

Note about Country/Territories Standardization

As I have learned while researching for this project, it is impossible to have a single source of truth for a list of the world's countries/terriorities let alone the geographical borders between them. Political disputes of the sovereignty of nations, ownership of territories and borders means that there doesn't exist a world map that every country can agree upon. Additionally, open-source providers of such geospatial data often differ from each other e.g. their exact placement of boundary lines.

Just for personal convience, I am Natural Earth's TopoJSON of the world's countries since it was the first one I found. It actually does not fully align with the names data set as it has vectors for countries not present in the other. But both data sets cover all the countries with official ISO 3166-1 codes.

Other interesting references

Here's a list of interesting sources/references I encountered but did not use during development:

Technical references

Things learned while researching

  • Seat of government differs from capital in that it's where the government buildings are
  • Countries often have two flags, the offical one and their coat of arms
  • endonym - native; exonym - non-native
  • de jure - by law; vs de facto
  • unincorporated territories - parts of the consistution doesn't apply