Skip to content
katjabercic edited this page Aug 2, 2016 · 7 revisions

FI-Impact R analysis

Data sources

  • Features
    The file features.csv describes which columns are used in the analysis and how. See documentation.
  • Accelerator information
    Files accelerators shorthand.csv and all indicators-Table 1.csv contain information about accelerators.
  • Project information
    Files fi-impact-export.txt and Final FIWARE Usage Assessment Scorecard.csv contain information about projects.
  • Additional funding Sub-grantees which obtained additional funding are listed with their unique identifiers in additional-funding.txt, one per line.

Connectivity data sources

The list of project partners Partners-Table 1.csv was compiled semi-automatically from project spreadsheets (these were in turn provided by the IDC). Each line represents a participation of a partner in a FI-PPP project. Columns breakdown:

  • Partner: in the IDC spreadsheets partners appear with different names. We attempted to match the different names to one entity, identified by the name in this column.
  • Name used: name of partner as used in the spreadsheet in a particular instance.
  • Project: FI-PPP project in which the partner participated.
  • Phase: Phase of the FI-PPP project.

The FI-PPP projects are listed in Projects-Table 1.csv. Each line contains a numeric id, project label, project phase and where applicable, accelerator shorthand. This file is used as a list of nodes for the connectivity graph in partners [Nodes].csv.

Files

  • Main file: run.R
    This is the file that needs to be run to obtain computations.
  • Functions: utility.R.
  • Data imports: import.R, partners.R
    These scripts import data from the csv files.
  • Connectivity computations: partners.R
    Outputs edge list for the connectivity graph and csv file with different degrees of connectivity.
  • Binary accelerator property computations: binary.R
    Outputs CDF diagrams and practice score csv files.
  • Correlations computations: correlations.R
    Outputs heatmaps of correlations with corresponding csv files with values and sample sizes.
  • Boxplots for categories: boxplot.R
    Outputs boxplots for binary and categorical indicators and boxplots for comparing accelerators.
  • Score overview tables and histograms: overview.R
    Outputs 5-bin histograms for scores (overall and per accelerator) and corresponding csv files.
  • T-test computations: ttest.R
    Outputs t-tests for binary and numeric indicators.
Clone this wiki locally