R package: tsfeaturex

Description

Calculate many features (over 50) of a time series. Click here to view the full feature list

Dependencies

tidyverse (https://www.tidyverse.org/)

Imports

stats (https://www.R-project.org/)
psych (https://CRAN.R-project.org/package=psych)
e1071 (https://CRAN.R-project.org/package=e1071)
entropy (https://CRAN.R-project.org/package=entropy)
Langevin (https://CRAN.R-project.org/package=Langevin)
Hmisc (https://CRAN.R-project.org/package=Hmisc)
forecast (https://CRAN.R-project.org/package=forecast)
zoo (https://CRAN.R-project.org/package=zoo)
viridis (https://CRAN.R-project.org/package=viridis)

Acknowledgements

Special Thanks

Inspiration for automatic feature extraction: https://github.com/blue-yonder/tsfresh
Dr. Nilam Ram for code on probability of acute change
Github user 'stas-g' for peak-finding code: https://github.com/stas-g/findPeaks

Funding

Nelson Roque was supported by National Institute on Aging Grant T32 AG049676 to The Pennsylvania State University.

Roadmap

Push to CRAN (June 2019)
Extracting numerical features from text data (Q2 2019)
More features (Fast Fourier Transform (FFT), Time-Series Components (Seasonality, trend, random), Friedrich coefficients (Q3 2019)
Extracting numerical features from image data (Q4 2019)

Statement of Need

In today's digital world, data collection and storage costs are quite low. Humans are collectively outputting 2.5 quintillion bytes of data every day; by 2020, each person will generate ~ 1.7 MB every second [@ibmstats]. At this scale, intensive longitudinal data about humans' behavior facilitates new discovery about the patterning of thought and action and potentially better prediction and optimization of health and well-being. In raw, form the 2.5 quintillion bytes of raw data generated daily are difficult to interpret -- noisy time-series. Extraction of features from the time-series, however, allows:

Researchers to reduce the dimensionality of their time-series data (e.g., reducing millions of time-stamped observations to, for example, summary feature vector of length 100);
Summary characterizations of time-series data that may be used as predictors, correlates, or outcomes in study of between-person differences; and
Improved and detailed description of human behavior streams (e.g., characterizing a behavioral time series in terms of its features; the mean is 'X', the range is 'Y', the peaks are at 'T12' and 'T30').

Short data streams are easily summarized using basic features (e.g., mean, standard deviation, IQR). However, as the time-series get longer, numerous other features may be needed and/or can be accessed. Study of intraindividual variability has outlined the wide variety of time-series features that can be used to characterize between-person differences and within-person change - with features such as probability of acute change (PAC) or mean square of successive differences (MSSD) providing useful information about individuals' cognitive, emotional, and behavioral dynamics.

Using `tsfeaturex`

Changelog

Click here to view the change log

Installation:

devtools::install_github("nelsonroque/tsfeaturex")

Usage:

# load library
library(tsfeaturex)

# for reproducibility of this example
set.seed(516)

# create test data
dat <- data.frame(expand.grid(day=c(1:7),id=c(1:100)))
dat$y <- rnorm(nrow(dat),5,1.5)
dat$y[1:3] <- NA # introduce NAs to check

# run function
out.list <- extract_features(df=dat,group_var="id",value_var="y",features="all")

# convert list to data.frame (MapReduce)
final.df <- features_to_df(out.list, data.format="wide", group_var = "id")

# get feature correlations
cor.df <- feature_correlations(final.df, data.format="wide", id_var = "id")

# view results
View(final.df)

Report a bug

Click here to file an issue on Github or feel free to reach out directly

Request a New Feature

Click here to request a new feature on Github or feel free to reach out directly

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
R		R
docs		docs
man		man
paper		paper
tests		tests
.Rbuildignore		.Rbuildignore
.gitattributes		.gitattributes
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
tsfeaturex.Rproj		tsfeaturex.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

R package: tsfeaturex

Description

Dependencies

Imports

Acknowledgements

Special Thanks

Funding

Roadmap

Statement of Need

Using `tsfeaturex`

Changelog

Installation:

Usage:

Report a bug

Request a New Feature

About

Releases 2

Packages

Contributors 2

Languages

License

nelsonroque/tsfeaturex

Folders and files

Latest commit

History

Repository files navigation

R package: tsfeaturex

Description

Dependencies

Imports

Acknowledgements

Special Thanks

Funding

Roadmap

Statement of Need

Using tsfeaturex

Changelog

Installation:

Usage:

Report a bug

Request a New Feature

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Using `tsfeaturex`

Packages