cuThomasBatch Functions to perform the Thomas Algorithm on tridiagonal systems in batches with CUDA. Code was used in the following paper: Efficient Interleaved Batch Matrix Solvers for CUDA (preprint http://arxiv.org/abs/1909.04539)