Source of input files for regex-redux? #268
Replies: 1 comment 1 reply
-
The input files used for the regex-redux problem (and also the knucleotide problem too) are generated by using one of the programs from the fasta problem with an argument of the desired size. The fasta programs generate their output through a combination of repeating a small sequence many times and generating sequences using a pseudorandom number generator. Most modern Unix-like systems include Wget or cURL and Python 3 so on these systems you can generate this file pretty easily with the Python #1 fasta program using the following commands in a terminal:
or
The Python code may take around half a minute to generate a file of this size. If you want to use larger file sizes, you may want to use one of the faster programs listed at https://programming-language-benchmarks.vercel.app/problem/fasta (or also https://benchmarksgame-team.pages.debian.net/benchmarksgame/performance/fasta.html) but you may need to make sure you have a suitable compiler/interpreter installed first. |
Beta Was this translation helpful? Give feedback.
-
Where is the file 2500000_in for regex-redux coming from? Couldn't find it. I don't have and want a full test setup locally, so I assume it is magically generated, isn't it?
Beta Was this translation helpful? Give feedback.
All reactions