-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
failed to run linear regression workload #134
Comments
Hi @akasaki, I updated your comment to have backticks ``` around the plaintext and I'm looking into your issue now. |
Hi @akasaki just an update to let you know that I'm still working on this. At a very high level summary there's basically two things going on. One is that there's some issues with your config file. The second, however, is a much bigger issue on the Spark-Bench side. Long story short, there's no linear regression workload available for you to actually run, yet. Based on your ticket I revised #83 and am close to a PR for it. You can see my progress here: https://github.com/ecurtin/spark-bench/tree/linear-regression-workload although there is additional progress that hasn't been pushed up yet. |
@ecurtin I see. Thank you for taking time. I was trying to run logistic regression and failed (Typo: I said 'linear regression' in the title). I thought the dataset generated by linear regression data generator can also be used for logistic regression, but it seems like the datasets for these two workloads are totally different. From my understanding, the logistic regression data generator hasn't been ported to this new version. Am I right? |
That's correct. It's a long, convoluted story but the result is that at the current moment we have the linear regression generator and the logistic regression workload, which is nuts, I know. The linear regression workload is first on the list but the logistic regression generator will be coming at some point! |
Spark-Bench Version: spark-bench_2.1.1_0.2.2
Spark Version on Your Cluster: 2.1.1.2.6.1.0
Scala Version on Your Spark Cluster: 2.11.8
date_gen_lr.conf
lr.conf
Relevant Stack Trace (If Applicable)
Description of Problem, Any Other Info
I was trying to run linear regression workload, but it failed. Basically, I use Data Generator - Linear Regressionr to create 2 small files, one as input file and one as test file (indicated in the lr.conf). Anyone can help me with fixing the error? Thanks a lot.
The error message is attached below:
The text was updated successfully, but these errors were encountered: