-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding delay to some jobs #140
Comments
Hmm, to my knowledge there is not. Spark-Bench sits at the level of a Spark Application, so it has no direct way to access and manipulate the internal jobs described in the picture. One thing that may or may not help depending on your use case is a SparkListener or tracer. These can be used in conjunction with Spark-Bench but they target deeper internal stuff. One of my colleagues built a configurable Spark tracer: https://github.com/SparkTC/spark-tracing and there are many others out there. #113 is from a user who tied in a tracer with Spark-Bench. Wish I could be of more help! |
Hi @Aalnafessah I saw you commented on #139 but I think your comment was addressing this issue. If you want to add delay to the csv load and write functions, those are here in the Spark-Bench codebase. If you're trying to add delay between tasks such as The only way I could see do this is by checking out Spark, inserting your delay statements to the KMeans class, and then re-compiling and running Spark-Bench against your re-compiled version of Spark by changing the |
Thanks @ecurtin.
Do you mean that i have to reinstall Apache Spark then recompile SparkBench ? |
I mean that you would need to edit the code inside of Spark itself and recompile Spark. You can use Spark-Bench as-is and just change the |
I got it. I will try your solution then I will let you know about the update. |
Cool, I'm gonna close this issue, feel free to reopen if you have more questions! |
@Aalnafessah hi~ Did you succeed in adding delay for some spark-jobs? And how? |
Spark-Bench Version
spark-bench-launch-2.1.1_0.2.2-RELEASE
Spark Version on Your Cluster
Spark.2.2.1
Scala Version on Your Spark Cluster
Scala version 2.11.8
Spark Cluster: Spark Standalone ( 1 master and 2 slaves).
Your Exact Configuration File (with system details anonymized)
Description of Problem, Any Other Info
I have run the SparkBench using KMeans Workload. Using the Spark dashboard i got the following jobs:
I am studying the Spark performance. I would like to add some delay inside some jobs ( e.g. adding delay inside job1 5 second). Is there any way to add this delay.
The text was updated successfully, but these errors were encountered: