Sample program for demo of spark program with cucumber framework
Employee dataframe with data - +-----------+-------------+-------+------+ |employee_id|employee_name|dept_id|salary| +-----------+-------------+-------+------+ |101 |"Rohit P" |10 |1000 | |102 |"Pooja P" |10 |1000 | |103 |"Rutu M" |10 |400 | |104 |"Rushi M" |20 |4000 | |105 |"Prithvi D" |20 |6000 | |106 |"Rajani D" |30 |10000 | |107 |"Shrikant D" |30 |5000 | |108 |"Rahul S" |30 |3000 | +-----------+-------------+-------+------+
Calculated average salary per department dataframe - +-------+----------------+ |dept_id|avg_sal_per_dept| +-------+----------------+ |30 |6000.0 | |20 |5000.0 | |10 |800.0 | +-------+----------------+
Expected average salary per department dataframe - +-------+----------------+ |dept_id|avg_sal_per_dept| +-------+----------------+ |10 |800.0 | |20 |5000.0 | |30 |6000.0 | +-------+----------------+