Skip to content

Implementation of PAS: Performance Aware Job Scheduling on Hadoop YARN and Spark

Notifications You must be signed in to change notification settings

yathin017/Performance_Aware_Job_Scheduling-Big_Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Code

There are totally three algorithm in code/algorithm folder, including PAS, AHP and SJF. And we also change the source code of hadoop yarn resource manger. To use our code. Firstly, you should export the hadoop-yarn-server-resourcemanager to the jar and replace it on the yarn. Then, you can use our code by running the socket_server.py, because the communication between hadoop and python is via socket.

Data

Prediction

Here is data of job completion time under different Resource Utilization Ratio (RUR), which including wordcount, sparkpi and SVM.

Experiment

Here is data of six groups of workloads, including the submission sequence, makespan, turnaround time and RUR.

About

Implementation of PAS: Performance Aware Job Scheduling on Hadoop YARN and Spark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages