Grid-Federation: Decentralized Resource Brokering and Resource Allocation
Introduction
The Grid resource brokering ( or superscheduling ) problem is defined as: " scheduling jobs across the grid resources such as computational clusters, parallel supercomputers, desktop machines that belong to different administrative domains". Brokering in computational grids is facilitated by specialized application schedulers such as Nimrod-G, Condor-G, Work-Flow Engines. Brokering activity involves (i) querying grid resource information services (GRIS) for locating resources that match the job requirements; (ii) coordinating and negotiating Service Level Agreements; and (iii) job scheduling. The grid resources are managed by their local resource management systems such as Condor, Portable Batch System, Sun Grid Engine and Alchemi. These systems manage job queues, initiate and monitor their execution.
Existing approaches to resource brokering in a Grid environment are non-coordinated and are based on centralized information services. In this case, application schedulers such as Nimrod-G, Condor-G perform scheduling related activities independent of the other schedulers in the system. They directly submit their applications to the underlying resources without taking into account the current load, priorities, utilization scenarios of other application level schedulers. Clearly, this can lead to over-utilization or bottleneck of some valuable resources while leaving others largely underutilized. Furthermore, these brokering systems do not have a co-ordination (or cooperative) mechanism, hence this exacerbates the load sharing and utilization problems of distributed resources because of the sub-optimal schedules that are likely to occur.
To overcome this, we propose federating these distributed brokers as part of one decentralized grid system. The resulting grid system is referred to as Grid-Federation. Our Grid-Federation system is defined as a large scale decentralized resource sharing system that consists of a coordinated federation of distributed computational resources. The key features of our proposed Grid-Federation includes: (i) a market-based grid scheduling technique; (ii) decentralization via a shared federation directory that gives site autonomy and scalability; (iii) ability to provide admission control facility at each site in the federation; (iv) incentives for resources owners to share their resources as part of the federation; and (v) access to a larger pool of resources for all users.
Project Team Members
- Rajkumar Buyya and Aaron Harwood (Project Advisors)
- Rajiv Ranjan (PHD Student) mailto: rranjan AT csse dot unimelb dot edu dot au
Active Members:
- Carlos Alexandre Queiroz (MEDC Student) mailto: caxqueiroz AT gmail dot com
- Rajiv Ranjan, Rajkumar Buyya, and Aaron Harwood A Model for Cooperative Federation of Distributed Clusters, In Proceedings of the 14th IEEE International Symposium on High Performance Distributed Computing (HPDC-14), July 24-27, 2005, Research Triangle Park, North Carolina, USA.
- Rajiv Ranjan, Rajkumar Buyya, and Aaron Harwood A Case for Cooperative and Incentive Based Coupling of Distributed Clusters, In Proceedings of the 7th IEEE International Conference on Cluster Computing (Cluster 2005), IEEE Computer Society Press, September 27 - 30, 2005, Boston, Massachusetts, USA.
- Rajiv Ranjan, Aaron Harwood, and Rajkumar Buyya A SLA-Based Cooperative Superscheduling Scheme for Computational Grids , In Proceedings of the 8th IEEE International Conference on Cluster Computing (Cluster 2006), IEEE Computer Society Press, September 27 - 30, 2006, Barcelona, Spain.
- Rajiv Ranjan, Aaron Harwood, and Rajkumar Buyya A Case for Cooperative and Incentive Based Federation of Distributed Clusters, (Submitted to the Future Generation Computer Systems Journal (FGCS))
- Rajiv Ranjan, Aaron Harwood, and Rajkumar Buyya A SLA-Based Cooperative Superscheduling Algorithms for Computational Grids , (Submitted to the ACM Transactions on Autonomous and Adaptive Systems (TAAS)).
- Rajiv Ranjan, Rajkumar Buyya, and Aaron Harwood A Case for Decentralized Grid Resource Indexing, (Submitted to IEEE TCSC Newsletter).
- Rajiv Ranjan, Aaron Harwood, and Rajkumar Buyya A Study on Peer-to-Peer Based Discovery of Grid Resource Information, (Technical Report, GRIDS-TR-2006-17).
- Decentralized Superscheduling and Resource Allocation ( Research Directory )
- Peer-to-Peer based Indexing of Grid Resources ( Research Directory )
Developer
Publications
Distributed Brokering
Decentralized Grid Resource Indexing
Useful Links
Stay Informed. Register Now!
Department of Computer Science and Software Engineering
The University of Melbourne, Australia