{"id":336,"date":"2015-03-30T00:31:27","date_gmt":"2015-03-29T23:31:27","guid":{"rendered":"http:\/\/graal.ens-lyon.fr\/~diet\/?page_id=336"},"modified":"2015-04-27T21:04:25","modified_gmt":"2015-04-27T20:04:25","slug":"lrms","status":"publish","type":"page","link":"https:\/\/graal.ens-lyon.fr\/diet\/?page_id=336","title":{"rendered":"LRMS (Batch Scheduler)"},"content":{"rendered":"<h1 id=\"parent-fieldname-title\" class=\"documentFirstHeading\">LRMS DIET Management<\/h1>\n<div id=\"content-core\">\n<div class=\"newsImageContainer\"><a id=\"parent-fieldname-image\" href=\"http:\/\/graal.ens-lyon.fr\/diet\/features\/lrms\/image\/image_view_fullscreen\"> <\/a><\/p>\n<div id=\"attachment_337\" style=\"width: 689px\" class=\"wp-caption alignright\"><a href=\"http:\/\/graal.ens-lyon.fr\/~diet\/wp-content\/uploads\/2015\/03\/drawgantt.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-337\" class=\"size-full wp-image-337\" src=\"http:\/\/graal.ens-lyon.fr\/~diet\/wp-content\/uploads\/2015\/03\/drawgantt.png\" alt=\"Draw Gantt example\" width=\"679\" height=\"189\" \/><\/a><p id=\"caption-attachment-337\" class=\"wp-caption-text\">Draw Gantt example<\/p><\/div>\n<\/div>\n<div id=\"parent-fieldname-text\" class=\"plain\">\n<h2>Overview of LRMS (Local Ressource Management System)<\/h2>\n<p style=\"text-align: justify;\">Parallel resources are generally accessible through reservation systems, also called batch systems. To execute their jobs, clients have to submit via special mnemonics and chosen options a script that contains the command line that launches the job. Options are not necessarily given in the submission line, but can also appear in the script with a correct syntax. Several batch systems exist. Among them, one can cite <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www-03.ibm.com\/servers\/eserver\/clusters\/software\/loadleveler.html\" target=\"_blank\">Loadleveler<\/a><\/span> on IBM resources, <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www.openpbs.org\/\" target=\"_blank\">OpenPBS<\/a><\/span> or <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www.clusterresources.com\/pages\/products\/torque-resource-manager.php\" target=\"_blank\">Torque<\/a><\/span> , which are forks of the well-know PBS system, and <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/oar.imag.fr\/\" target=\"_blank\">OAR<\/a><\/span> developped by <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www.imag.fr\/\" target=\"_blank\">IMAG<\/a><\/span> at Grenoble, and used in the <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www.grid5000.org\/\" target=\"_blank\">Grid&rsquo;5000<\/a><\/span> Project of a research grid. Most of the submitted jobs are parallel jobs, coded from the <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www-unix.mcs.anl.gov\/mpi\/\" target=\"_blank\">MPI<\/a><\/span> standard with an instantiation like <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www-unix.mcs.anl.gov\/mpi\/mpich1\" target=\"_blank\">MPICH<\/a><\/span> or <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/www.lam-mpi.org\/\" target=\"_blank\">LAM<\/a><\/span>.<\/p>\n<p style=\"text-align: justify;\">In order to correctly use a batch system, a client must provide in the submission line or in the script several information like: the number of machines to assign, the duration that they will be used, the number of MPI processes to use (indeed, most applications still use MPI-1.2 implementations where the number of processes is statically defined at the spawning of the jobs).<\/p>\n<h2><\/h2>\n<h2>Problematic<\/h2>\n<p style=\"text-align: justify;\">The Grid will only be used if its resources can be made easily available to clients. Grid middlewares are a good means to propose this transparent access, but few of them have the possibility to submit transparently for the user to batch systems, i.e., with the same manner than with sequential jobs: only the service and its input parameters are provided by clients. Information like the walltime, the number of machines and processes, must be determined by the middleware when choosing the parallel resources on which to submit in its scheduling phase.<\/p>\n<p>&nbsp;<\/p>\n<h2>Interfacing DIET with LRMS<\/h2>\n<p style=\"text-align: justify;\">We have firsly considered Elagi. This library allows to remotely submit on batch systems. But in our case, a DIET server (SeD) is deployed on each computing resource. So, it is locally present with the batch system. Then, most of the possibilities of Elagi are not used.<\/p>\n<p>We are actually extending the DIET server API such that any parallel application can be called from a SeD without further client information.<\/p>\n<p>&nbsp;<\/p>\n<h2>Performance prediction and scheduling algorithms<\/h2>\n<p style=\"text-align: justify;\"><span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/simgrid.gforge.inria.fr\/doc\/contrib.html\" target=\"_blank\">Simbatch<\/a><\/span> is a module of the grid simulator <span class=\"link-external\"><a class=\"externalLink\" href=\"http:\/\/simgrid.gforge.inria.fr\/\" target=\"_blank\">Simgrid<\/a><\/span>. It has been designed and developed to model batch systems in order to test realisticly distributed scheduling algorithms, and provide good performance prediction functions that can be embedded in the SeD.<\/p>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>LRMS DIET Management Overview of LRMS (Local Ressource Management System) Parallel resources are generally accessible through reservation systems, also called batch systems. To execute their jobs, clients have to submit via special mnemonics and chosen options a script that contains the command line that launches the job. Options are not necessarily given in the submission [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":79,"menu_order":5,"comment_status":"closed","ping_status":"open","template":"","meta":{"footnotes":""},"class_list":["post-336","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/pages\/336","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=336"}],"version-history":[{"count":3,"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/pages\/336\/revisions"}],"predecessor-version":[{"id":380,"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/pages\/336\/revisions\/380"}],"up":[{"embeddable":true,"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=\/wp\/v2\/pages\/79"}],"wp:attachment":[{"href":"https:\/\/graal.ens-lyon.fr\/diet\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=336"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}