bwgrid A computing grid for Baden-Württemberg

Similar documents
Grid Environment for Central Europe

How to obtain HPC resources. A. Emerson, HPC, Cineca.

CWE FB MC project. PLEF SG1, March 30 th 2012, Brussels

Server, Desktop, Mobile Platforms Working Group (SDMPWG) Dated

How do I get an Allocation?

New ways for cooperation with German Southwest: the Baden-Württemberg cluster and networks

Berkeley Research Computing Town Hall Meeting May 28. Patrick Schmitz Associate Director, Research IT BRC Program Director

Come study at the Cooperative State University Baden Württemberg (DHBW) Mosbach!

How to find initial financing for the PhD period

Documentation of the CWE FB MC solution as basis for the formal approval-request (Brussels, 9 th May 2014)

Distributed Monte Carlo Production for

INNOVATION ENTREPRENEURSHIP FINANCE

CWE Flow-based Market Coupling Project. at EMART Energy 2012

Using E-Procurement data to measure the transparency and performance of public spending

The GRIDS Center: Helping to Build the Cyberinfrastructure

NAWIPS Migration to AWIPS II Status Update Unidata Users Committee Meeting. NCEP Central Operations 11 April 2011

LotusLive. Working together just got easier Online collaboration solutions for the working world

2 nd Call for Collaborative Data Science Projects

Modification: IBM Blade and BladeCenter server solution 2007 Business Partner incentive

Income Eligible Procurement. February

Technology.Transfer.Application. Danube Transfer Centers. and their Role in the Regional Development

Federal Demonstration Partnership. January 12, 2009 Michael Pellegrino

Research Opportunities at the NSA. William Klingensmith IAD Trusted Engineering Solutions MARCH 2015

A Bigger Bang Patient Portal Strategy: How we activated 100K patients in our First Year

Helmholtz-Inkubator INFORMATION & DATA SCIENCE

ACTION ITEM #5 Establishment of Research Center for Institutional Research Computing (CIRC) (Daniel J. Bernardo)

Promoting the participation of young researchers in ICT FET Open

LANGUAGE COURSES AND EXAMINATIONS SEPTEMBER 2018 JANUARY 2019

OSOR.eu Open Source Observatory and Reposotory. OSOR eid/esignature/pki community workshop

EBF Working Groups Report. Schleswig-Holstein, Hamburg and Baden-Württemberg November 12-18, 2017

ERA-Net Smart Energy Systems. Flux 50, Brussels June 7, 2018 Welcome! The webinar will start soon.

High Performance Computing Advisory Group Thursday, November 3, 2016

CALENDAR OF L ARCHE CANADA S ENGAGEMENT WITH COMMUNITIES

SSF Call for Proposals: Framework Grants for Research on. Big Data and Computational Science

Success Story Enabling Global Growth with NVIDIA GRID

UNCLASSIFIED. R-1 ITEM NOMENCLATURE PE F: Integrated Broadcast Service (DEM/VAL) FY 2012 OCO

HOSPITAL IMPROVEMENT INNOVATION NETWORK (HIIN) Amanda Keilholz, Program Manager April 25, 2017

CareBase: A Reference Base for Nursing

A PROPOSED PROTOTYPE OF COOPERATIVE MEDICAL TREATMENT SYSTEM FOR HOSPITALS IN GCC COUNTRIES

UNESCO Chair in Technologies for Development

UNCLASSIFIED R-1 ITEM NOMENCLATURE

FPGA Accelerator Virtualization in an OpenPOWERcloud. Fei Chen, Yonghua Lin IBM China Research Lab

OWENS VALLEY CAREER DEVELOPMENT CENTER

Strategies on fostering IPR KIT

SJTU CCOE Annual Report and Renew Request

CV RADIOLOGY: WHAT WORKS (AND WHAT DOESN'T) IN BUILDING A PRACTICE

UNCLASSIFIED R-1 ITEM NOMENCLATURE FY 2013 OCO

Joint Program Executive Office Joint Tactical Radio System

documents application link Students of Natural and Technical Sciences, Agriculture good command of English. month

Good practices on Single Window Regional Integration Experience of ASEAN countries

Nation-wide Health Information System Estonian experience since 2007

Florida Health Information Exchange (HIE) Quarterly Plan Report. Contract No. EXD027. August 15, (Ref. EXD027 Attach. I, Pg.

Deployment Guide. GlobalMeet 5 June 27, 2018

STATISTICAL PRESS NOTICE MONTHLY CRITICAL CARE BEDS AND CANCELLED URGENT OPERATIONS DATA, ENGLAND March 2018

Software as Infrastructure at NSF. Daniel S. Katz Program Director, Division of Advanced Cyberinfrastructure

Central Okanagan KLO Campus Joint Occupational Safety and Health Committee - Meeting Minutes. July 11 th, Room B130 10:00 am 11:30 am

The Alexander von Humboldt Foundation - Connecting excellent researchers worldwide. Dr. Maike Didero Programme Director Division Africa, Middle East

JAN WEEKLY BULLETIN. Important Dates. Summer Opportunities. Job/Research Opportunities. mae.buffalo.edu ANNOUNCEMENTS MAE UNDERGRADUATE.

Managing FLOGI, Name Server, FDMI, and RSCN Databases, page 1

Phase 1: Project Orientation and Analysis

UNCLASSIFIED. R-1 ITEM NOMENCLATURE PE F: Theater Battle Management (TBM) C4I FY 2012 OCO

CWE Flow-based Market Coupling Project

The Research Excellence Framework (REF)

Prepare for the PSAT 8/

National E-government Strategies: Integrating Social Media Technologies into Your Government 2.0.

Technology Transfer in Slovakia and Abroad

Moving from Sentinel SuperPro to Sentinel LDK Migration Guide

Avenues for openlab evolution

Course Syllabus Spring 2007

Report Purpose To provide the Priorities Committee with an update on the Municipal Development Plan (MDP) update process and public engagement.

UNCLASSIFIED. FY 2016 Base FY 2016 OCO. Quantity of RDT&E Articles

Finding Postdoctoral Funding Opportunities. September 24, 2015 Nancy L. Devino, Ph.D. Research Development Associate

Massachusetts ICU Acuity Meeting

Request for Proposals for Property Tax System IBM iseries Migration & Hosting Services

Collaborative coordination of fire support mission execution

Unified Communications Improves Business Outcomes, Lowers Costs, and Enhances Environmental Sustainability

Guides. Global Operations Readiness 2016 Microsoft

Digital Infrastructures for Research 2016 (DI4R16)

Implementation of Automated Knowledge-based Classification of Nursing Care Categories

ONESOURCE FRINGE BENEFITS TAX ONESOURCE FBT INSTALLATION GUIDE 2017 STAND-ALONE INSTALLATION AND UPGRADE GUIDE. Thomson Reuters ONESOURCE Support

Vacancy Announcement

ENTERPRISE SYSTEMS MONTHLY STATUS REPORT

UNCLASSIFIED. FY 2016 Base FY 2016 OCO

Herzlich Willkommen!

International Journal of Advance Engineering and Research Development

Central Okanagan KLO Campus Joint Occupational Safety and Health Committee - Meeting Minutes. September 12, Room B130 10:00 am 11:30 am

Briefing on FRGS Phase 1/2014

SECURING NETWORKS, SECURING FUTURES

What type of research infrastructure is eligible for funding?

COMMON AVIATION COMMAND AND CONTROL SYSTEM

NICS and the NSF's High-Performance Computing Program. Jim Ferguson NICS Director of Education, Outreach & Training 8 September 2011

Exzellenz verbindet be part of a worldwide network

PEGAS. The Professional Enrichment Grant Application Service. Applicant Instruction Manual Academic Year

ENTERPRISE SYSTEMS MONTHLY STATUS REPORT

On Sharing Infrastructure Resources using Online Social Networks

Proposal Preparation Instructions: Call for Proposals for Research Projects Using the HPCI System in Fiscal Year 2019

Sanilac County Community Mental Health Authority

Siebel Installation Guide for Microsoft Windows. Siebel Innovation Pack 2015, Rev. D November 2015

Achieving Operational Excellence with an EHR a CIO s Perspective

Transportation & Parking Advisory Committee

Transcription:

bwgrid A computing grid for Baden-Württemberg Sven Hermann STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu

Outline bwgrid partners bwgrid the infra structure infra structure hardware at the sites What is (not) grid on bwgrid? bwgrid from a user s view Webpage User groups bwgrid-vo future: User portal (Ulm) bwgrid as a project course of the project bscw ( Basic Support for Cooperative Work ) successes & publicity upcoming tasks 2011/2012 bwgrid technical features as grid Jobs, CPUh & efficiencies grid usage & cross site computing Summary 2 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 3 Hermann BFG Workshop 28.04.2011

bwgrid Partners 4 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 5 Hermann BFG Workshop 28.04.2011

bwgrid infrastructure (1/2) 9 sites ( ) with ca. 1680 nodes á 8 cores in total 13450 Cores 510 TB storage accessible via grid middleware (Lustre system) Current middleware for all clusters: Globus 4.0.8 6

bwgrid infrastructure (2/2) German Federation payed for hardware MWK Baden-Württemberg payed for personal ressources Grid: transparent aggregation of computing units in Baden- Württemberg 7

Hardware at the sites Site Number of nodes (IBM Bladeserver HS21 or rather in Esslingen type Appro gb222x) Number of Blade-Chassis (IBM BladeCenter H, or rather Appro 5U) Freiburg, Heidelberg, Karlsruhe, Mannheim, Tübingen Stuttgart Ulm Esslingen 140 434 280 180 10 31 20 18 CPU-Cores per node 8 8 8 8 Main storage per node [GB] 16 16 16 24 Local disk storage per node 120 0 120 0 [GB] Number of InfiniBand- Switches (Voltaire Grid Director ISR 2012) Number of ports 1 2 1 1 (InfiniBand) 168 576 288 192 2 2 2 2 Number of Frontend und Backend Server (IBM xserver x3650) 8 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 9 Hermann BFG Workshop 28.04.2011

What is Grid on bwgrid? combination of computer resources from multiple administrative domains to reach a common goal a distributed system with non-interactive workloads that involve a large number of files is more loosely coupled, heterogeneous, and geographically dispersed than cluster computing common that a single grid will be used for a variety of different purposes often constructed with the aid of general-purpose grid software libraries known as middleware. Parallel computing locally at each site Very similar architecture at all sites (OS, software modules) 10 Hermann BFG Workshop 28.04.2011

What is not Grid on bwgrid? X no meta scheduling (like glite WMS) but cross site computing possible X no super virtual computer is composed of many networked loosely coupled computers acting together to perform very large tasks, but large clusters per site X no across site parallel computing 11 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 12 Hermann BFG Workshop 28.04.2011

bwgrid Webpage: www.bw-grid.de powered by Uni Konstanz 13 Hermann BFG Workshop 28.04.2011

Users in bwgrid Discipline (# of projects): Astrophysics (5) Biology (21) Chemistry (27) Economic science (9) Informatics (7) Mathematics (1) Physics (17) Political science (2) Social science (2) e.g. neuro science: e.g. geophysics e.g. quantum chemical analysis: 14 Hermann BFG Workshop 28.04.2011

User authorization (bwgrid-vo) bwgrid VOMRS host Computing ressource GT4 container incl. gridmap file Registration (once) Connect to a bwgrid site using a proxy to submit a job there user User machine grid-proxy-init 15 Hermann BFG Workshop 28.04.2011

bwgrid CPUh usage - H210 Local Users 22% Other 2% bwgrid VO 76% 16 Hermann BFG Workshop 28.04.2011

bwgrid User Portal 1/3 from the users point of view: meta submit system login file browser 17 Hermann BFG Workshop 28.04.2011

bwgrid User Portal 2/3 GridProxyManager / MyProxy Portlet job monitoring portlet 18 Hermann BFG Workshop 28.04.2011

bwgrid User Portal 3/3... and most important: portlets for applications Gatlet (Bash-Scripts) Math Chem CAE Med 19 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 20 Hermann BFG Workshop 28.04.2011

course of the project Jan 2010 Apr 2011 Bi-weekly regular video conference (Thursday) On 1.1.2010 the project leadership of bwgrid was handed over from HLRS to KIT End of April 2010: outstanding technical report has been delivered Mid May 2010: HS Esslingen joined with 180 Nodes March 2011: successful F2F Meeting bwgrid @KIT 21 Hermann BFG Workshop 28.04.2011

bwgrid BSCW powered by HLRS 22 Hermann BFG Workshop 28.04.2011

Successes and publicity bwgrid Poster for D-Grid AHM 2010 in Dresden Monitoring with "webmds was working for all sites (but current outage ) http://webmds.lrz-muenchen.de:8080/webmds/xslfiles/csm/ bwgrid site Lustre upgrades since Jan 2010 more stable than before bwgrid wide initiative for unification of all bwgrid clusters successfully finished in Sept 2010 Since then users have got an standard environment at the different sites Continuous improvement ongoing (e.g. Software modules with common versions) 23 Hermann BFG Workshop 28.04.2011

Upcoming tasks 2011/2012 1/2 Careful coordination of update to Globus 5 further middleware? E.g. Unicore and/or glite Improved transparency for users Workshops & training for users acquisition, access, promotion BW wide user support (1st-Line) with link to NGI-DE Support-Portal (https://helpdesk.ngi-de.eu) 24 Hermann BFG Workshop 28.04.2011

Upcoming tasks 2011/2012 2/2 Simplified login to the grid (e.g. with Shibboleth?) elaborated accounting for more detailed statistics: Which user group computes what in bwgrid? overall cluster scheduling Loadbalancing (MOAB or Alternative?) Transfer of user data among sites? Hedge against outages maintenance contracts for several hardware components needed! new contracts (e.g. infiniband switches, front server) 25 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 26 Hermann BFG Workshop 28.04.2011

Used MCPUh/month (whole bwgrid) 10 9 8 7 80% efficiency MAX 9,2 6 5 4 3 2 1 0 Juli August September Oktober November Dezember 27 Hermann BFG Workshop 28.04.2011

Used MCPUh/month (whole bwgrid) 10 9 8 MAX 9,2 7 6 5 4 3 2 1 0 Juli August September Oktober November Dezember 28 Hermann BFG Workshop 28.04.2011

percentaged CPUh used by different sites (per site) 100% 90% 80% 70% 60% 50% 40% 30% Average Ma/Hd Frei Ess Tue Ka Stutt Ulm 20% 10% 0% sept okt nov dez jan feb 29 Hermann BFG Workshop 28.04.2011

Site Tübingen: CPUh Sept 2010 Feb 2011 1000000 900000 800000 700000 600000 500000 400000 300000 200000 100000 MAX unknown fremdvo Ulm Stuttgart MA/HD Konstanz Karlsruhe Hohenheim Freiburg Esslingen Tuebingen 000-LOKAL 0 sept okt nov dez jan feb 30 Hermann BFG Workshop 28.04.2011

Site Karlsruhe: CPUh Sept 2010 Feb 2011 1000000 900000 800000 700000 600000 500000 400000 300000 200000 100000 MAX unknown fremdvo Ulm Tuebingen Stuttgart MA/HD Konstanz Hohenheim Freiburg Esslingen 000-LOKAL Karlsruhe 0 sept okt nov dez jan feb 31 Hermann BFG Workshop 28.04.2011

Site Esslingen: CPUh Sept 2010 Feb 2011 1000000 900000 800000 700000 600000 500000 400000 300000 200000 100000 MAX unknown fremdvo Ulm Tuebingen Stuttgart MA/HD Konstanz Karlsruhe Hohenheim Freiburg Esslingen 000-LOKAL 0 sept okt nov dez jan feb 32 Hermann BFG Workshop 28.04.2011

Correlation #Jobs CPUh (per site) CPUh/month 3000000 2500000 2000000 1500000 1000000 500000 Ess Frei KA Ma/Hd Stutt Tüb Ulm 0 100 1000 10000 100000 1000000 #Jobs/month 33 Hermann BFG Workshop 28.04.2011

Outline bwgrid partners bwgrid the infra structure What is (not) grid on bwgrid? bwgrid from a user s view bwgrid as a project bwgrid technical features as grid Summary 34 Hermann BFG Workshop 28.04.2011

Summary bwgrid... is strong collaboration group of 9 specialized sites... makes 13500 CPUs and 500 TB storage accessible to BW users... has been grown together since years... has reached production quality (e.g. high efficiency)... is unique in German Federation... makes communities profit of strong inter site collaboration inter site usability... is preparing for upcoming tasks 35 Hermann BFG Workshop 28.04.2011

Thank you! Questions? STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association www.kit.edu

Correlation # jobs efficiency (per site) 1,2 Efficiency/month 1 0,8 0,6 0,4 0,2 Ess Frei KA Ma/Hd Stutt Tüb Ulm 0 100 1000 10000 100000 1000000 #Jobs/month 37 Hermann BFG Workshop 28.04.2011

Half-year performance bwgrid per site (H210) 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% KA Tueb Ulm Ma-Hei Ess Stu Frei 38 Hermann BFG Workshop 28.04.2011

Site Freiburg: CPUh Sept 2010 Feb 2011 1000000 900000 800000 700000 600000 500000 400000 300000 200000 100000 unknown fremdvo Ulm Tuebingen Stuttgart MA/HD Konstanz Karlsruhe Hohenheim Esslingen Freiburg 000-LOKAL 0 sept okt nov dez jan feb 39 Hermann BFG Workshop 28.04.2011

Site Stuttgart: CPUh Sept 2010 Feb 2011 2500000 2000000 1500000 1000000 500000 unknown fremdvo Ulm Tuebingen MA/HD Konstanz Karlsruhe Hohenheim Freiburg Esslingen Stuttgart 000-LOKAL 0 sept okt nov dez jan feb 40 Hermann BFG Workshop 28.04.2011

Site Ma/Hd: CPUh Sept 2010 Feb 2011 1700000 1500000 1300000 1100000 900000 700000 500000 300000 100000 unknown fremdvo Ulm Tuebingen Stuttgart Konstanz Karlsruhe Hohenheim Freiburg Esslingen MA/HD 000-LOKAL -100000 sept okt nov dez jan feb 41 Hermann BFG Workshop 28.04.2011