The Virtual Organizations (VO) Group coordinates and supports the portfolio of the “at-large” Science community VOs in OSG except for the three major stakeholders (ATLAS, CMS, and LIGO) which are directly supported by the OSG Executive Team.
At various times through the year, science communities were provided assistance in planning their use of OSG. Direct input was gathered from nearly 20 at-large VOs and reported to the OSG Council on behalf of ALICE, CDF, CIGI, CompBioGrid, D0, DES, DOSAR, Fermilab VO, GEANT4, GPN, GRASE, GROW, GUGrid, IceCube, JDEM, Mariachi, NanoHUB, NYSGrid, and SBGrid. These input provided a roadmap of intended use to enable OSG to better support the goals of the science stakeholders; these roadmaps covered: scope of use; VO mission; average and peak grid utilization quantifiers; resource provisioning; and plans, needs, milestones.
From time to time a VO may need special assistance to effectively use OSG. In such cases, OSG management establishes ad hoc task forces coordinated by the VO Group to provide support for the science community. In the last year, joint task forces were organized with ALICE, D0, Geant4, NanoHUB, and SBGrid. With mutual sharing of expertise between OSG and the VOs, each task force addressed specialized matters to enable measurably higher productivity for each VO. Care was taken to try to maximize long-term residual impact and sustainability, beyond the near term goals of each effort.
We continued our efforts to strengthen the effective use of OSG by VOs. D0 increased to 60-80% efficiency at 80-120000 hours/day and this contributed to new levels of D0 Montecarlo production, reaching a new peak of 13 million events per week; and in effect, D0 enhanced its harnessing capacity on OSG, transforming it into ‘a publication per week.’ CDF undertook readiness exercises to prepare for Linux 5 and Kerberos CA transitions. The Fermilab VO with its wide array of more than 12 individual sub-VOs, continued efficient operations; and refocused on OSG-TeraGrid gateway, proving its concept at limited scale with D0 as a cross-grid consumer.
We conducted a range of activities to jump-start VOs that are new users of OSG or looking to increase their leverage of OSG. ALICE was enabled to start operations using its scale of resources at NERSC and LLNL. NanoHUB was enabled to pull-up from nominal utilization to 500 hours/day with multiple streams of nanotechnology applications. The SBGrid resource infrastructure was enabled and its molecular biology science applications started up across OSG. The GEANT4’s EGEE-based biannual regression-testing production runs were expanded onto the OSG, assisting in its toolkit’s quality releases for BaBar, MINOS, ATLAS, CMS, and LHCb. And we continue to actively address plans for scaling up operations of additional communities including IceCube, CompBioGrid, and GPN.
Pre-release validation by Science stakeholders was completed for OSG Release 1.2 in partnership with the OSG Integration team. After validation on ITB 1.1, official green flags for OSG 1.2 were given by 13 VOs including ATLAS, CMS, LIGO, and STAR. As part of ITB 0.9.2 earlier, a smaller-scale cycle was organized for the incremental Release1.0.1. By working closely with VOs in planning of schedules for timely release of software, we achieved validation within 4 weeks. Balancing schedules with thoroughness of testing remains a tradeoff and we will continue to address and improve these processes in the coming year.
Weekly VO Forum teleconferences were organized for regular one-on-one interaction between representatives of at-large VOs and staff members of OSG. We increased emphasis on assessing viability, and merger or closure, of a VO if this can better serve its own community’s success-roadmap and value proposition. As a result: two VOs, GUGrid and Mariachi, were closed due to lack of growth in activity, based on their own self-assessment; two new registrations were encouraged to merge into existing VOs, Minerva into Fermilab VO, and CALICE into ILC; four new registrations were vetted and encouraged to form official VOs - GlueX, GROW, JDEM, and NEBioGrid.
Efficient and effective use of the OSG continues to be a driving force for this team. Building and maintaining productivity with sustained growth is important for success of the diverse set of Science communities in OSG. In the coming year, the VO Group will continue to work toward this objective.
3.6Engagement
A priority of Open Science Grid is helping science communities benefit from the OSG infrastructure by working closely with these communities over periods of several months. The Engagement activity brings the power of the OSG infrastructure to scientists and educators beyond high-energy physics and uses the experiences gained from working with new communities to drive requirements for the natural evolution of OSG. To meet these goals, engagement helps in: providing an understanding of how to use the distributed infrastructure; adapting applications to run effectively on OSG sites; engaging the deployment of community owned distributed infrastructures; working with the OSG Facility to ensure the needs of the new community are met; providing common tools and services in support of the engagement communities; and working directly with and in support of the new end users with the goal to have them transition to be full contributing members of the OSG. These goals and methods continue to evolve from the foundation created in previous years.
Use of the OSG by the Engagement supported scientists continues to grow. Figure shows the diversity and level of activity among Engagement users for the previous year, and Figure shows the distribution by OSG facility of the more than 6 million CPU hours consumed by Engagement users, which is more than double the previous year’s activity.
Figure : Engage user activity for one year
Figure : CPU hours by facility for Engage Users
The Engagement team continues to work with active production users including:
-
Steffen Bass (+3 staff), theoretical physics, Duke University;
-
Anton Betten, mathematics, Colorado State;
-
Jinbo Xu (+1 staff), protein structure prediction, Toyota Technological Institute;
-
Eun Jung Choi, biochemistry, UNC-CH;
-
Vishagan Ratnaswamy, mechanical engineering, New Jersey Institute of Technology;
-
Abishek Patrap (+2 staff), systems biology, Institute for Systems Biology;
-
Damian Alvarez Paggi, molecular simulation, Universidad de Buenos Aires;
-
Eric Delwart, metagenomics, UCSF;
-
Tai Boon Tan, molecular simulation, SUNY Buffalo.
New engagements this year have led to work and collaboration with more than 10 research teams, including:
-
Peter Rose and Andreas Prlic, RCSB PDB at UCSD;
-
Bo Xin and Ian Shipsey, LSST, Purdue;
-
Ridwan Sakidja, materials science, UW Madison;
-
Dan Katz and student Allan Espinosa, computer science, U.Chicago;
-
Lene Jung Kjaer, Cooperative Wildlife Research Laboratory, SIU;
-
Zi-Wei Lin, Physics, Eastern Carolina University;
-
Jarrett Byrnes, UCSB - Marine Science Institute;
In addition, the Engagement effort has helped LIGO evolve their site selection methods for running their application on OSG. And the Engagement effort has also led to meaningful and growing use of OSG by the RENCI Science Portal, a separately funded TeraGrid Science Gateway project.
For the coming year, we have seeded relationships a number of research teams including a number of 2009 awardees of the US/European Digging Into Data Challenge, the University of Maryland Institute for Genome Sciences, the National Biomedical Computational Resource project (NBCR), and significant integration and enhancements support the work of Dr. Steffen Bass (Duke Physics) and his newly awarded 4 year NSF CDI award.
Share with your friends: |