The Monterey Data Conference explores the role of foundational AI models in the future of science

October 22, 2024 – In science, data is essential. Scientific methods rely on accurate, well-documented data to draw conclusions, test hypotheses, build models, and explain the natural world. Data is not just a by-product of research; it is a critical component that ensures results are reproducible and reliable – and the integrity of the scientific process depends on how data is collected, managed and stored throughout its life cycle.

In late August, 180 experts and leaders from across the computing landscape gathered at the Monterey Data Conference (MDC) on the beautiful coast of California to discuss that lifecycle, highlighting and discussing the latest advances and challenges of continuous in the analysis of scientific data and informatics.

Founded by Lawrence Berkeley National Laboratory (Berkeley Lab) staff in 2019, the MDC is an opportunity for experts from industry, academia, and US Department of Energy (DOE) user laboratories and facilities to come together for learning and collaboration.

The theme of this year’s meeting was “Foundations”, exploring the ways in which recent computational advances such as foundational AI models, emerging hardware and integrated research infrastructure (IRI) will be fundamental to the future of intelligence discovery. data driven. In particular, the meeting addressed ways in which the HPC community can collaborate to develop and build the next generation of computing tools and infrastructure.

“MDC is an exceptional venue for fostering collaboration across disciplines and organizations, and brings together a diverse group of experts and early career researchers to enable meaningful discussions that push the boundaries of scientific discovery,” said Ana Kupresanin, Director of the Division of of Scientific Data at Berkeley. Lab, who presented at the conference. “This year’s focus on foundational models and infrastructure really highlights the key role that data plays in shaping the future of science.”

Sessions included presentations on a variety of interrelated topics: among others, AI and foundational models for science; next-generation data infrastructure, including DOE’s new High Performance Data Facility; disruptive technologies, including quantum computing and other possible future technologies.

In addition to sharing information through talks and presentations, this year’s conference featured a variety of interactive events, from panels where experts addressed questions from the audience, to a poster session for early career seekers, to just having a lot of time for informal networks. These events encourage collaboration across the board, says Kupresanin, but they also provide support and inclusion for the next generation:

“This year’s poster session for PhDs and early career researchers shows that MDC is nurturing the next generation of data scientists with opportunities to present their work and engage with more senior colleagues,” she said. “This conference provides opportunities for technical exchange as well as mentoring.”

Ultimately, the MDC presents a unique opportunity for data scientists at all levels to gather and learn from each other in service of a larger mission, said NERSC Division Director Sudip Dosanjh:

“We founded the MDC so scientists, computer scientists, and technology providers could come together to discuss advances in data analysis, AI, and complex workflows for DOE open science; we know that forming interdisciplinary teams is critical to attacking these problems at scale. These topics are critical to DOE’s mission science, and there are several DOE initiatives underway in this area,” Dosanjh said. “I’ve been pleased to see tremendous interest in MDC from the community – this year we were at maximum capacity. .We have had a great team working at MDC and I want to thank everyone who helped.”

About NERSC and Berkeley Lab

The National Energy Research Scientific Computing Center (NERSC) is a US Department of Energy Office of Science User Facility that serves as the primary high-performance computing center for Office of Science-sponsored scientific research. Located at Lawrence Berkeley National Laboratory, NERSC serves nearly 10,000 scientists at national laboratories and universities researching a wide range of problems in climate, fusion energy, materials science, physics, chemistry, computational biology, and other disciplines. Berkeley Lab is a DOE national laboratory located in Berkeley, California. It conducts unclassified scientific research and is managed by the University of California for the US Department of Energy.


Source: Elizabeth Ball, NERSC

Leave a Comment