A Comprehensive Campus-based Approach to Address the Opportunities and Challenges Posed by Data Intensive Research
Event Type
TimeTuesday, July 306:30pm - 8:30pm
LocationCrystal Foyer and Crystal B
DescriptionIn 2007 Jim Gray distinguished data intensive science as a fourth paradigm of scientific exploration. This declaration was made with a sense of both optimism and caution: optimism in anticipation of the novel breakthroughs and insights that these new sources and volumes of data would likely bring and caution because, as Gray described at the time, “the tools for capturing data both at the mega-scale and at the milli-scale are just dreadful.” Over a decade later and we can confirm that the enthusiasm and apprehension expressed by Gray and others were warranted. Not only have the horizons of scientific discovery been expanded by the vast and diverse quantities of new data, but these data have also challenged our ability to sustain research cyberinfrastructure at universities in new and unexpected ways. These challenges are not strictly technical, but also involve social, cultural, legal, and financial components that can impede and sometimes completely inhibit our ability to conduct meaningful research. In this paper we will describe a comprehensive framework used by the University of Minnesota (UMN) to better address current and emerging challenges and opportunities brought by data intensive research and the process that our University used to develop this framework. Elements of this framework and the process used to develop it could be applied at other research institutions. While approaches may differ slightly, addressing these challenges within our respective universities is critical, and perhaps a prerequisite to building and sustaining partnerships among providers of advanced research computing.