This is a recording of Global CENTRA Webinar on 28 February 2018 (at 8-9pm EST). The webinar series is hosted by CENTRA Project (globalcentra.org, supported by US NSF ACI Award 1550126), headquartered at the **ACIS Lab, University of Florida.
Topic: Beyond Databases: Rethinking New Approaches to Virtual Collaboration and Data-sharing
Speaker: Dr. Tho Nguyen, Sr. Research Program Officer @Computer Science, University of Virginia
00:00 Introducing Dr. Tho Nguyen, by Dr. Fang-Pang Lin (CENTRA Steering Committee and National Center for High-performance Computing, Taiwan)
02:41 A quick work on when and how the project began
04:45 Presentation begins
06:28 The state of databases
11:50 General issues in data sharing
16:58 The Cyberinfrastructure (CI) Approach
17:17 The SCICADA platform: System Overview
25:27 IAM -Identity and Access Management
28:49 Upload/search SDD
(33:07 ~ 33:52 Suggest to skip; minor interruption for a technical tip)
33:53 Globally Federated File System (GFFS)
37:28 GFFS core concepts
44:01 Some things to consider for the wrapper
51:10 Project status
53:00 Next steps
55:20 End of presentation
55:30 Q & A begins (reference: genesis2.virginia.edu/wiki/Main/HomePage)
1:01:58 End of the webinar
Abstract:
Scientists have always recognized the importance of sharing data to maximize research impact and establish new collaborations. However, past efforts to develop public data repositories have fallen short of fulfilling their intended role due to not being able to address users’ needs. Toward enabling researchers to share data and collaborate on data analytics effectively we propose SCICADA, a Secured Cyber-Infrastructure for Collaborations and Advancing Data Accessibility. SCICADA is developed based on the recognition that: (1) Data owners are prohibited by the lack of control over their dataset once it’s openly available through a public repository (e.g., no accountability for attribution, access control, and misuse); and (2) Researchers using data from public repositories are challenged by the lack of information on data provenance, data quality, and support from the primary data collectors. SCICADA is a secured peer-to-peer collaboration and data-sharing platform that comprises an index of simplified data descriptors to enable elastic search and a data-collaboration utility called Globally Federated File System (GFFS, an XSEDE product). Through these services, SCICADA is portable and capable of supporting different modes of data collaborations – well beyond what is possible with public repositories. In this webinar, we describe the effort of integrating SCICADA’s core components (identity management, elastic search, and GFFS). We also discuss key issues such as security, performance, and scalability. If time permits, we may also discuss our strategy for deployment and community building.
**Advanced Computing and Information Systems Laboratory (ACIS Lab): acis.ufl.edu/
Follow us on Facebook: facebook.com/GlobalCENTRA and facebook.com/acis.lab
Twitter @GlobalCENTRA