Scientific Data Management on AWS with Open Source Quilt Data Packages

via aws.amazon.com => original post link

End-to-end scientific data management on the cloud is an area of increasing interest, as life sciences companies aim to increase the utility of their data. Research, development, and manufacturing teams need a holistic set of controls for the scientific data lifecycle. These controls include data integrity, metadata tracking, lineage, permissions, and security, while simultaneously allowing for the increase in agility and performance benefits of the cloud.

In this blog, we explore the concept of data packages for managing scientific data, both for wet-lab and computational data. Data packages are logical collections that can include any number of objects, metadata annotations, charts, and explanatory documentation in reusable and linkable units. We show how packaging data can preserve data integrity, lineage, and metadata without relying on complex naming conventions and folder hierarchies. Because data packages are linkable, they serve as points of integration and data exchange between applications, further increasing agility.