Sarah C. Williams
Life Sciences Data Services Librarian
Apr 26, 2012
BioProject is a NCBI resource. Yesterday the OpenHelix blog posted - Video Tip of the Week: BioProject, it's where to start finding data. The video is 5 minutes.
From the blog post, "Huge projects like 1000 Genomes, ENCODE and others and reduced sequencing costs produce enough data that finding it is difficult. BioProject grew out of a need to better organize these large projects’ datasets and metadata and replaces NCBI’s Genome Project resource. These projects produce data which is then deposited in several repositories. BioProject 'provides an organizational framework to access metadata about research projects and the data from those projects which is deposited, or planned for deposition, into archival databases.'"
The blog post also highlights the article "BioProject and BioSample databases at NCBI: Facilitating capture and organization of metadata" published this year in Nucleic Acids Research. From that abstract, "As the volume and complexity of data sets archived at NCBI grow rapidly, so does the need to gather and organize the associated metadata. Although metadata has been collected for some archival databases, previously, there was no centralized approach at NCBI for collecting this information and using it across databases. The BioProject database was recently established to facilitate organization and classification of project data submitted to NCBI, EBI and DDBJ databases. It captures descriptive information about research projects that result in high volume submissions to archival databases, ties together related data across multiple archives and serves as a central portal by which to inform users of data availability."
See more information about the University Library's OpenHelix subscription.