The CropStore Curation Pipeline
The CropStoreDB Curation Pipeline is a key component in allowing systematic management of data relating to crop plants. Data are available from multiple sources and typically are very heterogeneous in source, quality and completeness. It is therefore important that any data curator triage and formalise the progressive stages in moving raw data towards entry into the CropStore relational schema. This is a non-trivial task, and typically under-estimated and under-valued in research systems.
Suggested folder structure
We have found the following local folder structure helps in managing the workflow from raw to database-loaded data.
A key step for formalising entry into the database, and for training of data providers, is the completion of Input Template Workbooks.
For a more sophisticated online data entry system, see the Brassica Information Portal