Repeat regions are important sequences within a chrosomome. They play roles in regulation, genome structure, cross-overs and genetic plasticity. Repeats also are usually difficult to assembly from short reads. It is therefore necessary to either find these repeats in a genomic sequence or detect them de novobased on short read technology.
Custom repeat databases were created for the selected crops, either by using the VLPB genome annotation pipeline or their proprietary genome annotation pipeline. In the latter case specifications of the repeat database were provided to the executing partner. After this project VLPB partners were able, for their crops of interest, to build custom repeat databases, when the required input data is available.