Function symcurve::fasta::split_seq_by_n

source ·
pub fn split_seq_by_n(record: Record) -> Vec<RecordPiece>
Expand description

Given a record, split the sequence by runs of Ns.

Returns a vector of records, each with a sequence that does not contain any Ns. The description of each record is set to the start-end position of the sequence, the positions being 1-based.

Input:

>chr42
ATGCATGC
NNNNATGC
A

Output:

>chr42 1-8
ATGCATGC
>chr42 13-17
ATGCA