>1taatgtttgtgctggtTTTTGTGGCATCGGGCGAGAATagcgcgtggtgtgaaagactgtTTTTTTGATCGTTTTCACAAAAatggaagtccacagtcttgacag>2gacaaaaacgcgtaacAAAAGTGTCTATAATCACGGCAgaaaagtccacattgaTTATTTGCACGGCGTCACACTTtgctatgccatagcatttttatccataag>3acaaatcccaataacttaattattgggatttgttatatataactttataaattcctaaaattacacaaagttaatAACTGTGAGCATGGTCATATTTttatcaat
split the file on newline, and look for a
>
character to determine the name.if the sequence data is all in one line(no line breaks), we should just store that sequence information, like:
var reader = new StreamReader("C:\myfile.fasta"); while(true) { var line = reader.ReadLine(); if(string.IsNullOrEmpty(line)) break; if(line.StartsWith(">")) StoreProteinName(line); else StoreSequence(line); }
Reference:
http://stackoverflow.com/questions/3097051/best-way-to-read-a-fasta-file-in-c-sharp