FASTA (Pearson), NBRF/PIR, EMBL/Swiss Prot, GDE, CLUSTAL, GCG/MSF, GCG9/RSF.
The program tries to "guess" which format is being used and whether the sequences are nucleic acid (DNA/RNA) or amino acid (proteins). The format is recognised by the first characters in the file. This is kind of stupid/crude but works most of the time and it is difficult to do reliably, any other way.
Format First non blank word or character in the file.
FASTA | > |
NBRF | >P1; or >D1; |
EMBL/SWISS | ID |
GDE protein | % |
GDE nucleotide | # |
CLUSTAL | CLUSTAL (blocked multiple alignments) |
GCG/MSF | PILEUP or !!AA_MULTIPLE_ALIGNMENT or !!NA_MULTIPLE_ALIGNMENT |
or MSF on the first line, and '..' at the end of line | |
GCG9/RSF | !!RICH_SEQUENCE |
Example
FASTA FORMAT:
>AVE-ACPs TSATTVLSARLTALSPTQQQSLLLDLVRAHTMAVLNDDGNERTASDAGPS ASFAHLGFDSVMGVELRNRLSKATGLRLPVTLIFDHTTPAAVAARLRTAA LGHLDED >AVE-ACP1 PTPPAELHKTLAHQTSADQRAALLELVRDHVAAVLRHADPKAIAPDQSFR ALGFDSLTAVEFRNLLIKATGLRLPVSLVFDHPTPAKLAVHLQNQLRGTA AES >AVE-ACP2 ADNGAQLHARLAGQTHEQQHTTLLALVRSHIATVLGHTTPDTIPPDRAFR DLGFDSLTAVELRNRLSRTTGLRLPTTLAFDHPNPTTLTHHLHTQLQPQP DNA >AVE-ACP3 TTPSTPLRDVLVGKSPQERDEELLRLVRTHAAAVLGHATPEVIVPNKAFK ELGFDSLAAIQLRNRLLADVDLPLPATLIFDYPTPMALCQFLRAAIVGAD TGT >AVE-ACP4 QTESTNLRQLLMGRSRSEQEEELLSLVRIHSAAVLGRDDSEAIPPGRLFR DLGFDSLAAVELRNHLAAQTELALPTTLVFDYPSPTKLAQFLLSEIAEFQ PDN >AVE-ACP5 QPIATSLRERLARLTSSKQNQVLLGLIRTGICTVLGLRNPEGIEDQRAFR DLGFDSLTSAQFSKELAKETGLPLPPSLVFDYPTPQECAAHLRTQLVDLD DEE
>P1; AVE-ACPs AVES1 TSATTVLSAR LTALSPTQQQ SLLLDLVRAH TMAVLNDDGN ERTASDAGPS ASFAHLGFDS VMGVELRNRL SKATGLRLPV TLIFDHTTPA AVAARLRTAA LGHLDED* >P1; AVE-ACP1 AVES1 PTPPAELHKT LAHQTSADQR AALLELVRDH VAAVLRHADP KAIAPDQSFR ALGFDSLTAV EFRNLLIKAT GLRLPVSLVF DHPTPAKLAV HLQNQLRGTA AES* >P1; AVE-ACP2 AVES1 ADNGAQLHAR LAGQTHEQQH TTLLALVRSH IATVLGHTTP DTIPPDRAFR DLGFDSLTAV ELRNRLSRTT GLRLPTTLAF DHPNPTTLTH HLHTQLQPQP DNA* >P1; AVE-ACP3 AVES2 TTPSTPLRDV LVGKSPQERD EELLRLVRTH AAAVLGHATP EVIVPNKAFK ELGFDSLAAI QLRNRLLADV DLPLPATLIF DYPTPMALCQ FLRAAIVGAD TGT* >P1; AVE-ACP4 AVES2 QTESTNLRQL LMGRSRSEQE EELLSLVRIH SAAVLGRDDS EAIPPGRLFR DLGFDSLAAV ELRNHLAAQT ELALPTTLVF DYPSPTKLAQ FLLSEIAEFQ PDN* >P1; AVE-ACP5 AVES2 QPIATSLRER LARLTSSKQN QVLLGLIRTG ICTVLGLRNP EGIEDQRAFR DLGFDSLTSA QFSKELAKET GLPLPPSLVF DYPTPQECAA HLRTQLVDLD DEE* DNA sample (16S rDNA) >S.avermitilis GCAAGTCGAACGATGAAGCCCTTCGGGGTGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTGCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATAATACTCTCGCAGGCATCTGTGAGGGTTAAAAGCT CCGGCGGTGCAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTAGT GGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCAC ACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGA CGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGACG GTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAAT ACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAG GCGGCTTGTCACGTCGGGTGTGAAAGCCCGGGGCTTAACCCCGGGTCTGC ATTCGATACGGGCTAGCTAGAGTGTGGTAGGGGAGATCGGAATTCCTGGT GTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCGG ATCTCTGGGCCATTACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGAACTAGGTGTT GGCGACATTCCACGTCGTCGGTGCCGCAGCTAACGCATTAAGTTCCCCGC CTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCCC GCACAAGCAGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCTT ACCAAGGCTTGACATACACCGGAAAGCATTAGAGATAGTGCCCCCCTTGT GGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATG TTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTTCTGTGTTGCCAGCAT GCCCTTCGGGGTGATGGGGACTCACAGGAGACCGCCGGGGTCAACTCGGA GGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGC ACACGTGCTACAATGGCCGATACAATGAGCTGCGATACCGCAAGGTGGAG CGAATCTCAAAAAGTCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACC CCATGAAGTTGGAGTTGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAA TACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGTA ACACCC >S.ambofaciens GCAAGTCGAACGATGAACCACTTCGGTGGGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTGCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATACTGATCCGCTTGGGCATCCAGGCGGTTCGAAAGC TCCGGCGGTGCAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTAG TGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCA CACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGA ATATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATG ACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGAC GGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAA TACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTA GGCGGCTTGTCACGTCGGTTGTGAAAGCCCGGGGCTTAACCCCGGGTCTG CAGTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTGG TGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCG GATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAAC AGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGCACTAGGTGT GGGCAACATTCCACGTTGTCCGTGCCGCAGCTAACGCATTAAGTGCCCCG CCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCC CGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCT TACCAAGGCTTGACATACACCGGAAAGCATTAGAGATAGTGCCCCCCTTG TGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGAT GTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCCGTGTTGCCAGCA AGCCCTTCGGGGTGTTGGGGACTCACGGGAGACCGCCGGGGTCAACTCGG AGGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTG CACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGCGAGGTGGA GCGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGAC CCCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTGA ATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGT AACACCC >S.coelicolorA3_2 GCAAGTCGAACGATGAACCGCTTTCGGGCGGGGATTAGTGGCGAACGGGT GAGTAACACGTGGGCAATCTGCCCTGCACTCTGGGACAAGCCCTGGAAAC GGGGTCTAATACCGGATATGACTGTCCATCGCATGGTGGATGGTGTAAAG CTCCGGCGGTGCAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTA GTGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCC ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGG AATATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGAT GACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGA CGGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTA ATACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGT AGGCGGCTTGTCACGTCGGTTGTGAAAGCCCGGGGCTTAACCCCGGGTCT GCAGTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTG GTGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGC GGATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAA CAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGCACTAGGTG TGGGCAACATTCCACGTTGTCCGTGCCGCAGCTAACGCATTAAGTGCCCC GCCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGC CCGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACC TTACCAAGGCTTGACATACACCGGAAACGTCTGGAGACAGGCGCCCCCTT GTGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGA TGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCCGTGTTGCCAGC AGGCCCTTGTGGTGCTGGGGACTCACGGGAGACCGCCGGGGTCAACTCGG AGGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTG CACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGNGAGGTGGA GCGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGAC CCCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTGA ATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGT AACACCC >S.felleus GCAAGTCGAACGATGAACCGCTTTCGGGCGGGGATTAGTGGCGAACGGGT GAGTAACACGTGGGCAATCTGCCCTGCACTCTGGGACAAGCCCTGGAAAC GGGGTCTAATACCGGATATGACTGTCCATCGCATGGTGGATGGTGTAAAG CTCCGGCGGTGCAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTA GTGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCC ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGG AATATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGAT GACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGA CGGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTA ATACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGT AGGCGGCTTGTCACGTCGGTTGTGAAAGCCCGGGGCTTAACCCCGGGTCT GCAGTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTG GTGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGC GGATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAA CAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGCACTAGGTG TGGGCAACATTCCACGTTGTCCGTGCCGCAGCTAACGCATTAAGTGCCCC GCCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGC CCGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACC TTACCAAGGCTTGACATACACCGGAAACGTCTGGAGACAGGCGCCCCCTT GTGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGA TGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCCGTGTTGCCAGC AGGCCCTTGTGGTGCTGGGGACTCACGGGAGACCGCCGGGGTCAACTCGG AGGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTG CACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGNGAGGTGGA GCGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGAC CCCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTGA ATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGT AACACCC >S.griseus GCAAGTCGAACGATGAAGCCTTTCGGGGTGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTTCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATAACACTCTGTCCCGCATGGGACGGGGTTAAAAGCT CCGGCGGTGAAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGGGGTAAT GGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCAC ACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGA CGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAGAGTGACG GTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAAT ACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAG GCGGCTTGTCACGTCGGATGTGAAAGCCCGGGGCTTAACCCCGGGTCTGC ATTCGATACGGGCTAGCTAGAGTGTGGTAGGGGAGATCGGAATTCCTGGT GTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCGG ATCTCTGGGCCATTACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGTTGGGAACTAGGTGTT GGCGACATTCCACGTCGTCGGTGCCGCAGCTAACGCATTAAGTTCCCCGC CTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCCC GCACAAGCAGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCTT ACCAAGGCTTGACATATACCGGAAAGCATCAGAGATGGTGCCCCCCTTGT GGTCGGTATACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATG TTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTTCTGTGTTGCCAGCAT GCCTTCGGGGTGATGGGGACTCACAGGAGACTGCCGGGGTCAACTCGGAG GAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGCA CACGTGCTACAATGGCCGGTACAATGAGCTGCGATGCGCGAGGCGGAGCG AATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACCCC ATGAAGTCGGAGTTGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAATA CGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGTAAC ACCC >S.hygroscopicus GCAAGTCGAACGATGAACCACTTCGGTGGGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTTCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATACCACTCTCGCAGGCATCTGTGAGGGTTGAAAGCT CCGGCGGTGAAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTAAT GGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCAC ACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGA CGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGACG GTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAAT ACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAG GCGGCTTGTCACGTCGGGTGTGAAAGCCCGGGGCTTAACCCCGGGTCTGC ATTCGATACGGGCTAGCTAGAGTGTGGTAGGGGAGATCGGAATTCCTGGT GTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCGG ATCTCTGGGCCATTACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGAACTAGGTGTT GGCGACATTCCACGTCGTCGGTGCCGCAGCTAACGCATTAAGTTCCCCGC CTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCCC GCACAAGCAGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCTT ACCAAGGCTTGACATACACCGGAAAACCCTGGAGACAGGGTCCCCCTTGT GGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATG TTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCTGTGTTGCCAGCAT GCCTTCGGGGTGATGGGGACTCACAGGAGACCGCCGGGGTCAACTCGGAG GAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGCA CACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGTGAGGTGGAGC GAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACCC CATGAAGTCGGAGTTGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAAT ACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTTGGTAA CACCC >S.lavendulae GCAANTCGAACGATGAAGCCCTTCGGGGTGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTTCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATAATACTCCTGCCTGCATGGGCGGGGGTTAAAAGCT CCGGCGGTGAAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGGGGTAAT GGCCCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCAC ACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGA CGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGACG GTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAAT ACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAG GCGGCTTGTCACGTCGGATGTGAAAGCCCGAGGCTTAACCTCGGGTCTGC ATTCGATACGGGCTAGCTAGAGTGTGGTAGGGGAGATCGGAATTCCTGGT GTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCGG ATCTCTGGGCCATTACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGTTGGGAACTAGGTGTT GGCGACATTCCACGTCGTCGGTGCCGCAGCTAACGCATTAAGTTCCCCGC CTGGGGAGTACGGCCGCAAGGCTAAAACTCANAGGAATTGACGGGGGCCC GCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGNGAAGAACCTT ACCAAGGCTTGACATATACCGGAAAGNATTAGAGATAGTNCCCCCCTTGT GGTCGGTATACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATG TTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCTGTGTTGCCAGCAN GCCCTTCGGGGTGATGGGGACTCACAGGAGACCGCCGGGGTCAACTCGGA GGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGC ACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGTGAGGTGGNG CGNNTCTCAAAAAGCCGGTNTCAGTTCGGATTGGGGTCTGCAACTCGACC CCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAA TACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGTA ACACCC >S.nodosus GCAAGTCGAACGATGAAGCCCTTCGGGGTGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTGCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATACGAGCCGGGGAGGCATCTCCCTGGTTGGAAAGCT CCGGCGGTGCAGGATGAGCCCGCGCCCTATCAGCTTGTTGGTGAGGTAAC GGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCAC ACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGA CGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAGAGTGACG GTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAAT ACGTAGGGCGCGAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAG GCGGCTTGTCGCGTCGCGTTGTGAAAGCCCGGGGCTTAACCCCGGGTCTG CAGTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTGG TGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCG GATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAAC AGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGCACTAGGTGT GGGCAACATTCCACGTTGTCCGTGCCGCAGCTAACGCATTAAGTGCCCCG CCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCC CGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCT TACCAAGGCTTGACATACACCGGAAAGCATTAGAGATAGTGCCCCCCTTG TGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGAT GTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCCGTGTTGCCAGCA GGCCCTTGTGGTGCTGGGGACTCACGGGAGACCGCCGGGGTCAACTCGGA GGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGC ACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGTGAGGTGGAG CGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACC CCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAA TACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGTA ACACCC >S.lividans GCAAGTCGAACGATGAACCACTTCGGTGGGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTTCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATACTGACCCTCGCAGGCATCTGCGAGGTTCGAAAGC TCCGGCGGTGAAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTAA TGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGTCA CACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGA ATATTGCACAATGGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGAT GACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGA CGGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTA ATACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGT AGGCGGCTTGTCACGTCGGTTGTGAAAGCCCGGGGCTTAACCCCGGGTCT GCAGTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTG GTGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGC GGATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAA CAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGCACTAGGTG TGGGCAACATTCCACGTTGTCCGTGCCGCAGCTAACGCATTAAGTGCCCC GCCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGC CCGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACC TTACCAAGGCTTGACATACACCGGAAAGCATCAGAGATGGTGCCCCCCTT GTGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGA TGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCCGTGTTGCCAGC AAGCCCTTCGGGGTGTTGGGGACTCACGGGAGACCGCCGGGGTCAACTCG GAGGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCT GCACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGCAAGGTGG AGCGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGA CCCCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTG AATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGG TAACACCC >S.rimosus GCAAGTCGAACGATGAAGCCCTTCGGGGTGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTGCGCTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATATGACACACGACCGCATGGTCTGTGTGTGGAAAGC TCCGGCGGTGCAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTTAGGTAA TGCCTACCAAGGCGAGCGAGCGGGTAGCCGGCCTGAGAGGGCGACCGGCC ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGG AATATTGCACAATGGGCGCAAGCCTGATGCAGCGACGCCGCGTGAGGGAT GACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGCAAGTGA CGGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTA ATACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGT AGGCGGCTTGTCGCGTCGGATGTGAAAGCCCGGGGCTTAACCCCGGGTCT GCATTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTG GTGTAGCGGTGAAATGCGCAGATATCAGGAGGAACGCCGGTGGCGAAGGC GGATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAA CAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGTTGGGAACTAGGTG TGGGCGACATTCCACGTCGTCCGTGCCGCAGCTAACGCATTAATTGCCCC GCCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGCACGGGGG CCCGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAAC CTTACCAAGGCTTGACATACACCGGAAACCTCTGGAGACAGGGGCCCCCT TGTGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAG ATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGGCTCTGTTGCCAGC ATGCCTTTCGGGGTGATGGGGACTCACAGGAGACCGCCGGGGTCAACTCG GAGGAAGGTGGGGACGACCTCAAGTCATCATGCCCCTTATGTCTTGGGCT GCACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGCGAGGTGG AGCGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGA CCCCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTG AATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGG TAACACCC >S.scabies GCAAGTCGAACGATGAACCACTTCGGTGGGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTTCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATACGACACTCTCGGGCATCCGATGAGTGTGGAAAGC TCCGGCGGTGAAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTAA CGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCA CACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGA ATATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATG ACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGAC GGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAA TACGTAGGGCGCGAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTA GGCGGTCTGTCGCGTCGGATGTGAAAGCCCGGGGCTTAACCCCGGGTCTG CATTCGATACGGGCAGACTAGAGTGTGGTAGGGGAGATCGGAATTCCTGG TGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCG GATCTCTGGGCCATTACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAAC AGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGAACTAGGTGT TGGCGACATTCCACGTCGTCGGTGCCGCAGCTAACGCATTAAGTTCCCCG CCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCC CGCACAAGCAGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCT TACCAAGGCTTGACATACACCGGAAACGGCCAGAGATGGTCGCCCCCTTG TGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGAT GTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTTCTGTGTTGCCAGCA TGCCCTTCGGGGTGATGGGGACTCACAGGAGACTGCCGGGGTCAACTCGG AGGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTG CACACGTGCTACAATGGCAGGTACAATGAGCTGCGAAGCCGTGAGGCGGA GCGAATCTCAAAAAGCCTGTCTCAGTTCGGATTGGGGTCTGCAACTCGAC CCCATGAAGTCGGAGTTGCTAGTAATCGCAGATCAGCATTGCTGCGGTGA ATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGT AACACCC >S.tendae GCAAGTCGAACGATGAACCACTTCGGTGGGGATTAGTGGCGAACGGGTGA GTAACACGTGGGCAATCTGCCCTGCACTCTGGGACAAGCCCTGGAAACGG GGTCTAATACCGGATACTGACCCTCGCAGGCATCTGCGAGGTTCGAAAGC TCCGGCGGTGCAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGAGGTAA TGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCA CACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGA ATATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATG ACGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGAC GGTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAA TACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTA GGCGGCTTGTCACGTCGGTTGTGAAAGCCCGGGGCTTAACCCCGGGTCTG CAGTCGATACGGGCAGGCTAGAGTTCGGTAGGGGAGATCGGAATTCCTGG TGTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCG GATCTCTGGGCCGATACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAAC AGGATTAGATACCCTGGTAGTCCACGCCGTAAACGGTGGGCACTAGGTGT GGGCAACATTCCACGTTGTCCGTGCCGCAGCTAACGCATTAAGTGCCCCG CCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCC CGCACAAGCGGCGGAGCATGTGGCTTAATTCGACGCAACGCGAAGAACCT TACCAAGGCTTGACATACACCGGAAAGCATCAGAGATGGTGCCCCCCTTG TGGTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGAT GTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCCGTGTTGCCAGCA GGCCCTTGTGGTGCTGGGGACTCACGGGAGACCGCCGGGGTCAACTCGGA GGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGC ACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGCAAGGTGGAG CGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACC CCATGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAA TACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGTA ACACCC >S.virginiae GCAAGTCGAACGATGAAGCCCTTCGGNGTGGATTAGTGGGGAACGGGTGA GTANCACGTGGCNAATCTGCCCTNCACTCTGCAACAAGCCCTGGAAACGG GGTCTAATACCGGATACCACTCCTGCCTGCATGGGCGGGGGTTGAAAGCT CCGGCGGTGAAGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGGGGTAAT GGCCCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCAC ACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAA TATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGA CGGCCTTCGGGTTGTAAACCTCTTTCAGCAGGGAAGAAGCGAAAGTGACG GTACCTGCAGAAGAAGCGCCGGCTAACTACGTGCCAGCAGCCGCGGTAAT ACGTAGGGCGCAAGCGTTGTCCGGAATTATTGGGCGTAAAGAGCTCGTAG GCGGCTTGTCACGTCGGATGTGAAAGCCCGAGGCTTAACCTCGGGTCTGC ATTCGATNCGGGCTAGCTAGAGTGTGGTAGGGGAGATCGGAATTCCTGGT GTAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCGG ATCTCTGGGCCATTACTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAACA GGATTAGATACCCTGGTAGTCCACGCCGTAAACGTTGGGAACTAGGTGTT GGCGACATTCCACGTCGTCGGTGCCGCAGCTAACGCATTAAGTTCCCCGC CTGGGGAGTACGGCCGCAAGGCTAAAACTCANAGGAATTGACGGGGGCCC GCACAAGCAGCGGAGCATGTGGCTTAATTCGACGCAACGNGAAGAACCTT ACCAAGGCTTGACATATACCGGAAAGCATTAGAGATAGTGCCCCCCTTGT GGTCGGTATACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATG TTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGTCCTGTGTTGCCAGCAT NCCCTTCGGGGTGATGGGGACTCACAGGAGACCGCCGGGGTCAACTCGGA GGAAGGTGGGGACGACGTCAAGTCATCATGCCCCTTATGTCTTGGGCTGC ACACGTGCTACAATGGCCGGTACAATGAGCTGCGATACCGTGAGGTGGAG CGAATCTCAAAAAGCCGGTCTCAGTTCGGATTGGGGTCTGCAACTCGACC CCATGAAGTTGGAGTTGCTAGTAATCGCAGATCAGCATTGCTGCGGTGAA TACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCACGAAAGTCGGTA ACACCC >B.subtilis GCAAGTCGAGCGGACAGATGGGAGCTTGCTCCCTGATGTTAGCGGCGGAC GGGTGAGTAACACGTGGGTAACCTGCCTGTAAGACTGGGATAACTCCGGG AAACCGGGGCTAATACCGGATGGTTGTTTGAACCGCATGGTTCAAACATA AAAGGTGGCTTCGGCTACCACTTACAGATGGACCCGCGGCGCATTAGCTA GTTGGTGAGGTAACGGCTCACCAAGGCAACGATGCGTAGCCGACCTGAGA GGGTGATCGGCCACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAG GCAGCAGTAGGGAATCTTCCGCAATGGACGAAAGTCTGACGGAGCAACGC CGCGTGAGTGATGAAGGTTTTCGGATCGTAAAGCTCTGTTGTTAGGGAAG AACAAGTACCGTTCGAATAGGGCGGTACCTTGACGGTACCTAACCAGAAA GCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTAGGTGGCAAGC GTTNTCCGGAATTATTGGGCGTAAAGGGCTCGCAGGCGGTTTCTTAAGTC TGATGTGAAAGCCCCCGGCTCAACCGGGGAGGGTCATTGGAAACTGGGGA ACTTGAGTGCAGAAGAGGAGAGTGGAATTCCACGTGTNGCGGTGAAATGC GTAGAGATGTGGAGGAACACCAGTGGCGAAGGCGACTCTCTGGTCTGTAA CTGACGCTGAGGAGCGAAAGCGTGGGGAGCGAACAGGATTAGATACCCTG GTAGTCCACGCCGTAAACGATGAGTGCTAAGTGTTAGGGGGTTTCCGCCC CTTAGTGCTGCAGTAACGCATTNAGCACTCCGCCTGGGGAGTACGGTCGC AAGACTGAAACTCAAAGGAATTGACGGGGGCCGCACAAGCGGTGGAGCAT GTGGTTTAATTCGAAGCAACGCGAAGAACCTTACCAGGTCTTGACATCCT CTGACAATCCTAGAGATAGGACGTCTTCGGGGGCAGAGTGACAGGTGGTG CATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAAC GAGCGCAACCCTGGATCTTAGTTGCCAGCATTCAGTTGGGCACTCTAAGG TGACTGCCGGTGACAAACCGGAGGAAGGTGGGGATGACGTCAAATCATCA TGCCCCTTATGACCTGGGCTACACACGTGCTACAATGGACAGAACAAAGG GCAGCGAAACCGCGAGGTTAAGCCAATCCCACAAATCTGTTCTCAGTTCG GATCGCAGTCTGCAACTCGACTGCGTGAAGCTGGAATCGCTAGTAATCGC GGATCAGCATGCCGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCC GTCACACCACGAGAGTTTGTAACACCC >N.asteroides GCAAGTCGAGCGGTAAGGCCCTTCGGGTACACGAGCGGCGAACGGGTGAG TAACACGTGGGTGATCTGCCTCGTACTTCGGGATAAGCCTGGGAAACTGG GTCTAATACCGGATATGACCTTCGGATGCATGTCTGAGGGTGGAAAGATT TATCGGTACGAGATGGGCCCGCGGCCTATCAGCTTGTTGGTGGGGTAATG GCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACA CTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGGAAT ATTGCACAATGGGCGAAAGCCTGATGCAGCGACGCCGCGTGAGGGATGAC GGCCTTCGGGTTGTAAACCTCTTTCGACAGGGACGAAGCGCAAGTGACGG TACCTGTAGAAGAAGCACCGGCCAACTACGTGCCAGCAGCCGCGGTAATA CGTAGGGTGCGAGCGTTGTCCGGAATTACTGGGCGTAAAGAGCTTGTAGG CGGTTCGTCGCGTCGTTCGTGAAAACTTGGGGCTCAACCCCAAGCTTGCG GGCGATACGGGCGGACTAGAGTACTTCAGGGGAGACTGGAATTCCTGGTG TAGCGGTGAAATGCGCAGATATCAGGAGGAACACCGGTGGCGAAGGCGGG TCTCTGGGAAGTAACTGACGCTGAGAAGCGAAAGCGTGGGTAGCGAACAG GATTAGATACCCTGGTAGTCCACGNCGTAAACGGTGGGTACTAGGTGTGG GTTTCCTTCCACGGGATCCGTGCCGTAGCTAACGCATTAAGTACCCNGCC TGGGGAGTACGGCCGCAAGGCTAAAACTCAAAGGAATTGACGGGGGCCNG CACAAGCGGCGGAGCATGTGGATTAATTCGATGCAACGCGAAGAACCTTA CCTGGGTTTGACATACACCGGAAACCTGCAGAGATGTAGGCCCCCTTGTG GTCGGTGTACAGGTGGTGCATGGCTGTCGTCAGCTCGTGTCGTGAGATGT TGGGTTAAGTCCCGCAACGAGCGCAACCCTTATCTTATGTTGCCAGCGCG TAATGGCGGGGACTCGTGAGAGACTGCCGGGGTCAACTCGGAGGAAGGTG GGGACGACGTCAAGTCATCATGCCCCTTATGTCCAGGGCTTCACACATGC TACAATGGCCGGTACAGAGGGCTGCGATACCGTGAGGTGGAGCGAATCCC TTAAAGCCGGTCTCAGTTCGGATCGGGGTCTGCAACTCGACCCCGTGAAG TTGGAGTCGCTAGTAATCGCAGATCAGCAACGCTGCGGTGAATACGTTCC CGGGCCTTGTACACACCGCCCGTCACGTCATGAAAGTCGGTAACACCC >M.tuberculosis GCAAGTCGAACGGAAAGGTCTCTTCGGAGATACTCGAGTGGCGAACGGGT GAGTAACACGTGGGTGATCTGCCCTGCACTTCGGGATAAGCCTGGGAAAC TGGGTCTAATACCGGATAGGACCACGGGATGCATGTCTTGTGGTGGAAAG CGCTTTAGCGGTGTGGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGGGG TGACGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGTCCGG CCACACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCAGCAGTGG GGAATATTGCACAATGGGCGCAAGCCTGATGCAGCGACGCCGCGTGGGGG ATGACGGCCTTCGGGTTGTAAACCTCTTTCACCATCGACGAAGGTCCGGG TTCTCTCGGATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTGC CAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTGTCCGGAATTACTGGG CGTAAAGAGCTCGTAGGTGGTTTGTCGCGTTGTTCGTGAAATCTCACGGC TTAACTGTGAGCGTGCGGGCGATACGGGCAGACTAGAGTACTGCAGGGGA GACTGGAATTCCTGGTGTAGCGGTGGAATGCGCAGATATCAGGAGGAACA CCGGTGGCGAAGGCGGGTCTCTGGGCAGTAACTGACGCTGAGGAGCGAAA GCGTGGGGAGCGAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACG GTGGGTACTAGGTGTGGGTTTCCTTCCTTGGGATCCGTGCCGTAGCTAAC GCATTAAGTACCCCGCCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAG GAATTGACGGGGGCCCGCACAAGCGGCGGAGCATGTGGATTAATTCGATG CAACGCGAAGAACCTTACCTGGGTTTGACATGCACAGGACGCGTCTAGAG ATAGGCGTTCCCTTGTGGCCTGTGTGCAGGTGGTGCATGGCTGTCGTCAG CTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGT CTCATGTTGCCAGCACGTAATGGTGGGGACTCGTGAGAGACTGCCGGGGT CAACTCGGAGGAAGGTGGGGATGACGTCAAGTCATCATGCCCCTTATGTC CAGGGCTTCACACATGCTACAATGGCCGGTACAAAGGGCTGCGATGCCGC GAGGTTAAGCGAATCCTTAAAAGCCGGTCTCAGTTCGGATCGGGGTCTGC AACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCAACGC TGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCATGA AAGTCGGTAACACCC >M.bovis GCAAGTCGAACGGAAAGGTCTCTTCGGAGATACTCGAGTGGCGAACGGGT GAGTAACACGTGGGTGATCTGCCCTGCACTTCGGGATAAGCCTGGGAAAC TGGGTCTAATACCGGATAGGACCACGGGATGCATGTCTTGTGGTGGAAAG CGCTTTAGCGGTGTGGGATGAGCCCGCGGCCTATCAGCTTGTTGGTGGGG TGACGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGTCCGG CCACACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCAGCAGTGG GGAATATTGCACAATGGGCGCAAGCCTGATGCAGCGACGCCGCGTGGGGG ATGACGGCCTTCGGGTTGTAAACCTCTTTCACCATCGACGAAGGTCCGGG TTCTCTCGGATTGACGGTAGGTGGAGAAGAAGCACCGGCCAACTACGTGC CAGCAGCCGCGGTAATACGTAGGGTGCGAGCGTTGTCCGGAATTACTGGG CGTAAAGAGCTCGTAGGTGGTTTGTCGCGTTGTTCGTGAAATCTCACGGC TTAACTGTGAGCGTGCGGGCGATACGGGCAGACTAGAGTACTGCAGGGGA GACTGGAATTCCTGGTGTAGCGGTGGAATGCGCAGATATCAGGAGGAACA CCGGTGGCGAAGGCGGGTCTCTGGGCAGTAACTGACGCTGAGGAGCGAAA GCGTGGGGAGCGAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACG GTGGGTACTAGGTGTGGGTTTCCTTCCTTGGGATCCGTGCCGTAGCTAAC GCATTAAGTACCCCGCCTGGGGAGTACGGCCGCAAGGCTAAAACTCAAAG GAATTGACGGGGGCCCGCACAAGCGGCGGAGCATGTGGATTAATTCGATG CAACGCGAAGAACCTTACCTGGGTTTGACATGCACAGGACGCGTCTAGAG ATAGGCGTTCCCTTGTGGCCTGTGTGCAGGTGGTGCATGGCTGTCGTCAG CTCGTGTCGTGAGATGTTGGGTTAAGTCCCGCAACGAGCGCAACCCTTGT CTCATGTTGCCAGCACGTAATGGTGGGGACTCGTGAGAGACTGCCGGGGT CAACTCGGAGGAAGGTGGGGATGACGTCAAGTCATCATGCCCCTTATGTC CAGGGCTTCACACATGCTACAATGGCCGGTACAAAGGGCTGCGATGCCGC GAGGTTAAGCGAATCCTTAAAAGCCGGTCTCAGTTCGGATCGGGGTCTGC AACTCGACCCCGTGAAGTCGGAGTCGCTAGTAATCGCAGATCAGCAACGC TGCGGTGAATACGTTCCCGGGCCTTGTACACACCGCCCGTCACGTCATGA AAGTCGGTAACACCC