Anna Horvath Week 6

From OpenWetWare
Jump to navigationJump to search

Purpose

To determine, through a comparison of eighteen different ACE-2 sequences, which of them are likely intermediary host for SARS-CoV-2, which has the most similar ACE2 receptor to humans. This could then be used to inform future comparisons of coronaviruses by understanding the ACE-2 similarities

Methods/Results

Part 1: GenBank

  • I went to GenBank in order to find the nineteen sequences.
  • Sequences used included:
    • humans [2]
    • civet [3]
    • Chinese bats [4]
    • mice [5]
    • rats [6]
    • pigs [7]
    • ferrets [8]
    • cats [9]
    • orangutans [10]
    • grivet monkeys [11]
    • fox [12]
    • chickens [13]
    • king cobras [14]
    • pangolins [15]
    • dromedary camels [16]
    • squirrels [17]
    • mink [18]
    • Chinese softshell turtles [19]
>NP_001358344.1 angiotensin-converting enzyme 2 isoform 1 precursor [Homo sapiens]
MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWS
AFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQE
CLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVN
GVDGYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYS
LTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWD
LGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEP
VPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRL
GKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGD
KAYEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEV
EKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKK
KNKARSGENPYASIDISKGENNPGFQNTDDVQTSF
>AAX63775.1 angiotensin-converting enzyme 2 [Paguma larvata]
MSGSFWLLLSFAALTAAQSTTEELAKTFLETFNYEAQELSYQSSVASWNYNTNITDENAKNMNEAGAKWS
AYYEEQSKLAQTYPLAEIQDAKIKRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQE
CLLLEPGLDNIMENSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE
WTGGYNYSRNQLIQDVEDTFEQIKPLYQHLHAYVRAKLMDTYPSRISRTGCLPAHLLGDMWGRFWTNLYP
LTVPFGQKPNIDVTDAMVNQNWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDGRKVVCHPTAWD
LGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT
IGLLSPAFSEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGAIPKEQWMQKWWEMKRNIVGVVEP
VPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSTEAGKKLLEMLSL
GRSEPWTLALERVVGAKNMNVTPLLNYFEPLFTWLKEQNRNSFVGWDTDWRPYSDQSIKVRISLKSALGE
KAYEWNDNEMYLFRSSIAYAMREYFSKVKNQTIPFVEDNVWVSDLKPRISFNFFVTFSNNVSDVIPRSEV
EDAIRMSRSRINDAFRLDDNSLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGAIVVGIVLLIVSGIRNRRK
NDQAGSEENPYASVDLNKGENNPGFQHADDVQTSF
>AGZ48803.1 angiotensin-converting enzyme 2 [Rhinolophus sinicus]
MSGSSWLLLSLVAVTTAQSTTEDEAKMFLDKFNTKAEDLSHQSSLASWDYNTNINDENVQKMDEAGAKWS
AFYEEQSKLAKNYSLEQIQNVTVKLQLQILQQSGSPVLSEDKSKRLNSILNAMSTIYSTGKVCKPNKPQE
CLLLEPGLDNIMGTSKDYNERLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARGYHYEDYGDYWRRDYETE
ESPGPGYSRDQLMKDVERIFTEIKPLYEHLHAYVRAKLMDTYPFHISPTGCLPAHLLGDMWGRFWTNLYP
LTVPFGQKPNIDVTDEMLKQGWDADRIFKEAEKFFVSVGLPNMTEGFWNNSMLTEPGDGRKVVCHPTAWD
LGKGDFRIKMCTKVTMEDFLTAHHEMGHIQYDMAYASQPYLLRNGANEGFHEAVGEVMSLSVATPKHLKT
MGLLSPDFREDNETEINFLLKQALNIVGTLPFTYMLEKWRWMVFKGEIPKEEWMKKWWEMKRKIVGVVEP
VPHDETYCDPASLFHVANDYSFIRYYTRTIFEFQFHEALCRIAQHDGPLHKCDISNSTDAGKKLHQMLSV
GKSQAWTKTLEDIVDSRNMDVGPLLKYFEPLYTWLQEQNRKSYVGWNTDWSPYSDQSIKVRISLKSALGE
NAYEWNDNEMYLFRSSVAYAMREYFLKEKHQTILFGAENVWVSNLKPRISFNFHVTSPGNLSDIIPRPEV
EGAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAVVVVGIVVLIITGIRDRRK
TDQARSEENPYSSVDLSKGENNPGFQNGDDVQTSF
>NP_001123985.1 angiotensin-converting enzyme 2 precursor [Mus musculus]
MSSSSWLLLSLVAVTTAQSLTEENAKTFLNNFNQEAEDLSYQSSLASWNYNTNITEENAQKMSEAAAKWS
AFYEEQSKTAQSFSLQEIQTPIIKRQLQALQQSGSSALSADKNKQLNTILNTMSTIYSTGKVCNPKNPQE
CLLLEPGLDEIMATSTDYNSRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYNDYGDYWRGDYEAE
GADGYNYNRNQLIEDVERTFAEIKPLYEHLHAYVRRKLMDTYPSYISPTGCLPAHLLGDMWGRFWTNLYP
LTVPFAQKPNIDVTDAMMNQGWDAERIFQEAEKFFVSVGLPHMTQGFWANSMLTEPADGRKVVCHPTAWD
LGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYARQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
IGLLPSDFQEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFRGEIPKEQWMKKWWEMKREIVGVVEP
LPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKYNGSLHKCDISNSTEAGQKLLKMLSL
GNSEPWTKALENVVGARNMDVKPLLNYFQPLFDWLKEQNRNSFVGWNTEWSPYADQSIKVRISLKSALGA
NAYEWTNNEMFLFRSSVAYAMRKYFSIIKNQTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVSDVIPRSEV
EDAIRMSRGRINDVFGLNDNSLEFLGIHPTLEPPYQPPVTIWLIIFGVVMALVVVGIIILIVTGIKGRKK
KNETKREENPYDSMDIGKGESNAGFQNSDDAQTSF
>AAW78017.1 angiotensin converting enzyme 2 [Rattus norvegicus]
MSSSCWLLLSLVAVATAQSLIEEKAESFLNKFNQEAEDLSYQSSLASWNYNTNITEENAQKMNEAAAKWS
AFYEEQSKIAQNFSLQEIQNATIKRQLKALQQSGSSALSPDKNKQLNTILNTMSTIYSTGKVCNSMNPQE
CFLLEPGLDEIMATSTDYNRRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAE
GVEGYNYNRNQLIEDVENTFKEIKPLYEQLHAYVRTKLMEVYPSYISPTGCLPAHLLGDMWGRFWTNLYP
LTTPFLQKPNIDVTDAMVNQSWDAERIFKEAEKFFVSVGLPQMTPGFWTNSMLTEPGDDRKVVCHPTAWD
LGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
IGLLPSNFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFQDKIPREQWTKKWWEMKREIVGVVEP
LPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKHDGPLHKCDISNSTEAGQKLLNMLSL
GNSGPWTLALENVVGSRNMDVKPLLNYFQPLFVWLKEQNRNSTVGWSTDWSPYADQSIKVRISLKSALGK
NAYEWTDNEMYLFRSSVAYAMREYFSREKNQTVPFGEADVWVSDLKPRVSFNFFVTSPKNVSDIIPRSEV
EEAIRMSRGRINDIFGLNDNSLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGTVVVGIVILIVTGIKGRKK
KNETKREENPYDSMDIGKGESNAGFQNSDDAQTSF
>NP_001116542.1 angiotensin-converting enzyme 2 precursor [Sus scrofa]
MSGSFWLLLSLIPVTAAQSTTEELAKTFLEKFNLEAEDLAYQSSLASWTINTNITDENIQKMNDARAKWS
AFYEEQSRIAKTYPLDEIQTLILKRQLQALQQSGTSGLSADKSKRLNTILNTMSTIYSSGKVLDPNNPQE
CLVLEPGLDEIMENSKDYSRRLWAWESWRAEVGKQLRPLYEEYVVLENEMARANNYEDYGDYWRGDYEVT
GTGDYDYSRNQLMEDVERTFAEIKPLYEHLHAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYP
LTVPFGEKPSIDVTEAMVNQSWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSMLTEPGDGRKVVCHPTAWD
LGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLLRNGANEGFHEAVGEIMSLSAATPHYLKA
LGLLPPDFYEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEP
LPHDETYCDPACLFHVAEDYSFIRYYTRTIYQFQFHEALCRTAKHEGPLYKCDISNSTEAGQKLLQMLSL
GKSEPWTLALENIVGVKTMDVKPLLSYFEPLLTWLKAQNGNSSVGWNTDWTPYADQSIKVRISLKSALGE
DAYEWNDNEMYLFRSSIAYAMRNYFSSAKNETIPFGAVDVWVSDLKPRISFNFFVTSPANMSDIIPRSDV
EKAISMSRSRINDAFRLDDNTLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGLVVVGIVVLIFTGIRDRRK
KKQASSEENPYGSMDLSKGESNSGFQNGDDIQTSF
>BAE53380.1 angiotensin I converting enzyme 2 [Mustela putorius furo]
MLGSSWLLLSLAALTAAQSTTEDLAKTFLEKFNYEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWS
AFYEEESQHAKTYPLEEIQDPIIKRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQE
CLLLEPGLDDIMENSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE
WADGYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYP
LMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSMLTEPGDNRKVVCHPTAWD
LGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKN
IGLLPPDFSEDSETDINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEP
LPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSL
GRSKPWTFALERVVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGE
KAYEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMSDIIPRADV
EEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGVVVVGIFLLIFSGIRNRRK
NNQARSEENPYASVDLSKGENNPGFQNVDDVQTSF
>AAX59005.1 angiotensin I converting enzyme 2 [Felis catus]
MSGSFWLLLSFAALTAAQSTTEELAKTFLEKFNHEAEELSYQSSLASWNYNTNITDENVQKMNEAGAKWS
AFYEEQSKLAKTYPLAEIHNTTVKRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQE
CLLLEPGLDDIMENSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE
WTDGYNYSRSQLIKDVEHTFTQIKPLYQHLHAYVRAKLMDTYPSRISPTGCLPAHLLGDMWGRFWTNLYP
LTVPFGQKPNIDVTDAMVNQSWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDSRKVVCHPTAWD
LGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT
IGLLSPGFSEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEP
VPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCRIAKHEGPLHKCDISNSSEAGKKLLQMLTL
GKSKPWTLALEHVVGEKKMNVTPLLKYFEPLFTWLKEQNRNSFVGWNTDWRPYADQSIKVRISLKSALGD
EAYEWNDNEMYLFRSSVAYAMREYFSKVKNQTIPFVEDNVWVSNLKPRISFNFFVTASKNVSDVIPRSEV
EEAIRMSRSRINDAFRLDDNSLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGVVVVGIVLLIVSGIRNRRK
NNQARSEENPYASVDLSKGENNPGFQHADDVQTSF
>NP_001124604.1 angiotensin-converting enzyme 2 precursor [Pongo abelii]
MSGSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWS
AFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPNNPQE
CLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVN
GVDSYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLINAYPSYISPIGCLPAHLLGDMWGRFWTNLYS
LTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQRFWENSMLTDPGNVQKVVCHPTAWD
LGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEP
VPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRL
GKSEPWTLALENVVGAKNMNVRPLLDYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGN
KAYEWNDNEIYLFRSSVAYAMRKYFLEVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEV
EKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVVLIFTGIRDRKK
KNKARNEENPYASIDISKGENNPGFQNTDDVQTSF

>AAY57872.1 angiotensin converting enzyme 2 [Chlorocebus aethiops]
MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGEKWS
AFLKEQSTLAQMYPLQAIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIHSTGKVCNPNNPQE
CLLLDPGLNEIMEKSLDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYKDYGDYWRGDYEVN
GVDGYDYNRDQLIEDVERTFEEIKPLYEHLHAYVRAKLMNAYPSYISPTGCLPAHLLGDMWGRFWTNLYS
LTVPFGQKPNIDVTDAMVNQAWNAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKVVCHPTAWD
LGKGDFRIIMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEP
VPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLKL
GKSEPWTLALENVVGAKNMSVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGA
NAYKWNDNEMYLFRSSVAYAMRQYFLENKHQTILFGEEDVRVADLKPRISFNFYVTAPKNVSDIIPRTEV
EEAIRFSRSRINDAFQLNDNSLEFLGIQSTLVPPYQSPITTWLIVFGVVMAVIVAGIVVLIFTGIRDRKK
KNQARSEENPYASIDISKGENNPGFQNTDDVQTSF

Part 2: Creating a sequence alignment with Phylogeny.fr

  • I went to the website www.phylogeny.fr. Then, I clicked 'Phylogeny analysis’, and clicked on the text ‘One Click'.
  • Then, I clicked on ‘Upload your set of sequences in FASTA, EMBL, or NEXUS format’. I copied the protein sequences from Week 4 Talk Page.
  • I used Command-V to paste my sequences in the field and clicked 'Submit".
    • In order to properly align the sequences, I first pasted them into a Word document.
  • I found the numbered tabs located just beneath the text One Click Mode, and clicked on the tab labeled 3. Alignment. Prior to this, I saw the pages named Alignment results, Phylogeny results, and Tree rendering results.
  • Positions are color-coded to indicate their conservation. Blue highlighting meant high conservation (the sequences are identical or very similar), gray highlighting means lower conservation, and white highlighting means little conservation.
  • Under Outputs, I clicked on Alignment in Clustal Format.
    • This showed my sequences with the amount of conservation indicated below them. The amount of conservation corresponded to the color-coded highlights shown above.
    • Key:
      • “*” for invariant
      • “:” for highly conserved
      • “.” for weakly conserved
      • Space for not conserved
    • Below are the class' alignments
XP_0061228      ----------MLSHL---------------WILC--SLTVVVKSQDITQE-AINFLSEFN
XP_416822.      ----------MLLHF---------------WLLC--GLSAVVTPQDVTQE-AQTFLAEFN
ETE61880.1      MLMKQAPVRKPSSRSFTHPAFFDLKGNMLTWLCLTWSLVVLALAQDETK-VATKFLEQFD
AGZ48803.1      ----------MSGSS---------------WLLL--SLVAVTTAQSTTEDEAKMFLDKFN
NP_0011165      ----------MSGSF---------------WLLL--SLIPVTAAQSTTEELAKTFLEKFN
XP_0313017      ----------MSGSF---------------WLLL--SLVAVTAAQSTTEELAKTFLEEFN
QLH93383.1      ----------MSGSS---------------WLLL--SLVAVTAAQSTSDEEAKTFLEKFN
BAE53380.1      ----------MLGSS---------------WLLL--SLAALTAAQSTTEDLAKTFLEKFN
CCP86723.1      ------------------------------------------------------------
XP_0258425      ----------MSGSS---------------WLLL--SLAALTAAQST-EDLVNTFLEKFN
AAX63775.1      ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLETFN
AAX59005.1      ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLEKFN
NP_0011239      ----------MSSSS---------------WLLL--SLVAVTTAQSLTEENAKTFLNNFN
AAW78017.1      ----------MSSSC---------------WLLL--SLVAVATAQSLIEEKAESFLNKFN
XP_0053160      MGSCPGARGKMLGSS---------------WLLL--SFVAVTAAQSTIEELAKTFLDKFN
AAY57872.1      ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN
NP_0013583      ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN
NP_0011246      ----------MSGSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN
   
                                                                       
XP_0061228      VQAEDLSYASSLASWNYNTNITDENAKKMNEAGAKWSVFYDEASTNASKYAIDKITNHTV
XP_416822.      VRAEDISYENSLASWNYNTNITEETARKMSEAGAKWAAFYEEASRNASRFSLANIQDAVT
ETE61880.1      ARATDLYYNASIASWNYNTNLTEENAKIMHEKDNIFSKFYGEACRNASMFNVNHITDETI
AGZ48803.1      TKAEDLSHQSSLASWDYNTNINDENVQKMDEAGAKWSAFYEEQSKLAKNYSLEQIQNVTV
NP_0011165      LEAEDLAYQSSLASWTINTNITDENIQKMNDARAKWSAFYEEQSRIAKTYPLDEIQTLIL
XP_0313017      HEAEDLSYQSSLASWNYNTNITDENVQKMNDARAKWSTFYEEKSKTAKTYPLEEIQNVTL
QLH93383.1      SEAEELSYQSSLASWNYNTNITDENVQKMNVAGAKWSTFYEEQSKIAKNYQLQNIQNDTI
BAE53380.1      YEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWSAFYEEESQHAKTYPLEEIQDPII
CCP86723.1      ------------------------------------------------------------
XP_0258425      YEAEELSYQSSLASWDYNTNISDENVQKMNNAGAKWSAFYEEQSKLAKTYPLEEIQDSTV
AAX63775.1      YEAQELSYQSSVASWNYNTNITDENAKNMNEAGAKWSAYYEEQSKLAQTYPLAEIQDAKI
AAX59005.1      HEAEELSYQSSLASWNYNTNITDENVQKMNEAGAKWSAFYEEQSKLAKTYPLAEIHNTTV
NP_0011239      QEAEDLSYQSSLASWNYNTNITEENAQKMSEAAAKWSAFYEEQSKTAQSFSLQEIQTPII
AAW78017.1      QEAEDLSYQSSLASWNYNTNITEENAQKMNEAAAKWSAFYEEQSKIAQNFSLQEIQNATI
XP_0053160      QEAEDLDYQRSLAAWNYNTNITEENTQKMNEAEAKWSAFYEEQSKLATAYPLQEIQNFTL
AAY57872.1      HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGEKWSAFLKEQSTLAQMYPLQAIQNLTV
NP_0013583      HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV
NP_0011246      HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV
   
                                                                       
XP_0061228      KLQLQSLQGKGTSVLSGEKYNELNKILSTMSTFYSTGTVCKPDNPDICLPLEPGLDAIMA
XP_416822.      RLQIQSLQDRGSSVLSPEKYSRLNSVMNSMSTIYSTGVVCKATEPFDCLVLEPGLDDIMA
ETE61880.1      KLQIRLLQ-SGSTDSTKD---QLDTVLHKMSTLYS-------------------LDDIMA
AGZ48803.1      KLQLQILQQSGSPVLSEDKSKRLNSILNAMSTIYSTGKVCKPNKPQECLLLEPGLDNIMG
NP_0011165      KRQLQALQQSGTSGLSADKSKRLNTILNTMSTIYSSGKVLDPNNPQECLVLEPGLDEIME
XP_0313017      KRQLQALQQSGASALSADKSKRLTTVLSTMSTIYSSGEVCDPNNPQECLVLEPGLDDIME
QLH93383.1      KRQLQALQLSGSSALSADKNQRLNTILNTMSTIYSTGKVCNPGNPQECSLLEPGLDNIME
BAE53380.1      KRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME
CCP86723.1      ------------------------------------------------------------
XP_0258425      KRQLRALQHSGSSVLSADKNQRLNTILNSMSTIYSTGKACNPSNPQECLLLEPGLDDIME
AAX63775.1      KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDNIME
AAX59005.1      KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME
NP_0011239      KRQLQALQQSGSSALSADKNKQLNTILNTMSTIYSTGKVCNPKNPQECLLLEPGLDEIMA
AAW78017.1      KRQLKALQQSGSSALSPDKNKQLNTILNTMSTIYSTGKVCNSMNPQECFLLEPGLDEIMA
XP_0053160      KRQLQALQQSGSSALSANKREQLNTILNTMSTIYSTGKVCNPKKPQECLLLEPGLDEIMA
AAY57872.1      KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIHSTGKVCNPNNPQECLLLDPGLNEIME
NP_0013583      KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMA
NP_0011246      KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPNNPQECLLLEPGLNEIMA                                                                            


XP_0061228      SSTDYFERLWAWEGWRADVGKKMRELYERYVELENEAARLNKYSDYGDYWRGNYEVNDPT
XP_416822.      NSIDYHERLWAWEGWRADVGRMMRPLYEEYVELKNEAARLNNYSDYGDYWRANYETDYPE
ETE61880.1      NNWNYPERLWAWEGWRANVGKKMRPLYETYVELKNKYARLRGYADYGDYWRANYEVDLPG
AGZ48803.1      TSKDYNERLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARGYHYEDYGDYWRRDYETEESP
NP_0011165      NSKDYSRRLWAWESWRAEVGKQLRPLYEEYVVLENEMARANNYEDYGDYWRGDYEVTGTG
XP_0313017      NSKDYNQRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEVMWAG
QLH93383.1      SSKDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYETEGAN
BAE53380.1      NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWAD
CCP86723.1      ------------------------------------------------------------
XP_0258425      NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWEN
AAX63775.1      NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTG
AAX59005.1      NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTD
NP_0011239      TSTDYNSRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYNDYGDYWRGDYEAEGAD
AAW78017.1      TSTDYNRRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGVE
XP_0053160      NSTDYNERLWVWEGWRSEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGAD
AAY57872.1      KSLDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYKDYGDYWRGDYEVNGVD
NP_0013583      NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD
NP_0011246      NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD                                                                            


XP_0061228      EYAYSRNQLMEDVEATFEQIKPLYRELHAYVRYRLEKFYGSDHISSTGCLPAHLLGDMWG
XP_416822.      EYKYSRDQLVQDVEKTFEQIKPLYQHLHAYVRHRLEQVYGSELINPTGCLPAHLLGDMWG
ETE61880.1      KFQYQREQLITDVESTFKQ------QLHAYVRHHLYKRYGPELINPEGAIPAHLLGDMWG
AGZ48803.1      GPGYSRDQLMKDVERIFTEIKPLYEHLHAYVRAKLMDTY-PFHISPTGCLPAHLLGDMWG
NP_0011165      DYDYSRNQLMEDVERTFAEIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG
XP_0313017      DYDYSRDQLMGDVEHTFAEIKPLYEHLHAYVRAKLMDVY-PSHISPTGCLPAHLLGDMWG
QLH93383.1      GYNYSRDHLIEDVEHIFTQIKPLYEHLHAYVRAKLMDNY-PSHISPTGCLPAHLLGDMWG
BAE53380.1      GYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG
CCP86723.1      ------------------------------------------------------------
XP_0258425      GYNYSRNQLIDDVEHTFTQIMPLYQHLHAYVRTKLMDTY-PSYISPTGCLPAHLLGDMWG
AAX63775.1      GYNYSRNQLIQDVEDTFEQIKPLYQHLHAYVRAKLMDTY-PSRISRTGCLPAHLLGDMWG
AAX59005.1      GYNYSRSQLIKDVEHTFTQIKPLYQHLHAYVRAKLMDTY-PSRISPTGCLPAHLLGDMWG
NP_0011239      GYNYNRNQLIEDVERTFAEIKPLYEHLHAYVRRKLMDTY-PSYISPTGCLPAHLLGDMWG
AAW78017.1      GYNYNRNQLIEDVENTFKEIKPLYEQLHAYVRTKLMEVY-PSYISPTGCLPAHLLGDMWG
XP_0053160      GYGYNRNQLIEDVERTFAEIKPLYEHLHAYVRAKLMNTY-PSYISPTGCLPAHLLGDMWG
AAY57872.1      GYDYNRDQLIEDVERTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPTGCLPAHLLGDMWG
NP_0013583      GYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPIGCLPAHLLGDMWG
NP_0011246      SYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLINAY-PSYISPIGCLPAHLLGDMWG
                                                                           
XP_0061228      RFWTNLYALTVPYPDKPNIDVTSEMVKKNWNATKIFKAAEDFFMSVGLYKMTEGFWKNSM
XP_416822.      RFWTNLYNLTVPYPEKPNIDVTSAMAQKNWDAMKIFKTAEAFFASIGLYNMTEGFWTNSM
ETE61880.1      RFWTNLYPLMVPYPNKTSIDVTSAMEKKKWTVNSIFKAAEHFFISIGLFNMTVGFWKNSM
AGZ48803.1      RFWTNLYPLTVPFGQKPNIDVTDEMLKQGWDADRIFKEAEKFFVSVGLPNMTEGFWNNSM
NP_0011165      RFWTNLYPLTVPFGEKPSIDVTEAMVNQSWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSM
XP_0313017      RFWTNLYSLTVPFGQKPNIDVTEAMENQSWDAKRIFKEAEKFFVSIGLPNMTQGFWDNSM
QLH93383.1      RFWTNLYPLTVPFRQKPNIDVTDAMVNQTWDANRIFKEAEKFFVSVGLPKMTQTFWENSM
BAE53380.1      RFWTNLYPLMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSM
CCP86723.1      ----------------------------------------------GLPNMTEGFWQNSM
XP_0258425      RFWTNLYPLTVPFGQKPNIDVTNAMVNQSWDARKIFKEAEKFFVSVGLPNMTQGFWENSM
AAX63775.1      RFWTNLYPLTVPFGQKPNIDVTDAMVNQNWDARRIFKEAEKFFVSVGLPNMTQGFWENSM
AAX59005.1      RFWTNLYPLTVPFGQKPNIDVTDAMVNQSWDARRIFKEAEKFFVSVGLPNMTQGFWENSM
NP_0011239      RFWTNLYPLTVPFAQKPNIDVTDAMMNQGWDAERIFQEAEKFFVSVGLPHMTQGFWANSM
AAW78017.1      RFWTNLYPLTTPFLQKPNIDVTDAMVNQSWDAERIFKEAEKFFVSVGLPQMTPGFWTNSM
XP_0053160      RFWTNLYSLTVPFPEKPNIDVTDAMINQNWNAVRIFKEAEKFFVSVGLPNMTQGFWENSM
AAY57872.1      RFWTNLYSLTVPFGQKPNIDVTDAMVNQAWNAQRIFKEAEKFFVSVGLPNMTQGFWENSM
NP_0013583      RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM
NP_0011246      RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQRFWENSM
                                                              ** :**  ** ***
XP_0061228      ITEPNDGRKVVCHPTAWDMGKKDYRIKMCTKVSMDDFLTVHHEMGHIEYDMAYSNLSYLL
XP_416822.      LTEPTDNRKVVCHPTAWDMGKNDYRIKMCTKVTMDDFLTAHHEMGHIEYDMAYSVQPFLL
ETE61880.1      LEEPKGGRKVVCHPTAWDMGKEDYRIKMCTKINMEDFLTAHHEMGHIEYDMAYANQPFLL
AGZ48803.1      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMEDFLTAHHEMGHIQYDMAYASQPYLL
NP_0011165      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLL
XP_0313017      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPFLL
QLH93383.1      LTEPGDGRKVVCHPTAWDLGKHDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAMQPYLL
BAE53380.1      LTEPGDNRKVVCHPTAWDLGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLL
CCP86723.1      LTEPGDNRKVVCHPTAWDLGKHDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
XP_0258425      LTEPSDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
AAX63775.1      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
AAX59005.1      LTEPGDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLL
NP_0011239      LTEPADGRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYARQPFLL
AAW78017.1      LTEPGDDRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLL
XP_0053160      LTEPTDGRKVVCHPTAWDLQKGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAMQPYLL
AAY57872.1      LTDPGNVQKVVCHPTAWDLGKGDFRIIMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
NP_0013583      LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
NP_0011246      LTDPGNVQKVVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
                : :* . .*.********: : *:** ****:.*::***.*******:*****:  .:**
XP_0061228      RSGANEGFHEAVGEIMSLSAATPKHLKSLDLLEPTFQEDNETDINFLLKQALTIVGTMPF
XP_416822.      RNGANEGFHEAVGEIMSLSAATPQHLKSLDLLEPTFQEDEETEINFLLKQALTIVGTMPF
ETE61880.1      RNGANEGFHEAVGEIMSLSAATPKYLKSLGLLEPTFQEDAETDINFLLKQALTIVGTMPF
AGZ48803.1      RNGANEGFHEAVGEVMSLSVATPKHLKTMGLLSPDFREDNETEINFLLKQALNIVGTLPF
NP_0011165      RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPPDFYEDSETEINFLLKQALTIVGTLPF
XP_0313017      RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPADFYEDSETEINFLLKQALTIVGTLPF
QLH93383.1      RNGANEGFHEAVGEIMSLSAATPKHLKNIGLLPPDFYEDNETEINFLLKQALTIVGTLPF
BAE53380.1      RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPDFSEDSETDINFLLKQALTIVGTLPF
CCP86723.1      RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPDFSEDSETDINFLLKQALTIVGTLPF
XP_0258425      RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPSFFEDSETEINFLLKQALTIVGTLPF
AAX63775.1      RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPAFSEDNETEINFLLKQALTIVGTLPF
AAX59005.1      RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPGFSEDSETEINFLLKQALTIVGTLPF
NP_0011239      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSDFQEDSETEINFLLKQALTIVGTLPF
AAW78017.1      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSNFQEDNETEINFLLKQALTIVGTLPF
XP_0053160      RNGANEGFHEAVGEIMSLSASTPKHLKSIGLLPSDFREDSETEINFLLKQALTIVGTLPF
AAY57872.1      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF
NP_0013583      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF
NP_0011246      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF
                *.************:****.:**::** :.** . * ** **:*********.****:**
XP_0061228      TYMLEKWRWMVFKGDIPKDEWMKKWWEMKRAIVGVVEPVPHDETYCDPAALFHVANDYSF
XP_416822.      TYMLEKWRWMVFNGEITKQEWTKRWWKMKREIVGVVEPVPHDETYCDPAALFHVANDYSF
ETE61880.1      TYMLEKWRWMVFAEQIPKDQWMKKWWEMKREIVGVVEPLPHNEEYCDPAALFHVANDYSF
AGZ48803.1      TYMLEKWRWMVFKGEIPKEEWMKKWWEMKRKIVGVVEPVPHDETYCDPASLFHVANDYSF
NP_0011165      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF
XP_0313017      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF
QLH93383.1      TYMLEKWRWMVFSGQIPKEQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF
BAE53380.1      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEPLPHDETYCDPAALFHVANDYSF
CCP86723.1      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEPLPHDETYCDPAALFHVANDYSF
XP_0258425      TYMLEKWRWMVFKGEIPKDQWMKTWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF
AAX63775.1      TYMLEKWRWMVFKGAIPKEQWMQKWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF
AAX59005.1      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF
NP_0011239      TYMLEKWRWMVFRGEIPKEQWMKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF
AAW78017.1      TYMLEKWRWMVFQDKIPREQWTKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF
XP_0053160      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVMEPVPHDETYCDPAALYHVSNDFSF
AAY57872.1      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF
NP_0013583      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF
NP_0011246      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF
                ************   *..::* : **:*** ****:**:**:* *****.*:**::*:**
XP_0061228      IRYYTRTIYQFQFQEALCKAANHGGLLHTCDITNSMAAGQKLRDMLALGRSQPWTKALES
XP_416822.      IRYYTRTIYQFQFQEALCKAANHTGPLHKCDITNSTAAGGNLRQLLELGKSKPWTQALES
ETE61880.1      IRYYTRTIYQFQFQEALCKAAGHTKELYKCDISDSTNAGRILKDMLALGSSQPWTKALES
AGZ48803.1      IRYYTRTIFEFQFHEALCRIAQHDGPLHKCDISNSTDAGKKLHQMLSVGKSQAWTKTLED
NP_0011165      IRYYTRTIYQFQFHEALCRTAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEN
XP_0313017      IRYYTRTIYQFQFHEALCQIAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEG
QLH93383.1      IRYYTRTIYQFQFQEALCQTAKHEGPLHKCDISNSTEAGQKLLQMLSLGKSKPWTLALER
BAE53380.1      IRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSLGRSKPWTFALER
CCP86723.1      IRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSREAGQKLHEMLSLGRSKPWTFALER
XP_0258425      IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSSEAGQKLLEMLKLGKSKPWTYALEI
AAX63775.1      IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSTEAGKKLLEMLSLGRSEPWTLALER
AAX59005.1      IRYYTRTIYQFQFQEALCRIAKHEGPLHKCDISNSSEAGKKLLQMLTLGKSKPWTLALEH
NP_0011239      IRYYTRTIYQFQFQEALCQAAKYNGSLHKCDISNSTEAGQKLLKMLSLGNSEPWTKALEN
AAW78017.1      IRYYTRTIYQFQFQEALCQAAKHDGPLHKCDISNSTEAGQKLLNMLSLGNSGPWTLALEN
XP_0053160      IRYYTRTIYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSKPWTLALEN
AAY57872.1      IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLKLGKSEPWTLALEN
NP_0013583      IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN
NP_0011246      IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSEPWTLALEN
                *******:::***:****. * :   *:.***::*  **  * .:* :* * .** :** 
XP_0061228      ITGEKKMNATPLLHYFEPLYQWLIKNNSGRAVGWNTFWSPYSGNAIKVRISLKTALGDNA
XP_416822.      ATGEKYMNATPLLHYFEPLFNWLQKNNSGRSIGWNTDWTPYSDNAIKVRISLKAALGDDA
ETE61880.1      ITGSLKMDAKPFCQYFDPLLKWLEKTNSNENVGWNVNWTPYSKDAIKVRISLKAALGDDA
AGZ48803.1      IVDSRNMDVGPLLKYFEPLYTWLQEQNRKSYVGWNTDWSPYSDQSIKVRISLKSALGENA
NP_0011165      IVGVKTMDVKPLLSYFEPLLTWLKAQNGNSSVGWNTDWTPYADQSIKVRISLKSALGEDA
XP_0313017      LVGVKTMDVKPLLNYFEPLLTWLKDQNRNSFVGWSTDWTPYTDQSIKVRISLKSALGDKA
QLH93383.1      VVGTKNMDVRPLLNYFEPLLTWLKEQNKNSFVGWNTDWSPYAAQSIKVRISLKSALGEKA
BAE53380.1      VVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA
CCP86723.1      VVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA
XP_0258425      VVGAKNMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA
AAX63775.1      VVGAKNMNVTPLLNYFEPLFTWLKEQNRNSFVGWDTDWRPYSDQSIKVRISLKSALGEKA
AAX59005.1      VVGEKKMNVTPLLKYFEPLFTWLKEQNRNSFVGWNTDWRPYADQSIKVRISLKSALGDEA
NP_0011239      VVGARNMDVKPLLNYFQPLFDWLKEQNRNSFVGWNTEWSPYADQSIKVRISLKSALGANA
AAW78017.1      VVGSRNMDVKPLLNYFQPLFVWLKEQNRNSTVGWSTDWSPYADQSIKVRISLKSALGKNA
XP_0053160      VVGARNMDVRPLLNYFEPLFGWLKDQNRNSFVGWNTNWSPYTDQSIKVRISLKSALGEEA
AAY57872.1      VVGAKNMSVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGANA
NP_0013583      VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGDKA
NP_0011246      VVGAKNMNVRPLLDYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGNKA
                 ..   *.. *:  **:**  **   *    :**.. * **: ::********:*** .*
XP_0061228      YEWDENELYFFKSSIAYAMRKYFLEVKNQTVSFQCTDIHVWAVTQRVSFYFAVSMPGNAT
XP_416822.      YVWDASELFLFKSSIAYAMRKYFAKEKEQNVDFQVTDIHVGEETQRVSFYLTVSMPGNVS
ETE61880.1      YNWDESEMFLFKSTIAYAMQKYFLEVKNKTVPF---------------------------
AGZ48803.1      YEWNDNEMYLFRSSVAYAMREYFLKEKHQTILFGAENVWVSNLKPRISFNFHVTSPGNLS
NP_0011165      YEWNDNEMYLFRSSIAYAMRNYFSSAKNETIPFGAVDVWVSDLKPRISFNFFVTSPANMS
XP_0313017      YEWNDNEMYLFQSSLAYAMRKYFLKVQNQTILFGVEDVWVSDLKPRISFSFFVTSPKNVS
QLH93383.1      YEWNDSEMYLFRSSVAYAMREYFSKFKKQTIPFEEESVRVSDLKPRVSFIFFVTLPKNVS
BAE53380.1      YEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMS
CCP86723.1      YEWNDNEMYFFQSSIAYAMREYFSKVKKQTIPFVDKDVRVSDLKPRISFNFIVTSPENMS
XP_0258425      YEWNNNEMYLFRSSIAYAMRRYFSEVKKQTIPFVEDNVWVSDLKPRISFNFFVTSPGNVS
AAX63775.1      YEWNDNEMYLFRSSIAYAMREYFSKVKNQTIPFVEDNVWVSDLKPRISFNFFVTFSNNVS
AAX59005.1      YEWNDNEMYLFRSSVAYAMREYFSKVKNQTIPFVEDNVWVSNLKPRISFNFFVTASKNVS
NP_0011239      YEWTNNEMFLFRSSVAYAMRKYFSIIKNQTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVS
AAW78017.1      YEWTDNEMYLFRSSVAYAMREYFSREKNQTVPFGEADVWVSDLKPRVSFNFFVTSPKNVS
XP_0053160      YQWNDNEMYLFRSSVAYAMRMYFSKVKNQTIPFGEKDVWVSDEKPRISFNFFVTAPQNVS
AAY57872.1      YKWNDNEMYLFRSSVAYAMRQYFLENKHQTILFGEEDVRVADLKPRISFNFYVTAPKNVS
NP_0013583      YEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS
NP_0011246      YEWNDNEIYLFRSSVAYAMRKYFLEVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS
                * *  .*:::*.*::****. **   : : : *                           
XP_0061228      DFIPKSEVETAIRMSRGRINEAFRLDDNTLEFEGLLPTLASPYEPPVTVWLILFGVVMGV
XP_416822.      DIVPRADVEKAIRMSRGRISEAFRLDDNTLEFDGIVPTLATPYKPPVTIWLILFGVVMSL
ETE61880.1      ------------HLSRDRINEAFKLTDQTLEFIGLLPTLAPPYESPITVWLVAFGVVIGL
AGZ48803.1      DIIPRPEVEGAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAV
NP_0011165      DIIPRSDVEKAISMSRSRINDAFRLDDNTLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGL
XP_0313017      DIIPRTEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYEPPVTVWLIIFGIVMGL
QLH93383.1      AVIPRAEVEEAIRMSRSRINDVFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV
BAE53380.1      DIIPRADVEEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV
CCP86723.1      DIIPRADVEEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV
XP_0258425      DIIPRTEVEKAIRMYRGRINDVFRLDDNSLEFLGIQPTLGPSYEPPVTIWLIVFGVVMGV
AAX63775.1      DVIPRSEVEDAIRMSRSRINDAFRLDDNSLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGA
AAX59005.1      DVIPRSEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGV
NP_0011239      DVIPRSEVEDAIRMSRGRINDVFGLNDNSLEFLGIHPTLEPPYQPPVTIWLIIFGVVMAL
AAW78017.1      DIIPRSEVEEAIRMSRGRINDIFGLNDNSLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGT
XP_0053160      DIIPRTDVEKAIRMSRGRINGVFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMGL
AAY57872.1      DIIPRTEVEEAIRFSRSRINDAFQLNDNSLEFLGIQSTLVPPYQSPITTWLIVFGVVMAV
NP_0013583      DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV
NP_0011246      DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV
                               *.**.  * * *::*** *: .** ..  .*:: **: **:*:. 
XP_0061228      IVVGVIVLIVTGQRDRRKRMKAGTNELVQTNAIDP------ELENGEVNPAFIKHEERQT
XP_416822.      IVIGVIVLIITGQRDKRKKARGRANEAGSNCEVNPYD------EDGRSNKGFEQSEETQT
ETE61880.1      IVIGIITLEKAGSKN---------------------------------------------
AGZ48803.1      VVVGIVVLIITGIRDRRKTDQARSEE-------NPYS--SVDLSKGENNPGFQNGDDVQT
NP_0011165      VVVGIVVLIFTGIRDRRKKKQASSEE-------NPYG--SMDLSKGESNSGFQNGDDIQT
XP_0313017      VVVGIVVLIFTGIRDRRKKKQASTEE-------NPYG--SVDLSKGENNSGFQNGDDVQT
QLH93383.1      IVVGIVVLIFTGIRDRKKKNQARSEQ-------NPYA--SVDLSKGENNPGFQNVDDVQT
BAE53380.1      VVVGIFLLIFSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQNVDDVQT
CCP86723.1      VVVGIFLLIFSGIRNRRKNNQARSEE-------NPYA--SVDLSKG--------------
XP_0258425      VVVGIVLLIFSGIRNRRKNDQARGEE-------NPYA--SVDLSKGENNPGFQNVDDAQT
AAX63775.1      IVVGIVLLIVSGIRNRRKNDQAGSEE-------NPYA--SVDLNKGENNPGFQHADDVQT
AAX59005.1      VVVGIVLLIVSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQHADDVQT
NP_0011239      VVVGIIILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT
AAW78017.1      VVVGIVILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT
XP_0053160      IVVGIVILIFTGIRDRRRKNQTKREE-------NPYAESSMEMGKGENNPGYQNNDDVQT
AAY57872.1      IVAGIVVLIFTGIRDRKKKNQARSEE-------NPYA--SIDISKGENNPGFQNTDDVQT
NP_0013583      IVVGIVILIFTGIRDRKKKNKARSGE-------NPYA--SIDISKGENNPGFQNTDDVQT
NP_0011246      IVVGIVVLIFTGIRDRKKKNKARNEE-------NPYA--SIDISKGENNPGFQNTDDVQT
                :* *:. *  :* ..                                             
XP_0061228      SF
XP_416822.      SF
ETE61880.1      --
AGZ48803.1      SF
NP_0011165      SF
XP_0313017      SF
QLH93383.1      SF
BAE53380.1      SF
CCP86723.1      --
XP_0258425      SF
AAX63775.1      SF
AAX59005.1      SF
NP_0011239      SF
AAW78017.1      SF
XP_0053160      SF
AAY57872.1      SF
NP_0013583      SF
NP_0011246      SF

New alignment:

AGZ48803.1      ----------MSGSS---------------WLLL--SLVAVTTAQSTTEDEAKMFLDKFN
NP_0011165      ----------MSGSF---------------WLLL--SLIPVTAAQSTTEELAKTFLEKFN
XP_0313017      ----------MSGSF---------------WLLL--SLVAVTAAQSTTEELAKTFLEEFN
QLH93383.1      ----------MSGSS---------------WLLL--SLVAVTAAQSTSDEEAKTFLEKFN
BAE53380.1      ----------MLGSS---------------WLLL--SLAALTAAQSTTEDLAKTFLEKFN
XP_0258425      ----------MSGSS---------------WLLL--SLAALTAAQST-EDLVNTFLEKFN
AAX63775.1      ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLETFN
AAX59005.1      ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLEKFN
NP_0011239      ----------MSSSS---------------WLLL--SLVAVTTAQSLTEENAKTFLNNFN
AAW78017.1      ----------MSSSC---------------WLLL--SLVAVATAQSLIEEKAESFLNKFN
XP_0053160      MGSCPGARGKMLGSS---------------WLLL--SFVAVTAAQSTIEELAKTFLDKFN
AAY57872.1      ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN
NP_0013583      ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN
NP_0011246      ----------MSGSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN
ETE61880.1      MLMKQAPVRKPSSRSFTHPAFFDLKGNMLTWLCLTWSLVVLALAQDETKVATK-FLEQFD
XP_0061228      ----------MLSHL---------------WILC--SLTVVVKSQDITQEAIN-FLSEFN
XP_416822.      ----------MLLHF---------------WLLC--GLSAVVTPQDVTQE-AQTFLAEFN
                                              *:    .:  :. .*.  .   : **  *:
AGZ48803.1      TKAEDLSHQSSLASWDYNTNINDENVQKMDEAGAKWSAFYEEQSKLAKNYSLEQIQNVTV
NP_0011165      LEAEDLAYQSSLASWTINTNITDENIQKMNDARAKWSAFYEEQSRIAKTYPLDEIQTLIL
XP_0313017      HEAEDLSYQSSLASWNYNTNITDENVQKMNDARAKWSTFYEEKSKTAKTYPLEEIQNVTL
QLH93383.1      SEAEELSYQSSLASWNYNTNITDENVQKMNVAGAKWSTFYEEQSKIAKNYQLQNIQNDTI
BAE53380.1      YEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWSAFYEEESQHAKTYPLEEIQDPII
XP_0258425      YEAEELSYQSSLASWDYNTNISDENVQKMNNAGAKWSAFYEEQSKLAKTYPLEEIQDSTV
AAX63775.1      YEAQELSYQSSVASWNYNTNITDENAKNMNEAGAKWSAYYEEQSKLAQTYPLAEIQDAKI
AAX59005.1      HEAEELSYQSSLASWNYNTNITDENVQKMNEAGAKWSAFYEEQSKLAKTYPLAEIHNTTV
NP_0011239      QEAEDLSYQSSLASWNYNTNITEENAQKMSEAAAKWSAFYEEQSKTAQSFSLQEIQTPII
AAW78017.1      QEAEDLSYQSSLASWNYNTNITEENAQKMNEAAAKWSAFYEEQSKIAQNFSLQEIQNATI
XP_0053160      QEAEDLDYQRSLAAWNYNTNITEENTQKMNEAEAKWSAFYEEQSKLATAYPLQEIQNFTL
AAY57872.1      HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGEKWSAFLKEQSTLAQMYPLQAIQNLTV
NP_0013583      HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV
NP_0011246      HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV
ETE61880.1      ARATDLYYNASIASWNYNTNLTEENAKIMHEKDNIFSKFYGEACRNASMFNVNHITDETI
XP_0061228      VQAEDLSYASSLASWNYNTNITDENAKKMNEAGAKWSVFYDEASTNASKYAIDKITNHTV
XP_416822.      VRAEDISYENSLASWNYNTNITEETARKMSEAGAKWAAFYEEASRNASRFSLANIQDAVT
                  * :: :  *:*:*  ***:.:*. . *      :: :  * .  *  : :  *     
AGZ48803.1      KLQLQILQQSGSPVLSEDKSKRLNSILNAMSTIYSTGKVCKPNKPQECLLLEPGLDNIMG
NP_0011165      KRQLQALQQSGTSGLSADKSKRLNTILNTMSTIYSSGKVLDPNNPQECLVLEPGLDEIME
XP_0313017      KRQLQALQQSGASALSADKSKRLTTVLSTMSTIYSSGEVCDPNNPQECLVLEPGLDDIME
QLH93383.1      KRQLQALQLSGSSALSADKNQRLNTILNTMSTIYSTGKVCNPGNPQECSLLEPGLDNIME
BAE53380.1      KRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME
XP_0258425      KRQLRALQHSGSSVLSADKNQRLNTILNSMSTIYSTGKACNPSNPQECLLLEPGLDDIME
AAX63775.1      KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDNIME
AAX59005.1      KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME
NP_0011239      KRQLQALQQSGSSALSADKNKQLNTILNTMSTIYSTGKVCNPKNPQECLLLEPGLDEIMA
AAW78017.1      KRQLKALQQSGSSALSPDKNKQLNTILNTMSTIYSTGKVCNSMNPQECFLLEPGLDEIMA
XP_0053160      KRQLQALQQSGSSALSANKREQLNTILNTMSTIYSTGKVCNPKKPQECLLLEPGLDEIMA
AAY57872.1      KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIHSTGKVCNPNNPQECLLLDPGLNEIME
NP_0013583      KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMA
NP_0011246      KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPNNPQECLLLEPGLNEIMA
ETE61880.1      KLQIRLLQ-SGSTDSTKD---QLDTVLHKMSTLYS-------------------LDDIMA
XP_0061228      KLQLQSLQGKGTSVLSGEKYNELNKILSTMSTFYSTGTVCKPDNPDICLPLEPGLDAIMA
XP_416822.      RLQIQSLQDRGSSVLSPEKYSRLNSVMNSMSTIYSTGVVCKATEPFDCLVLEPGLDDIMA
                . *:. **  *:.  : :    * .::  ***::*                   *: ** 
AGZ48803.1      TSKDYNERLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARGYHYEDYGDYWRRDYETEESP
NP_0011165      NSKDYSRRLWAWESWRAEVGKQLRPLYEEYVVLENEMARANNYEDYGDYWRGDYEVTGTG
XP_0313017      NSKDYNQRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEVMWAG
QLH93383.1      SSKDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYETEGAN
BAE53380.1      NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWAD
XP_0258425      NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWEN
AAX63775.1      NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTG
AAX59005.1      NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTD
NP_0011239      TSTDYNSRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYNDYGDYWRGDYEAEGAD
AAW78017.1      TSTDYNRRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGVE
XP_0053160      NSTDYNERLWVWEGWRSEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGAD
AAY57872.1      KSLDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYKDYGDYWRGDYEVNGVD
NP_0013583      NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD
NP_0011246      NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD
ETE61880.1      NNWNYPERLWAWEGWRANVGKKMRPLYETYVELKNKYARLRGYADYGDYWRANYEVDLPG
XP_0061228      SSTDYFERLWAWEGWRADVGKKMRELYERYVELENEAARLNKYSDYGDYWRGNYEVNDPT
XP_416822.      NSIDYHERLWAWEGWRADVGRMMRPLYEEYVELKNEAARLNNYSDYGDYWRANYETDYPE
                .. :*  ***.**.**::**. :* *** ** *:*: **   * ******* :**     
AGZ48803.1      GPGYSRDQLMKDVERIFTEIKPLYEHLHAYVRAKLMDTY-PFHISPTGCLPAHLLGDMWG
NP_0011165      DYDYSRNQLMEDVERTFAEIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG
XP_0313017      DYDYSRDQLMGDVEHTFAEIKPLYEHLHAYVRAKLMDVY-PSHISPTGCLPAHLLGDMWG
QLH93383.1      GYNYSRDHLIEDVEHIFTQIKPLYEHLHAYVRAKLMDNY-PSHISPTGCLPAHLLGDMWG
BAE53380.1      GYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG
XP_0258425      GYNYSRNQLIDDVEHTFTQIMPLYQHLHAYVRTKLMDTY-PSYISPTGCLPAHLLGDMWG
AAX63775.1      GYNYSRNQLIQDVEDTFEQIKPLYQHLHAYVRAKLMDTY-PSRISRTGCLPAHLLGDMWG
AAX59005.1      GYNYSRSQLIKDVEHTFTQIKPLYQHLHAYVRAKLMDTY-PSRISPTGCLPAHLLGDMWG
NP_0011239      GYNYNRNQLIEDVERTFAEIKPLYEHLHAYVRRKLMDTY-PSYISPTGCLPAHLLGDMWG
AAW78017.1      GYNYNRNQLIEDVENTFKEIKPLYEQLHAYVRTKLMEVY-PSYISPTGCLPAHLLGDMWG
XP_0053160      GYGYNRNQLIEDVERTFAEIKPLYEHLHAYVRAKLMNTY-PSYISPTGCLPAHLLGDMWG
AAY57872.1      GYDYNRDQLIEDVERTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPTGCLPAHLLGDMWG
NP_0013583      GYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPIGCLPAHLLGDMWG
NP_0011246      SYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLINAY-PSYISPIGCLPAHLLGDMWG
ETE61880.1      KFQYQREQLITDVESTFKQ------QLHAYVRHHLYKRYGPELINPEGAIPAHLLGDMWG
XP_0061228      EYAYSRNQLMEDVEATFEQIKPLYRELHAYVRYRLEKFYGSDHISSTGCLPAHLLGDMWG
XP_416822.      EYKYSRDQLVQDVEKTFEQIKPLYQHLHAYVRHRLEQVYGSELINPTGCLPAHLLGDMWG
                   *.* :*: ***  * :       ****** .* . * .  *.  *.:**********
AGZ48803.1      RFWTNLYPLTVPFGQKPNIDVTDEMLKQGWDADRIFKEAEKFFVSVGLPNMTEGFWNNSM
NP_0011165      RFWTNLYPLTVPFGEKPSIDVTEAMVNQSWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSM
XP_0313017      RFWTNLYSLTVPFGQKPNIDVTEAMENQSWDAKRIFKEAEKFFVSIGLPNMTQGFWDNSM
QLH93383.1      RFWTNLYPLTVPFRQKPNIDVTDAMVNQTWDANRIFKEAEKFFVSVGLPKMTQTFWENSM
BAE53380.1      RFWTNLYPLMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSM
XP_0258425      RFWTNLYPLTVPFGQKPNIDVTNAMVNQSWDARKIFKEAEKFFVSVGLPNMTQGFWENSM
AAX63775.1      RFWTNLYPLTVPFGQKPNIDVTDAMVNQNWDARRIFKEAEKFFVSVGLPNMTQGFWENSM
AAX59005.1      RFWTNLYPLTVPFGQKPNIDVTDAMVNQSWDARRIFKEAEKFFVSVGLPNMTQGFWENSM
NP_0011239      RFWTNLYPLTVPFAQKPNIDVTDAMMNQGWDAERIFQEAEKFFVSVGLPHMTQGFWANSM
AAW78017.1      RFWTNLYPLTTPFLQKPNIDVTDAMVNQSWDAERIFKEAEKFFVSVGLPQMTPGFWTNSM
XP_0053160      RFWTNLYSLTVPFPEKPNIDVTDAMINQNWNAVRIFKEAEKFFVSVGLPNMTQGFWENSM
AAY57872.1      RFWTNLYSLTVPFGQKPNIDVTDAMVNQAWNAQRIFKEAEKFFVSVGLPNMTQGFWENSM
NP_0013583      RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM
NP_0011246      RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQRFWENSM
ETE61880.1      RFWTNLYPLMVPYPNKTSIDVTSAMEKKKWTVNSIFKAAEHFFISIGLFNMTVGFWKNSM
XP_0061228      RFWTNLYALTVPYPDKPNIDVTSEMVKKNWNATKIFKAAEDFFMSVGLYKMTEGFWKNSM
XP_416822.      RFWTNLYNLTVPYPEKPNIDVTSAMAQKNWDAMKIFKTAEAFFASIGLYNMTEGFWTNSM
                ******* * .*: :*..****. * .: * .  **: ** ** *:** :**  ** ***
AGZ48803.1      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMEDFLTAHHEMGHIQYDMAYASQPYLL
NP_0011165      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLL
XP_0313017      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPFLL
QLH93383.1      LTEPGDGRKVVCHPTAWDLGKHDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAMQPYLL
BAE53380.1      LTEPGDNRKVVCHPTAWDLGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLL
XP_0258425      LTEPSDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
AAX63775.1      LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
AAX59005.1      LTEPGDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLL
NP_0011239      LTEPADGRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYARQPFLL
AAW78017.1      LTEPGDDRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLL
XP_0053160      LTEPTDGRKVVCHPTAWDLQKGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAMQPYLL
AAY57872.1      LTDPGNVQKVVCHPTAWDLGKGDFRIIMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
NP_0013583      LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
NP_0011246      LTDPGNVQKVVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL
ETE61880.1      LEEPKGGRKVVCHPTAWDMGKEDYRIKMCTKINMEDFLTAHHEMGHIEYDMAYANQPFLL
XP_0061228      ITEPNDGRKVVCHPTAWDMGKKDYRIKMCTKVSMDDFLTVHHEMGHIEYDMAYSNLSYLL
XP_416822.      LTEPTDNRKVVCHPTAWDMGKNDYRIKMCTKVTMDDFLTAHHEMGHIEYDMAYSVQPFLL
                : :* . .*.********: : *:** ****:.*::***.*******:*****:  .:**
AGZ48803.1      RNGANEGFHEAVGEVMSLSVATPKHLKTMGLLSPDFREDNETEINFLLKQALNIVGTLPF
NP_0011165      RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPPDFYEDSETEINFLLKQALTIVGTLPF
XP_0313017      RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPADFYEDSETEINFLLKQALTIVGTLPF
QLH93383.1      RNGANEGFHEAVGEIMSLSAATPKHLKNIGLLPPDFYEDNETEINFLLKQALTIVGTLPF
BAE53380.1      RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPDFSEDSETDINFLLKQALTIVGTLPF
XP_0258425      RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPSFFEDSETEINFLLKQALTIVGTLPF
AAX63775.1      RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPAFSEDNETEINFLLKQALTIVGTLPF
AAX59005.1      RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPGFSEDSETEINFLLKQALTIVGTLPF
NP_0011239      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSDFQEDSETEINFLLKQALTIVGTLPF
AAW78017.1      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSNFQEDNETEINFLLKQALTIVGTLPF
XP_0053160      RNGANEGFHEAVGEIMSLSASTPKHLKSIGLLPSDFREDSETEINFLLKQALTIVGTLPF
AAY57872.1      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF
NP_0013583      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF
NP_0011246      RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF
ETE61880.1      RNGANEGFHEAVGEIMSLSAATPKYLKSLGLLEPTFQEDAETDINFLLKQALTIVGTMPF
XP_0061228      RSGANEGFHEAVGEIMSLSAATPKHLKSLDLLEPTFQEDNETDINFLLKQALTIVGTMPF
XP_416822.      RNGANEGFHEAVGEIMSLSAATPQHLKSLDLLEPTFQEDEETEINFLLKQALTIVGTMPF
                *.************:****.:**::** :.** . * ** **:*********.****:**
AGZ48803.1      TYMLEKWRWMVFKGEIPKEEWMKKWWEMKRKIVGVVEPVPHDETYCDPASLFHVANDYSF
NP_0011165      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF
XP_0313017      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF
QLH93383.1      TYMLEKWRWMVFSGQIPKEQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF
BAE53380.1      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEPLPHDETYCDPAALFHVANDYSF
XP_0258425      TYMLEKWRWMVFKGEIPKDQWMKTWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF
AAX63775.1      TYMLEKWRWMVFKGAIPKEQWMQKWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF
AAX59005.1      TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF
NP_0011239      TYMLEKWRWMVFRGEIPKEQWMKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF
AAW78017.1      TYMLEKWRWMVFQDKIPREQWTKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF
XP_0053160      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVMEPVPHDETYCDPAALYHVSNDFSF
AAY57872.1      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF
NP_0013583      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF
NP_0011246      TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF
ETE61880.1      TYMLEKWRWMVFAEQIPKDQWMKKWWEMKREIVGVVEPLPHNEEYCDPAALFHVANDYSF
XP_0061228      TYMLEKWRWMVFKGDIPKDEWMKKWWEMKRAIVGVVEPVPHDETYCDPAALFHVANDYSF
XP_416822.      TYMLEKWRWMVFNGEITKQEWTKRWWKMKREIVGVVEPVPHDETYCDPAALFHVANDYSF
                ************   *..::* : **:*** ****:**:**:* *****.*:**::*:**
AGZ48803.1      IRYYTRTIFEFQFHEALCRIAQHDGPLHKCDISNSTDAGKKLHQMLSVGKSQAWTKTLED
NP_0011165      IRYYTRTIYQFQFHEALCRTAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEN
XP_0313017      IRYYTRTIYQFQFHEALCQIAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEG
QLH93383.1      IRYYTRTIYQFQFQEALCQTAKHEGPLHKCDISNSTEAGQKLLQMLSLGKSKPWTLALER
BAE53380.1      IRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSLGRSKPWTFALER
XP_0258425      IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSSEAGQKLLEMLKLGKSKPWTYALEI
AAX63775.1      IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSTEAGKKLLEMLSLGRSEPWTLALER
AAX59005.1      IRYYTRTIYQFQFQEALCRIAKHEGPLHKCDISNSSEAGKKLLQMLTLGKSKPWTLALEH
NP_0011239      IRYYTRTIYQFQFQEALCQAAKYNGSLHKCDISNSTEAGQKLLKMLSLGNSEPWTKALEN
AAW78017.1      IRYYTRTIYQFQFQEALCQAAKHDGPLHKCDISNSTEAGQKLLNMLSLGNSGPWTLALEN
XP_0053160      IRYYTRTIYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSKPWTLALEN
AAY57872.1      IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLKLGKSEPWTLALEN
NP_0013583      IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN
NP_0011246      IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSEPWTLALEN
ETE61880.1      IRYYTRTIYQFQFQEALCKAAGHTKELYKCDISDSTNAGRILKDMLALGSSQPWTKALES
XP_0061228      IRYYTRTIYQFQFQEALCKAANHGGLLHTCDITNSMAAGQKLRDMLALGRSQPWTKALES
XP_416822.      IRYYTRTIYQFQFQEALCKAANHTGPLHKCDITNSTAAGGNLRQLLELGKSKPWTQALES
                *******:::***:****. * :   *:.***::*  **  * .:* :* * .** :** 
AGZ48803.1      IVDSRNMDVGPLLKYFEPLYTWLQEQNRKSYVGWNTDWSPYSDQSIKVRISLKSALGENA
NP_0011165      IVGVKTMDVKPLLSYFEPLLTWLKAQNGNSSVGWNTDWTPYADQSIKVRISLKSALGEDA
XP_0313017      LVGVKTMDVKPLLNYFEPLLTWLKDQNRNSFVGWSTDWTPYTDQSIKVRISLKSALGDKA
QLH93383.1      VVGTKNMDVRPLLNYFEPLLTWLKEQNKNSFVGWNTDWSPYAAQSIKVRISLKSALGEKA
BAE53380.1      VVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA
XP_0258425      VVGAKNMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA
AAX63775.1      VVGAKNMNVTPLLNYFEPLFTWLKEQNRNSFVGWDTDWRPYSDQSIKVRISLKSALGEKA
AAX59005.1      VVGEKKMNVTPLLKYFEPLFTWLKEQNRNSFVGWNTDWRPYADQSIKVRISLKSALGDEA
NP_0011239      VVGARNMDVKPLLNYFQPLFDWLKEQNRNSFVGWNTEWSPYADQSIKVRISLKSALGANA
AAW78017.1      VVGSRNMDVKPLLNYFQPLFVWLKEQNRNSTVGWSTDWSPYADQSIKVRISLKSALGKNA
XP_0053160      VVGARNMDVRPLLNYFEPLFGWLKDQNRNSFVGWNTNWSPYTDQSIKVRISLKSALGEEA
AAY57872.1      VVGAKNMSVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGANA
NP_0013583      VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGDKA
NP_0011246      VVGAKNMNVRPLLDYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGNKA
ETE61880.1      ITGSLKMDAKPFCQYFDPLLKWLEKTNSNENVGWNVNWTPYSKDAIKVRISLKAALGDDA
XP_0061228      ITGEKKMNATPLLHYFEPLYQWLIKNNSGRAVGWNTFWSPYSGNAIKVRISLKTALGDNA
XP_416822.      ATGEKYMNATPLLHYFEPLFNWLQKNNSGRSIGWNTDWTPYSDNAIKVRISLKAALGDDA
                 ..   *.. *:  **:**  **   *    :**.. * **: ::********:*** .*
AGZ48803.1      YEWNDNEMYLFRSSVAYAMREYFLKEKHQTILFGAENVWVSNLKPRISFNFHVTSPGNLS
NP_0011165      YEWNDNEMYLFRSSIAYAMRNYFSSAKNETIPFGAVDVWVSDLKPRISFNFFVTSPANMS
XP_0313017      YEWNDNEMYLFQSSLAYAMRKYFLKVQNQTILFGVEDVWVSDLKPRISFSFFVTSPKNVS
QLH93383.1      YEWNDSEMYLFRSSVAYAMREYFSKFKKQTIPFEEESVRVSDLKPRVSFIFFVTLPKNVS
BAE53380.1      YEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMS
XP_0258425      YEWNNNEMYLFRSSIAYAMRRYFSEVKKQTIPFVEDNVWVSDLKPRISFNFFVTSPGNVS
AAX63775.1      YEWNDNEMYLFRSSIAYAMREYFSKVKNQTIPFVEDNVWVSDLKPRISFNFFVTFSNNVS
AAX59005.1      YEWNDNEMYLFRSSVAYAMREYFSKVKNQTIPFVEDNVWVSNLKPRISFNFFVTASKNVS
NP_0011239      YEWTNNEMFLFRSSVAYAMRKYFSIIKNQTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVS
AAW78017.1      YEWTDNEMYLFRSSVAYAMREYFSREKNQTVPFGEADVWVSDLKPRVSFNFFVTSPKNVS
XP_0053160      YQWNDNEMYLFRSSVAYAMRMYFSKVKNQTIPFGEKDVWVSDEKPRISFNFFVTAPQNVS
AAY57872.1      YKWNDNEMYLFRSSVAYAMRQYFLENKHQTILFGEEDVRVADLKPRISFNFYVTAPKNVS
NP_0013583      YEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS
NP_0011246      YEWNDNEIYLFRSSVAYAMRKYFLEVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS
ETE61880.1      YNWDESEMFLFKSTIAYAMQKYFLEVKNKTVPF---------------------------
XP_0061228      YEWDENELYFFKSSIAYAMRKYFLEVKNQTVSFQCTDIHVWAVTQRVSFYFAVSMPGNAT
XP_416822.      YVWDASELFLFKSSIAYAMRKYFAKEKEQNVDFQVTDIHVGEETQRVSFYLTVSMPGNVS
                * *  .*:::*.*::****. **   : : : *                           
AGZ48803.1      DIIPRPEVEGAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAV
NP_0011165      DIIPRSDVEKAISMSRSRINDAFRLDDNTLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGL
XP_0313017      DIIPRTEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYEPPVTVWLIIFGIVMGL
QLH93383.1      AVIPRAEVEEAIRMSRSRINDVFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV
BAE53380.1      DIIPRADVEEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV
XP_0258425      DIIPRTEVEKAIRMYRGRINDVFRLDDNSLEFLGIQPTLGPSYEPPVTIWLIVFGVVMGV
AAX63775.1      DVIPRSEVEDAIRMSRSRINDAFRLDDNSLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGA
AAX59005.1      DVIPRSEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGV
NP_0011239      DVIPRSEVEDAIRMSRGRINDVFGLNDNSLEFLGIHPTLEPPYQPPVTIWLIIFGVVMAL
AAW78017.1      DIIPRSEVEEAIRMSRGRINDIFGLNDNSLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGT
XP_0053160      DIIPRTDVEKAIRMSRGRINGVFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMGL
AAY57872.1      DIIPRTEVEEAIRFSRSRINDAFQLNDNSLEFLGIQSTLVPPYQSPITTWLIVFGVVMAV
NP_0013583      DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV
NP_0011246      DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV
ETE61880.1      ------------HLSRDRINEAFKLTDQTLEFIGLLPTLAPPYESPITVWLVAFGVVIGL
XP_0061228      DFIPKSEVETAIRMSRGRINEAFRLDDNTLEFEGLLPTLASPYEPPVTVWLILFGVVMGV
XP_416822.      DIVPRADVEKAIRMSRGRISEAFRLDDNTLEFDGIVPTLATPYKPPVTIWLILFGVVMSL
                               *.**.  * * *::*** *: .** ..  .*:: **: **:*:. 
AGZ48803.1      VVVGIVVLIITGIRDRRKTDQARSEE-------NPYS--SVDLSKGENNPGFQNGDDVQT
NP_0011165      VVVGIVVLIFTGIRDRRKKKQASSEE-------NPYG--SMDLSKGESNSGFQNGDDIQT
XP_0313017      VVVGIVVLIFTGIRDRRKKKQASTEE-------NPYG--SVDLSKGENNSGFQNGDDVQT
QLH93383.1      IVVGIVVLIFTGIRDRKKKNQARSEQ-------NPYA--SVDLSKGENNPGFQNVDDVQT
BAE53380.1      VVVGIFLLIFSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQNVDDVQT
XP_0258425      VVVGIVLLIFSGIRNRRKNDQARGEE-------NPYA--SVDLSKGENNPGFQNVDDAQT
AAX63775.1      IVVGIVLLIVSGIRNRRKNDQAGSEE-------NPYA--SVDLNKGENNPGFQHADDVQT
AAX59005.1      VVVGIVLLIVSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQHADDVQT
NP_0011239      VVVGIIILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT
AAW78017.1      VVVGIVILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT
XP_0053160      IVVGIVILIFTGIRDRRRKNQTKREE-------NPYAESSMEMGKGENNPGYQNNDDVQT
AAY57872.1      IVAGIVVLIFTGIRDRKKKNQARSEE-------NPYA--SIDISKGENNPGFQNTDDVQT
NP_0013583      IVVGIVILIFTGIRDRKKKNKARSGE-------NPYA--SIDISKGENNPGFQNTDDVQT
NP_0011246      IVVGIVVLIFTGIRDRKKKNKARNEE-------NPYA--SIDISKGENNPGFQNTDDVQT
ETE61880.1      IVIGIITLEKAGSKN---------------------------------------------
XP_0061228      IVVGVIVLIVTGQRDRRKRMKAGTNELVQTNAIDP------ELENGEVNPAFIKHEERQT
XP_416822.      IVIGVIVLIITGQRDKRKKARGRANEAGSNCEVNPYD------EDGRSNKGFEQSEETQT
                :* *:. *  :* ..                                             
AGZ48803.1      SF
NP_0011165      SF
XP_0313017      SF
QLH93383.1      SF
BAE53380.1      SF
XP_0258425      SF
AAX63775.1      SF
AAX59005.1      SF
NP_0011239      SF
AAW78017.1      SF
XP_0053160      SF
AAY57872.1      SF
NP_0013583      SF
NP_0011246      SF
ETE61880.1      --
XP_0061228      SF
XP_416822.      SF

Part 3: Creating a phylogenetic tree with Phylogeny.fr

  • Continued working with the www.phylogeny.fr website.
  • Went back to 6. Tree rendering for the phylogenetic tree of the sequences.
    • Horizontal lines represent individual evolutionary lines.
    • Vertical lines represent mutation events. the vertical length has no biological meaning.
    • The left-most split is called the root of the tree, which represents a hypothesis about the most recent common ancestor (MRCA) of the sequences within your tree.
    • The length of each branch represents the percentage change in the amino acid sequence occurring along that branch, relative to the scale bar
      • The scale bar was 0.5 (50%).
  • I saved the image to a file and uploaded it to the wiki.

  • Original image included the American mink. After further analysis of this partial sequence, this sequence was removed.
  • The new phylogenetic tree is visible below:

Part 4: Structural Analysis and Critical Residues Table

  • My research partner, Aiden Burnett, performed a structural analysis of the ACE2 receptor.
  • Aiden Burnett also created a table detailing the differences between the critical amino acid residues.
  • The methods by which he did these can both be found on his user page.

Part 5: Sequence Percent Similarity Table

  • Navigated to LALIGN, a platform used by researchers to find a percent identity for sequences.
    • Allows for the comparison of two different sequences.
  • Entered human ACE2 sequence in area called 1st Query sequence.
  • Entered one of the other seventeen sequences into the ;2nd Query sequence'.
  • Clicked Run LALIGN.
  • Noted the percent similarity between the sequences.
    • Recorded this value in a table.
  • Repeated procedure for other sixteen sequences, comparing each to the human ACE2 sequence.
  • Created a table showing the percent similarity of each organism to humans.

Part 6: Final Presentation

  • Uploaded presentation created based on the information presented above.

Presentation Slides (PDF)

Scientific Conclusion

  • Known human orthologues, including monkeys and orangutans, showed close similarities to the human ACE2 sequences. Foxes and cats also showed similarities, while turtles and king cobras did not show to be similar. Of the five critical amino acids that correspond to the RBD of SARS-CoV-2 on the ACE2 receptor, many were relatively conserved across species, with most organisms having between 2-3 of the 5 amino acids altered. This could help study the lineage of SARS-CoV-2 and identify which animals could act as intermediary hosts for future strains of SARS viruses.

Acknowledgments

  • I consulted with my partner Aiden Burnett in class, over text, as well as over the phone several times to discuss the creation of our presentation.
  • I contacted my TA, Annika Dinulos, to ask about the formatting of a presentation.
  • I copied and modified procedures from the Week 6 assignment page.
  • I referred back to procedures used on my Week 4 and Week 5 pages.
  • I used the Wan et. al - Receptor Recognition by the Novel Coronavirus from Wuhan paper for reference.
  • I obtained sequences from GenBank.
  • I built the phylogenetic tree and created sequence alignments using Phylogeny.fr.
  • I used LALIGN to compare sequence percent similarities.
  • I uploaded images using the Wiki Upload page.
  • I copied and modified wiki syntax on formatting a photo from the Media Wiki Help Page.
  • Except for what is noted above, this individual journal entry was completed by me and not copied from another source.

Anna Horvath (talk) 21:25, 14 October 2020 (PDT)

References

  1. Andersen, K., Rambaut, A., Lipkin, W., Holmes, E., & Garry, R. (2020). The proximal origin of SARS-CoV-2. Nature Medicine, 26(4), 450-452. doi: 10.1038/s41591-020-0820-9
  2. Angiotensin-converting enzyme 2 isoform 1 precursor [Homo sapiens] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001358344.1?report=fasta
  3. Angiotensin-converting enzyme 2 [Paguma larvata] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAX63775.1?report=fasta
  4. Angiotensin-converting enzyme 2 [Rhinolophus sinicus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AGZ48803.1?report=fasta
  5. Angiotensin-converting enzyme 2 precursor [Mus musculus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001123985.1?report=fasta
  6. angiotensin converting enzyme 2 [Rattus norvegicus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAW78017.1?report=fasta
  7. Angiotensin-converting enzyme 2 precursor [Sus scrofa] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001116542.1?report=fasta
  8. Angiotensin I converting enzyme 2 [Mustela putorius furo] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/BAE53380.1?report=fasta
  9. Angiotensin I converting enzyme 2 [Felis catus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAX59005.1?report=fasta
  10. Angiotensin-converting enzyme 2 precursor [Pongo abelii] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001124604.1?report=fasta
  11. Angiotensin converting enzyme 2 [Chlorocebus aethiops] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAY57872.1?report=fasta
  12. Angiotensin-converting enzyme 2 [Vulpes vulpes] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_025842513.1?report=fasta
  13. Angiotensin-converting enzyme 2 [Gallus gallus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_416822.2?report=fasta
  14. Angiotensin-converting enzyme 2 [Ophiophagus hannah] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/ETE61880.1?report=fasta
  15. Angiotensin I converting enzyme 2 [Manis pentadactyla] - Protein - NCBI. (n.d.). Retrieved October 08, 2020, from https://www.ncbi.nlm.nih.gov/protein/QLH93383.1?report=fasta
  16. Angiotensin-converting enzyme 2 [Camelus dromedarius] - Protein - NCBI. (n.d.). Retrieved October 08, 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_031301717.1
  17. Angiotensin-converting enzyme 2, partial [Neovison vison] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/CCP86723.1?report=fasta
  18. Angiotensin-converting enzyme 2 [Ictidomys tridecemlineatus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_005316051.3?report=fasta
  19. Angiotensin-converting enzyme 2 [Pelodiscus sinensis] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_006122891.1?report=fasta
  20. Deng, J., Jin, Y., Liu, Y., Sun, J., Hao, L., & Bai, J. et al. (2020). Serological survey of SARS‐CoV‐2 for experimental, domestic, companion and wild animals excludes intermediate hosts of 35 different species of animals. Transboundary And Emerging Diseases, 67(4), 1745-1749. doi: 10.1111/tbed.13577
  21. LALIGN Server. (2020). Retrieved 14 October 2020, from https://embnet.vital-it.ch/software/LALIGN_form.html
  22. OpenWetWare - Anna Horvath Week 4. (2020). Retrieved 14 October 2020, from https://openwetware.org/wiki/Anna_Horvath_Week_4
  23. OpenWetWare - Anna Horvath Week 5. (2020). Retrieved 14 October 2020, from https://openwetware.org/wiki/Anna_Horvath_Week_5
  24. OpenWetWare - BIOL368/F20:Week 6. (2020). Retrieved 14 October 2020, from https://openwetware.org/wiki/BIOL368/F20:Week_6
  25. Phylogeny.fr: "One Click" Mode. (2020). Retrieved 8 October 2020, from http://www.phylogeny.fr/simple_phylogeny.cgi?workflow_id=b9c0813cbbe9695d63cf7e31da5f026d&tab_index=1
  26. Wan, Y., Shang, J., Graham, R., Baric, R., & Li, F. (2020). Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus. Journal Of Virology, 94(7). doi: 10.1128/jvi.00127-20
  27. Yuan, S., Jiang, S., & Li, Z. (2020). Analysis of Possible Intermediate Hosts of the New Coronavirus SARS-CoV-2. Frontiers In Veterinary Science, 7. doi: 10.3389/fvets.2020.00379
  28. Zhao, J., Cui, W., & Tian, B. (2020). The Potential Intermediate Hosts for SARS-CoV-2. Frontiers In Microbiology, 11. doi: 10.3389/fmicb.2020.580137

Template

Anna Horvath Template

User Pages

Assignments

Journal Pages

Class Journal Pages