Anna Horvath Week 6
From OpenWetWare
Jump to navigationJump to search
Purpose
To determine, through a comparison of eighteen different ACE-2 sequences, which of them are likely intermediary host for SARS-CoV-2, which has the most similar ACE2 receptor to humans. This could then be used to inform future comparisons of coronaviruses by understanding the ACE-2 similarities
Methods/Results
Part 1: GenBank
- I went to GenBank in order to find the nineteen sequences.
- Sequences used included:
- humans [2]
- civet [3]
- Chinese bats [4]
- mice [5]
- rats [6]
- pigs [7]
- ferrets [8]
- cats [9]
- orangutans [10]
- grivet monkeys [11]
- fox [12]
- chickens [13]
- king cobras [14]
- pangolins [15]
- dromedary camels [16]
- squirrels [17]
- mink [18]
- Chinese softshell turtles [19]
>NP_001358344.1 angiotensin-converting enzyme 2 isoform 1 precursor [Homo sapiens] MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWS AFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQE CLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVN GVDGYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYS LTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWD LGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEP VPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRL GKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGD KAYEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEV EKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKK KNKARSGENPYASIDISKGENNPGFQNTDDVQTSF
>AAX63775.1 angiotensin-converting enzyme 2 [Paguma larvata] MSGSFWLLLSFAALTAAQSTTEELAKTFLETFNYEAQELSYQSSVASWNYNTNITDENAKNMNEAGAKWS AYYEEQSKLAQTYPLAEIQDAKIKRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQE CLLLEPGLDNIMENSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE WTGGYNYSRNQLIQDVEDTFEQIKPLYQHLHAYVRAKLMDTYPSRISRTGCLPAHLLGDMWGRFWTNLYP LTVPFGQKPNIDVTDAMVNQNWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDGRKVVCHPTAWD LGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT IGLLSPAFSEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGAIPKEQWMQKWWEMKRNIVGVVEP VPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSTEAGKKLLEMLSL GRSEPWTLALERVVGAKNMNVTPLLNYFEPLFTWLKEQNRNSFVGWDTDWRPYSDQSIKVRISLKSALGE KAYEWNDNEMYLFRSSIAYAMREYFSKVKNQTIPFVEDNVWVSDLKPRISFNFFVTFSNNVSDVIPRSEV EDAIRMSRSRINDAFRLDDNSLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGAIVVGIVLLIVSGIRNRRK NDQAGSEENPYASVDLNKGENNPGFQHADDVQTSF
>AGZ48803.1 angiotensin-converting enzyme 2 [Rhinolophus sinicus] MSGSSWLLLSLVAVTTAQSTTEDEAKMFLDKFNTKAEDLSHQSSLASWDYNTNINDENVQKMDEAGAKWS AFYEEQSKLAKNYSLEQIQNVTVKLQLQILQQSGSPVLSEDKSKRLNSILNAMSTIYSTGKVCKPNKPQE CLLLEPGLDNIMGTSKDYNERLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARGYHYEDYGDYWRRDYETE ESPGPGYSRDQLMKDVERIFTEIKPLYEHLHAYVRAKLMDTYPFHISPTGCLPAHLLGDMWGRFWTNLYP LTVPFGQKPNIDVTDEMLKQGWDADRIFKEAEKFFVSVGLPNMTEGFWNNSMLTEPGDGRKVVCHPTAWD LGKGDFRIKMCTKVTMEDFLTAHHEMGHIQYDMAYASQPYLLRNGANEGFHEAVGEVMSLSVATPKHLKT MGLLSPDFREDNETEINFLLKQALNIVGTLPFTYMLEKWRWMVFKGEIPKEEWMKKWWEMKRKIVGVVEP VPHDETYCDPASLFHVANDYSFIRYYTRTIFEFQFHEALCRIAQHDGPLHKCDISNSTDAGKKLHQMLSV GKSQAWTKTLEDIVDSRNMDVGPLLKYFEPLYTWLQEQNRKSYVGWNTDWSPYSDQSIKVRISLKSALGE NAYEWNDNEMYLFRSSVAYAMREYFLKEKHQTILFGAENVWVSNLKPRISFNFHVTSPGNLSDIIPRPEV EGAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAVVVVGIVVLIITGIRDRRK TDQARSEENPYSSVDLSKGENNPGFQNGDDVQTSF
>NP_001123985.1 angiotensin-converting enzyme 2 precursor [Mus musculus] MSSSSWLLLSLVAVTTAQSLTEENAKTFLNNFNQEAEDLSYQSSLASWNYNTNITEENAQKMSEAAAKWS AFYEEQSKTAQSFSLQEIQTPIIKRQLQALQQSGSSALSADKNKQLNTILNTMSTIYSTGKVCNPKNPQE CLLLEPGLDEIMATSTDYNSRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYNDYGDYWRGDYEAE GADGYNYNRNQLIEDVERTFAEIKPLYEHLHAYVRRKLMDTYPSYISPTGCLPAHLLGDMWGRFWTNLYP LTVPFAQKPNIDVTDAMMNQGWDAERIFQEAEKFFVSVGLPHMTQGFWANSMLTEPADGRKVVCHPTAWD LGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYARQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLPSDFQEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFRGEIPKEQWMKKWWEMKREIVGVVEP LPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKYNGSLHKCDISNSTEAGQKLLKMLSL GNSEPWTKALENVVGARNMDVKPLLNYFQPLFDWLKEQNRNSFVGWNTEWSPYADQSIKVRISLKSALGA NAYEWTNNEMFLFRSSVAYAMRKYFSIIKNQTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVSDVIPRSEV EDAIRMSRGRINDVFGLNDNSLEFLGIHPTLEPPYQPPVTIWLIIFGVVMALVVVGIIILIVTGIKGRKK KNETKREENPYDSMDIGKGESNAGFQNSDDAQTSF
>AAW78017.1 angiotensin converting enzyme 2 [Rattus norvegicus] MSSSCWLLLSLVAVATAQSLIEEKAESFLNKFNQEAEDLSYQSSLASWNYNTNITEENAQKMNEAAAKWS AFYEEQSKIAQNFSLQEIQNATIKRQLKALQQSGSSALSPDKNKQLNTILNTMSTIYSTGKVCNSMNPQE CFLLEPGLDEIMATSTDYNRRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAE GVEGYNYNRNQLIEDVENTFKEIKPLYEQLHAYVRTKLMEVYPSYISPTGCLPAHLLGDMWGRFWTNLYP LTTPFLQKPNIDVTDAMVNQSWDAERIFKEAEKFFVSVGLPQMTPGFWTNSMLTEPGDDRKVVCHPTAWD LGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLPSNFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFQDKIPREQWTKKWWEMKREIVGVVEP LPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKHDGPLHKCDISNSTEAGQKLLNMLSL GNSGPWTLALENVVGSRNMDVKPLLNYFQPLFVWLKEQNRNSTVGWSTDWSPYADQSIKVRISLKSALGK NAYEWTDNEMYLFRSSVAYAMREYFSREKNQTVPFGEADVWVSDLKPRVSFNFFVTSPKNVSDIIPRSEV EEAIRMSRGRINDIFGLNDNSLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGTVVVGIVILIVTGIKGRKK KNETKREENPYDSMDIGKGESNAGFQNSDDAQTSF
>NP_001116542.1 angiotensin-converting enzyme 2 precursor [Sus scrofa] MSGSFWLLLSLIPVTAAQSTTEELAKTFLEKFNLEAEDLAYQSSLASWTINTNITDENIQKMNDARAKWS AFYEEQSRIAKTYPLDEIQTLILKRQLQALQQSGTSGLSADKSKRLNTILNTMSTIYSSGKVLDPNNPQE CLVLEPGLDEIMENSKDYSRRLWAWESWRAEVGKQLRPLYEEYVVLENEMARANNYEDYGDYWRGDYEVT GTGDYDYSRNQLMEDVERTFAEIKPLYEHLHAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYP LTVPFGEKPSIDVTEAMVNQSWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSMLTEPGDGRKVVCHPTAWD LGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLLRNGANEGFHEAVGEIMSLSAATPHYLKA LGLLPPDFYEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEP LPHDETYCDPACLFHVAEDYSFIRYYTRTIYQFQFHEALCRTAKHEGPLYKCDISNSTEAGQKLLQMLSL GKSEPWTLALENIVGVKTMDVKPLLSYFEPLLTWLKAQNGNSSVGWNTDWTPYADQSIKVRISLKSALGE DAYEWNDNEMYLFRSSIAYAMRNYFSSAKNETIPFGAVDVWVSDLKPRISFNFFVTSPANMSDIIPRSDV EKAISMSRSRINDAFRLDDNTLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGLVVVGIVVLIFTGIRDRRK KKQASSEENPYGSMDLSKGESNSGFQNGDDIQTSF
>BAE53380.1 angiotensin I converting enzyme 2 [Mustela putorius furo] MLGSSWLLLSLAALTAAQSTTEDLAKTFLEKFNYEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWS AFYEEESQHAKTYPLEEIQDPIIKRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQE CLLLEPGLDDIMENSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE WADGYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYP LMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSMLTEPGDNRKVVCHPTAWD LGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKN IGLLPPDFSEDSETDINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEP LPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSL GRSKPWTFALERVVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGE KAYEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMSDIIPRADV EEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGVVVVGIFLLIFSGIRNRRK NNQARSEENPYASVDLSKGENNPGFQNVDDVQTSF
>AAX59005.1 angiotensin I converting enzyme 2 [Felis catus] MSGSFWLLLSFAALTAAQSTTEELAKTFLEKFNHEAEELSYQSSLASWNYNTNITDENVQKMNEAGAKWS AFYEEQSKLAKTYPLAEIHNTTVKRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQE CLLLEPGLDDIMENSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE WTDGYNYSRSQLIKDVEHTFTQIKPLYQHLHAYVRAKLMDTYPSRISPTGCLPAHLLGDMWGRFWTNLYP LTVPFGQKPNIDVTDAMVNQSWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDSRKVVCHPTAWD LGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT IGLLSPGFSEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEP VPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCRIAKHEGPLHKCDISNSSEAGKKLLQMLTL GKSKPWTLALEHVVGEKKMNVTPLLKYFEPLFTWLKEQNRNSFVGWNTDWRPYADQSIKVRISLKSALGD EAYEWNDNEMYLFRSSVAYAMREYFSKVKNQTIPFVEDNVWVSNLKPRISFNFFVTASKNVSDVIPRSEV EEAIRMSRSRINDAFRLDDNSLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGVVVVGIVLLIVSGIRNRRK NNQARSEENPYASVDLSKGENNPGFQHADDVQTSF
>NP_001124604.1 angiotensin-converting enzyme 2 precursor [Pongo abelii] MSGSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWS AFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPNNPQE CLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVN GVDSYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLINAYPSYISPIGCLPAHLLGDMWGRFWTNLYS LTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQRFWENSMLTDPGNVQKVVCHPTAWD LGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEP VPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRL GKSEPWTLALENVVGAKNMNVRPLLDYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGN KAYEWNDNEIYLFRSSVAYAMRKYFLEVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEV EKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVVLIFTGIRDRKK KNKARNEENPYASIDISKGENNPGFQNTDDVQTSF >AAY57872.1 angiotensin converting enzyme 2 [Chlorocebus aethiops] MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQNMNNAGEKWS AFLKEQSTLAQMYPLQAIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIHSTGKVCNPNNPQE CLLLDPGLNEIMEKSLDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYKDYGDYWRGDYEVN GVDGYDYNRDQLIEDVERTFEEIKPLYEHLHAYVRAKLMNAYPSYISPTGCLPAHLLGDMWGRFWTNLYS LTVPFGQKPNIDVTDAMVNQAWNAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKVVCHPTAWD LGKGDFRIIMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEP VPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLKL GKSEPWTLALENVVGAKNMSVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGA NAYKWNDNEMYLFRSSVAYAMRQYFLENKHQTILFGEEDVRVADLKPRISFNFYVTAPKNVSDIIPRTEV EEAIRFSRSRINDAFQLNDNSLEFLGIQSTLVPPYQSPITTWLIVFGVVMAVIVAGIVVLIFTGIRDRKK KNQARSEENPYASIDISKGENNPGFQNTDDVQTSF
Part 2: Creating a sequence alignment with Phylogeny.fr
- I went to the website www.phylogeny.fr. Then, I clicked 'Phylogeny analysis’, and clicked on the text ‘One Click'.
- Then, I clicked on ‘Upload your set of sequences in FASTA, EMBL, or NEXUS format’. I copied the protein sequences from Week 4 Talk Page.
- I used Command-V to paste my sequences in the field and clicked 'Submit".
- In order to properly align the sequences, I first pasted them into a Word document.
- I found the numbered tabs located just beneath the text One Click Mode, and clicked on the tab labeled 3. Alignment. Prior to this, I saw the pages named Alignment results, Phylogeny results, and Tree rendering results.
- Positions are color-coded to indicate their conservation. Blue highlighting meant high conservation (the sequences are identical or very similar), gray highlighting means lower conservation, and white highlighting means little conservation.
- Under Outputs, I clicked on Alignment in Clustal Format.
- This showed my sequences with the amount of conservation indicated below them. The amount of conservation corresponded to the color-coded highlights shown above.
- Key:
- “*” for invariant
- “:” for highly conserved
- “.” for weakly conserved
- Space for not conserved
- Below are the class' alignments
XP_0061228 ----------MLSHL---------------WILC--SLTVVVKSQDITQE-AINFLSEFN XP_416822. ----------MLLHF---------------WLLC--GLSAVVTPQDVTQE-AQTFLAEFN ETE61880.1 MLMKQAPVRKPSSRSFTHPAFFDLKGNMLTWLCLTWSLVVLALAQDETK-VATKFLEQFD AGZ48803.1 ----------MSGSS---------------WLLL--SLVAVTTAQSTTEDEAKMFLDKFN NP_0011165 ----------MSGSF---------------WLLL--SLIPVTAAQSTTEELAKTFLEKFN XP_0313017 ----------MSGSF---------------WLLL--SLVAVTAAQSTTEELAKTFLEEFN QLH93383.1 ----------MSGSS---------------WLLL--SLVAVTAAQSTSDEEAKTFLEKFN BAE53380.1 ----------MLGSS---------------WLLL--SLAALTAAQSTTEDLAKTFLEKFN CCP86723.1 ------------------------------------------------------------ XP_0258425 ----------MSGSS---------------WLLL--SLAALTAAQST-EDLVNTFLEKFN AAX63775.1 ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLETFN AAX59005.1 ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLEKFN NP_0011239 ----------MSSSS---------------WLLL--SLVAVTTAQSLTEENAKTFLNNFN AAW78017.1 ----------MSSSC---------------WLLL--SLVAVATAQSLIEEKAESFLNKFN XP_0053160 MGSCPGARGKMLGSS---------------WLLL--SFVAVTAAQSTIEELAKTFLDKFN AAY57872.1 ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN NP_0013583 ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN NP_0011246 ----------MSGSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN XP_0061228 VQAEDLSYASSLASWNYNTNITDENAKKMNEAGAKWSVFYDEASTNASKYAIDKITNHTV XP_416822. VRAEDISYENSLASWNYNTNITEETARKMSEAGAKWAAFYEEASRNASRFSLANIQDAVT ETE61880.1 ARATDLYYNASIASWNYNTNLTEENAKIMHEKDNIFSKFYGEACRNASMFNVNHITDETI AGZ48803.1 TKAEDLSHQSSLASWDYNTNINDENVQKMDEAGAKWSAFYEEQSKLAKNYSLEQIQNVTV NP_0011165 LEAEDLAYQSSLASWTINTNITDENIQKMNDARAKWSAFYEEQSRIAKTYPLDEIQTLIL XP_0313017 HEAEDLSYQSSLASWNYNTNITDENVQKMNDARAKWSTFYEEKSKTAKTYPLEEIQNVTL QLH93383.1 SEAEELSYQSSLASWNYNTNITDENVQKMNVAGAKWSTFYEEQSKIAKNYQLQNIQNDTI BAE53380.1 YEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWSAFYEEESQHAKTYPLEEIQDPII CCP86723.1 ------------------------------------------------------------ XP_0258425 YEAEELSYQSSLASWDYNTNISDENVQKMNNAGAKWSAFYEEQSKLAKTYPLEEIQDSTV AAX63775.1 YEAQELSYQSSVASWNYNTNITDENAKNMNEAGAKWSAYYEEQSKLAQTYPLAEIQDAKI AAX59005.1 HEAEELSYQSSLASWNYNTNITDENVQKMNEAGAKWSAFYEEQSKLAKTYPLAEIHNTTV NP_0011239 QEAEDLSYQSSLASWNYNTNITEENAQKMSEAAAKWSAFYEEQSKTAQSFSLQEIQTPII AAW78017.1 QEAEDLSYQSSLASWNYNTNITEENAQKMNEAAAKWSAFYEEQSKIAQNFSLQEIQNATI XP_0053160 QEAEDLDYQRSLAAWNYNTNITEENTQKMNEAEAKWSAFYEEQSKLATAYPLQEIQNFTL AAY57872.1 HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGEKWSAFLKEQSTLAQMYPLQAIQNLTV NP_0013583 HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV NP_0011246 HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV XP_0061228 KLQLQSLQGKGTSVLSGEKYNELNKILSTMSTFYSTGTVCKPDNPDICLPLEPGLDAIMA XP_416822. RLQIQSLQDRGSSVLSPEKYSRLNSVMNSMSTIYSTGVVCKATEPFDCLVLEPGLDDIMA ETE61880.1 KLQIRLLQ-SGSTDSTKD---QLDTVLHKMSTLYS-------------------LDDIMA AGZ48803.1 KLQLQILQQSGSPVLSEDKSKRLNSILNAMSTIYSTGKVCKPNKPQECLLLEPGLDNIMG NP_0011165 KRQLQALQQSGTSGLSADKSKRLNTILNTMSTIYSSGKVLDPNNPQECLVLEPGLDEIME XP_0313017 KRQLQALQQSGASALSADKSKRLTTVLSTMSTIYSSGEVCDPNNPQECLVLEPGLDDIME QLH93383.1 KRQLQALQLSGSSALSADKNQRLNTILNTMSTIYSTGKVCNPGNPQECSLLEPGLDNIME BAE53380.1 KRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME CCP86723.1 ------------------------------------------------------------ XP_0258425 KRQLRALQHSGSSVLSADKNQRLNTILNSMSTIYSTGKACNPSNPQECLLLEPGLDDIME AAX63775.1 KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDNIME AAX59005.1 KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME NP_0011239 KRQLQALQQSGSSALSADKNKQLNTILNTMSTIYSTGKVCNPKNPQECLLLEPGLDEIMA AAW78017.1 KRQLKALQQSGSSALSPDKNKQLNTILNTMSTIYSTGKVCNSMNPQECFLLEPGLDEIMA XP_0053160 KRQLQALQQSGSSALSANKREQLNTILNTMSTIYSTGKVCNPKKPQECLLLEPGLDEIMA AAY57872.1 KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIHSTGKVCNPNNPQECLLLDPGLNEIME NP_0013583 KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMA NP_0011246 KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPNNPQECLLLEPGLNEIMA
XP_0061228 SSTDYFERLWAWEGWRADVGKKMRELYERYVELENEAARLNKYSDYGDYWRGNYEVNDPT XP_416822. NSIDYHERLWAWEGWRADVGRMMRPLYEEYVELKNEAARLNNYSDYGDYWRANYETDYPE ETE61880.1 NNWNYPERLWAWEGWRANVGKKMRPLYETYVELKNKYARLRGYADYGDYWRANYEVDLPG AGZ48803.1 TSKDYNERLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARGYHYEDYGDYWRRDYETEESP NP_0011165 NSKDYSRRLWAWESWRAEVGKQLRPLYEEYVVLENEMARANNYEDYGDYWRGDYEVTGTG XP_0313017 NSKDYNQRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEVMWAG QLH93383.1 SSKDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYETEGAN BAE53380.1 NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWAD CCP86723.1 ------------------------------------------------------------ XP_0258425 NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWEN AAX63775.1 NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTG AAX59005.1 NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTD NP_0011239 TSTDYNSRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYNDYGDYWRGDYEAEGAD AAW78017.1 TSTDYNRRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGVE XP_0053160 NSTDYNERLWVWEGWRSEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGAD AAY57872.1 KSLDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYKDYGDYWRGDYEVNGVD NP_0013583 NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD NP_0011246 NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD
XP_0061228 EYAYSRNQLMEDVEATFEQIKPLYRELHAYVRYRLEKFYGSDHISSTGCLPAHLLGDMWG XP_416822. EYKYSRDQLVQDVEKTFEQIKPLYQHLHAYVRHRLEQVYGSELINPTGCLPAHLLGDMWG ETE61880.1 KFQYQREQLITDVESTFKQ------QLHAYVRHHLYKRYGPELINPEGAIPAHLLGDMWG AGZ48803.1 GPGYSRDQLMKDVERIFTEIKPLYEHLHAYVRAKLMDTY-PFHISPTGCLPAHLLGDMWG NP_0011165 DYDYSRNQLMEDVERTFAEIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG XP_0313017 DYDYSRDQLMGDVEHTFAEIKPLYEHLHAYVRAKLMDVY-PSHISPTGCLPAHLLGDMWG QLH93383.1 GYNYSRDHLIEDVEHIFTQIKPLYEHLHAYVRAKLMDNY-PSHISPTGCLPAHLLGDMWG BAE53380.1 GYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG CCP86723.1 ------------------------------------------------------------ XP_0258425 GYNYSRNQLIDDVEHTFTQIMPLYQHLHAYVRTKLMDTY-PSYISPTGCLPAHLLGDMWG AAX63775.1 GYNYSRNQLIQDVEDTFEQIKPLYQHLHAYVRAKLMDTY-PSRISRTGCLPAHLLGDMWG AAX59005.1 GYNYSRSQLIKDVEHTFTQIKPLYQHLHAYVRAKLMDTY-PSRISPTGCLPAHLLGDMWG NP_0011239 GYNYNRNQLIEDVERTFAEIKPLYEHLHAYVRRKLMDTY-PSYISPTGCLPAHLLGDMWG AAW78017.1 GYNYNRNQLIEDVENTFKEIKPLYEQLHAYVRTKLMEVY-PSYISPTGCLPAHLLGDMWG XP_0053160 GYGYNRNQLIEDVERTFAEIKPLYEHLHAYVRAKLMNTY-PSYISPTGCLPAHLLGDMWG AAY57872.1 GYDYNRDQLIEDVERTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPTGCLPAHLLGDMWG NP_0013583 GYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPIGCLPAHLLGDMWG NP_0011246 SYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLINAY-PSYISPIGCLPAHLLGDMWG XP_0061228 RFWTNLYALTVPYPDKPNIDVTSEMVKKNWNATKIFKAAEDFFMSVGLYKMTEGFWKNSM XP_416822. RFWTNLYNLTVPYPEKPNIDVTSAMAQKNWDAMKIFKTAEAFFASIGLYNMTEGFWTNSM ETE61880.1 RFWTNLYPLMVPYPNKTSIDVTSAMEKKKWTVNSIFKAAEHFFISIGLFNMTVGFWKNSM AGZ48803.1 RFWTNLYPLTVPFGQKPNIDVTDEMLKQGWDADRIFKEAEKFFVSVGLPNMTEGFWNNSM NP_0011165 RFWTNLYPLTVPFGEKPSIDVTEAMVNQSWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSM XP_0313017 RFWTNLYSLTVPFGQKPNIDVTEAMENQSWDAKRIFKEAEKFFVSIGLPNMTQGFWDNSM QLH93383.1 RFWTNLYPLTVPFRQKPNIDVTDAMVNQTWDANRIFKEAEKFFVSVGLPKMTQTFWENSM BAE53380.1 RFWTNLYPLMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSM CCP86723.1 ----------------------------------------------GLPNMTEGFWQNSM XP_0258425 RFWTNLYPLTVPFGQKPNIDVTNAMVNQSWDARKIFKEAEKFFVSVGLPNMTQGFWENSM AAX63775.1 RFWTNLYPLTVPFGQKPNIDVTDAMVNQNWDARRIFKEAEKFFVSVGLPNMTQGFWENSM AAX59005.1 RFWTNLYPLTVPFGQKPNIDVTDAMVNQSWDARRIFKEAEKFFVSVGLPNMTQGFWENSM NP_0011239 RFWTNLYPLTVPFAQKPNIDVTDAMMNQGWDAERIFQEAEKFFVSVGLPHMTQGFWANSM AAW78017.1 RFWTNLYPLTTPFLQKPNIDVTDAMVNQSWDAERIFKEAEKFFVSVGLPQMTPGFWTNSM XP_0053160 RFWTNLYSLTVPFPEKPNIDVTDAMINQNWNAVRIFKEAEKFFVSVGLPNMTQGFWENSM AAY57872.1 RFWTNLYSLTVPFGQKPNIDVTDAMVNQAWNAQRIFKEAEKFFVSVGLPNMTQGFWENSM NP_0013583 RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM NP_0011246 RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQRFWENSM ** :** ** ***
XP_0061228 ITEPNDGRKVVCHPTAWDMGKKDYRIKMCTKVSMDDFLTVHHEMGHIEYDMAYSNLSYLL XP_416822. LTEPTDNRKVVCHPTAWDMGKNDYRIKMCTKVTMDDFLTAHHEMGHIEYDMAYSVQPFLL ETE61880.1 LEEPKGGRKVVCHPTAWDMGKEDYRIKMCTKINMEDFLTAHHEMGHIEYDMAYANQPFLL AGZ48803.1 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMEDFLTAHHEMGHIQYDMAYASQPYLL NP_0011165 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLL XP_0313017 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPFLL QLH93383.1 LTEPGDGRKVVCHPTAWDLGKHDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAMQPYLL BAE53380.1 LTEPGDNRKVVCHPTAWDLGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLL CCP86723.1 LTEPGDNRKVVCHPTAWDLGKHDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL XP_0258425 LTEPSDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL AAX63775.1 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL AAX59005.1 LTEPGDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLL NP_0011239 LTEPADGRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYARQPFLL AAW78017.1 LTEPGDDRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLL XP_0053160 LTEPTDGRKVVCHPTAWDLQKGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAMQPYLL AAY57872.1 LTDPGNVQKVVCHPTAWDLGKGDFRIIMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL NP_0013583 LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL NP_0011246 LTDPGNVQKVVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL : :* . .*.********: : *:** ****:.*::***.*******:*****: .:**
XP_0061228 RSGANEGFHEAVGEIMSLSAATPKHLKSLDLLEPTFQEDNETDINFLLKQALTIVGTMPF XP_416822. RNGANEGFHEAVGEIMSLSAATPQHLKSLDLLEPTFQEDEETEINFLLKQALTIVGTMPF ETE61880.1 RNGANEGFHEAVGEIMSLSAATPKYLKSLGLLEPTFQEDAETDINFLLKQALTIVGTMPF AGZ48803.1 RNGANEGFHEAVGEVMSLSVATPKHLKTMGLLSPDFREDNETEINFLLKQALNIVGTLPF NP_0011165 RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPPDFYEDSETEINFLLKQALTIVGTLPF XP_0313017 RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPADFYEDSETEINFLLKQALTIVGTLPF QLH93383.1 RNGANEGFHEAVGEIMSLSAATPKHLKNIGLLPPDFYEDNETEINFLLKQALTIVGTLPF BAE53380.1 RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPDFSEDSETDINFLLKQALTIVGTLPF CCP86723.1 RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPDFSEDSETDINFLLKQALTIVGTLPF XP_0258425 RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPSFFEDSETEINFLLKQALTIVGTLPF AAX63775.1 RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPAFSEDNETEINFLLKQALTIVGTLPF AAX59005.1 RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPGFSEDSETEINFLLKQALTIVGTLPF NP_0011239 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSDFQEDSETEINFLLKQALTIVGTLPF AAW78017.1 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSNFQEDNETEINFLLKQALTIVGTLPF XP_0053160 RNGANEGFHEAVGEIMSLSASTPKHLKSIGLLPSDFREDSETEINFLLKQALTIVGTLPF AAY57872.1 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF NP_0013583 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF NP_0011246 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF *.************:****.:**::** :.** . * ** **:*********.****:**
XP_0061228 TYMLEKWRWMVFKGDIPKDEWMKKWWEMKRAIVGVVEPVPHDETYCDPAALFHVANDYSF XP_416822. TYMLEKWRWMVFNGEITKQEWTKRWWKMKREIVGVVEPVPHDETYCDPAALFHVANDYSF ETE61880.1 TYMLEKWRWMVFAEQIPKDQWMKKWWEMKREIVGVVEPLPHNEEYCDPAALFHVANDYSF AGZ48803.1 TYMLEKWRWMVFKGEIPKEEWMKKWWEMKRKIVGVVEPVPHDETYCDPASLFHVANDYSF NP_0011165 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF XP_0313017 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF QLH93383.1 TYMLEKWRWMVFSGQIPKEQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF BAE53380.1 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEPLPHDETYCDPAALFHVANDYSF CCP86723.1 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEPLPHDETYCDPAALFHVANDYSF XP_0258425 TYMLEKWRWMVFKGEIPKDQWMKTWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF AAX63775.1 TYMLEKWRWMVFKGAIPKEQWMQKWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF AAX59005.1 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF NP_0011239 TYMLEKWRWMVFRGEIPKEQWMKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF AAW78017.1 TYMLEKWRWMVFQDKIPREQWTKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF XP_0053160 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVMEPVPHDETYCDPAALYHVSNDFSF AAY57872.1 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF NP_0013583 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF NP_0011246 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF ************ *..::* : **:*** ****:**:**:* *****.*:**::*:**
XP_0061228 IRYYTRTIYQFQFQEALCKAANHGGLLHTCDITNSMAAGQKLRDMLALGRSQPWTKALES XP_416822. IRYYTRTIYQFQFQEALCKAANHTGPLHKCDITNSTAAGGNLRQLLELGKSKPWTQALES ETE61880.1 IRYYTRTIYQFQFQEALCKAAGHTKELYKCDISDSTNAGRILKDMLALGSSQPWTKALES AGZ48803.1 IRYYTRTIFEFQFHEALCRIAQHDGPLHKCDISNSTDAGKKLHQMLSVGKSQAWTKTLED NP_0011165 IRYYTRTIYQFQFHEALCRTAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEN XP_0313017 IRYYTRTIYQFQFHEALCQIAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEG QLH93383.1 IRYYTRTIYQFQFQEALCQTAKHEGPLHKCDISNSTEAGQKLLQMLSLGKSKPWTLALER BAE53380.1 IRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSLGRSKPWTFALER CCP86723.1 IRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSREAGQKLHEMLSLGRSKPWTFALER XP_0258425 IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSSEAGQKLLEMLKLGKSKPWTYALEI AAX63775.1 IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSTEAGKKLLEMLSLGRSEPWTLALER AAX59005.1 IRYYTRTIYQFQFQEALCRIAKHEGPLHKCDISNSSEAGKKLLQMLTLGKSKPWTLALEH NP_0011239 IRYYTRTIYQFQFQEALCQAAKYNGSLHKCDISNSTEAGQKLLKMLSLGNSEPWTKALEN AAW78017.1 IRYYTRTIYQFQFQEALCQAAKHDGPLHKCDISNSTEAGQKLLNMLSLGNSGPWTLALEN XP_0053160 IRYYTRTIYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSKPWTLALEN AAY57872.1 IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLKLGKSEPWTLALEN NP_0013583 IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN NP_0011246 IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSEPWTLALEN *******:::***:****. * : *:.***::* ** * .:* :* * .** :**
XP_0061228 ITGEKKMNATPLLHYFEPLYQWLIKNNSGRAVGWNTFWSPYSGNAIKVRISLKTALGDNA XP_416822. ATGEKYMNATPLLHYFEPLFNWLQKNNSGRSIGWNTDWTPYSDNAIKVRISLKAALGDDA ETE61880.1 ITGSLKMDAKPFCQYFDPLLKWLEKTNSNENVGWNVNWTPYSKDAIKVRISLKAALGDDA AGZ48803.1 IVDSRNMDVGPLLKYFEPLYTWLQEQNRKSYVGWNTDWSPYSDQSIKVRISLKSALGENA NP_0011165 IVGVKTMDVKPLLSYFEPLLTWLKAQNGNSSVGWNTDWTPYADQSIKVRISLKSALGEDA XP_0313017 LVGVKTMDVKPLLNYFEPLLTWLKDQNRNSFVGWSTDWTPYTDQSIKVRISLKSALGDKA QLH93383.1 VVGTKNMDVRPLLNYFEPLLTWLKEQNKNSFVGWNTDWSPYAAQSIKVRISLKSALGEKA BAE53380.1 VVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA CCP86723.1 VVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA XP_0258425 VVGAKNMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA AAX63775.1 VVGAKNMNVTPLLNYFEPLFTWLKEQNRNSFVGWDTDWRPYSDQSIKVRISLKSALGEKA AAX59005.1 VVGEKKMNVTPLLKYFEPLFTWLKEQNRNSFVGWNTDWRPYADQSIKVRISLKSALGDEA NP_0011239 VVGARNMDVKPLLNYFQPLFDWLKEQNRNSFVGWNTEWSPYADQSIKVRISLKSALGANA AAW78017.1 VVGSRNMDVKPLLNYFQPLFVWLKEQNRNSTVGWSTDWSPYADQSIKVRISLKSALGKNA XP_0053160 VVGARNMDVRPLLNYFEPLFGWLKDQNRNSFVGWNTNWSPYTDQSIKVRISLKSALGEEA AAY57872.1 VVGAKNMSVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGANA NP_0013583 VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGDKA NP_0011246 VVGAKNMNVRPLLDYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGNKA .. *.. *: **:** ** * :**.. * **: ::********:*** .*
XP_0061228 YEWDENELYFFKSSIAYAMRKYFLEVKNQTVSFQCTDIHVWAVTQRVSFYFAVSMPGNAT XP_416822. YVWDASELFLFKSSIAYAMRKYFAKEKEQNVDFQVTDIHVGEETQRVSFYLTVSMPGNVS ETE61880.1 YNWDESEMFLFKSTIAYAMQKYFLEVKNKTVPF--------------------------- AGZ48803.1 YEWNDNEMYLFRSSVAYAMREYFLKEKHQTILFGAENVWVSNLKPRISFNFHVTSPGNLS NP_0011165 YEWNDNEMYLFRSSIAYAMRNYFSSAKNETIPFGAVDVWVSDLKPRISFNFFVTSPANMS XP_0313017 YEWNDNEMYLFQSSLAYAMRKYFLKVQNQTILFGVEDVWVSDLKPRISFSFFVTSPKNVS QLH93383.1 YEWNDSEMYLFRSSVAYAMREYFSKFKKQTIPFEEESVRVSDLKPRVSFIFFVTLPKNVS BAE53380.1 YEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMS CCP86723.1 YEWNDNEMYFFQSSIAYAMREYFSKVKKQTIPFVDKDVRVSDLKPRISFNFIVTSPENMS XP_0258425 YEWNNNEMYLFRSSIAYAMRRYFSEVKKQTIPFVEDNVWVSDLKPRISFNFFVTSPGNVS AAX63775.1 YEWNDNEMYLFRSSIAYAMREYFSKVKNQTIPFVEDNVWVSDLKPRISFNFFVTFSNNVS AAX59005.1 YEWNDNEMYLFRSSVAYAMREYFSKVKNQTIPFVEDNVWVSNLKPRISFNFFVTASKNVS NP_0011239 YEWTNNEMFLFRSSVAYAMRKYFSIIKNQTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVS AAW78017.1 YEWTDNEMYLFRSSVAYAMREYFSREKNQTVPFGEADVWVSDLKPRVSFNFFVTSPKNVS XP_0053160 YQWNDNEMYLFRSSVAYAMRMYFSKVKNQTIPFGEKDVWVSDEKPRISFNFFVTAPQNVS AAY57872.1 YKWNDNEMYLFRSSVAYAMRQYFLENKHQTILFGEEDVRVADLKPRISFNFYVTAPKNVS NP_0013583 YEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS NP_0011246 YEWNDNEIYLFRSSVAYAMRKYFLEVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS * * .*:::*.*::****. ** : : : *
XP_0061228 DFIPKSEVETAIRMSRGRINEAFRLDDNTLEFEGLLPTLASPYEPPVTVWLILFGVVMGV XP_416822. DIVPRADVEKAIRMSRGRISEAFRLDDNTLEFDGIVPTLATPYKPPVTIWLILFGVVMSL ETE61880.1 ------------HLSRDRINEAFKLTDQTLEFIGLLPTLAPPYESPITVWLVAFGVVIGL AGZ48803.1 DIIPRPEVEGAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAV NP_0011165 DIIPRSDVEKAISMSRSRINDAFRLDDNTLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGL XP_0313017 DIIPRTEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYEPPVTVWLIIFGIVMGL QLH93383.1 AVIPRAEVEEAIRMSRSRINDVFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV BAE53380.1 DIIPRADVEEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV CCP86723.1 DIIPRADVEEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV XP_0258425 DIIPRTEVEKAIRMYRGRINDVFRLDDNSLEFLGIQPTLGPSYEPPVTIWLIVFGVVMGV AAX63775.1 DVIPRSEVEDAIRMSRSRINDAFRLDDNSLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGA AAX59005.1 DVIPRSEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGV NP_0011239 DVIPRSEVEDAIRMSRGRINDVFGLNDNSLEFLGIHPTLEPPYQPPVTIWLIIFGVVMAL AAW78017.1 DIIPRSEVEEAIRMSRGRINDIFGLNDNSLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGT XP_0053160 DIIPRTDVEKAIRMSRGRINGVFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMGL AAY57872.1 DIIPRTEVEEAIRFSRSRINDAFQLNDNSLEFLGIQSTLVPPYQSPITTWLIVFGVVMAV NP_0013583 DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV NP_0011246 DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV *.**. * * *::*** *: .** .. .*:: **: **:*:.
XP_0061228 IVVGVIVLIVTGQRDRRKRMKAGTNELVQTNAIDP------ELENGEVNPAFIKHEERQT XP_416822. IVIGVIVLIITGQRDKRKKARGRANEAGSNCEVNPYD------EDGRSNKGFEQSEETQT ETE61880.1 IVIGIITLEKAGSKN--------------------------------------------- AGZ48803.1 VVVGIVVLIITGIRDRRKTDQARSEE-------NPYS--SVDLSKGENNPGFQNGDDVQT NP_0011165 VVVGIVVLIFTGIRDRRKKKQASSEE-------NPYG--SMDLSKGESNSGFQNGDDIQT XP_0313017 VVVGIVVLIFTGIRDRRKKKQASTEE-------NPYG--SVDLSKGENNSGFQNGDDVQT QLH93383.1 IVVGIVVLIFTGIRDRKKKNQARSEQ-------NPYA--SVDLSKGENNPGFQNVDDVQT BAE53380.1 VVVGIFLLIFSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQNVDDVQT CCP86723.1 VVVGIFLLIFSGIRNRRKNNQARSEE-------NPYA--SVDLSKG-------------- XP_0258425 VVVGIVLLIFSGIRNRRKNDQARGEE-------NPYA--SVDLSKGENNPGFQNVDDAQT AAX63775.1 IVVGIVLLIVSGIRNRRKNDQAGSEE-------NPYA--SVDLNKGENNPGFQHADDVQT AAX59005.1 VVVGIVLLIVSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQHADDVQT NP_0011239 VVVGIIILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT AAW78017.1 VVVGIVILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT XP_0053160 IVVGIVILIFTGIRDRRRKNQTKREE-------NPYAESSMEMGKGENNPGYQNNDDVQT AAY57872.1 IVAGIVVLIFTGIRDRKKKNQARSEE-------NPYA--SIDISKGENNPGFQNTDDVQT NP_0013583 IVVGIVILIFTGIRDRKKKNKARSGE-------NPYA--SIDISKGENNPGFQNTDDVQT NP_0011246 IVVGIVVLIFTGIRDRKKKNKARNEE-------NPYA--SIDISKGENNPGFQNTDDVQT :* *:. * :* ..
XP_0061228 SF XP_416822. SF ETE61880.1 -- AGZ48803.1 SF NP_0011165 SF XP_0313017 SF QLH93383.1 SF BAE53380.1 SF CCP86723.1 -- XP_0258425 SF AAX63775.1 SF AAX59005.1 SF NP_0011239 SF AAW78017.1 SF XP_0053160 SF AAY57872.1 SF NP_0013583 SF NP_0011246 SF
New alignment:
AGZ48803.1 ----------MSGSS---------------WLLL--SLVAVTTAQSTTEDEAKMFLDKFN NP_0011165 ----------MSGSF---------------WLLL--SLIPVTAAQSTTEELAKTFLEKFN XP_0313017 ----------MSGSF---------------WLLL--SLVAVTAAQSTTEELAKTFLEEFN QLH93383.1 ----------MSGSS---------------WLLL--SLVAVTAAQSTSDEEAKTFLEKFN BAE53380.1 ----------MLGSS---------------WLLL--SLAALTAAQSTTEDLAKTFLEKFN XP_0258425 ----------MSGSS---------------WLLL--SLAALTAAQST-EDLVNTFLEKFN AAX63775.1 ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLETFN AAX59005.1 ----------MSGSF---------------WLLL--SFAALTAAQSTTEELAKTFLEKFN NP_0011239 ----------MSSSS---------------WLLL--SLVAVTTAQSLTEENAKTFLNNFN AAW78017.1 ----------MSSSC---------------WLLL--SLVAVATAQSLIEEKAESFLNKFN XP_0053160 MGSCPGARGKMLGSS---------------WLLL--SFVAVTAAQSTIEELAKTFLDKFN AAY57872.1 ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN NP_0013583 ----------MSSSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN NP_0011246 ----------MSGSS---------------WLLL--SLVAVTAAQSTIEEQAKTFLDKFN ETE61880.1 MLMKQAPVRKPSSRSFTHPAFFDLKGNMLTWLCLTWSLVVLALAQDETKVATK-FLEQFD XP_0061228 ----------MLSHL---------------WILC--SLTVVVKSQDITQEAIN-FLSEFN XP_416822. ----------MLLHF---------------WLLC--GLSAVVTPQDVTQE-AQTFLAEFN *: .: :. .*. . : ** *:
AGZ48803.1 TKAEDLSHQSSLASWDYNTNINDENVQKMDEAGAKWSAFYEEQSKLAKNYSLEQIQNVTV NP_0011165 LEAEDLAYQSSLASWTINTNITDENIQKMNDARAKWSAFYEEQSRIAKTYPLDEIQTLIL XP_0313017 HEAEDLSYQSSLASWNYNTNITDENVQKMNDARAKWSTFYEEKSKTAKTYPLEEIQNVTL QLH93383.1 SEAEELSYQSSLASWNYNTNITDENVQKMNVAGAKWSTFYEEQSKIAKNYQLQNIQNDTI BAE53380.1 YEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWSAFYEEESQHAKTYPLEEIQDPII XP_0258425 YEAEELSYQSSLASWDYNTNISDENVQKMNNAGAKWSAFYEEQSKLAKTYPLEEIQDSTV AAX63775.1 YEAQELSYQSSVASWNYNTNITDENAKNMNEAGAKWSAYYEEQSKLAQTYPLAEIQDAKI AAX59005.1 HEAEELSYQSSLASWNYNTNITDENVQKMNEAGAKWSAFYEEQSKLAKTYPLAEIHNTTV NP_0011239 QEAEDLSYQSSLASWNYNTNITEENAQKMSEAAAKWSAFYEEQSKTAQSFSLQEIQTPII AAW78017.1 QEAEDLSYQSSLASWNYNTNITEENAQKMNEAAAKWSAFYEEQSKIAQNFSLQEIQNATI XP_0053160 QEAEDLDYQRSLAAWNYNTNITEENTQKMNEAEAKWSAFYEEQSKLATAYPLQEIQNFTL AAY57872.1 HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGEKWSAFLKEQSTLAQMYPLQAIQNLTV NP_0013583 HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV NP_0011246 HEAEDLFYQSSLASWNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTV ETE61880.1 ARATDLYYNASIASWNYNTNLTEENAKIMHEKDNIFSKFYGEACRNASMFNVNHITDETI XP_0061228 VQAEDLSYASSLASWNYNTNITDENAKKMNEAGAKWSVFYDEASTNASKYAIDKITNHTV XP_416822. VRAEDISYENSLASWNYNTNITEETARKMSEAGAKWAAFYEEASRNASRFSLANIQDAVT * :: : *:*:* ***:.:*. . * :: : * . * : : *
AGZ48803.1 KLQLQILQQSGSPVLSEDKSKRLNSILNAMSTIYSTGKVCKPNKPQECLLLEPGLDNIMG NP_0011165 KRQLQALQQSGTSGLSADKSKRLNTILNTMSTIYSSGKVLDPNNPQECLVLEPGLDEIME XP_0313017 KRQLQALQQSGASALSADKSKRLTTVLSTMSTIYSSGEVCDPNNPQECLVLEPGLDDIME QLH93383.1 KRQLQALQLSGSSALSADKNQRLNTILNTMSTIYSTGKVCNPGNPQECSLLEPGLDNIME BAE53380.1 KRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME XP_0258425 KRQLRALQHSGSSVLSADKNQRLNTILNSMSTIYSTGKACNPSNPQECLLLEPGLDDIME AAX63775.1 KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDNIME AAX59005.1 KRQLQALQQSGSSVLSADKSQRLNTILNAMSTIYSTGKACNPNNPQECLLLEPGLDDIME NP_0011239 KRQLQALQQSGSSALSADKNKQLNTILNTMSTIYSTGKVCNPKNPQECLLLEPGLDEIMA AAW78017.1 KRQLKALQQSGSSALSPDKNKQLNTILNTMSTIYSTGKVCNSMNPQECFLLEPGLDEIMA XP_0053160 KRQLQALQQSGSSALSANKREQLNTILNTMSTIYSTGKVCNPKKPQECLLLEPGLDEIMA AAY57872.1 KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIHSTGKVCNPNNPQECLLLDPGLNEIME NP_0013583 KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMA NP_0011246 KLQLQALQQNGSSVLSEDKSKRLNTILNTMSTIYSTGKVCNPNNPQECLLLEPGLNEIMA ETE61880.1 KLQIRLLQ-SGSTDSTKD---QLDTVLHKMSTLYS-------------------LDDIMA XP_0061228 KLQLQSLQGKGTSVLSGEKYNELNKILSTMSTFYSTGTVCKPDNPDICLPLEPGLDAIMA XP_416822. RLQIQSLQDRGSSVLSPEKYSRLNSVMNSMSTIYSTGVVCKATEPFDCLVLEPGLDDIMA . *:. ** *:. : : * .:: ***::* *: **
AGZ48803.1 TSKDYNERLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARGYHYEDYGDYWRRDYETEESP NP_0011165 NSKDYSRRLWAWESWRAEVGKQLRPLYEEYVVLENEMARANNYEDYGDYWRGDYEVTGTG XP_0313017 NSKDYNQRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEVMWAG QLH93383.1 SSKDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYETEGAN BAE53380.1 NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWAD XP_0258425 NSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWEN AAX63775.1 NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTG AAX59005.1 NSKDYNERLWAWEGWRAEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEEWTD NP_0011239 TSTDYNSRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYNDYGDYWRGDYEAEGAD AAW78017.1 TSTDYNRRLWAWEGWRAEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGVE XP_0053160 NSTDYNERLWVWEGWRSEVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRGDYEAEGAD AAY57872.1 KSLDYNERLWAWEGWRSEVGKQLRPLYEEYVVLKNEMARANHYKDYGDYWRGDYEVNGVD NP_0013583 NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD NP_0011246 NSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMARANHYEDYGDYWRGDYEVNGVD ETE61880.1 NNWNYPERLWAWEGWRANVGKKMRPLYETYVELKNKYARLRGYADYGDYWRANYEVDLPG XP_0061228 SSTDYFERLWAWEGWRADVGKKMRELYERYVELENEAARLNKYSDYGDYWRGNYEVNDPT XP_416822. NSIDYHERLWAWEGWRADVGRMMRPLYEEYVELKNEAARLNNYSDYGDYWRANYETDYPE .. :* ***.**.**::**. :* *** ** *:*: ** * ******* :**
AGZ48803.1 GPGYSRDQLMKDVERIFTEIKPLYEHLHAYVRAKLMDTY-PFHISPTGCLPAHLLGDMWG NP_0011165 DYDYSRNQLMEDVERTFAEIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG XP_0313017 DYDYSRDQLMGDVEHTFAEIKPLYEHLHAYVRAKLMDVY-PSHISPTGCLPAHLLGDMWG QLH93383.1 GYNYSRDHLIEDVEHIFTQIKPLYEHLHAYVRAKLMDNY-PSHISPTGCLPAHLLGDMWG BAE53380.1 GYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAY-PSRISPTGCLPAHLLGDMWG XP_0258425 GYNYSRNQLIDDVEHTFTQIMPLYQHLHAYVRTKLMDTY-PSYISPTGCLPAHLLGDMWG AAX63775.1 GYNYSRNQLIQDVEDTFEQIKPLYQHLHAYVRAKLMDTY-PSRISRTGCLPAHLLGDMWG AAX59005.1 GYNYSRSQLIKDVEHTFTQIKPLYQHLHAYVRAKLMDTY-PSRISPTGCLPAHLLGDMWG NP_0011239 GYNYNRNQLIEDVERTFAEIKPLYEHLHAYVRRKLMDTY-PSYISPTGCLPAHLLGDMWG AAW78017.1 GYNYNRNQLIEDVENTFKEIKPLYEQLHAYVRTKLMEVY-PSYISPTGCLPAHLLGDMWG XP_0053160 GYGYNRNQLIEDVERTFAEIKPLYEHLHAYVRAKLMNTY-PSYISPTGCLPAHLLGDMWG AAY57872.1 GYDYNRDQLIEDVERTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPTGCLPAHLLGDMWG NP_0013583 GYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLMNAY-PSYISPIGCLPAHLLGDMWG NP_0011246 SYDYSRGQLIEDVEHTFEEIKPLYEHLHAYVRAKLINAY-PSYISPIGCLPAHLLGDMWG ETE61880.1 KFQYQREQLITDVESTFKQ------QLHAYVRHHLYKRYGPELINPEGAIPAHLLGDMWG XP_0061228 EYAYSRNQLMEDVEATFEQIKPLYRELHAYVRYRLEKFYGSDHISSTGCLPAHLLGDMWG XP_416822. EYKYSRDQLVQDVEKTFEQIKPLYQHLHAYVRHRLEQVYGSELINPTGCLPAHLLGDMWG *.* :*: *** * : ****** .* . * . *. *.:**********
AGZ48803.1 RFWTNLYPLTVPFGQKPNIDVTDEMLKQGWDADRIFKEAEKFFVSVGLPNMTEGFWNNSM NP_0011165 RFWTNLYPLTVPFGEKPSIDVTEAMVNQSWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSM XP_0313017 RFWTNLYSLTVPFGQKPNIDVTEAMENQSWDAKRIFKEAEKFFVSIGLPNMTQGFWDNSM QLH93383.1 RFWTNLYPLTVPFRQKPNIDVTDAMVNQTWDANRIFKEAEKFFVSVGLPKMTQTFWENSM BAE53380.1 RFWTNLYPLMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSM XP_0258425 RFWTNLYPLTVPFGQKPNIDVTNAMVNQSWDARKIFKEAEKFFVSVGLPNMTQGFWENSM AAX63775.1 RFWTNLYPLTVPFGQKPNIDVTDAMVNQNWDARRIFKEAEKFFVSVGLPNMTQGFWENSM AAX59005.1 RFWTNLYPLTVPFGQKPNIDVTDAMVNQSWDARRIFKEAEKFFVSVGLPNMTQGFWENSM NP_0011239 RFWTNLYPLTVPFAQKPNIDVTDAMMNQGWDAERIFQEAEKFFVSVGLPHMTQGFWANSM AAW78017.1 RFWTNLYPLTTPFLQKPNIDVTDAMVNQSWDAERIFKEAEKFFVSVGLPQMTPGFWTNSM XP_0053160 RFWTNLYSLTVPFPEKPNIDVTDAMINQNWNAVRIFKEAEKFFVSVGLPNMTQGFWENSM AAY57872.1 RFWTNLYSLTVPFGQKPNIDVTDAMVNQAWNAQRIFKEAEKFFVSVGLPNMTQGFWENSM NP_0013583 RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM NP_0011246 RFWTNLYSLTVPFGQKPNIDVTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQRFWENSM ETE61880.1 RFWTNLYPLMVPYPNKTSIDVTSAMEKKKWTVNSIFKAAEHFFISIGLFNMTVGFWKNSM XP_0061228 RFWTNLYALTVPYPDKPNIDVTSEMVKKNWNATKIFKAAEDFFMSVGLYKMTEGFWKNSM XP_416822. RFWTNLYNLTVPYPEKPNIDVTSAMAQKNWDAMKIFKTAEAFFASIGLYNMTEGFWTNSM ******* * .*: :*..****. * .: * . **: ** ** *:** :** ** ***
AGZ48803.1 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMEDFLTAHHEMGHIQYDMAYASQPYLL NP_0011165 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLL XP_0313017 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAIQPFLL QLH93383.1 LTEPGDGRKVVCHPTAWDLGKHDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAMQPYLL BAE53380.1 LTEPGDNRKVVCHPTAWDLGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLL XP_0258425 LTEPSDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL AAX63775.1 LTEPGDGRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL AAX59005.1 LTEPGDSRKVVCHPTAWDLGKGDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLL NP_0011239 LTEPADGRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYARQPFLL AAW78017.1 LTEPGDDRKVVCHPTAWDLGHGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLL XP_0053160 LTEPTDGRKVVCHPTAWDLQKGDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYAMQPYLL AAY57872.1 LTDPGNVQKVVCHPTAWDLGKGDFRIIMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL NP_0013583 LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL NP_0011246 LTDPGNVQKVVCHPTAWDLGKGDFRILMCTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLL ETE61880.1 LEEPKGGRKVVCHPTAWDMGKEDYRIKMCTKINMEDFLTAHHEMGHIEYDMAYANQPFLL XP_0061228 ITEPNDGRKVVCHPTAWDMGKKDYRIKMCTKVSMDDFLTVHHEMGHIEYDMAYSNLSYLL XP_416822. LTEPTDNRKVVCHPTAWDMGKNDYRIKMCTKVTMDDFLTAHHEMGHIEYDMAYSVQPFLL : :* . .*.********: : *:** ****:.*::***.*******:*****: .:**
AGZ48803.1 RNGANEGFHEAVGEVMSLSVATPKHLKTMGLLSPDFREDNETEINFLLKQALNIVGTLPF NP_0011165 RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPPDFYEDSETEINFLLKQALTIVGTLPF XP_0313017 RNGANEGFHEAVGEIMSLSAATPHYLKALGLLPADFYEDSETEINFLLKQALTIVGTLPF QLH93383.1 RNGANEGFHEAVGEIMSLSAATPKHLKNIGLLPPDFYEDNETEINFLLKQALTIVGTLPF BAE53380.1 RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPDFSEDSETDINFLLKQALTIVGTLPF XP_0258425 RNGANEGFHEAVGEIMSLSAATPNHLKNIGLLPPSFFEDSETEINFLLKQALTIVGTLPF AAX63775.1 RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPAFSEDNETEINFLLKQALTIVGTLPF AAX59005.1 RNGANEGFHEAVGEIMSLSAATPNHLKTIGLLSPGFSEDSETEINFLLKQALTIVGTLPF NP_0011239 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSDFQEDSETEINFLLKQALTIVGTLPF AAW78017.1 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLPSNFQEDNETEINFLLKQALTIVGTLPF XP_0053160 RNGANEGFHEAVGEIMSLSASTPKHLKSIGLLPSDFREDSETEINFLLKQALTIVGTLPF AAY57872.1 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF NP_0013583 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF NP_0011246 RNGANEGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEINFLLKQALTIVGTLPF ETE61880.1 RNGANEGFHEAVGEIMSLSAATPKYLKSLGLLEPTFQEDAETDINFLLKQALTIVGTMPF XP_0061228 RSGANEGFHEAVGEIMSLSAATPKHLKSLDLLEPTFQEDNETDINFLLKQALTIVGTMPF XP_416822. RNGANEGFHEAVGEIMSLSAATPQHLKSLDLLEPTFQEDEETEINFLLKQALTIVGTMPF *.************:****.:**::** :.** . * ** **:*********.****:**
AGZ48803.1 TYMLEKWRWMVFKGEIPKEEWMKKWWEMKRKIVGVVEPVPHDETYCDPASLFHVANDYSF NP_0011165 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF XP_0313017 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPLPHDETYCDPACLFHVAEDYSF QLH93383.1 TYMLEKWRWMVFSGQIPKEQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF BAE53380.1 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEPLPHDETYCDPAALFHVANDYSF XP_0258425 TYMLEKWRWMVFKGEIPKDQWMKTWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF AAX63775.1 TYMLEKWRWMVFKGAIPKEQWMQKWWEMKRNIVGVVEPVPHDETYCDPASLFHVANDYSF AAX59005.1 TYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEPVPHDETYCDPASLFHVANDYSF NP_0011239 TYMLEKWRWMVFRGEIPKEQWMKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF AAW78017.1 TYMLEKWRWMVFQDKIPREQWTKKWWEMKREIVGVVEPLPHDETYCDPASLFHVSNDYSF XP_0053160 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVMEPVPHDETYCDPAALYHVSNDFSF AAY57872.1 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF NP_0013583 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF NP_0011246 TYMLEKWRWMVFKGEIPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVSNDYSF ETE61880.1 TYMLEKWRWMVFAEQIPKDQWMKKWWEMKREIVGVVEPLPHNEEYCDPAALFHVANDYSF XP_0061228 TYMLEKWRWMVFKGDIPKDEWMKKWWEMKRAIVGVVEPVPHDETYCDPAALFHVANDYSF XP_416822. TYMLEKWRWMVFNGEITKQEWTKRWWKMKREIVGVVEPVPHDETYCDPAALFHVANDYSF ************ *..::* : **:*** ****:**:**:* *****.*:**::*:**
AGZ48803.1 IRYYTRTIFEFQFHEALCRIAQHDGPLHKCDISNSTDAGKKLHQMLSVGKSQAWTKTLED NP_0011165 IRYYTRTIYQFQFHEALCRTAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEN XP_0313017 IRYYTRTIYQFQFHEALCQIAKHEGPLYKCDISNSTEAGQKLLQMLSLGKSEPWTLALEG QLH93383.1 IRYYTRTIYQFQFQEALCQTAKHEGPLHKCDISNSTEAGQKLLQMLSLGKSKPWTLALER BAE53380.1 IRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSLGRSKPWTFALER XP_0258425 IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSSEAGQKLLEMLKLGKSKPWTYALEI AAX63775.1 IRYYTRTIYQFQFQEALCQIAKHEGPLHKCDISNSTEAGKKLLEMLSLGRSEPWTLALER AAX59005.1 IRYYTRTIYQFQFQEALCRIAKHEGPLHKCDISNSSEAGKKLLQMLTLGKSKPWTLALEH NP_0011239 IRYYTRTIYQFQFQEALCQAAKYNGSLHKCDISNSTEAGQKLLKMLSLGNSEPWTKALEN AAW78017.1 IRYYTRTIYQFQFQEALCQAAKHDGPLHKCDISNSTEAGQKLLNMLSLGNSGPWTLALEN XP_0053160 IRYYTRTIYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSKPWTLALEN AAY57872.1 IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLKLGKSEPWTLALEN NP_0013583 IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN NP_0011246 IRYYTRTLYQFQFQEALCQAAKHEGPLHKCDISNSTEAGQKLLNMLRLGKSEPWTLALEN ETE61880.1 IRYYTRTIYQFQFQEALCKAAGHTKELYKCDISDSTNAGRILKDMLALGSSQPWTKALES XP_0061228 IRYYTRTIYQFQFQEALCKAANHGGLLHTCDITNSMAAGQKLRDMLALGRSQPWTKALES XP_416822. IRYYTRTIYQFQFQEALCKAANHTGPLHKCDITNSTAAGGNLRQLLELGKSKPWTQALES *******:::***:****. * : *:.***::* ** * .:* :* * .** :**
AGZ48803.1 IVDSRNMDVGPLLKYFEPLYTWLQEQNRKSYVGWNTDWSPYSDQSIKVRISLKSALGENA NP_0011165 IVGVKTMDVKPLLSYFEPLLTWLKAQNGNSSVGWNTDWTPYADQSIKVRISLKSALGEDA XP_0313017 LVGVKTMDVKPLLNYFEPLLTWLKDQNRNSFVGWSTDWTPYTDQSIKVRISLKSALGDKA QLH93383.1 VVGTKNMDVRPLLNYFEPLLTWLKEQNKNSFVGWNTDWSPYAAQSIKVRISLKSALGEKA BAE53380.1 VVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA XP_0258425 VVGAKNMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGEKA AAX63775.1 VVGAKNMNVTPLLNYFEPLFTWLKEQNRNSFVGWDTDWRPYSDQSIKVRISLKSALGEKA AAX59005.1 VVGEKKMNVTPLLKYFEPLFTWLKEQNRNSFVGWNTDWRPYADQSIKVRISLKSALGDEA NP_0011239 VVGARNMDVKPLLNYFQPLFDWLKEQNRNSFVGWNTEWSPYADQSIKVRISLKSALGANA AAW78017.1 VVGSRNMDVKPLLNYFQPLFVWLKEQNRNSTVGWSTDWSPYADQSIKVRISLKSALGKNA XP_0053160 VVGARNMDVRPLLNYFEPLFGWLKDQNRNSFVGWNTNWSPYTDQSIKVRISLKSALGEEA AAY57872.1 VVGAKNMSVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGANA NP_0013583 VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGDKA NP_0011246 VVGAKNMNVRPLLDYFEPLFTWLKDQNKNSFVGWSTDWSPYADQSIKVRISLKSALGNKA ETE61880.1 ITGSLKMDAKPFCQYFDPLLKWLEKTNSNENVGWNVNWTPYSKDAIKVRISLKAALGDDA XP_0061228 ITGEKKMNATPLLHYFEPLYQWLIKNNSGRAVGWNTFWSPYSGNAIKVRISLKTALGDNA XP_416822. ATGEKYMNATPLLHYFEPLFNWLQKNNSGRSIGWNTDWTPYSDNAIKVRISLKAALGDDA .. *.. *: **:** ** * :**.. * **: ::********:*** .*
AGZ48803.1 YEWNDNEMYLFRSSVAYAMREYFLKEKHQTILFGAENVWVSNLKPRISFNFHVTSPGNLS NP_0011165 YEWNDNEMYLFRSSIAYAMRNYFSSAKNETIPFGAVDVWVSDLKPRISFNFFVTSPANMS XP_0313017 YEWNDNEMYLFQSSLAYAMRKYFLKVQNQTILFGVEDVWVSDLKPRISFSFFVTSPKNVS QLH93383.1 YEWNDSEMYLFRSSVAYAMREYFSKFKKQTIPFEEESVRVSDLKPRVSFIFFVTLPKNVS BAE53380.1 YEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMS XP_0258425 YEWNNNEMYLFRSSIAYAMRRYFSEVKKQTIPFVEDNVWVSDLKPRISFNFFVTSPGNVS AAX63775.1 YEWNDNEMYLFRSSIAYAMREYFSKVKNQTIPFVEDNVWVSDLKPRISFNFFVTFSNNVS AAX59005.1 YEWNDNEMYLFRSSVAYAMREYFSKVKNQTIPFVEDNVWVSNLKPRISFNFFVTASKNVS NP_0011239 YEWTNNEMFLFRSSVAYAMRKYFSIIKNQTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVS AAW78017.1 YEWTDNEMYLFRSSVAYAMREYFSREKNQTVPFGEADVWVSDLKPRVSFNFFVTSPKNVS XP_0053160 YQWNDNEMYLFRSSVAYAMRMYFSKVKNQTIPFGEKDVWVSDEKPRISFNFFVTAPQNVS AAY57872.1 YKWNDNEMYLFRSSVAYAMRQYFLENKHQTILFGEEDVRVADLKPRISFNFYVTAPKNVS NP_0013583 YEWNDNEMYLFRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS NP_0011246 YEWNDNEIYLFRSSVAYAMRKYFLEVKNQMILFGEEDVRVANLKPRISFNFFVTAPKNVS ETE61880.1 YNWDESEMFLFKSTIAYAMQKYFLEVKNKTVPF--------------------------- XP_0061228 YEWDENELYFFKSSIAYAMRKYFLEVKNQTVSFQCTDIHVWAVTQRVSFYFAVSMPGNAT XP_416822. YVWDASELFLFKSSIAYAMRKYFAKEKEQNVDFQVTDIHVGEETQRVSFYLTVSMPGNVS * * .*:::*.*::****. ** : : : *
AGZ48803.1 DIIPRPEVEGAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAV NP_0011165 DIIPRSDVEKAISMSRSRINDAFRLDDNTLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGL XP_0313017 DIIPRTEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLGPPYEPPVTVWLIIFGIVMGL QLH93383.1 AVIPRAEVEEAIRMSRSRINDVFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV BAE53380.1 DIIPRADVEEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGV XP_0258425 DIIPRTEVEKAIRMYRGRINDVFRLDDNSLEFLGIQPTLGPSYEPPVTIWLIVFGVVMGV AAX63775.1 DVIPRSEVEDAIRMSRSRINDAFRLDDNSLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGA AAX59005.1 DVIPRSEVEEAIRMSRSRINDAFRLDDNSLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGV NP_0011239 DVIPRSEVEDAIRMSRGRINDVFGLNDNSLEFLGIHPTLEPPYQPPVTIWLIIFGVVMAL AAW78017.1 DIIPRSEVEEAIRMSRGRINDIFGLNDNSLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGT XP_0053160 DIIPRTDVEKAIRMSRGRINGVFRLDDNSLEFLGIQPTLGPPYQPPVTIWLIVFGVVMGL AAY57872.1 DIIPRTEVEEAIRFSRSRINDAFQLNDNSLEFLGIQSTLVPPYQSPITTWLIVFGVVMAV NP_0013583 DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV NP_0011246 DIIPRTEVEKAIRMSRSRINDAFRLNDNSLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGV ETE61880.1 ------------HLSRDRINEAFKLTDQTLEFIGLLPTLAPPYESPITVWLVAFGVVIGL XP_0061228 DFIPKSEVETAIRMSRGRINEAFRLDDNTLEFEGLLPTLASPYEPPVTVWLILFGVVMGV XP_416822. DIVPRADVEKAIRMSRGRISEAFRLDDNTLEFDGIVPTLATPYKPPVTIWLILFGVVMSL *.**. * * *::*** *: .** .. .*:: **: **:*:.
AGZ48803.1 VVVGIVVLIITGIRDRRKTDQARSEE-------NPYS--SVDLSKGENNPGFQNGDDVQT NP_0011165 VVVGIVVLIFTGIRDRRKKKQASSEE-------NPYG--SMDLSKGESNSGFQNGDDIQT XP_0313017 VVVGIVVLIFTGIRDRRKKKQASTEE-------NPYG--SVDLSKGENNSGFQNGDDVQT QLH93383.1 IVVGIVVLIFTGIRDRKKKNQARSEQ-------NPYA--SVDLSKGENNPGFQNVDDVQT BAE53380.1 VVVGIFLLIFSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQNVDDVQT XP_0258425 VVVGIVLLIFSGIRNRRKNDQARGEE-------NPYA--SVDLSKGENNPGFQNVDDAQT AAX63775.1 IVVGIVLLIVSGIRNRRKNDQAGSEE-------NPYA--SVDLNKGENNPGFQHADDVQT AAX59005.1 VVVGIVLLIVSGIRNRRKNNQARSEE-------NPYA--SVDLSKGENNPGFQHADDVQT NP_0011239 VVVGIIILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT AAW78017.1 VVVGIVILIVTGIKGRKKKNETKREE-------NPYD--SMDIGKGESNAGFQNSDDAQT XP_0053160 IVVGIVILIFTGIRDRRRKNQTKREE-------NPYAESSMEMGKGENNPGYQNNDDVQT AAY57872.1 IVAGIVVLIFTGIRDRKKKNQARSEE-------NPYA--SIDISKGENNPGFQNTDDVQT NP_0013583 IVVGIVILIFTGIRDRKKKNKARSGE-------NPYA--SIDISKGENNPGFQNTDDVQT NP_0011246 IVVGIVVLIFTGIRDRKKKNKARNEE-------NPYA--SIDISKGENNPGFQNTDDVQT ETE61880.1 IVIGIITLEKAGSKN--------------------------------------------- XP_0061228 IVVGVIVLIVTGQRDRRKRMKAGTNELVQTNAIDP------ELENGEVNPAFIKHEERQT XP_416822. IVIGVIVLIITGQRDKRKKARGRANEAGSNCEVNPYD------EDGRSNKGFEQSEETQT :* *:. * :* ..
AGZ48803.1 SF NP_0011165 SF XP_0313017 SF QLH93383.1 SF BAE53380.1 SF XP_0258425 SF AAX63775.1 SF AAX59005.1 SF NP_0011239 SF AAW78017.1 SF XP_0053160 SF AAY57872.1 SF NP_0013583 SF NP_0011246 SF ETE61880.1 -- XP_0061228 SF XP_416822. SF
Part 3: Creating a phylogenetic tree with Phylogeny.fr
- Continued working with the www.phylogeny.fr website.
- Went back to 6. Tree rendering for the phylogenetic tree of the sequences.
- Horizontal lines represent individual evolutionary lines.
- Vertical lines represent mutation events. the vertical length has no biological meaning.
- The left-most split is called the root of the tree, which represents a hypothesis about the most recent common ancestor (MRCA) of the sequences within your tree.
- The length of each branch represents the percentage change in the amino acid sequence occurring along that branch, relative to the scale bar
- The scale bar was 0.5 (50%).
- I saved the image to a file and uploaded it to the wiki.
Error creating thumbnail: Unable to save thumbnail to destination
- Original image included the American mink. After further analysis of this partial sequence, this sequence was removed.
- The new phylogenetic tree is visible below:
Error creating thumbnail: Unable to save thumbnail to destination
Part 4: Structural Analysis and Critical Residues Table
- My research partner, Aiden Burnett, performed a structural analysis of the ACE2 receptor.
- Aiden Burnett also created a table detailing the differences between the critical amino acid residues.
- The methods by which he did these can both be found on his user page.
Part 5: Sequence Percent Similarity Table
- Navigated to LALIGN, a platform used by researchers to find a percent identity for sequences.
- Allows for the comparison of two different sequences.
- Entered human ACE2 sequence in area called 1st Query sequence.
- Entered one of the other seventeen sequences into the ;2nd Query sequence'.
- Clicked Run LALIGN.
- Noted the percent similarity between the sequences.
- Recorded this value in a table.
- Repeated procedure for other sixteen sequences, comparing each to the human ACE2 sequence.
- Created a table showing the percent similarity of each organism to humans.
Error creating thumbnail: Unable to save thumbnail to destination
Part 6: Final Presentation
- Uploaded presentation created based on the information presented above.
Scientific Conclusion
- Known human orthologues, including monkeys and orangutans, showed close similarities to the human ACE2 sequences. Foxes and cats also showed similarities, while turtles and king cobras did not show to be similar. Of the five critical amino acids that correspond to the RBD of SARS-CoV-2 on the ACE2 receptor, many were relatively conserved across species, with most organisms having between 2-3 of the 5 amino acids altered. This could help study the lineage of SARS-CoV-2 and identify which animals could act as intermediary hosts for future strains of SARS viruses.
Acknowledgments
- I consulted with my partner Aiden Burnett in class, over text, as well as over the phone several times to discuss the creation of our presentation.
- I contacted my TA, Annika Dinulos, to ask about the formatting of a presentation.
- I copied and modified procedures from the Week 6 assignment page.
- I referred back to procedures used on my Week 4 and Week 5 pages.
- I used the Wan et. al - Receptor Recognition by the Novel Coronavirus from Wuhan paper for reference.
- I obtained sequences from GenBank.
- I built the phylogenetic tree and created sequence alignments using Phylogeny.fr.
- I used LALIGN to compare sequence percent similarities.
- I uploaded images using the Wiki Upload page.
- I copied and modified wiki syntax on formatting a photo from the Media Wiki Help Page.
- Except for what is noted above, this individual journal entry was completed by me and not copied from another source.
Anna Horvath (talk) 21:25, 14 October 2020 (PDT)
References
- Andersen, K., Rambaut, A., Lipkin, W., Holmes, E., & Garry, R. (2020). The proximal origin of SARS-CoV-2. Nature Medicine, 26(4), 450-452. doi: 10.1038/s41591-020-0820-9
- Angiotensin-converting enzyme 2 isoform 1 precursor [Homo sapiens] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001358344.1?report=fasta
- Angiotensin-converting enzyme 2 [Paguma larvata] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAX63775.1?report=fasta
- Angiotensin-converting enzyme 2 [Rhinolophus sinicus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AGZ48803.1?report=fasta
- Angiotensin-converting enzyme 2 precursor [Mus musculus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001123985.1?report=fasta
- angiotensin converting enzyme 2 [Rattus norvegicus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAW78017.1?report=fasta
- Angiotensin-converting enzyme 2 precursor [Sus scrofa] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001116542.1?report=fasta
- Angiotensin I converting enzyme 2 [Mustela putorius furo] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/BAE53380.1?report=fasta
- Angiotensin I converting enzyme 2 [Felis catus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAX59005.1?report=fasta
- Angiotensin-converting enzyme 2 precursor [Pongo abelii] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/NP_001124604.1?report=fasta
- Angiotensin converting enzyme 2 [Chlorocebus aethiops] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/AAY57872.1?report=fasta
- Angiotensin-converting enzyme 2 [Vulpes vulpes] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_025842513.1?report=fasta
- Angiotensin-converting enzyme 2 [Gallus gallus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_416822.2?report=fasta
- Angiotensin-converting enzyme 2 [Ophiophagus hannah] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/ETE61880.1?report=fasta
- Angiotensin I converting enzyme 2 [Manis pentadactyla] - Protein - NCBI. (n.d.). Retrieved October 08, 2020, from https://www.ncbi.nlm.nih.gov/protein/QLH93383.1?report=fasta
- Angiotensin-converting enzyme 2 [Camelus dromedarius] - Protein - NCBI. (n.d.). Retrieved October 08, 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_031301717.1
- Angiotensin-converting enzyme 2, partial [Neovison vison] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/CCP86723.1?report=fasta
- Angiotensin-converting enzyme 2 [Ictidomys tridecemlineatus] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_005316051.3?report=fasta
- Angiotensin-converting enzyme 2 [Pelodiscus sinensis] - Protein - NCBI. (2020). Retrieved 8 October 2020, from https://www.ncbi.nlm.nih.gov/protein/XP_006122891.1?report=fasta
- Deng, J., Jin, Y., Liu, Y., Sun, J., Hao, L., & Bai, J. et al. (2020). Serological survey of SARS‐CoV‐2 for experimental, domestic, companion and wild animals excludes intermediate hosts of 35 different species of animals. Transboundary And Emerging Diseases, 67(4), 1745-1749. doi: 10.1111/tbed.13577
- LALIGN Server. (2020). Retrieved 14 October 2020, from https://embnet.vital-it.ch/software/LALIGN_form.html
- OpenWetWare - Anna Horvath Week 4. (2020). Retrieved 14 October 2020, from https://openwetware.org/wiki/Anna_Horvath_Week_4
- OpenWetWare - Anna Horvath Week 5. (2020). Retrieved 14 October 2020, from https://openwetware.org/wiki/Anna_Horvath_Week_5
- OpenWetWare - BIOL368/F20:Week 6. (2020). Retrieved 14 October 2020, from https://openwetware.org/wiki/BIOL368/F20:Week_6
- Phylogeny.fr: "One Click" Mode. (2020). Retrieved 8 October 2020, from http://www.phylogeny.fr/simple_phylogeny.cgi?workflow_id=b9c0813cbbe9695d63cf7e31da5f026d&tab_index=1
- Wan, Y., Shang, J., Graham, R., Baric, R., & Li, F. (2020). Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus. Journal Of Virology, 94(7). doi: 10.1128/jvi.00127-20
- Yuan, S., Jiang, S., & Li, Z. (2020). Analysis of Possible Intermediate Hosts of the New Coronavirus SARS-CoV-2. Frontiers In Veterinary Science, 7. doi: 10.3389/fvets.2020.00379
- Zhao, J., Cui, W., & Tian, B. (2020). The Potential Intermediate Hosts for SARS-CoV-2. Frontiers In Microbiology, 11. doi: 10.3389/fmicb.2020.580137
Template
User Pages
Assignments
- Week 1 Assignment
- Week 2 Assignment
- Week 3 Assignment
- Week 4 Assignment
- Week 5 Assignment
- Week 6 Assignment
- Week 7 Assignment
- Week 8 Assignment
- Week 10 Assignment
- Week 11 Assignment
- Week 12 Assignment
- Week 14 Assignment
Journal Pages
- Anna Horvath Week 2
- Anna Horvath Week 3
- Anna Horvath Week 4
- Anna Horvath Week 5
- Anna Horvath Week 6
- Anna Horvath Week 7
- DrugComboDB Review
- Anna Horvath Week 10
- Anna Horvath Week 11
- Anna Horvath Week 12
- Anna Horvath Week 14