Nathan R Beshai Week 6
Nathan R. Beshai User Page
Nathan R. Beshai Template Page
Course assignments
Individual journal assignments
- Nathan R Beshai Week 2
- Nathan R Beshai Week 3
- Nathan R Beshai Week 4
- Nathan R Beshai Week 5
- Nathan R Beshai Week 6
- Nathan R Beshai Week 7
- Nathan R Beshai Week 8
- Nathan R Beshai Week 9
- Nathan R Beshai Week 10
- Nathan R Beshai Week 11
- The D614G Research Group Week 12
- The D614G Research Group Week 14
Class Journals
- Class Journal 1
- Class Journal 2
- Class Journal 3
- Class Journal 4
- Class Journal 5
- Class Journal 6
- Class Journal 7
- Class Journal 8
- Class Journal 9
- Class Journal 10
- Class Journal 11
- Class Journal 12
- Class Journal 14
Link to Brightspace and LMU's Homepage
Purpose
To see how closely related species' ACE2 sequences that are more compatible with the 2019-nCoV spike protein are, to see how well conserved the necessary human ACE2 amino acid residues are across the species' sequence, and to see if any of the human 5 amino acid residues that bind to the 2019-nCoV spike protein are necessary for the ACE2 function and structure.
Methods and results
1. Searched and collected the ACE-2 protein sequences from UniProt and if they were not available collected the sequences from the NCBI database.
1.Went to UniProt and copied the ACE-2 sequence for the Pig (unreviewed).
>tr|A0A220QT48|A0A220QT48_PIG Angiotensin-converting enzyme OS=Sus scrofa domesticus OX=9825 GN=ACE2 PE=2 SV=1 MSGSFWLLLSLIPVTAAQSTTEELAKTFLEKFNLEAEDLAYQSSLASWNYNTNITDENIQ KMNDARAKWSAFYEEQSRIAKTYPLDEIQTLILKRQLQALQQSGTSGLSADKSKRLNTIL NTMSTIYSSGKVLDPNNPQECLVLEPGLDEIMENSKDYSRRLWAWESWRAEVGKQLRPLY EEYVVLENEMARANNYEDYGDYWRGDYEVTGTGDYDYSRNQLMEDVERTFAEIKPLYEHL HAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYPLTVPFGEKPSIDVTEAMVNQ SWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSMLTEPGDGRKVVCHPTAWDLGKGDFRIKM CTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLLRNGANEGFHEAVGEIMSLSAATPHYLKA LGLLPPDFYEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEM KREIVGVVEPLPHDETYCDPACLFHVAEDYSFIRYYTRTIYQFQFHEALCRTAKHEGPLY KCDISNSTEAGQKLLQMLSLGKSEPWTLALENIVGVKTMDVKPLLSYFEPLLTWLKAQNG NSSVGWNTDWTPYADQSIKVRISLKSALGKEAYEWNDNEMYLFRSSIAYAMRNYFSSAKN ETIPFGAEDVWVSDLKPRISFNFFVTSPANMSDIIPRSDVEKAISMSRSRINDAFRLDDN TLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGLVVVGIVVLIFTGIRDRRKKKQASSEENP YGSMDLSKGESNSGFQNGDDIQTSF
2.Went to NCBI and copied the ACE-2 sequence for the Ferrat.
>BAE53380.1 angiotensin I converting enzyme 2 [Mustela putorius furo] MLGSSWLLLSLAALTAAQSTTEDLAKTFLEKFNYEAEELSYQNSLASWNYNTNITDENIQKMNIAGAKWS AFYEEESQHAKTYPLEEIQDPIIKRQLRALQQSGSSVLSADKRERLNTILNAMSTIYSTGKACNPNNPQE CLLLEPGLDDIMENSKDYNERLWAWEGWRSEVGKQLRPLYEEYVALKNEMARANNYEDYGDYWRGDYEEE WADGYSYSRNQLIEDVEHTFTQIKPLYEHLHAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYP LMVPFRQKPNIDVTDAMVNQSWDARRIFEEAETFFVSVGLPNMTEGFWQNSMLTEPGDNRKVVCHPTAWD LGKRDFRIKMCTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKN IGLLPPDFSEDSETDINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKRDIVGVVEP LPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLYKCDISNSSEAGQKLHEMLSL GRSKPWTFALERVVGAKTMDVRPLLNYFEPLFTWLKEQNRNSFVGWNTDWSPYADQSIKVRISLKSALGE KAYEWNDNEMYFFQSSIAYAMREYFSKVKNQTIPFVGKDVRVSDLKPRISFNFIVTSPENMSDIIPRADV EEAIRKSRGRINDAFRLDDNSLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGVVVVGIFLLIFSGIRNRRK NNQARSEENPYASVDLSKGENNPGFQNVDDVQTSF
3.Went to UniProt and copied the ACE-2 sequence for the Chimpanzee (unreviewed).
>tr|A0A2J8KU96|A0A2J8KU96_PANTR Angiotensin-converting enzyme OS=Pan troglodytes OX=9598 GN=ACE2 PE=3 SV=1 MSGSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQ NMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTIL NTMSAIYSTGKVCNPNNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLY EEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHL HAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQ AWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILM CTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPEDQWMKKWWEM KREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLH KCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNK NSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSVAYAMRQYFLKVKN QMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEVEKAIRKSRSRINDAFRLNDN SLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKKKNKARSEENP YASVDTSKGENNPGFQNTDDVQTSF
4. Went to UniProt and copied the ACE-2 sequence for the Homosapien (human).
>sp|Q9BYF1|ACE2_HUMAN Angiotensin-converting enzyme 2 OS=Homo sapiens OX=9606 GN=ACE2 PE=1 SV=2 MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQ NMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTIL NTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLY EEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHL HAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQ AWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILM CTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEM KREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLH KCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNK NSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSVAYAMRQYFLKVKN QMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRLNDN SLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKKKNKARSGENP YASIDISKGENNPGFQNTDDVQTSF
5.Went to UniProt and copied the ACE-2 sequence for the Masked Palm civet.
>sp|Q56NL1|ACE2_PAGLA Angiotensin-converting enzyme 2 OS=Paguma larvata OX=9675 GN=ACE2 PE=1 SV=1 MSGSFWLLLSFAALTAAQSTTEELAKTFLETFNYEAQELSYQSSVASWNYNTNITDENAK NMNEAGAKWSAYYEEQSKLAQTYPLAEIQDAKIKRQLQALQQSGSSVLSADKSQRLNTIL NAMSTIYSTGKACNPNNPQECLLLEPGLDNIMENSKDYNERLWAWEGWRAEVGKQLRPLY EEYVALKNEMARANNYEDYGDYWRGDYEEEWTGGYNYSRNQLIQDVEDTFEQIKPLYQHL HAYVRAKLMDTYPSRISRTGCLPAHLLGDMWGRFWTNLYPLTVPFGQKPNIDVTDAMVNQ NWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDGRKVVCHPTAWDLGKGDFRIKM CTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT IGLLSPAFSEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGAIPKEQWMQKWWEM KRNIVGVVEPVPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLH KCDISNSTEAGKKLLEMLSLGRSEPWTLALERVVGAKNMNVTPLLNYFEPLFTWLKEQNR NSFVGWDTDWRPYSDQSIKVRISLKSALGEKAYEWNDNEMYLFRSSIAYAMREYFSKVKN QTIPFVEDNVWVSDLKPRISFNFFVTFSNNVSDVIPRSEVEDAIRMSRSRINDAFRLDDN SLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGAIVVGIVLLIVSGIRNRRKNDQAGSEENP YASVDLNKGENNPGFQHADDVQTSF
6.Went to UniProt and copied the ACE-2 sequence for Mus musculus.
>sp|Q8R0I0|ACE2_MOUSE Angiotensin-converting enzyme 2 OS=Mus musculus OX=10090 GN=Ace2 PE=1 SV=1 MSSSSWLLLSLVAVTTAQSLTEENAKTFLNNFNQEAEDLSYQSSLASWNYNTNITEENAQ KMSEAAAKWSAFYEEQSKTAQSFSLQEIQTPIIKRQLQALQQSGSSALSADKNKQLNTIL NTMSTIYSTGKVCNPKNPQECLLLEPGLDEIMATSTDYNSRLWAWEGWRAEVGKQLRPLY EEYVVLKNEMARANNYNDYGDYWRGDYEAEGADGYNYNRNQLIEDVERTFAEIKPLYEHL HAYVRRKLMDTYPSYISPTGCLPAHLLGDMWGRFWTNLYPLTVPFAQKPNIDVTDAMMNQ GWDAERIFQEAEKFFVSVGLPHMTQGFWANSMLTEPADGRKVVCHPTAWDLGHGDFRIKM CTKVTMDNFLTAHHEMGHIQYDMAYARQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLPSDFQEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFRGEIPKEQWMKKWWEM KREIVGVVEPLPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKYNGSLH KCDISNSTEAGQKLLKMLSLGNSEPWTKALENVVGARNMDVKPLLNYFQPLFDWLKEQNR NSFVGWNTEWSPYADQSIKVRISLKSALGANAYEWTNNEMFLFRSSVAYAMRKYFSIIKN QTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVSDVIPRSEVEDAIRMSRGRINDVFGLNDN SLEFLGIHPTLEPPYQPPVTIWLIIFGVVMALVVVGIIILIVTGIKGRKKKNETKREENP YDSMDIGKGESNAGFQNSDDAQTSF
7. Went to UniProt and copied the ACE-2 sequence for Rattus norvegicus.
>sp|Q5EGZ1|ACE2_RAT Angiotensin-converting enzyme 2 OS=Rattus norvegicus OX=10116 GN=Ace2 PE=1 SV=1 MSSSCWLLLSLVAVATAQSLIEEKAESFLNKFNQEAEDLSYQSSLASWNYNTNITEENAQ KMNEAAAKWSAFYEEQSKIAQNFSLQEIQNATIKRQLKALQQSGSSALSPDKNKQLNTIL NTMSTIYSTGKVCNSMNPQECFLLEPGLDEIMATSTDYNRRLWAWEGWRAEVGKQLRPLY EEYVVLKNEMARANNYEDYGDYWRGDYEAEGVEGYNYNRNQLIEDVENTFKEIKPLYEQL HAYVRTKLMEVYPSYISPTGCLPAHLLGDMWGRFWTNLYPLTTPFLQKPNIDVTDAMVNQ SWDAERIFKEAEKFFVSVGLPQMTPGFWTNSMLTEPGDDRKVVCHPTAWDLGHGDFRIKM CTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS IGLLPSNFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFQDKIPREQWTKKWWEM KREIVGVVEPLPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKHDGPLH KCDISNSTEAGQKLLNMLSLGNSGPWTLALENVVGSRNMDVKPLLNYFQPLFVWLKEQNR NSTVGWSTDWSPYADQSIKVRISLKSALGKNAYEWTDNEMYLFRSSVAYAMREYFSREKN QTVPFGEADVWVSDLKPRVSFNFFVTSPKNVSDIIPRSEVEEAIRMSRGRINDIFGLNDN SLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGTVVVGIVILIVTGIKGRKKKNETKREENP YDSMDIGKGESNAGFQNSDDAQTSF
8.Went to UniProt and copied the ACE-2 sequence for the Felis Cat.
>sp|Q56H28|ACE2_FELCA Angiotensin-converting enzyme 2 OS=Felis catus OX=9685 GN=ACE2 PE=2 SV=1 MSGSFWLLLSFAALTAAQSTTEELAKTFLEKFNHEAEELSYQSSLASWNYNTNITDENVQ KMNEAGAKWSAFYEEQSKLAKTYPLAEIHNTTVKRQLQALQQSGSSVLSADKSQRLNTIL NAMSTIYSTGKACNPNNPQECLLLEPGLDDIMENSKDYNERLWAWEGWRAEVGKQLRPLY EEYVALKNEMARANNYEDYGDYWRGDYEEEWTDGYNYSRSQLIKDVEHTFTQIKPLYQHL HAYVRAKLMDTYPSRISPTGCLPAHLLGDMWGRFWTNLYPLTVPFGQKPNIDVTDAMVNQ SWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDSRKVVCHPTAWDLGKGDFRIKM CTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT IGLLSPGFSEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEM KREIVGVVEPVPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCRIAKHEGPLH KCDISNSSEAGKKLLQMLTLGKSKPWTLALEHVVGEKKMNVTPLLKYFEPLFTWLKEQNR NSFVGWNTDWRPYADQSIKVRISLKSALGDEAYEWNDNEMYLFRSSVAYAMREYFSKVKN QTIPFVEDNVWVSNLKPRISFNFFVTASKNVSDVIPRSEVEEAIRMSRSRINDAFRLDDN SLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGVVVVGIVLLIVSGIRNRRKNNQARSEENP YASVDLSKGENNPGFQHADDVQTSF
9.Went to UniProt and copied the ACE-2 sequence for the Horseshoe Bat (unreviewed).
>tr|E2DHI2|E2DHI2_RHIFE Angiotensin-converting enzyme OS=Rhinolophus ferrumequinum OX=59479 GN=ACE2 PE=2 SV=1 MSGSSWFLLSLVAVTAAQSTTEDLAKKFLDDFNSEAENLSHQSSLASWEYNTNISDENVQ KMDEAGAKWSDFYEKQSKLAKNFSLEEIHNDTVKLQLQILQQSGSPVLSEDKSKRLNSIL NAMSTIYSTGKVCKPNNPQECLLLEPGLDNIMGTSKDYNERLWAWEGWRAEVGKQLRPLY EEYVVLKNEMARGYHYEDYGDYWRRDYETEGSPDLEYSRDQLIKDVERIFAEIKPLYEQL HAYVRTKLMDTYPFHISPTGCLPAHLLGDMWGRFWTNLYPLTVPFGQKPNIDVTDAMLNQ NWDAKRIFKEAEKFLVSIGLPNMTEGFWNNSMLTDPGDGRKVVCHPTAWDLGKGDFRIKM CTKVTMEDFLTAHHEMGHIQYDMAYASQPYLLRNGANEGFHEAVGEVMSLSVATPEHLKT MGLLSSDFLEDNETEINFLFKQALNIVGTLPLTYMLEKWRWMVFKGEIPKEEWMKKWWEM KRKIVGVVEPVPHDETYCDPASLFHVANDYSFIRYYTRTIFEFQFHEALCRIAKHDGPLH KCGISNSTDAGEKLHQMLSVGKSQPWTSVLKDFVGSKNMDVGPLLRYFEPLYTWLTEQNR KSFVGWNTDWSPYADQSIKVWISLKSALGEKAYEWNNNEMYLFRSSVAYAMREYFLKTKN QTILFGEEDVWVSNLKPRISFNFYVTSPRNLSDIIPRPEVEGAIRMSRSRINDAFRLDDN SLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAVVVVGIVVLIITGIRDRRKKDQARSEENP YSSVDLSKGENNPGFQNGNDVQTSF
10.Went to NCBI and copied the ACE-2 sequence for the Rabbit.
>QHX39726.1 angiotensin I converting enzyme 2 [Oryctolagus cuniculus] MSGSSWLLLSLVAVTAAQSTIEELAKTFLEKFNQEAEDLSYQSALASWDYNTNITEENVQKMNDAEAKWS AFYEEQSKLAKTYPSQEVQNLTVKRQLQALQQSGSSALSADKSKQLNTILSTMSTIYSTGKVCNQSNPQE CFLLEPGLDEIMAKSTDYNERLWAWEGWRSVVGKQLRPLYEEYVVLKNEMARANNYEDYGDYWRADYEAE GADGYDYSRSQLIDDVERTFSEIKPLYEQLHAYVRTKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYS LTVPFGQKPNIDVTDTMVNQGWDAERIFKEAEKFFVSVGLPSMTHGFWENSMLPESGDGRKVVCHPTAWD LGKRDFRIKMCTKVTMDNFLTAHHEMGHIQYDMAYATQPFLLRNGANEGFHEAVGEIMSLSAATPEHLKS IGLLPYDFHEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEMKREIVGVVEP MPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFQEALCQAAQHEGPLHKCDISNSTEAGQKLLNMLRL GRSEPWTLALENVVGAKNMDVRPLLNYFEPLFTWLKEQNRNSFVGWSTEWTPYADQSIKVRISLKTALGD QAYEWNDSEMYLFRSSVAYAMRKYFSEVKNQTILFGEEDVRVSDLKPRISFNFFVTAPNNVNDIIPRNEV EEAISMSRSRINDIFRLDDNSLEFVGIQPTLEPPYESPVPIWLVVFGVVMGMIVIGIVVLIFTGIKDRRK QKQAKREENPYGFVDMSKGENNSGFQNSDDIQTSF
11. Went to UniProt and copied the ACE-2 sequence for the opossum.
>tr|F6WXR7|F6WXR7_MONDO Angiotensin-converting enzyme OS=Monodelphis domestica OX=13616 GN=ACE2 PE=3 SV=2 MLDPLWLFFSLLAVTAAQNSIEEDAKTFLDDYNAKAEELSHQSALASWEYNTNITNENVE KMNEAAARWSSFYENQSSISRTYPLNEITNATVKLQLKSLQKKEGAVLSTEQSVRLNTIL NTMSTLYSTGSVCNSETPQQCFLLEPGLDKIMDESTDYDERLWAWEGWRSKVGKEMRPLY EEYVELKNELAKGNNYEDYGDYWRGDYEVEEPSEYVYSRPQLKKDVENTFKQIKSLYEHL HAYVRRKMRNTYGSLISETGGLPAHLLGDMWGRFWTNLYSLTMPYREKPNIDVTSAMKKQ NWSARRIFQEAEMFFASVGLPNMTEGFWKNSMLTEPNDGRKVVCHPTAWDLGKNDFRIKM CTKVTMDDFLTAHHEMGHIQYDMAYAKQPFTLRNGANEGFHEAVGEIMSLSAATPKHLQA LGLLPPTFQEDNETEINFLFKQALTIIGTMPFTYMLENWRWMVFEGKIPKEEWMKKWWEM KREIVGVVEPLPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFHKALCKIAQPSAALH KCDITNSTEAGTKLQNMLKMGKSEPWTKALESIVGNKMMDAGPLLEYFEPLFTWLKEQNK DAYVGWNTDWSPYNAYKIKVRISLKTLGENAYTWNENEMYLFQSSIVFAMRQYFLIKKKQ SIPFSNENVKMFDLKPRISFYFFVTFPPNGTSFVPREEVEAAISMSRDRINDAFRLNDNS LEFVGISPTLAPPYEPPVTVWMIVFGVVMGIVVIGIVYLIYTGVRDRKKRAKTSSSNDEN PYVDVDVAGGQHNPAFQSSEDAQTSF
2.Went to a phylogeny tree builder and sequence aligner.
- Copied the ACE-2 sequence and pasted them in the one-click.
- Copied the generated sequence alignment and pasted the results below.
Sequence alignment
CLUSTAL FORMAT: MUSCLE (3.8) multiple sequence alignments
tr|E2DHI2| MSGSSWFLLSLVAVTAAQSTTEDLAKKFLDDFNSEAENLSHQSSLASWEYNTNISDENVQ
sp|Q8R0I0| MSSSSWLLLSLVAVTTAQSLTEENAKTFLNNFNQEAEDLSYQSSLASWNYNTNITEENAQ
sp|Q5EGZ1| MSSSCWLLLSLVAVATAQSLIEEKAESFLNKFNQEAEDLSYQSSLASWNYNTNITEENAQ
tr|A0A220Q MSGSFWLLLSLIPVTAAQSTTEELAKTFLEKFNLEAEDLAYQSSLASWNYNTNITDENIQ
QHX39726.1 MSGSSWLLLSLVAVTAAQSTIEELAKTFLEKFNQEAEDLSYQSALASWDYNTNITEENVQ
tr|A0A2J8K MSGSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQ
sp|Q9BYF1| MSSSSWLLLSLVAVTAAQSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNTNITEENVQ
BAE53380.1 MLGSSWLLLSLAALTAAQSTTEDLAKTFLEKFNYEAEELSYQNSLASWNYNTNITDENIQ
sp|Q56NL1| MSGSFWLLLSFAALTAAQSTTEELAKTFLETFNYEAQELSYQSSVASWNYNTNITDENAK
sp|Q56H28| MSGSFWLLLSFAALTAAQSTTEELAKTFLEKFNHEAEELSYQSSLASWNYNTNITDENVQ
* .* *:***: .:::*** *: *:.**: ** **::* :*.::***:*****::** :
tr|E2DHI2| KMDEAGAKWSDFYEKQSKLAKNFSLEEIHNDTVKLQLQILQQSGSPVLSEDKSKRLNSIL
sp|Q8R0I0| KMSEAAAKWSAFYEEQSKTAQSFSLQEIQTPIIKRQLQALQQSGSSALSADKNKQLNTIL
sp|Q5EGZ1| KMNEAAAKWSAFYEEQSKIAQNFSLQEIQNATIKRQLKALQQSGSSALSPDKNKQLNTIL
tr|A0A220Q KMNDARAKWSAFYEEQSRIAKTYPLDEIQTLILKRQLQALQQSGTSGLSADKSKRLNTIL
QHX39726.1 KMNDAEAKWSAFYEEQSKLAKTYPSQEVQNLTVKRQLQALQQSGSSALSADKSKQLNTIL
tr|A0A2J8K NMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTIL
sp|Q9BYF1| NMNNAGDKWSAFLKEQSTLAQMYPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLNTIL
BAE53380.1 KMNIAGAKWSAFYEEESQHAKTYPLEEIQDPIIKRQLRALQQSGSSVLSADKRERLNTIL
sp|Q56NL1| NMNEAGAKWSAYYEEQSKLAQTYPLAEIQDAKIKRQLQALQQSGSSVLSADKSQRLNTIL
sp|Q56H28| KMNEAGAKWSAFYEEQSKLAKTYPLAEIHNTTVKRQLQALQQSGSSVLSADKSQRLNTIL
:*. * *** : :::* *: :. *:: :* **. ***.*:. ** ** :.**:**
tr|E2DHI2| NAMSTIYSTGKVCKPNNPQECLLLEPGLDNIMGTSKDYNERLWAWEGWRAEVGKQLRPLY
sp|Q8R0I0| NTMSTIYSTGKVCNPKNPQECLLLEPGLDEIMATSTDYNSRLWAWEGWRAEVGKQLRPLY
sp|Q5EGZ1| NTMSTIYSTGKVCNSMNPQECFLLEPGLDEIMATSTDYNRRLWAWEGWRAEVGKQLRPLY
tr|A0A220Q NTMSTIYSSGKVLDPNNPQECLVLEPGLDEIMENSKDYSRRLWAWESWRAEVGKQLRPLY
QHX39726.1 STMSTIYSTGKVCNQSNPQECFLLEPGLDEIMAKSTDYNERLWAWEGWRSVVGKQLRPLY
tr|A0A2J8K NTMSAIYSTGKVCNPNNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLY
sp|Q9BYF1| NTMSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLDYNERLWAWESWRSEVGKQLRPLY
BAE53380.1 NAMSTIYSTGKACNPNNPQECLLLEPGLDDIMENSKDYNERLWAWEGWRSEVGKQLRPLY
sp|Q56NL1| NAMSTIYSTGKACNPNNPQECLLLEPGLDNIMENSKDYNERLWAWEGWRAEVGKQLRPLY
sp|Q56H28| NAMSTIYSTGKACNPNNPQECLLLEPGLDDIMENSKDYNERLWAWEGWRAEVGKQLRPLY
.:**:***:**. . *****::*****::** .* **. ******.**: *********
tr|E2DHI2| EEYVVLKNEMARGYHYEDYGDYWRRDYETEGSPDLEYSRDQLIKDVERIFAEIKPLYEQL
sp|Q8R0I0| EEYVVLKNEMARANNYNDYGDYWRGDYEAEGADGYNYNRNQLIEDVERTFAEIKPLYEHL
sp|Q5EGZ1| EEYVVLKNEMARANNYEDYGDYWRGDYEAEGVEGYNYNRNQLIEDVENTFKEIKPLYEQL
tr|A0A220Q EEYVVLENEMARANNYEDYGDYWRGDYEVTGTGDYDYSRNQLMEDVERTFAEIKPLYEHL
QHX39726.1 EEYVVLKNEMARANNYEDYGDYWRADYEAEGADGYDYSRSQLIDDVERTFSEIKPLYEQL
tr|A0A2J8K EEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHL
sp|Q9BYF1| EEYVVLKNEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVEHTFEEIKPLYEHL
BAE53380.1 EEYVALKNEMARANNYEDYGDYWRGDYEEEWADGYSYSRNQLIEDVEHTFTQIKPLYEHL
sp|Q56NL1| EEYVALKNEMARANNYEDYGDYWRGDYEEEWTGGYNYSRNQLIQDVEDTFEQIKPLYQHL
sp|Q56H28| EEYVALKNEMARANNYEDYGDYWRGDYEEEWTDGYNYSRSQLIKDVEHTFTQIKPLYQHL
****.*:*****. :*:******* *** . .*.*.**:.*** * :*****::*
tr|E2DHI2| HAYVRTKLMDTYPFHISPTGCLPAHLLGDMWGRFWTNLYPLTVPFGQKPNIDVTDAMLNQ
sp|Q8R0I0| HAYVRRKLMDTYPSYISPTGCLPAHLLGDMWGRFWTNLYPLTVPFAQKPNIDVTDAMMNQ
sp|Q5EGZ1| HAYVRTKLMEVYPSYISPTGCLPAHLLGDMWGRFWTNLYPLTTPFLQKPNIDVTDAMVNQ
tr|A0A220Q HAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYPLTVPFGEKPSIDVTEAMVNQ
QHX39726.1 HAYVRTKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDTMVNQ
tr|A0A2J8K HAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQ
sp|Q9BYF1| HAYVRAKLMNAYPSYISPIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAMVDQ
BAE53380.1 HAYVRAKLMDAYPSRISPTGCLPAHLLGDMWGRFWTNLYPLMVPFRQKPNIDVTDAMVNQ
sp|Q56NL1| HAYVRAKLMDTYPSRISRTGCLPAHLLGDMWGRFWTNLYPLTVPFGQKPNIDVTDAMVNQ
sp|Q56H28| HAYVRAKLMDTYPSRISPTGCLPAHLLGDMWGRFWTNLYPLTVPFGQKPNIDVTDAMVNQ
***** ***:.** ** ********************.* .** :**.****::*::*
tr|E2DHI2| NWDAKRIFKEAEKFLVSIGLPNMTEGFWNNSMLTDPGDGRKVVCHPTAWDLGKGDFRIKM
sp|Q8R0I0| GWDAERIFQEAEKFFVSVGLPHMTQGFWANSMLTEPADGRKVVCHPTAWDLGHGDFRIKM
sp|Q5EGZ1| SWDAERIFKEAEKFFVSVGLPQMTPGFWTNSMLTEPGDDRKVVCHPTAWDLGHGDFRIKM
tr|A0A220Q SWDAIRIFEEAEKFFVSIGLPNMTQGFWNNSMLTEPGDGRKVVCHPTAWDLGKGDFRIKM
QHX39726.1 GWDAERIFKEAEKFFVSVGLPSMTHGFWENSMLPESGDGRKVVCHPTAWDLGKRDFRIKM
tr|A0A2J8K AWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILM
sp|Q9BYF1| AWDAQRIFKEAEKFFVSVGLPNMTQGFWENSMLTDPGNVQKAVCHPTAWDLGKGDFRILM
BAE53380.1 SWDARRIFEEAETFFVSVGLPNMTEGFWQNSMLTEPGDNRKVVCHPTAWDLGKRDFRIKM
sp|Q56NL1| NWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDGRKVVCHPTAWDLGKGDFRIKM
sp|Q56H28| SWDARRIFKEAEKFFVSVGLPNMTQGFWENSMLTEPGDSRKVVCHPTAWDLGKGDFRIKM
*** ***:***.*:**:*** ** *** ****.:..: .*.**********: **** *
tr|E2DHI2| CTKVTMEDFLTAHHEMGHIQYDMAYASQPYLLRNGANEGFHEAVGEVMSLSVATPEHLKT
sp|Q8R0I0| CTKVTMDNFLTAHHEMGHIQYDMAYARQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
sp|Q5EGZ1| CTKVTMDNFLTAHHEMGHIQYDMAYAKQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
tr|A0A220Q CTKVTMDDFLTAHHEMGHIQYDMAYAIQPYLLRNGANEGFHEAVGEIMSLSAATPHYLKA
QHX39726.1 CTKVTMDNFLTAHHEMGHIQYDMAYATQPFLLRNGANEGFHEAVGEIMSLSAATPEHLKS
tr|A0A2J8K CTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
sp|Q9BYF1| CTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPKHLKS
BAE53380.1 CTKVTMDDFLTAHHEMGHIQYDMAYAEQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKN
sp|Q56NL1| CTKVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT
sp|Q56H28| CTKVTMDDFLTAHHEMGHIQYDMAYAVQPFLLRNGANEGFHEAVGEIMSLSAATPNHLKT
******::****************** **:****************:****.*** :**
tr|E2DHI2| MGLLSSDFLEDNETEINFLFKQALNIVGTLPLTYMLEKWRWMVFKGEIPKEEWMKKWWEM
sp|Q8R0I0| IGLLPSDFQEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFRGEIPKEQWMKKWWEM
sp|Q5EGZ1| IGLLPSNFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFQDKIPREQWTKKWWEM
tr|A0A220Q LGLLPPDFYEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEM
QHX39726.1 IGLLPYDFHEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEM
tr|A0A2J8K IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPEDQWMKKWWEM
sp|Q9BYF1| IGLLSPDFQEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQWMKKWWEM
BAE53380.1 IGLLPPDFSEDSETDINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEM
sp|Q56NL1| IGLLSPAFSEDNETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGAIPKEQWMQKWWEM
sp|Q56H28| IGLLSPGFSEDSETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKEQWMQKWWEM
:***. * **.**:****:****.******:************.. ** ::* :*****
tr|E2DHI2| KRKIVGVVEPVPHDETYCDPASLFHVANDYSFIRYYTRTIFEFQFHEALCRIAKHDGPLH
sp|Q8R0I0| KREIVGVVEPLPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKYNGSLH
sp|Q5EGZ1| KREIVGVVEPLPHDETYCDPASLFHVSNDYSFIRYYTRTIYQFQFQEALCQAAKHDGPLH
tr|A0A220Q KREIVGVVEPLPHDETYCDPACLFHVAEDYSFIRYYTRTIYQFQFHEALCRTAKHEGPLY
QHX39726.1 KREIVGVVEPMPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFQEALCQAAQHEGPLH
tr|A0A2J8K KREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLH
sp|Q9BYF1| KREIVGVVEPVPHDETYCDPASLFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLH
BAE53380.1 KRDIVGVVEPLPHDETYCDPAALFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLY
sp|Q56NL1| KRNIVGVVEPVPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCQIAKHEGPLH
sp|Q56H28| KREIVGVVEPVPHDETYCDPASLFHVANDYSFIRYYTRTIYQFQFQEALCRIAKHEGPLH
**.*******:**********.****::***********:::***:****. *:::*.*:
tr|E2DHI2| KCGISNSTDAGEKLHQMLSVGKSQPWTSVLKDFVGSKNMDVGPLLRYFEPLYTWLTEQNR
sp|Q8R0I0| KCDISNSTEAGQKLLKMLSLGNSEPWTKALENVVGARNMDVKPLLNYFQPLFDWLKEQNR
sp|Q5EGZ1| KCDISNSTEAGQKLLNMLSLGNSGPWTLALENVVGSRNMDVKPLLNYFQPLFVWLKEQNR
tr|A0A220Q KCDISNSTEAGQKLLQMLSLGKSEPWTLALENIVGVKTMDVKPLLSYFEPLLTWLKAQNG
QHX39726.1 KCDISNSTEAGQKLLNMLRLGRSEPWTLALENVVGAKNMDVRPLLNYFEPLFTWLKEQNR
tr|A0A2J8K KCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNK
sp|Q9BYF1| KCDISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAKNMNVRPLLNYFEPLFTWLKDQNK
BAE53380.1 KCDISNSSEAGQKLHEMLSLGRSKPWTFALERVVGAKTMDVRPLLNYFEPLFTWLKEQNR
sp|Q56NL1| KCDISNSTEAGKKLLEMLSLGRSEPWTLALERVVGAKNMNVTPLLNYFEPLFTWLKEQNR
sp|Q56H28| KCDISNSSEAGKKLLQMLTLGKSKPWTLALEHVVGEKKMNVTPLLKYFEPLFTWLKEQNR
**.****::**:** :** :*.* *** .*: .** ..*:* *** **:** **. **
tr|E2DHI2| KSFVGWNTDWSPYADQSIKVWISLKSALGEKAYEWNNNEMYLFRSSVAYAMREYFLKTKN
sp|Q8R0I0| NSFVGWNTEWSPYADQSIKVRISLKSALGANAYEWTNNEMFLFRSSVAYAMRKYFSIIKN
sp|Q5EGZ1| NSTVGWSTDWSPYADQSIKVRISLKSALGKNAYEWTDNEMYLFRSSVAYAMREYFSREKN
tr|A0A220Q NSSVGWNTDWTPYADQSIKVRISLKSALGKEAYEWNDNEMYLFRSSIAYAMRNYFSSAKN
QHX39726.1 NSFVGWSTEWTPYADQSIKVRISLKTALGDQAYEWNDSEMYLFRSSVAYAMRKYFSEVKN
tr|A0A2J8K NSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSVAYAMRQYFLKVKN
sp|Q9BYF1| NSFVGWSTDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSVAYAMRQYFLKVKN
BAE53380.1 NSFVGWNTDWSPYADQSIKVRISLKSALGEKAYEWNDNEMYFFQSSIAYAMREYFSKVKN
sp|Q56NL1| NSFVGWDTDWRPYSDQSIKVRISLKSALGEKAYEWNDNEMYLFRSSIAYAMREYFSKVKN
sp|Q56H28| NSFVGWNTDWRPYADQSIKVRISLKSALGDEAYEWNDNEMYLFRSSVAYAMREYFSKVKN
:* ***.*:* **:******.****:*** :****.:.**::*.**:*****:** **
tr|E2DHI2| QTILFGEEDVWVSNLKPRISFNFYVTSPRNLSDIIPRPEVEGAIRMSRSRINDAFRLDDN
sp|Q8R0I0| QTVPFLEEDVRVSDLKPRVSFYFFVTSPQNVSDVIPRSEVEDAIRMSRGRINDVFGLNDN
sp|Q5EGZ1| QTVPFGEADVWVSDLKPRVSFNFFVTSPKNVSDIIPRSEVEEAIRMSRGRINDIFGLNDN
tr|A0A220Q ETIPFGAEDVWVSDLKPRISFNFFVTSPANMSDIIPRSDVEKAISMSRSRINDAFRLDDN
QHX39726.1 QTILFGEEDVRVSDLKPRISFNFFVTAPNNVNDIIPRNEVEEAISMSRSRINDIFRLDDN
tr|A0A2J8K QMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEVEKAIRKSRSRINDAFRLNDN
sp|Q9BYF1| QMILFGEEDVRVANLKPRISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRLNDN
BAE53380.1 QTIPFVGKDVRVSDLKPRISFNFIVTSPENMSDIIPRADVEEAIRKSRGRINDAFRLDDN
sp|Q56NL1| QTIPFVEDNVWVSDLKPRISFNFFVTFSNNVSDVIPRSEVEDAIRMSRSRINDAFRLDDN
sp|Q56H28| QTIPFVEDNVWVSNLKPRISFNFFVTASKNVSDVIPRSEVEEAIRMSRSRINDAFRLDDN
: : * :*.*::****:** * ** . *:.*:*** :** ** **.**** * *:**
tr|E2DHI2| SLEFLGIQPTLGPPYQPPVTIWLIVFGVVMAVVVVGIVVLIITGIRDRRKKDQARSEENP
sp|Q8R0I0| SLEFLGIHPTLEPPYQPPVTIWLIIFGVVMALVVVGIIILIVTGIKGRKKKNETKREENP
sp|Q5EGZ1| SLEFLGIYPTLKPPYEPPVTIWLIIFGVVMGTVVVGIVILIVTGIKGRKKKNETKREENP
tr|A0A220Q TLEFLGIQPTLGPPDEPPVTVWLIIFGVVMGLVVVGIVVLIFTGIRDRRKKKQASSEENP
QHX39726.1 SLEFVGIQPTLEPPYESPVPIWLVVFGVVMGMIVIGIVVLIFTGIKDRRKQKQAKREENP
tr|A0A2J8K SLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKKKNKARSEENP
sp|Q9BYF1| SLEFLGIQPTLGPPNQPPVSIWLIVFGVVMGVIVVGIVILIFTGIRDRKKKNKARSGENP
BAE53380.1 SLEFLGIQPTLEPPYQPPVTIWLIVFGVVMGVVVVGIFLLIFSGIRNRRKNNQARSEENP
sp|Q56NL1| SLEFLGIEPTLSPPYRPPVTIWLIVFGVVMGAIVVGIVLLIVSGIRNRRKNDQAGSEENP
sp|Q56H28| SLEFLGIQPTLSPPYQPPVTIWLIVFGVVMGVVVVGIVLLIVSGIRNRRKNNQARSEENP
:***:** *** ** .**.:**::*****. :*:**.:**.:**..*.*:.:: ***
tr|E2DHI2| YSSVDLSKGENNPGFQNGNDVQTSF
sp|Q8R0I0| YDSMDIGKGESNAGFQNSDDAQTSF
sp|Q5EGZ1| YDSMDIGKGESNAGFQNSDDAQTSF
tr|A0A220Q YGSMDLSKGESNSGFQNGDDIQTSF
QHX39726.1 YGFVDMSKGENNSGFQNSDDIQTSF
tr|A0A2J8K YASVDTSKGENNPGFQNTDDVQTSF
sp|Q9BYF1| YASIDISKGENNPGFQNTDDVQTSF
BAE53380.1 YASVDLSKGENNPGFQNVDDVQTSF
sp|Q56NL1| YASVDLNKGENNPGFQHADDVQTSF
sp|Q56H28| YASVDLSKGENNPGFQHADDVQTSF
* :* .***.*.***: :* ****
3.Went to a phylogeny tree builder and sequence aligner.
- Copied the ACE-2 sequence and pasted them in the one-click.
- Copied the generated Phylogenetic tree and pasted the results below.
Figure 1: Phylogenetic tree results for the copied ACE-2 Sequences.
4. Went to the article titled Investigation of the genetic variation in ACE2 on the structural recognition by the novel coronavirus (SARS-CoV-2) and copied the 8 necessary human amino acids and pasted them below.
- HIS378
- SER19
- GLY211
- ASP 206
- ARG 219
- LYS 341
- LLE 468
- SER 547
5. Located the 8 necessary human ACE-2 amino acids for structure and function in the sequence alignment and highlighted them across sequences. Created a table making the sequence alignments clearer. Posted sequences and interpretations below.
![]()
![]()
![]()
Figure 2: Sequence alignment for 8 necessary human amino acids in all the sequences.
- Fully conserved residues: His378, Asp 206, Arg 219, Lys 341, LLE 468, and Ser 547.
- Strongly Conserved residue: Ser 19
- No conservation: Gly 211
6. Went to NCBI 3d-protein shapes iCn3D and created two structures.
- The first is highlighting the necessary human ACE2 amino residues.
- The second is the 5 human amino acid residues that the 2019-nCoV binds to.
- Pasted images below:
![]()
![]()
Figure 3: Protein structure showing 8 necessary amino acids for function and structure in human ACE2s (left). Protein structure showing 5 ACE2 amino acid residues that the novel 2019-nCoV binds to.
- None of the necessary amino acid residues for the human ACE2 structure and function is a residue where the 2019-nCoV spike protein binds to.
Powerpoint presentation
- Link to the PowerPoint presentation that will be presented in class.
Scientific Conclusion
- The goal of this lab is to compare the ACE2 sequences of different species, note fully conserved amino acids necessary for structure and function, and compare them to the 5 amino acid sequences that the novel 2019-nCoV spike protein binds to. This work finds through a phylogenetic tree and sequence alignment that the ACE-2 amino acid residues that are more compatible with the novel 2019-nCoV do not have to be more closely related in the tree. The 8 amino acid residues that are necessary for human ACE2 structure and function are not the same as the 5 residues that the 2019-nCoV spike protein binds to. The sequence alignment shows that across the 11 species tested only 6 of the 8 necessary amino acids were fully conserved, 1 were strongly conserved, and 1 was not conserved. Mutations, therefore, to the 5 amino acid sequences will not influence the ACE-2 structure or function, in humans. Future experiments will focus on mutations to the 5 key amino acid sequences for the 2019-nCoV spike protein attempt to lower the binding efficiency for the spike protein.
Acknowledgments
- Referenced and copied OpenWebWare syntax from the BIOL368/F20 week 1 page.
- Referenced and copied methods from the BIOL368/F20 week 6 page.
- Referenced MediaWiki for image formatting syntax.
- Worked with my partner Macie Duran on the presentation and research.
- Worked with Dr. Dahlquist, in class, on our research and protein databases.
- Referenced the article "Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus" for information regarding 3-dimensional proteins.
- Referenced the article Investigation of the genetic variation in ACE2 on the structural recognition by the novel coronavirus (SARS-CoV-2) for the 8 necessary human ACE2 amino acid residues.
- Used UniProtto find ACE-2 protein sequences.
- Used NCBI to find ACE-2 Protein sequences and study protein structures.
- Used Phylogeny.fr to make a phylogenetic tree and sequence alignment.
References
- OpenWetWare. (2020). BIOL368/F20:Week 1. Retrieved September 22, 2020, from https://openwetware.org/wiki/BIOL368/F20:Week_1
- OpenWetWare. (2020). BIOL368/F20:Week 6. Retrieved October 14, 2020, from https://openwetware.org/wiki/BIOL368/F20:Week_6
- Wan, Y., et al. (2020). Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus. Journal of Virology, 54 (7), retrieved from https://doi.org/10.1128/JVI.00127-20.
- Phylogeny.fr (2020). Methodes et Algorithmes pour la Bio-informatique LIRM, retrieved from http://www.phylogeny.fr/simple_phylogeny.cgi?workflow_id=f6c1c87f6ac1d379903543866e4da087&tab_index=3%7C.
- NCBI Structure (2020). PDB ID 1R42: Native Human Angiotensin-Converting Enzyme-2, Retrieved from https://www.ncbi.nlm.nih.gov/Structure/icn3d/full.html?&mmdbid=26160&bu=1&showanno=1&source=full-feature.
- NCBI Database (2020). Angiotensin I converting enzyme 2 [Mustela putorius furo], https://www.ncbi.nlm.nih.gov/protein/BAE53380.1?report=fasta%7C.
- NCBI Database (2020). Angiotensin-converting enzyme 2 [Oryctolagus cuniculus], https://www.ncbi.nlm.nih.gov/protein/XP_002719891.1.
- UniProt (2020). UniProtKB - K7GLM4 (K7GLM4_PIG), retrieved from https://www.uniprot.org/uniprot/K7GLM4.
- UniProt (2020). UniProtKB - Q9BYF1 (ACE2_HUMAN), retrieved from https://www.uniprot.org/uniprot/Q56NL1.
- UniProt (2020). UniProtKB - Q5EGZ1 (ACE2_RAT), retrieved from https://www.uniprot.org/uniprot/Q5EGZ1.
- UniProt (2020). UniProtKB - Q56NL1 (ACE2_PAGLA), retrieved from https://www.uniprot.org/uniprot/Q56NL1.
- UniProt (2020). UniProtKB - A0A2J8KU96 (A0A2J8KU96_PANTR), retrieved fromhttps://www.uniprot.org/uniprot/A0A2J8KU96.
- UniProt (2020). UniProtKB - B6ZGN7 (B6ZGN7_RHIFE), retrieved from https://www.uniprot.org/uniprot/B6ZGN7.
- UniProt (2020). UniProtKB - Q8R0I0 (ACE2_MOUSE), retrieved from hhttps://www.uniprot.org/uniprot/Q8R0I0.
- UniProt (2020). UniProtKB - Q56H28 (ACE2_FELCA), retrieved from https://www.uniprot.org/uniprot/Q56H28.
- UniProt (2020). UniProtKB - F6WXR7 (F6WXR7_MONDO), retrieved from https://www.uniprot.org/uniprot/F6WXR7.
- Guo, X., Chen, Z., Xia, Y. et al. Investigation of the genetic variation in ACE2 on the structural recognition by the novel coronavirus (SARS-CoV-2). J Transl Med 18, 321 (2020). https://doi.org/10.1186/s12967-020-02486-7
"Except for what is noted above, this individual journal entry was completed by me and not copied from another source" Nathan R. Beshai (talk) 20:36, 14 October 2020 (PDT)