[Home] [Server] [Queue] [About] [Remove] [Statistics]

I-TASSER results for job id S34200

  Submitted Sequence

>your_protein
MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCDFKYKQGRLSDGNIQSHLYFKKDHFIHDF
NGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRS
KNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLE
SINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQN
GSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGSGKSTFISFLIANLLTKYHMSVVALDRMN
GLEIMTDFFEGQYNTANTDGGFYINPFSLKDTEENRQFLANWIKFMLNIDSDNQQDNKAS
QSIDKVIRDTYNYMGEQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFS
KSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAY
IANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGVP
LSDSEINFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFIFLIAMLVIEKSSMIL
KKQTKKLIRKSI

  Predicted Secondary Structure

Sequence                  20                  40                  60                  80                 100                 120                 140                 160                 180                 200                 220                 240                 260                 280                 300                 320                 340                 360                 380                 400                 420                 440                 460                 480                 500                 520                 540                 560                 580                 600
                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |             
MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCDFKYKQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGSGKSTFISFLIANLLTKYHMSVVALDRMNGLEIMTDFFEGQYNTANTDGGFYINPFSLKDTEENRQFLANWIKFMLNIDSDNQQDNKASQSIDKVIRDTYNYMGEQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGVPLSDSEINFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFIFLIAMLVIEKSSMILKKQTKKLIRKSI
PredictionCCCHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCSSSCCCSSSSSCCCCCSSSSSSSSCCCCCCCCCCHHHHHHHCCCCCSSSSSSSSSCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCSSSSSSSSSSSSCCCHHHHHHHHHHHHHHHHHCCCSSSSCCCHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCSSSCCCCSSSSSSCCCCCCCCCCCCCCCCSSSSSCCCCCCHHHHHHHHHHHHHHHCCCSSSSSSCCCCHHHHHHHCCCSSSSSCCCCCCSSCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCSSSSSCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCSSSSSHHHHHHHCCHHHHHHHHHHHHHHHHCCCSSSSSSCCHHHHHHCCCHHHHHHHHHCCSSSSCCCCCCHHHHHHHCCCCHHHHHHHHCCCCCCCSSSSSSCCCCCSSSSSSSCCCCCHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCC
Conf.Score902123258787887999999999957988886898675467507770797587799962898789999986326744678078998846997689999999868999999999999999988999999999997279903899999999986999999999999999999789889975201789999878893331133402447778886763324567888988986532620562412454320476433455552528998899998999999999999997299499995997579999734877999538997333667477965788899999999851278876788999999999999985334433533289998740243799999997387652236887756666683799980453158778999999999999998644799689998228888539999999999999998759789999587999987676899999983987898359611388997459899999998448977708999956999469999520004517788818963899999999873987000479

  Top 5 Models predicted by I-TASSER

Download Model 1 Download Model 2 Download Model 3 Download Model 4 Download Model 5
C-score=-1.27
C-score=-1.56
C-score=-2.27
C-score=-3.53
C-score=-3.62
 
Estimated accuracy of Model1: 0.56±0.15 (TM-score)    10.8±4.6Å (RMSD)    (Read more about C-score of generated models)

  Top 10 templates used by I-TASSER

RankPDB
Hit
Iden1Iden2Cov.Norm.
Z-score
Download
Align.
                   20                  40                  60                  80                 100                 120                 140                 160                 180                 200                 220                 240                 260                 280                 300                 320                 340                 360                 380                 400                 420                 440                 460                 480                 500                 520                 540                 560                 580                 600
                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |                   |            
Sec.Str
Seq
CCCHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHCCCCCSSSCCCSSSSSCCCCCSSSSSSSSCCCCCCCCCCHHHHHHHCCCCCSSSSSSSSSCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCSSSSSSSSSSSSCCCHHHHHHHHHHHHHHHHHCCCSSSSCCCHHHHHHHHHCCCCHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCSSSCCCCSSSSSSCCCCCCCCCCCCCCCCSSSSSCCCCCCHHHHHHHHHHHHHHHCCCSSSSSSCCCCHHHHHHHCCCSSSSSCCCCCCSSCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCSSSSSCHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCSSSSSHHHHHHHCCHHHHHHHHHHHHHHHHCCCSSSSSSCCHHHHHHCCCHHHHHHHHHCCSSSSCCCCCCHHHHHHHCCCCHHHHHHHHCCCCCCCSSSSSSCCCCCSSSSSSSCCCCCHHHHHHHCCCCHHHHHHHHHHHCCCCCCCCCC
MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCDFKYKQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGSGKSTFISFLIANLLTKYHMSVVALDRMNGLEIMTDFFEGQYNTANTDGGFYINPFSLKDTEENRQFLANWIKFMLNIDSDNQQDNKASQSIDKVIRDTYNYMGEQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGVPLSDSEINFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFIFLIAMLVIEKSSMILKKQTKKLIRKSI
12gjkA 0.11 0.18 0.80 1.43Download ------PLGSRYEDAYQYQNIFGPLVSQTQDNITVRWDLGLN--KKRIAYFTLPKTDSDMRLMQGDEICLRYKGDLAPLWKGIGHVIKVPDNYGDEIAIELR---------------------------------SSVGAPVEVTHNFQVDFVWKSTSFDRMQSALKTFAVDETSV-----------------------------SGYIYHKLLGHEVEDVIIKCQLPKRFTA-----------------QGLPDLNHLQRPLSLIQGPPGTGKTVTSATIVYHLARQGNGPVLVCAPSNIVDQLTEKIHQTGLKVVRLCAKSREAIDSPVSFLALHNQIRLQKLQQLKDESSADEKRYRALKRTAERELLMKAAKAGLSQSLFERLVVLGIRPPALSAFPSVTAADRVKKGFDFQWPQPDKPMFFYVTQGQSGTSYLNRTEAANVEKITTKLLKAGAKPDQIGI-----ITPYEGQRSYLVQYMQFSGSLH---------TKLYQEVEIASVDAFQGREKDFIILSCVRANEHQGIGFLNDPRRLNVALT---RARYGVI--------IVGN---------PKALSKQPLWNHLLNYYKEQKVGPLNNLRE
21e9rA 0.13 0.13 0.59 3.48Download ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------VG-----------QGEFGGAPF-KRFLRGTRIVSGGKLKRMTR-------EK-----AKQVTV----AGVPMPRD-----------AEPRHLLVNGATGTGKSVLLRELAYTGL-LRGDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRNDYDWQRYALS---VVPRGKTEEWASYGRLLLRETAKKLALI--GTPSMRELFHWTTIATFDLTSARFVLSDKLPEHVTMPD-GDFSDPNGGNLFITWRE--DMGPALRPLISAWVDVVCTSILEPKRRLWLFIDELASLEKLAS----LADALTKGRKAGLRVVAGLQSTSQLDDYGVKEAQTLRASFRSLVVLGGSDPKTDMSLSLGVMP---AEIANLPDLTA--YVGFAGNRPIAKVPLEIKQFANRQPAF---------------------VEGT-
31ih7A 0.09 0.17 0.97 1.41Download MKEHCPESQATKYNMRDASQWIKRMEDIGLEALGMDDFKLAYLSDTYNYEIKYDDAITHYDSIDDRFYVFDLLNSPYGNVEEWSIEIAAKLQEQIDKIIYMPFDNEKELLMEYLNFWQQYNRIKNIFGESTAKRLSPHRKTRVKVREIITLFGGPISKLREDVYRVLQIDAKRQF----INLSLDMGYYAKIQIQSVFSPIKTWDAIIFNSLKEQIPQGRSHPVQPYPGAPETIAGTFKVAPLHDYINAVAERPSDVYSCSPNGMMYYKDRDGVVPTEITKVFNQRKEHKGYMLAAQ--RNGEIIKEALHNPN---LSVDEPLDVDYRFDFSDEIKEKAKSLNEMLFRAQRTEVAGMTAQINRKLLINSLYGALGNVWDLRNATAITTFGQMALQWIERKVNEYLNEVCGTEGEAFVLYGDTDSIYVSADKIIDKVGEMEPAIDRGFREMCEYMNNKQHLMFMDREAIAGPPLGKAVQKALKECIRRMLQ---------EGEESLQEYFKEFEKEFYISIASVSSANNIAKYDVGGFPGPKCPFHIRGILQVVEGEKVYVLGNPFGDKCIAWPSGTEITDLIKDDVLHWMDYIKPLEGFTSAAKLDYEKKAF
41e9rA 0.13 0.13 0.60 1.56Download ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------VGQGEFGGAPFKRFLRGTRIVSGGKLQVTVAGVPMPRDAEPRHLLVNGATGTGKSVLLRELAYTGLL-RGDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRNDYDWQRYALSVV----PRGKTDEAEEWASYGRLLLRETAKKLALIGTPSMRELFHWTTIATFDDLRGFLEGTLAESLFTMPDGDFSIDPNGGNLFREDMGPALRPLISAWVDVVCTSILSLPEEPKRRLWLFIDELASLEKLAS----LADALTKGRKAGLRVVAGLQSTSQLDDVGVKEAQTLRASFRSLVVLGGSRTDPKTNEDM---SLSL----------GEHEVERDRALERVRERVEIANLPDLTAYVGFAGNRPIAKVPLEIKQFANRQPAFV
51iexA 0.09 0.17 0.89 0.74Download VADLLGR-----MTLAEKIGQMT-QIERLV----ATPDVLRDNFIGSLLSGGGSVPRKGATAKEWQDMVDGFQKACMSTRLGIPMIYGIDAVHGQNNVYGATIFPHNVGLGATRD-----PYLVKRIGEATALEVRATGIQYAFAPCIAVCRDPRWGRCYESYSEDRRIVQSM--------TELIPGLQGDVPKDFT----SGMPFVAGKNKVAACAKHFVGDGGTVDGINENNTIINREG---LMNIHMPAYKNAMDKGVSTVMISYSSWNGVMHANQDLVTGYLDTLKFKGFVISDWEGIDRITTPAGSDYASILAGLDMIMVP-------NKYQQFISILTGHVN--GGVIPMSRIDDAVTRILRVKFTMGLFENPYADPAMAEQLGKQEHRDLAREAARKSLVLLKNGKTPLLPLPKKAPKILVAGSHADN------LGYQCGGWTIEWQGDTGRTT-----------VGTTILEAVKAAV---DPSTVVVFAENPDAEFVKSGGF-------SYAIVAVGEHPYTETKGDNLNLTIPEPGLSTVQAVCGGVRCATVLISGRASDALVAAWLPGSEGVTDALFGDFGTGRLPRTWFKSVDQNVGDAHY
61e9rA 0.14 0.13 0.60 1.84Download VGQ-------------------------------------------------------GEFGGAPFKRFLRGTRI--------------------------------------------------VSGGKLKRMTREKAKQVTVAGVPM-----PRDAEPR-----------------------------------------------------------------------------------------------------HLLVNGATGTGKSVLLRELAYTGLL-RGDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRNDYDWQRYALSVVPRGKTDEAEEWASYGRLLLRETAKKLALAESLFAGSNEASKALTSARFVLSDKLPEHVTMPDGDFSIRSWLEDPNGGNLFITWREDMGPALRPLISAWVDVVCTSILSLPEEPKRRLWLFIDELASLEKLA----SLADALTKGRKAGLRVVAGLQSTSQLDDVGVKEAQTLRASFRSLVVLGGSRTDPKTNEDMVVMPAEIA-----NLPDLTAYVGFAGNRPIAKVPLEIKQFAN----------------------RQPAFVEGT
71e9rA 0.15 0.13 0.58 0.95Download ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------VGQGEFGGAPFKRFLRGTRIVSGGKLKRMTREKAKQRDAEPRHLLVNGATGTGKSVLLRELAYTGLLR-GDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRNDYDWQRYALSVV----PRGKTDEAEEWASYGRLLLRETAKKLALIGTPSMRELFHWTTIATFDDLRGFLEGTLAESLFAGSMPDGDFSGNLFITWREDMGPALRPLISAWVDVVCTSILSLPEEPKRRLWLFIDE----LASLEKLASLADALTKGRKAGLRVVAGLQSTSQLDDVGVKEAQTLRASFRSLVVLGGSRTDPKTNEDMVVMPAEIANL-----PDLTAYVGFAGNRPIAKVPLEIKQFANRQPAFVEGT----------------------
81e9rA 0.14 0.13 0.58 3.93Download --------------------------------------------------------------------------------------------------------------------------------------VGQ-------------------------------------------------GEFGGAPFKRFL--RGTIVSGGKLKRMT------REK------AKQVTV----AGVPM------PR-----DAEPRHLLVNGATGTGKSVLLRELAYTGLLR-GDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRNDYDWQRYALS---VVPRGKTD-EAEEWRLLLRETAKKLALI--GTPSMRELFHWTTIATF--DDLRGFLEGTLAESLFAGSLPEHV--TMPDGDFSIRSWLEDPPALRPLISAVCTSILSLPEEPKRRLWLFIDELASLEKLASL----ADALTKGRKAGLRVVAGLQSTSQLDDVKE--AQTLRASFRSLVVLGSDPKTNEDMLSLGEHEVEPAEIANL-P-DLTAYVGFAGNRPIAKVPLEIKQFANR-------------------------QPAFV
92iutA 0.10 0.13 0.65 1.90Download ----------------------------------------------------------------------------------------------------------LPPLSLLDPAEVKQKSYSPESLEAMSRLLEIKLKEFGVEVSVDSVHPGTRFEIQPAAGVKVSRISNLAKDLARSLAVISVRVVEVIPGKTTVGIEIEDRQMVRFSEVLSSPEYDEHKST-----VPLALGHDIGGRPIITDLAK----------MPHLLVAGTTGSGKSVGVNAMLLSILTPSEARLIMIDPK--MLELSIYEGIPHLLCPVVTDMKEAANALRWSVAEMERRYRLMAAMVRNLAGFNRKVKDAEEAGTPLTDPLFRRESPDDEPPQLSTLPTIVVVVDEFADMMMIVGKKVEELIARIAQKARAAGIHLILATQRPSVDVITGLIKANIPTRIAFQVSSKIDSRTILDQGGAEQLLGHGDMLYLPPGTGLPIRVHGAFVSDDEVHRVVEAWKLRGAPDYIEDILA------------------------------------------------------------------------------------------
101e9rA 0.15 0.13 0.44 2.90Download ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HLLVNGATGTGKSVLLRELAYTGLLR-GDRMVIVDPNGDMLSKFGRDKDIILNPYDQRTKGWSFFNEIRNDYDWQRYALSVVPRGKTDEAEEWASYGRLLLRETAKKLALIGT--PSMRELFHWTTIA--TFDDLRGFLEGTLAESLFAGSNELPEHVTMPDGDFSIRSWLEDPNLISAWVDVVCTSILSLPEEPKRRLWLFIDELASLEKLASL----ADALTKGRKAGLRVVAGLQSTSQLDDVKE--AQTLRASFRSLVVLRTDPKTNEDMLSLG------------------------------------------------------------------------
(a)All the residues are colored in black; however, those residues in template which are identical to the residue in the query sequence are highlighted in color. Coloring scheme is based on the property of amino acids, where polar are brightly coloured while non-polar residues are colored in dark shade. (more about the colors used)
(b)Rank of templates represents the top ten threading templates used by I-TASSER.
(c)Ident1 is the percentage sequence identity of the templates in the threading aligned region with the query sequence.
(d)Ident2 is the percentage sequence identity of the whole template chains with query sequence.
(e)Cov. represents the coverage of the threading alignment and is equal to the number of aligned residues divided by the length of query protein.
(f)Norm. Z-score is the normalized Z-score of the threading alignments. Alignment with a Normalized Z-score >1 mean a good alignment and vice versa.
(g)Download Align. provides the 3D structure of the aligned regions of the threading templates.
(h)The top 10 alignments reported above (in order of their ranking) are from the following threading programs:
       1: PROSPECT2   2: HHSEARCH   3: PROSPECT2   4: SP3   5: MUSTER   6: PROSPECT2   7: PPA-I   8: HHSEARCH I   9: FUGUE   10: HHSEARCH II   

  10 proteins in PDB which are structurally closest to the first I-TASSER model (identified by TM-align)

RankTM-scoreRMSDaIDENaCov.PDB
Hit
10.8665 1.89 0.10 0.901iexA
Model1
 
20.7434 3.24 0.10 0.823bmxA
Model1
 
30.4481 3.10 0.07 0.491tr9A
Model1
 
40.4317 7.17 0.06 0.641htyA
Model1
 
50.4194 6.57 0.06 0.591tg7A
Model1
 
60.4018 7.17 0.04 0.603ecqA
Model1
 
70.3854 7.18 0.03 0.581jqoA
Model1
 
80.3829 5.91 0.07 0.503bg5A
Model1
 
90.3821 5.92 0.04 0.503bg3A
Model1
 
100.3820 4.83 0.05 0.471ituA
Model1
 
Structural alignment using TM-align
 
DYVLYKDATKPVEDRVADLLG-RMTLAEKIGQMT-QIERL----VATPDVLRDNFIGSLLSGGGSVPRKGATAKEWQDMVDGFQKACMSTRLGIPMIYGIDAVHGQNNVYGATIFPHNVGLGATR-----DPYLVKRIGEATALEVRATGIQYAFAPCIAVCRDPRWGRCYESYSEDRRIVQSMT--------ELIPGLQGDVPKDF--T--SGMPFVAGKNKVAACAKHFVGDGGTVDGINENNTIINREG---LMNIHMPAYKNAMDKGVSTVMISYSSWNGVKMHANQDLVTGYLKDTLKFKGFVISDWEGIDRITTPAGSDYSYSVKASILAGLDMIMVP----N--KYQQFISILTGHVNG--GVIPMSRIDDAVTRILRVKFTMGLFENPYADPAMAEQLGKQEHRDLAREAARKSLVLLKNGKTSTDAPLLPLPKKAPKILVAGSHADN------LGYQCGGWTIEWQGDTGRTT-----------VGTTILEAVKAAV---DPSTVVVFAENPDAEFVKSGG-------FSYAIVAVGEHPYTETKGDNLNLTIPEPGLSTVQAVCGGVRCATVLISGRPVVVQPLLAASDALVAAWLPGSEGQGVTDALFGDFGFTG-RLPRTWFKSVDQLPMNVGDAHYDPLFRLGYGLTTNATK-- 
-----------MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCDFKYKQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGSGKST-FISFLIANLL-TKYHMSVVALDRMNGLEIMTDFFEGQY-N-T--ANTDGGFYINPFSLKDTEENRQFLANWIKFMLNIDSDNQQDNKASQSIDKVIRDTYNYMGEQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNF---Q-DCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGVPLSDSEINFLHNTDMRVRQVLVKNIVTNA-S---A-----FIEIDLKKICKN-YF-IFLI-AMLVIEKSSMILKKQTKKLIR-----K-----------------SI 
 
ASASKRAIDANQIV-N-RMSLDEKLGQML-MPDFRNWQKEGESSPQALTKMNDEVASLVKKYQFGGIILF-AENV---KTTKQTVQLTDDYQKAS--P--KIPLMLSIDQEGGIVTRLGEGTNFPGNMALGAAR-----SRINAYQTGSIIGKELSALGINTDFSPVVDINNNPDNPVIGVRSFSSNRELTSRLG-------LYTMKGLQR--------------------QDIASALKHFPGHGDT-DVDSHYGLPLVSHGQE---RLREVELYPFQKAIDAGADMVMTAHVQFPAFDDTTYKSKLDGSDILVPATLSKKVMTGLLRQEMGFNGVIVTDALNMKAIADHFGQEEAVVMAVKAGVDIALMPASVTSLKEEQKFARVIQALKEAVK--NGDIPEQQINNSVERIISLKIKRGMYPARNSDSTKEKIAKAKKIVGSKQHLKAEKKLAEKAVTVLKNE-QHTLPFKPKKGSRILIVAP--YEEQTA-----------------------------------SIEQTIHDLIKRKKIKPVSLSKMNFASQVFKTEHEKQVK-E-------AD--YIITGSYV------VKNDPVVNDGVIDDTISDSSKWATVFPRAVMKAALQHNKPFVLMSLRNPYDAANFEEAKALIAVYGFKGYANGRYLQPNIPAGVMAIFGQAKPKG-TLPVDIPSVTKPGNTLYPLGYGLNIKTGRPL---- 
------MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCD---------F---K-YKQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFN-LDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKK-ELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDC---PVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGS--GKS------------T-F--ISFLIANL-LTKYHMSVVALDRMNGLEIMTDFFEGQY--NT-ANTDGGFYINPFSLKD--TEENRQFLANWIKFMLNIDSDNQQDNKASQSIDKVIRDTYNYMGEQKNQ---I--NL-LEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAF--------QKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGV--------------P-LSDSEINFLHNTDMR-VRQVLVKNIVTNAS--A-----FIEIDLK---------KIC-KN-YFIF-LIAMLVIEKSSMILKKQ-TKKLI----------------RKSI 
 
----------------SL--GPL-WLDVAGYELSAEDREILQHPTVGGVILF-GRNY-H-DN-QQLLALNKAIRQAAK----R-PILIGVDQEGGRVQRFREGFSRIPPAQYYARAE----NGVELAEQGGWL-AAELIAHDVDLSFAPVLDGF-AC-KAIGNRAFGEDVQTVLKHS-------SAFLRG-KA--------------------VG-ATTGKHFPGHGAVIADSHLETPYDERE------TIAQD-AIFRAQIEAGVLDA--PAHVVYPHYDAQPASGSSYWLKQVLREELGFKGIVFSDDL--SEGAAVGGPVERSHQALVAGCDILI-C-NK-R-EAAVEVLDNLP--I------------E--VPQAEALL--KK---Q--QF-SYSELKRLERWQQASANQR-LIE--Q-----------------------------------------------------------------------------------------------------------------------------------------------------------------------F--S----------------------------------- 
MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCDFKY-KQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFN--LDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGD--CPVSVFKNQNGSAHFFNFQAKQGKDKN-DNVVGHTMIIGSTGS--GKSTF--ISFLIANLLT-KYHMSVVALDRMNGLEIMTDFFEG-QY-NTANTDGGFYINPFSLKDTEENRQFLANWIKFMLNIDSDNQQDNKASQSIDKVIRDTYNYMGEQKNQINLLEIAKNL---GSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGVPLSDSEINFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFIFLIAMLVIEKSSMILKKQTKKLIRKSI 
 
CQDVVQDVPNVDVQLELYDRSFKDIDGGVWKQGWNIKYDPLKYNAHHKL-----------------KV-FVVPHSHNDPGWIQTFEEYYQHDTKHILSNALRHLHDNPEKFIWAE--ISY-----FARFYHDLGENKKLQKSIV-KNGQLEFVTGG--WVP---------DEANS------------HW-RNVLLQLTEGQTWLKQFNVTPTASWAIDP--------F--GHS---------PT-------PYILQK-SG---------------FK-------NMLIQR--THYS--------VKK-EL----AQQ-RQLEFLWRQIWDNKGDTALFTH-PFYSYDI---PHTCGPDPKVC----------------CQ-FDFKRGSFGLSCPWKVPPRTISDQNVAARSDLLVDQWKKKAELYRTNVLLIPLGDDFRFKQNTEWDVQRVNYERLFEHINSQAHFNVQAQFGTLQEYFDAV-HQAERAGQAEFPTLSGDFFTYADRSDNYWSGYYTSRPYHKRD-----------------RVLHYVRAAELSAWHSWDGMARIEERLEQARR------------------------------------------ELSL-FQHHDGITGTAKTHVVVDYEQRQE-ALKACQVQQSVYRLLTKPSIYSPDFSFSYFTLD-DSRWPGSGVEDSRTTIILGEDILPSKHVVHNTLPHWREQLVDFYVSSPFVSVTDLANNPVEAQVSPVWSWHHDTLTKTIHPQGSTTKYRIIFKARVPPGLATYVLTISDSKPEHTSYASNLLLRKNPTSLPLGQYPEDVKFGDPREISLRVGNGPTLAFSE--QGLLKSIQLTQDSPHVPVHFKFLKYGVRSHGDR---SGAYLFLPNGPAS--PVELGQPVVLVTKGKLESSVSVGLPSVVHQTIRGGAPEIRNLVDIGSLDN-TEIVRLETHIDSGDIFYTDLNGLQFIKRRRLDKLPLQANYYP----IPSGFIEDANTRLTLLTGQPLGGSSLASGELEIQDRRLASDDERGLGQGVLDNKPVLHIYRLVLEKVNNCVRPSKLHPAGYLTSAAHKASQSLLDPLDKFIFAENEWIGAQGQFGGDHPSAREDLDVSVRRLTKSSAKTQRVGYVLHRTNLQCGTPEEHTQKLDVCHLLPNVARCERTTLTFLQNLEHLDGVAPEVCPETAAYVSSHS------- 
-------------------------------------------------MSSVLTDFEPKFLSSDDLLNLYAEYCNG----------HYCD--FK--YK-QGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIA-VKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRA----QI-QKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKN-----D-NVVGHTMIIGST--GSGKSTFISF----LIANLLTKYHMSVVALD-RMNGLE----IMTDFFEGQY------------NTANTD--G---------GFYINPFSLKDTEENRQFLANWIKFMLNID-SDNQQDNKASQSIDKVIRDT-YN-----------------------------------------YMGEQKNQINLLEIAKNLGSSEQDFN-----------------------EILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIA-MYLFHKLFFEAKEH--NKPFFLFIDETKDYIM---------------------------HPIMFAYI---ANALA-------------------------QARKI--------------------NGTLCMAFQKISQVKEL---------------------------------------G-----IDKAK-----------------------------SLIGNL---------------S-QVIIYP------TKDTDELIECGVPLS-DSEINFLHN---------------TDM-RVRQ---------VLVKNI-----VTNASAFI---------EIDLKK--------------------ICKNYFIFLI----AMLVIEK----SSMILKK-QTK--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------KLIRKSI 
 
LLQKYVTWDEHS----IFVNGERL-MIFSGEVHPYRLPVASLYIDIFEKVKALGFNCVSFYVDWALLE-GNPGHYSAEGIFDLQPFFDAAKEA--G---IYLLARPGPYINAEVSGG-GFPGWL--QRVDGIL-R-TSDEAYLKATDNYASNIAATIAKAQITNGGPIILYQPENEYSG------------ACCGYNGFPDG--------SY-MQYIEDH--------------ARDAGIVVPFISNDAWA--------------------------AGHNAPGTGAGAVDIYGHDSYPLGFDCANPSTWPSGNLPTYFHTSHEQQSP--S-TPYSLVEFQGGAFDPWGGVGFAKCAALLNHEFERVFYKNDFSFGVAFLNLYMIFGGTNWGNLGHPGGYTSYDYGSAISESRNITREKYSELKLLGNFAKVSPGYLVANPGDLSTSTYTNTADLTVTPLLGSNSSASSFFVIRHSDYSSQASVEYKLTVPTSAGNLTIPQLGGSLTLSGRDSKIHVTDYDVAGTNILYSTAEVFTWKKFNNEKVLVLYGGPGEHHEFAVSGASSSSVVEGSSSGISSKKVGKALVVAWDVSTARRIVQVGSLKVFLLDRNSAYNYWVPQVPTKGTAPGYSNQETTASSIIVKAGYLVRSAYLDGNDLHIQADFNATTPIEVVGAPSGAKNLVINGKKTQTKVDKNGIWSASVAYTAPKVQLPSLKSLKWKSVDTLPEAKNTYDDSAWTSAD---------------------------------------HA-YTNNSAHSLQ-----TPTS--------L---FASD---YGYHT--------------GA----------------LLFR-GHFTANGKEKTFFVQ-------------TKGGTAYGHSIWINETYVGSWAGTSINDNNNATYTLPTLQSGKN-YVI---TVVIDNMGLDEDWTIGSEDMK--------NP--RG--IIQYSLSGQEASAISWKLTGNLG-------------------GENYRDTVRGPLNEGGLYAERQGFHQPQPPTQKWDSSSPFTGLTKPGIRFYSTSFDLDLPSGYDIPLYFNFGNSTSTPAAYRVQLYVNGYQYGKYVNNIGPQTSFPVPEGILNYHGTNWLALSLWAQEDNGAKLDSFELINTTPVLTSLGEVKSVNQPKYQARKGAY------------------------------- 
M-SSVLTD----FEPKFLSSDDLLNLYAEYCNGH--Y-C-DFKYK-QGRLSDGNIQSHLYFKKDHFI-HD-----FNGIE-TFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILL-TVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDR--A-------Q--IQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTG------------SGKSTFISFLIANLLTKYHMSVVALDRMNGLE--------------IMTDFFEGQ-YN-TANTDGGFYIN-PFS--------------------LKD-----T----EENRQFLANWIKFMLNIDSD-------------------NQQDN---KA----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------SQSIDKVIRDTYNYMGEQKNQINLLEIAKNLGSSEQDFNE-ILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIA-------------NALAQARKINGTLC-------MAFQ-----K---------------ISQVKELGIDKAKSLIGNLSQVIIYPTKDTD--------ELIECGVPL-SDSEINFLHNTDMRVRQV-----------LVKNIVTNASAFIEIDLKKICK--------------------------------------------------------------------------------------------------------------------------------------------------------------------NYFIFLIAMLVIEKSSMILKKQTKKLIRKSI 
 
KTVTPEWQTVEKKEQQGTVTIREEKGVRYNQLSSTAQNDNAGKPALFEKKGLTVDANGNATVDLTFKDDSEKGKSRFGVFLKFKDTKNNVFVGYDKDGWFWEYKSPTTSTWYRGSRVAAPETGSTNRLSITLKSDGQLNASNNDVNLFDTVTLPAAVNDHLKNEKKILLKAGSYDDERTVVSVKTDNQEETGPEVDDSKVTYDTIQSKVLKAVIDQAFPRVKEYSLNGHTLPGQVQQFNQVFINNHRITPEVTYKKINETTAEYLKLRDDAHLINAETVRLQV-VDN-QLHFDVTKIVNHNQVTPGQKIDDERKLLSSISFLGNALVSVSSDQTGAKFDGATSNNTHVSGDDHIDVTNPKDLAKGYYGFVSTDKLAAGVWSNSQNSYGGGSNDWTRLTAYKETVGNANYVGIHSSEWQWEKAYKGIVFPEYTKELPSAKVVITEDANADKKVDWQDGAIAYRSINNPQG-WKKVKDITAYRIANFGSQAQNPFLTLD---------------------GIKKINLHT------D--GL-----------------GQGVLLK-----------GYGSEGHDSGHLNYADIGKRIGGVEDFKTLIEKAK-KYG-AHLGIHVNASETYPESKYFNEKILRKNPDGSYSYGWN---------WLDQGINID-AAYDLAHGRL--------ARWEDLKKKLG--------------------DGLDFIYVDVWG----------------NG-----QSGDNGAWATHVLAKEINKQGWRFAIEWGHGGEYDSTFHHWAADLTYGGYTNKGINSA--ITRFIRN--------HQKDAW--VGDYRSYGGAANYPLL---G-GYSKDFEGWQGRSDYNGYVTNLFAHDVTKYFQHFTVSKWENGTPVTTDNGSTYKW-TPER-VEL--VDADNNKVVVTRKSNDVNSPQYRERTVTLNGRVIQDGSAYLTPWNW---------------DA-NGKKLSTDKEKYYFNTQAGATTWTLPSDWAKSKVYLYKLTDQGKTEEQELTVKDGKITLDLLANQPYVLYRSKQTN----PESWSEG--HIYDQGFNSGT----------------------------------------LKHWTISGDASKAEIV---------KSQGANDLRIQGNKEKV------SLTQKLTGLKPNTKYAVYVGVDNRSNAKASITV-NTG--E-KEVTTYTNKSL-------------------ALNYVKAYAHNTRRNNATVD---DTSYFQN-YAFFTTGSDVSNVTLTLSREAGDEATYF---------------------------DEIRTFENNSSYGDKHDTGKGTFKQDFENVAQGIFPFVVGGVEGVEDNRTHLSEKHDPYTQRGWNGKKVDDVIEGNWSLKTNGLVSRRNLVYQTIPQNFRFEAGKTYRVTFEYEAGSDNTYAFVVGKGEFQSGRRGTQASNLEHELPNTWTDSKKAKKATFLVTGAETGDTWVGIYST-GNASNTRGDSGGNANFRGYNDFDNLQIEEITLTGKLTENALKNYLPTVAKEAVFNLSQADDDISVEEARAEIAKI 
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSS-VL-------------------------------------------------------------------------------------------------------------------------------------------------------TDFEPKF--------L-SSDDL-------LNLYAEYCNG------------------HYCDFKYKQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNI---SYQNIEDLEQMVSTDRAQIQKVSLSIMVF--------------------------ASSKKELDEKSIIVYNT-LK---------KEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAK--QG-KDKNDNV---VGHT------------MII---GSTGSGKSTFISF--LIAN-----LLTKYHMSVVALDRMNGLEIMTDFFEGQYNTANTDGGFYIN--------------PFSLKDTEENRQFLANWIKFMLNID------------SDNQ-QD--NKA---------------------------------------------SQSIDKVIRDTYNYMG-EQKNQI-----------------------------------------------------------------------NLLEIAKNLGSSEQD---------FNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGL-----------IAMYLFHKLFFEAKEHNK--------PFFLFIDETKDYIMH-----PIMFAY----------IANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIY--PTKDTDELIE---CGVPLSDSEINF---------------------------LHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFIF----L---------IAMLV---------------------IEKS---------------SM-----ILKKQTKKLIRK-----------------------------------------------------------------------------------------SI-------------------------------------------------------------------------- 
 
IEYDALLVDRFLNILQDLHGPSLREFVQECYEVSADYEGKGDTTKLGELGAKLTGLAPADAILVASSILHMLNLANLAE----------------------------------------------------------EVQ-IAHRRRNSDIEETLKRLVSEVGKSPEEVFEALKNQTVDLVFTAHPTQSARR----SLLQKNARIRNCLTQLNAKDITDDDKQELDEALQREIQAAFRTDEI---RRAQPTP-Q-AEMRYGMSYIHETVWKGVPKFLRRVDTALKNIGINERLPYNVSLIRFSSWMGGDR------DG--NPRVTPEVTRDVC------LLARMMAANLYIDQIEELMFELSMWRCNDELRVRAEELHSSSGSKVTKYYIEFWKQIPPNEPYRVILGHVRDKLYNTRERARHLLASGVSEISAESSFTSIEEFLEPLELCYKSLCDCGDKAIADGSLLDLLRQVFTFGLSLVKLDIR--QESER----HTDVIDAITTHLGIGSYREWPEDKRQEWLLSELRGKRPLLPPDLPQTDEIADVIGAFHVLAELPP-DSFGPYIISMATAPSDVLAVELLQRECG-VRQP--LPVVPLFERLADLQSAPASVERLFSVDWYMDRIKGKQQVMVGYSDSGKDAGRLSAAWQLYRAQEEMAQVAKRYGVKLTLFHGRGGTVGRGGGPTHLAILSQPPDTIN---------------------GSIRVTVQGEVIEFC--------------FG-EEHLCFQTLQRFTAATLEHGMHPPVSPKPEWRKLMDEMAVVATEEYRSVVVKEARFVEYFRS-------------------------------------ATPETEYGR-------MNIGSRPITTLRAI-PWIFSWTQTRFHLPVWLGVGAAFKFAIDKDVRNFQVLKEMYNEWPFFRVTLDLLEMVFAKGDPGIAGLYDELLVAEELKPFGKQLRDKYVETQQLLLQIAGHKDILEGDPFLKQGLVLRNP-YITTL--NV-----------FQAYTLKRIRDPNFKVTP-QPPLSK-EAGLVKLNPA-------------------------SE-YP--PGLEDTLI--------L-TMKGIAAGMQNTG------------------------------- 
-------------------------------------------------------------------------------MSSVLTDFEPKFLSSDDLLNLYAEYCNGHYCDFKYKQGRLSDGNIQSHLYFKKDHFIHD--FNG--I--ETFKRFI-AVK----AYDVDNITS-LALSNLLCEKFNLDI----LLTVEAMD------------------------------------------KEKALFFIRERKKRSKNISYQNIEDLEQMVS--TD--R------------------AQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFF---------------------------------------------------------P--ER-NF----------------LN--SRLRPQT----------------------------SQNIASL--IMFEKYQEGFKENSWGDCPVSV-FKN-------------------------------Q-N-G-SA--HF-FNFQAKQGKDKNDNVVGHTMIIG-STGSGKSTFISFLIANLLTKYHMSVVALDRMNG---LEIMTDFFEGQYNTA-NT--DGG-FYINPFS----------------LKD-TEENRQFLANWIKF-------------------------------MLNIDSDNQQDNKASQSIDKVI--------------RDTYNYMGEQKNQIN-LLEIAKNLG------------------------------------------------------SSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFAND--KKLMGLIAMYL-F-----HKLFFEAKEHNK-P------------------------------------------F--FL-FID---E-------------------------------------------------TKDYIM-HPIMFAYIA-NALAQARKINGTLCMAFQK--ISQVKELGIDKAKS-LIG-NLSQV--IIYPTKDTDELIECGVPLSDSEINFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICK-----NYFIFLIAMLVIEKSSMILKKQTKKLIRKSI 
 
QIKKLLVANRGEIAIRIFRAAAELDISTVAIYSNEDKSSLHRYKADESYLVGSDLGPAESYLNIERIIDVAKQANVDAIHPGYGFLSENEQFARRCAEEGIKFIGPHLEHLDMFGDKVKARTTAIKADLPVIPGTDGPIKSYELAKEFAEEAGFPLMIKATSRIVREESELEDAFHRAKSEAEKSFGNSEVYIERYIDNPKHIEVQVIGDEHGNIVHLFERDCSVQRRHQKVVEVAPSVGLSPTLRQRICDAAIQLMENIKYVNAGTVEFLVSGDEFFFIEVNPRVQVEHTITEMVTGIDIVKTQILVAAGADLFGEEINMPQQKDITTLGYAIQCRITTEDPLNDFMPDTGTIIAYRSSGGFGVRLDAGDGFQGAEISPYYDSLLVKLSTHAISFKQAEEKMVRSLREMRIRGVKTNIPFLINVMKNKKFTSGDYTTKFIEETPELFDIQPSLDRGTKTLEYIGNVTINGFPNVEKRPKPDYELASIPTVSSSKIASFSGTKQLL-DEVGPKGVAE----WVKK---QDDVLLTDTTFRDAHQSLLATRVRTKDMINIASKTADVFKDGFSLEMWGGATFDVAYNFLKENPWERLERLRKAIP-----NVLFQMLLRASNAV-G--YKNY---------------------PD-NVIHKFVQESAKAGIDVFRIFD------------SLNW---------VDQ--------MK-VANEA--------------VQEAG-KISEGTIC---YT-G-----DILNPERS---NIYTLEYYVKLAKELEREGFHILAIKDMAGLLKPKAAYELIGELKSAV---DL--PIHLHTHD-TS--GNGLLTYKQAIDAGVDIIDTAVASMSGLTSQPSANSLYYAL-NG-----FPRH--LRTDIEGMESLSHYWSTVRTYYSDFESDIKSPNTEIYQHE---------------------------------------------------------------------------------------MPG-GQYSNLSQQAKSLGLGERFDEVKDMYRRVNF-------------------------------------------------------------------LFGDI-VKVTPSSKVVGDMALYMVQNDLDEQSVITDGYKLDFP---ESVVSFFKGEIGQPVNGFNKDLQAVIL--------KGQEA--LTARPGEYLEPVDFEKVRELLEEEQQGPVTEQDIISYVLYPKVYEQYIQTRNQYGNLSLLDTPTFFFGMRNGETVEIEIDKGKRLIIKLETISEPDENGNRTIYYAMNGQARRIYIKDENVHTNANVKPKADKSNPSHIGAQMPGSVTEVKVSVGETVKANQPLLITEAMKMETTIQAPFDGVIKQVTVNNGDTIATGDLLIEIE------------------- 
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSS-VL-TDF--------EPKFL---SSDDLLNLYAEYCNG--H-----YCDF--KY--KQGRLSDG-N-I-QSHLYFKKDH-FIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTD--R-AQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKN-QNGSAHFFNFQAKQGKDKNDNVVGHTMIIG---STGSGKSTFISFLIANLLTKYHMSVVALDRMNGLEIMTDFFEGQYN-TANTDGGFYINPF--SLKD-TEENRQFLANWIKFMLNIDSDNQQDNKAS-------------------------------------QSIDKVIRDTYNYMGEQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEH-------------------------NKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIY---PTKD--TDELIECGVPLS-DSEI---NFLHNTDMR---VRQVLVKNIV------T---NA-SAFIEIDLKKICKNYFIFLIAMLVI----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EKSSMILKKQTKKLIRKSI 
 
QNRAQKLLHYLGHVMVNGPTTPIPVKASPSPTDPVVPAVPIGPPPAGFR-DILLREGP---------E-GFARAV-R-------------------N-H-----------------------------P-------GLLLMDTTFRDAHQ-----SLLAT-RV------------RTHDLKKIAPYVAHNFSKLFSMENWGG-----------ATFD----VA-MR--FLYECPWRRLQE-LREL--------------IPN---IPFQMLLRGANAVG--Y-----TNYPDN------V-VFKFCEVAKENGMDVFRVFDSLN-YLPNML-LGME-AAGSA-G-GVVEAAISYTGDVADPSRTKYSLQYYMGLAEELVRAGTHILCIDMAGLLKPTACTMLVSSLRDRFPDLPLHIHTHDTSGAGVAAMLACAQAGA-------------------DVVDVAADSMSGMTSQPSMGALVACTRGTPL----------DTEVPMERVFDYSE-YWEGARGLYAA-FD-------------------------CTAT--M-KSGNSDVYENE---IPGG--QYTNLHFQA----------------HSM------------------GLGSK----------------------FKEVKKAYVEANQMLGDLIKVTPSSKIVGDLAQFMVQNGLSRAEAEAQAEELSFPRSVVEFLQGYIGVPHGGFPEPFRSKVLKDLPRVEGRPGASLPPLDLQALEKELVDRHGEEVTPEDVLSAAMYPDVFAHFKDFTATFGPLDSLNT---------------------------------RLFLQGP-KIAEEFEVELERGKTLHIKALAVSDLNRAGQRQVFFELNGQLRSILVKDTQAMKEKALKDVKGQIGAPMPGKVIDIKVVAGAKVAKGQPLCVLSAMKMETVVTSPMEGTVRKVHVTKDMTLEGDDLILEIE------------------ 
-----------------------------------------------M-SSVL-----TDFEPKFLSSDDLLNLYAEYCNGHYCDFKYKQGRLSDGNIQSHLYFKKDHFIHDFNGIETFKRFIAVKAYDVDNITSLALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVSLSIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGSGKSTFISFLIANLLTKYHMSVVALDRMNGL--E-------IMTDFFEGQYNT-ANTDGGFYINP---FSL-KDTEENRQFLANWIKFM-------------------------LNIDSDNQQDNKASQSIDKVI-----------------------------RDTYNYMGEQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIA-------MYLFHKLFFEAKEHNK--PFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQ-KISQVKELGIDKAKSLIGNLSQVIIYPT-K-----------DT--DEL-IECGVPLSDSEI--------------------------------------------------------------------------------------------------------------NFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFI-FL------------------------------I-A------------MLVIE--------------------------------------------------------------------------------KSSMILKKQTKKLIRKSI 
 
DFFRD----------EAERIMRDSPVIDGHNDLPWQLLDMFNNRLQDERANLTTLAGTHTNIPKLRAGFVGGQFWSVYTPCDTQNKDAV-RRTLEQMDVVHRMCRMYPETFLYVTSSAGIRQAFREGKVASLIGVEGGHSI-------DS----------------------SL-GVLRALYQLGMRYLTLTHSCNTPWADNWLVDTGDSE--PQSQ-G--L-SPFG------QRVVKELNRL---------------G-----VLIDLAHVSV-------------------------ATMKATLQLS-RAPVIFSHSSAYSVCASRR-NVPDDVLRLVK-Q-TDSLVMVN-FYNNYISCTNKANLSQVADHLDHIKEVAGARAVGFGGDFDGVPRVPEGLEDVSK-YPDLIAELL---RRNWTEAEVKGALADNLLRVFEAVEQASNLTQAP------------------E--EE-PIPLDQL-------------------------------------------------------------------------------------------------------------------------------------------------------------------------GGSCRTHYGYSS------------------------- 
-----MSSVLTDFEPKFLSSDDLLNLYAEYCNG--HY---C-------DFKY--------KQGRLSD-GNIQSHLYFKKD-------H-FIH-DFNGIETFKRFIAVKAYDV---------DNITS--LALSNLLCEKFNLDILLTVEAMDKEKALFFIRERKKRSKNISYQNIEDLEQMVSTDRAQIQKVS---L---SIMVFASSKKELDEKSIIVYNTLKKEGFSAVLESINMRPIFFSFFPERNFLNSRLRPQTSQNIASLIMFEKYQEGFKENSWGDCPVSVFKNQNGSAHFFNFQAKQGKDKNDNVVGHTMIIGSTGSGKS--TFISFLIANLLTKYHMSVVALDRMNGLEIMTDFFEGQY-NT---ANTD--------GGFYINPFSLKDT---EENRQFLANWIKFMLNIDSDNQQD-NKASQS-IDKVIRDTYNYMG--------EQKNQINLLEIAKNLGSSEQDFNEILKSQGEKIYFKNFQDCLDFSKSPLSVINMDAFANDKKLMGLIAMYLFHKLFFEAKEHNKPFFLFIDETKDYIMHPIMFAYIANALAQARKINGTLCMAFQKISQVKELGIDKAKSLIGNLSQVIIYPTKDTDELIECGVPLSDSEINFLHNTDMRVRQVLVKNIVTNASAFIEIDLKKICKNYFIFLIAMLVIEKSSMILKKQTKKLIRKSI 
 
(a)Residue pairs that are structurally aligned by TM-align program are highlighted in color. Coloring scheme is based on the property of amino acids, where polar are brightly coloured while non-polar residues are colored in dark shade.(more about the colors used)
(b)Ranking of proteins is based on TM-score of the structural alignment between Model1 and the PDB structures in our template library.
(c)RMSDa is the RMSD between residues that are structurally aligned by TM-align.
(d)IDENa is the percentage sequence identity in the structurally aligned region.
(e)Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the model.

  Function Prediction

  Predicted EC Numbers

RankTM-scoreRMSDaIDENaCov.EC-ScorePDB
Hit
EC No.
10.4651 3.10 0.07 0.510.41821y65A 3.2.1.52
 
20.4414 6.56 0.06 0.630.38513ddgA 3.2.1.114
 
30.3974 7.11 0.06 0.590.35621o95A 1.5.8.2
 
40.3990 6.69 0.06 0.570.35097reqC 5.4.99.2
 
50.3349 5.05 0.10 0.420.34851fhlA 3.2.1.89
 
(a)Ranking is based on EC-score, which is a confidence score for the Enzyme Classification (EC) Number prediction.
(b)RMSDa is the RMSD between models and the PDB structure in the structurally aligned regions by TM-align.
(c)IDENa is percentage sequence identity in the structurally aligned region.
(d)Cov. represents the coverage of the alignment and is equal to the number of structurally aligned residues divided by length of model.
(e)EC-Score is defined based on the C-score of the structure prediction and similarity of the model with known enzyme structures, as identified using both global and local structural alignment programs. The global similarity score uses TM-score, IDENa,RMSDa and Cov. of the structural alignment by TM-align, while the local match compares the structural and chemical similarity of local spatial motifs in the model with known catalytic site of enzymes. A prediction with a EC-score >1.1 signifies a prediction with high confidence (upto 3 digit numbers of EC) and vice versa (For details, see Ambrish, Kucukural and Zhang,Large scale benchmarking of protein function prediction using predicted protein structures, 2009, submitted).

  Predicted GO terms

RankTMscoreRMSDaIDENaCov.PDB
Hit
Fh-ScoreAssociated GO Terms
10.8648 1.95 0.10 0.901x39A 0.88 GO:0004553 GO:0005975
20.6584 2.04 0.09 0.681x39A 0.66 GO:0004553 GO:0005975
30.4674 3.05 0.06 0.512oxnA 0.41 GO:0004563 GO:0007049 GO:0009058 GO:0009653 GO:0016043 GO:0044085 GO:0044238 GO:0048856 GO:0048869 GO:0050793 GO:0050794 GO:0051301 GO:0065008 GO:0005622 GO:0005623 GO:0005737
40.4414 6.56 0.06 0.633ddgA 0.39 GO:0004559 GO:0004572 GO:0008270 GO:0030246 GO:0043169 GO:0006013 GO:0006066 GO:0009987 GO:0000139 GO:0005622 GO:0005623 GO:0005737 GO:0005783 GO:0005794 GO:0005795 GO:0012505 GO:0016020 GO:0016021 GO:0043229 GO:0044444
50.3418 4.63 0.11 0.421jneA 0.37 GO:0004568 GO:0008084 GO:0043169 GO:0006032 GO:0006044 GO:0006046 GO:0007444 GO:0009056 GO:0043283 GO:0048856 GO:0005576
60.4159 6.53 0.06 0.591xc6A 0.37 GO:0004565 GO:0043169 GO:0005975 GO:0009341
70.4183 6.61 0.06 0.591xc6A 0.36 GO:0004565 GO:0043169 GO:0005975 GO:0009341
80.3957 7.05 0.06 0.582tmdB 0.35 GO:0005506 GO:0010181 GO:0043169 GO:0050470 GO:0050662 GO:0051539 GO:0055114
90.3990 6.69 0.06 0.577reqC 0.35 GO:0004494 GO:0031419 GO:0043169 GO:0050897 GO:0008152
100.3349 4.99 0.10 0.421fobA 0.35 GO:0015926 GO:0031218 GO:0043169 GO:0005975

Consensus Prediction of Gene Ontology terms 
Molecular Function  Biological Process Cellular Location
GO termGO-Score  GO termGO-Score GO termGO-Score
GO:00038240.449  GO:00081520.449  GO:00057370.212
GO:00167870.378  GO:00442380.378  GO:00444240.212
GO:00045530.378  GO:00059750.337  GO:00056230.212
GO:00167980.378  GO:00488560.119  GO:00444640.212
GO:00054880.253  GO:00099870.080  GO:00056220.212
GO:00431690.253  GO:00325020.078  GO:00329910.106
GO:00431670.253  GO:00507940.041  GO:00093410.106
GO:00469140.109  GO:00440850.041  GO:00432340.106
GO:00468720.109  GO:00160430.041  GO:00432270.105
GO:00159250.073  GO:00650080.041  GO:00432290.105

(a)Ranking in the first table is based on a function prediction score (Fh-score), which is calculated based on the C-score of the structure prediction and the TM-score, IDENa, RMSDa and Cov. of the structural alignment by TM-align between the predicted model and the PDB structures (For details, see Ambrish, Kucukural and Zhang,Large scale benchmarking of protein function prediction using predicted protein structures, 2009, submitted).
(b)RMSDa is the RMSD between models and the PDB structure in the structurally aligned regions by TM-align.
(c)IDENa is the percentage sequence identity in the structurally aligned region.
(d)Cov. represents the coverage of the alignment and is equal to the number of structurally aligned residues divided by length of model.
(e)A consensus prediction of GO terms is derived from the structural analogs that have an Fh-score of >=1.0. The GO-Score associated with each prediction is defined as the average weight of the GO term, where the weights are assigned based on the Fh-score of the template from which the GO term is derived. A prediction with a GO-score >0.5 signifies a prediction with high confidence and vice versa.


  Predicted Binding Site


Binding site residues in the model:
LEU:147  ILE:149  GLU:158  GLU:161  LYS:218  TYR:219  PHE:223  
ASN:226  ASP:230  CYS:231  PRO:232  VAL:233  SER:234  ILE:266  
ILE:267  GLY:268  SER:269  THR:270  ARG:298  THR:531  
Identified analogs with similar binding site:

RankPDB
Hit
TM-scoreRMSDaIDENaCov.BS-scoreDownload
Complex
Binding site residues on the predicted model
11fa2A0.3765 5.94 0.05 0.50 1.00Download147,149,158,161,218,219,223,226,230,231,232,233,234,266,267,268,269,270,298,531
21wdqA0.3774 5.97 0.04 0.50 0.54Download147,149,156,157,158,161,219,221,222,223,224,226,229,230,231,232,233,234,267,269,270
31iexA0.8665 1.89 0.10 0.89 0.42Download50,51,52,144,218,265,298,299,324,329,449,453
41didA0.3508 5.24 0.07 0.45 0.36Download25,49,88,90,144,216,218,265,296,297,298,299,323,324
51ituA0.3820 4.83 0.05 0.47 0.28Download51,52,298,299,300,328,329,453
62epnA0.3614 5.42 0.07 0.46 0.22Download147,218,219,223,265,266,267,268,298,299,449

(a)Ranking of the analogs in the table is based on their BS-score. BS-score is calculated by first finding the structural analogs of the predicted model based on their TM-score, IDENa and Cov. of the structural alignment. The ligand(s) in the analog structure are then transferred onto the model and the fitness of the ligand-model complex (BS-score) is calculated by comparing the local structure and sequence similarity in the binding site region. (For detail, see Kucukural, Szilagyi, Ambrish, and Zhang, Template based ligand binding site prediction on modeled protein structure, 2009, in preparation).
(b) A BS-score of >0.5 signifies a binding site prediction with high confidence and vice-versa.
(c)RMSDa the RMSD between residues that are structurally aligned by TM-align.
(d)IDENa is the percentage sequence identity in the structurally aligned region.
(e)Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the model.
(f)The image shows the ligand-protein complex with the best BS-score. The ligand is depicted in magenta colored ball & stick, the predicted binding site residues interacting with the ligand are shown as transparent green spheres, while the N & C terminus in the model are marked by blue and red spheres respectively.




 Please cite following articles when you use the I-TASSER server:
1.Yang Zhang. I-TASSER server for protein 3D structure prediction. BMC Bioinformatics, 9:40 (2008).
2.Yang Zhang. Template-based modeling and free modeling by I-TASSER in CASP7. Proteins, 8: 108-117 (2007).
3.Sitao Wu, Jeffrey Skolnick, Yang Zhang. Ab initio modeling of small proteins by iterative TASSER simulations. BMC Biology, 5:17 (2007).