Alignment of the MULE/TnpA group of transposases variable helical insert RNAse H Core <---*-------------------------------------------------------------------------------*-----------------------------> | * Strand-1 Strand-2 Str-3 Hel-1 Str-4 Hel-2 Str-5 v catalytic helix Secondary Structure prediction -EEEEEEEEEEE--- ----EEEEEEEEE-----EEEEEEEE----HHHHHHHHHHHHH---------------EEEEEE-----HHHHHHHH----------EEEEEE--HHHHHHH--- -EEE---HHHHHHHHHHHHHH--- -HHHHHHHHHHHHH-- tnp_Paer_45360229 170 IVYLDCIHSKVREGA -VRVKAVYLALGINLAGEKEILGLWIAQNEGAKFWLQVVTELR--------NRGVQ-DIFVACVDGLKGFPEAIEAVFPRT-----SVQLCIVHMVRHSLNYVSW 63 -VIYTTNAIESVNMSLRKITKNRG 19 SKKWTMPIRDWKAALT 395\TnpA/prokaryotic transposases SMa1356_Smel_14523863 161 LVFFDAIRVKIRDEG FVRNKAVYVALAVLADGSKEILGLWIEQTEGAKFWLRVMNELK--------NRGCQ-DILIAVVDGLKGFPEAITAVFPQT-----IVQTCIVHLIRHSLEFVSY 64 -IIYTTNAIEALNSKLRRAVRSRG 19 AEQWKRAPREWVEAKT 388| Rmet_2268_Rmet_94311206 174 VVFFDALRVKIREDA VVRNKAVYLALGVLPDGTREILGLWIENTEGAKFWMKVFNDLK--------TRGVN-DILIAVTDGLKGMPEALSAVFPAT-----TLQTCIVHLIRNSLNYASW 64 -VIYTTNAIENINSQLRKIIKTRG 19 TSDWGRAAKDWKDAMN 401| PSPPH_2312_Psyr_71558630 177 VIFFDALRVKIRDEG LVCNKAIYLALGVLPDGTRDILGIWIESTEGAKFWMKVFNDLK--------TRGVE-DVLIAVTDGLKGIPEALGAVFPAT-----TLQTCIVHLIRNSLDYAAW 64 -VVYTTNAIESINAQLRKIIKTRG 19 TANWGHAAHDWKVAMN 404| EcolE1_01003404_Ecol_75234766 162 IVYLDCIVLKVRQDS RVINKSVFLALGINIEGQKELLGMWLAENEGAKFWLNVLTELK--------NRGLN-DILIACVDGLKGFPDAINTVYPEA-----RIQLCIVHMVRNSLRFVSW 63 -VIYTTNAIESLNSVIRPAIKKRK 19 SQKWTMPLRDWRMAMS 388| CtheDRAFT_2840_Cthe_67874546 171 IVFIDAIHFSVKNDG IVGKKAVYIVLAIDIEGQKDVIGIYVGENESSKFWLSVLNDLK--------NRGVK-DILILCADALSGIKDAINAAFPNT-----EYQRCIVHQIRNTLKYVSD 63 -IMYTTNTIESLNSSYRRINKSRT 19 TSKWTMRYKNWGLILG 397| MYPE9620_Mpen_26554410 166 IIFVDGIRFKVRDDS QFLEKSVYIILGVNLEGIKEILGFWIGETESAKQWLSIFNDLK--------SRGVK-KVFISCSDNLTGISDAFKSAFPET-----YIQKCIVHQVRNSTKFVRY 63 -LIYTTNLIENVNRNIRKITKAKG 19 SENWKSRVANWGLILQ 392| EF_2800_Efae_29344737 159 AIFMDATYIPLKRQT -VSKEAIYIAIGIREDGTKEVLSYAIAPTESTYVWNELLQDIN--------SRGVQ-EVLLFITDGLKGMKDTIHQIYPKA-----KYQHCCIHVSRNIAHKVRV 62 -TIYSTNLIESFNKQIKRYSRRKE 19 NQKFLNRSHKGFQQVT 383| tnp_Saur_14246183 159 AIYMDATYIPLKRKT -VAKEAIHIAVGIRPDGSKEVLSYAIAPTESITIWEEILLDLQ--------ERGLK-NVLLFITDGLKGMVGAISRFYPKA-----RFQHCCVHVSRNISHKVRV 62 -SIYSTNLIESFNKQIKKYSHRKE 19 NQKFLGRSHKGFQQAE 383| BH8065401_Bhal_47076577 151 VLYLDGTYLKLRRED -VANEVVYLVVGVTEEGYREILGFYVGGQESANGWRNTLLDLY--------SRGLE-EVLLGVFDGLAGLEEAMKAVYPKA-----DVQRCVVHKVRNALHAVRK 63 -MIYTTNIIERTMKEIKKRTKTMN 19 NERWATRRLRGFGEAY 376| TTHB232_Tthe_55773588 156 FVYLDGLSLKVFREG 1 GIVRESVYVALGIAPNGERRVLGFWLLPTESALGWEGVLGELW--------QRGLR-RVLLFVTDGLPGLPEAIRRVYPQA-----EWQRCVVHGVRWSLSQVRS 63 -YLRSTNLMERFIRELRRGTKVRD 20 EGRWAERKLKGFSEVK 384| FaciDRAFT_1713_Faci_68141314 148 AIFLDGLFFYLRRGN -VDKEPVIFALGIKETGEYEVLGFYLTVKESHNSYKEVLEDLY--------SRGLK-EPLLIVADGIKNLDEEVMEIYPRS-----EFQLCTIHYTRGLKSNVRE 63 -SLKSTNAIERLNGEVRRRVKTIS 19 NSKHAFRKMNGYYKCK 373| SCP1.259_Scoe_13620745 173 YVWADGIHLNVRLEE --AKACVLVLVGVRADGSKELVALKDGYRESAEAWADLMRDCA--------RRGMR-APVLAVGDGALGFWKALAEVFPAT-----REQRCWVHKTANVLDAMPK 63 -HLRTTNPIESTFATVRLRTKVTR 19 QQRWRAVNAPHLVALV 397| Mmcs_1475_Msp._108798446 179 YLWVDGIHLKVRLDQ --EKLCLLVMLGVRADGRKELVAITDGYRESTESWADLLRDCK--------RRGMT-APVLAVGDGALGFWKAVREVFPAT-----KEQRCWFHKQANVLAALPK 63 -HLRTTNPIESTFATVRLRTKVTK 19 AARWRAVNAPHLVALV 403| NB311A_20481_Nsp._85714597 192 YIWADGIHLQARLED --EKQCILVLIGATPEGRKELVGFTDGARESAQDWRDLLLDLK--------RRGLDVPPQLAIADGALGFWKAAGEVWPKT-----REQRCWVHKTANVLAKLPK 63 -HLRTTNPIESTFATVRHRTIRSK 19 QKSWRRLDGHNQLSKL 417| Magn03007664_Mmag_23013207 176 YVWADGVYLQARMED --QAECMLVLIGATPEGRKELVGFQVGVRESAQSWRELLVDVK--------RRGLSIAPEIAVGDGALGFWQALDEVFPGT-----AHQRCWVHKAANVLDKVPK 63 -HLRSSNPIESVFATVRHRTVRTK 19 SKTWRRLKGENQLPKV 401| SSO1980_Ssol_13815256 168 ALYIDVKIVKVRISE SIMERAIYIAIGVDLEGNKFVLDYEVRDREDLDGWKSFLSGLV--------SRGVS-RVDVIVSDDFSGLDRVVSTLFPSS-----QHQLCITHMVRNLMRVLPD 63 -YLYTNNTSESFNLTLARFEEELG 20 NSRWKFRPMSVIRHYS 395| DSM3645_27508_Bmar_87310248 184 YVMLDGIWLKRSWGG EVKNIAVLVAVGVRSDGHREILSVAEGTKEDSESWRTFLRHLK--------ERGLQ-GVRLITSDKCLGLVEALGEFFPEA-----AWQRCIVHFYRNVLKDVPR 64 -RLRSNNMLERIMKEIRRRTRVVG 20 GTHWGTRRYLDMDRLH 412| Tn951_orf1_Bcep_208687 162 YLFLDARYEKVRLEG RIVDCAVLIAVGIEASGKRRVLGCEVATSEAEINWRRFLESLL--------ARGLK-GVTLIIADDHAGLKAARRAVLPSV-----PWQRCQFHLQQNAGALTTR 64 -RLRTTNGLERINRELRRRTRVAS 19 DDEWMTGKVYLNFNP- 388| AcryDRAFT_2294_Acry_88940363 157 YLWLDATYLKQREGG RIVSVAAIIAVAANTEGKREIVGLHIGPSEAETFWAAFLKSLV--------RRGLR-QVKLVISDAHEGLKSAIRRVM-GA-----SWQRCRVHWMRNALAYVGK 63 -KIHSTNPIERLNKEVKRRADVVG 19 NDEWQTQNRYMQTEPM 382| CtheDRAFT_2466_Cthe_67876288 161 YLWLDATFPKVREGG RVCSMALVIAVGVNQQGEREILGFDVGMSEDGAFWEEFLRRLV--------ARGLK-GVRLVISDAHEGLKAAIKKILTGS-----AWQRCRVHFMRNVLSQVPK 63 -QIHSTNPLERLNREIRRRTDVVC 19 NDEWKVGRRYFSLESM 387| WH5701_02179_Syn_87301444 162 YVYLDATYLKGRLGK 2 QVCSRAVVVAMGVNVDGRRELLGLKVGDSETEGFWSEFLASLIAAGFCEAVERGLT-GVKLVISDAHVGLTKAIRRQLQGC-----VWQRCRVHFARNLLQRVPI 71 -KVWSTNLLERVNEEIKRRTRVVG 19 HEHWQLEGRRMFSAES 406| Mbur_1067_Mbur_91773056 154 YLYVDATYLKVRDRV RYVNKAVFIVAGVKNDGYREILGVKIADSEEAMFWEEMFTDLK--------ERGLR-GVELVISDGHKGIQRAVERQFLGA-----SWQMCIVHLERLILKKLPR 58 -RIKTTNMIERLNKEVKRRSKVVG 19 NENWITGNRYLTMED- 374| MM2530_Mmaz_21228632 172 YLFIDATYLKVRDGL HYENKALFIVSGVRDDGFREILGARLADSEDSLFWQDLFEDLK--------ERGLR-GVKLIVSDGHKGIQKAVRESFIGS-----SWQMCHVHLIRQALKKVQK 58 -KIRTTNMMERTNKEIKRRTKVVG 19 NEDWITGNRYIVMEQ- 392| tnpAb_Dhaf_66271271 148 VLWVDALYEKIRDDR RVKNMAVLIVTGIDLEGKRDILAVEPMPEESTATYTSLFEKLK--------SRGLE-KVWLVVSDAHKGLVKAVQESFIGC-----SWQRCKVHFMRNILAHISG 63 -KIASTNLLERLNREVRRRTRVVG 19 SEDWCSGRSYINPKII 374/ F52C6.14_Cele_2746816 274 IIILDDTHNVTMYG- LKLTTITVVDNNDRGEPAGFLLSSSTTSAEVAVFFQKVKELY-----------PEF-RPAFFMSDEANCFWNGFSAVFDSTHT---KKVLCRWHLLRSWCK---- 36 -LSLLTELDESGNAKAKQFSDYFM 6 IGQWSTTSRANIACHT 454\Celegans expansion Y73F8A.33_Cele_17544214 317 GICIDDTHNPSKYN- LKLTTMLVVNGHGRGIPVAYMISSTVTQEDVKQLFECIVKEI-----------PDF-HPQYFMSDEAHAFWNGYNCVLKNHKT---QRLWCIWHVQRALFENADK 27 --NLLATLLEHLENDCENGKQYAD 6 DKLWAGCFRRGAPFQT 491/ ECU05_0180_Ecun_19170760 194 VPIVEIRIYKRTEG- ----FVILGVLRDPAW---FPVVYSCVVSDLADRDDAFGYFVDS-----------M-PSLFFLVDFDVHLIRVLKEKDR-------EFFVKTRDICKFYYNRGKS 37 DMLGLQNTSECDVEFIGLGIFNLP 11 ISDNLKQRKKINLGEG 374\HxC group K2N11.4_Atha_10177931 427 VVVVDGTFLQGKYL- ----GTLLTATTQDGNFQIYPIAFAVVDTENDASWEWFFRQLSS-------VIPDD-ESLAIISDRHQSIKRAIMTVYPKS-----SRGICTYHLYKNILVRFKG 54 YNLLTSNIAESMNKVMSPARSLPI 10 MTRWFSDRRNDALNLS 632| At2g12720_Atha_4850412 449 VVVVDGTFLHGSYK- ----GTLLTALAQDGNFQIFPLAFGVVDTENDDSWRWLFTQLKV-------VIPDA-TDLAIISDRHKSIGKAIGEVYPLA-----ARGICTYHLYKNILVKFKR 54 YNLMTTNIAESMNRALSQAKNLPI 10 MTRWFAERRDDASKQH 654| At2g29230_Atha_3980409 542 VIVIDGAHLKGKYG- ----GCLLTASGQDANFQVFPIAFGVVDSENDDAWEWFFRVLST-------AIPDG-DNLTFVSDRHSSIYTGLRRVYPKA-----KHGACIVHLQRNIATSYKK 54 YNVMTSNVAESLNAVLKEARELPI 10 LISWFAMRREAARTEA 747| At2g07230_Atha_3805769 251 VIVIDGTHLRGRYG- ----GCLIAASAQDANFQVFPIAFGIVNSENDDAWTWFMERLTD-------AIPND-PDLVFVSDRHSSIYASMRKVYPMS-----SHAACVVHLKRNIVSIFKS 54 YFFMTSNIAESLNNVLTMARDYPV 10 LVTWFALRQETAQHEG 456| F7N22.13_Atha_3047071 506 VVVVDGTQLVRPYK- ----GCLLIACAQDGNFQIFPIAFGVVDGETDASWAWFFEKLSE-------IVPDS-DDLMIVSDRHSSIYKGLSVVYPRA-----HHGACAVHLERNLSTYYGK 54 YNIMSSNNSESMNHVLTKAKTYPI 10 LMRWFASRRKKVARCK 711| T12C22.11_Atha_8655994 831 ILVVDGTHLKGKYK- ----GVLLTSSGQDANFQVYPLGFAVVDSENDESWTWFFTKLER-------IIADS-KTLTILSDRHSSILVAVKRVFPQA-----NHGACIIHLCRNIQTKYKN 54 FNLMTSNIAETLNKALNKGRSSHI 10 LTRWFNARRKKSLKHK 1036| At2g15150_Atha_4585914 242 VISIDGAHLTSKFK- ----GTLLGASAQDGNFNLYPIAFAIVDSENDASWDWFLKCLLN-------IIPDE-NDLVFVSERAASIASGLSGNYPLA-----HHGLCTFHLQKNLETHFRG 54 YNIMTSNLAESVNALLKQNREYPI 10 MTRWFNERREESSQHP 447| MQD17.7_Atha_11994228 420 VIIVDATFLKTIYK- ----GVLIFATAQDPNHHHYPLAFAVADGEKDVTWKWFFETLKT-------VIPDS-TELVFMSDRNSSLIKAVAEVYPSS-----HHGNCVYHLSQNVRTKVAY 57 YNIDTSNAVESMNGVFRDVRGYAL 10 FAEWSCNNRKEALSGS 628| OSJNBa0012L23.50_Osat_37535470 512 YLSIDSTALNGKWN- ----GQLASATSIDGHNWMFPVAFGFFQSETTDNWTWFMQQLNK-------AVGNL-PTLAISSDACKGLENAVKNVFQRA-----EHRECFWHLMQNFIKKFQG 54 CDFITNNLAESWNKWIKDMKHLPI 10 TMNLLARRRKIGEKLD 717| OSJNBb0038A07.22_Osat_37536362 423 YLGIDSTVLTGKWR- ----GQLASAIGVDGHNWMFPVAYGVFESESTENWAWFMDKLHM-------AIGSP-VGLVLSTDAGKGIDTTVTRVFNNGV----EHRECMRHLVKNFQKRFSG 55 CDYVTNNIAETFNSWIRHEKSLPV 10 IMERISIRKRLAEKLT 630| OSJNBb0041J20.4_Osat_50921073 499 YIAMDSTHLTGKHR- ----GQLAAAVAIDGNNWLFPVAFGVIEAETTESWTWFVQNLKN-------AIGNP-PGLAISTDAGKGLERAVSDVYPTA-----EHRECMRHLWKNFKKQYHG 55 VDYINNNLSESFNNWVMKIKELHI 10 IIDKFHLRSQLASKME 705| OSJNBb0007E22.22_Osat_50901476 476 IICLDGCHIKTKFG- ----GQLLTAVGIDPNDCIFPIAMAVVEVESFSTWSWFLQTLKDDV-----GIVNT-YPWTIMTDKQKGLIPAVQQLFPDS-----EHRFCVRHLYQNFQQSFKG 55 CDILLNNSCEVFNKYILEAREMPI 10 LMTRFFNKQKEAQKWQ 684| MtrDRAFT_AC124956g12v2_Mtru_92883123 562 LIGLDGCFLKGYYG- ----GQLLSAVGQDGNNHIYVIAYAIVDVENKDNWKWFLELLHKDL-----GDYER-NGWNFISDMQKGLIPAMQEVMPGV-----PHRYCAMHLWRNFTKQWKD 55 ADNIVNNACEVFNAKILNYRGKPI 10 VMRKMSHNKMKLDGRA 770| MtrDRAFT_AC150981g6v1_Mtru_92883944 457 FIGVDGCHLKTKYG- ----GIILVAVGRDPNDQYYPLAFGVCETETKESWKWFLTLLLE-------DIGQD-KRWVFISDQQKGLMAVFDELFESV-----EHRLCLRHLYANFKKKFGG 56 CDVLMNNISEAFNSTILVARDKPI 10 LMNRNSTLRQKVDRWQ 664| MtrDRAFT_AC146590g49v2_Mtru_92891293 485 IIGLDGCFLKGYYG- ----GQILAAVGRDPNDQMLPIVVAVVESETKDSWDWFLKLLVDDLG----GPEAC-NSYTFISDQQKGLLPAMEALLPDV-----EQRFCVRHLYNNFRKKHPG 55 CDVLVNNMSETFNSVIIGPRGKPI 10 FMERWATNRTKIQAYD 694| MQK4.25_Atha_20259876 219 LLELDRAHLKGKYL- ----GAILCAAAVDADDGLFPLAIAIVDNESDENWSWFLSELRKLLGM---NTDSM-PKLTILSERQSAVVEAVETHFPTA-----FHGFCLRYVSENFRDTFKN 54 YGHFGLGITEVLYNWALECHELPI 10 ISSWFDNRRELSMGWN 428| MtrDRAFT_AC159872g18v1_Mtru_92882072 200 IVALGGIQLKSKYL- ----STFLSATSFDADGGLFPLAFAVVDVENDESWTWFLSELHNALEV---NTECM-PQIIFLSDGQKGIVDAIRRKFPRS-----SHAFCMRHLSENIGKEFKN 54 YGHLSSNI-EEFNKWILEAQELPI 10 LKTEFDDRRLKSSSWC 408| OSJNBa0071K19.12_Osat_37530384 562 LIFLDKVPLKATNE- ----YKLLVAAGVDADDGVFPVAFNVVEDENYESWVWFLMQLRYALQN---HNYPY-NAMTFLSSGQKGLDAAVPQVFEES-----HHAFCLHHIMEEFKGELRK 62 YDHFSSNIVDAFNNWIPTKKEGSI 10 IMEVIEARRESCKSWS 779| F10A16.15_Atha_6714398 397 LVFLDSMQLKSKYQ- ----GTLLAATSVDGDDEVFPLAFAVVDAETDDNWEWFLLQLRS-------LLSTP-CYITFVADRQKNLQESIPKVFEKS-----FHAYCLRYLTDELIKDLK- 63 YNHMTSHSGEPFFSWASDANDLPI 10 IMGLIHVRRISANEAN 610| MtrDRAFT_AC137065g7v1_Mtru_92872825 328 VVHVDGTWLYGKYK- ----GTLLLAVAQDGNNKTIPIAFALVEGETKEGWSFFLKNLRR-------HVTKG-ISVCMVSDRHESIKSAFNDPRNGWQATGSAHVYCIRHIKQNFMRTIKD 54 WGHMTSNIVESWNSVFKGTRNLPV 10 LACLFADRAQKAFARV 538/ T3F12.8_Atha_2565007 412 VVIVDATFLKTVYG- ----DMLVFATAQDPNHHNYIIASAVIDRENDASWSWFFNKLKT-------VIPDE-LGLVFVSDRHQSIIKSIMHVFPNA-----RHGHCVWHLSQNVKVRVKT 58 -NIDTSNVCESLNSTFESARKYFL 10 ISEWFNKHRKKSVKYS 620\Insert containing group/FAR1-like FAR1_Atha_5764395 275 VVSFDTTYVKFNDK- ----LPLALFIGVNHHSQPMLLGCALVADESMETFVWLIKTWLR-------AMGGR-APKVILTDQDKFLMSAVSELLPNT-----RHCFALWHVLEKIPEYFSH 64 AGMSTSQRSESVNSFFDKYIHKKI 36 PSPWEKQMATTYTHTI 516| P0466H10.7_Osat_50904427 580 FVSFDTTYKTNRYN- ----MPFAPIVGVTGHGNICIFACAFLGDETTETFKWVFETFLT-------AMGGK-HPETIITDQDLAMRAAIRQVFPNS-----KHRNCLFHILKKCRERSGN 67 -FIHSTALSEGTNARFKRGVGPTH 37 NYMIEEQAADLYNHGI 824| MtrDRAFT_AC150889g5v1_Mtru_87241168 571 VITFDTTYLTNKYD- ----MPFAPFVGVNHHGQSVLLGCALLSNEDTKTFSWLFKTWLE-------CMHGR-APNAIITDQDRAMKKAIEDVFPKA-----RHRWCLWHLMKKVPEKLGR 35 ----ELHDNEWLKGLFDERYRWVP 4 DTFWAGMSTTQRSESM 747| OSJNBa0018M09.21_Osat_50906959 495 LIMFDAAYSTDMYN- ----MPFVPIIGINSHATPFLLGCALLKDEKVETFEWMLRTFLQ-------VMGGK-MPRAVITNQDTSMEKAFAELMPHV-----RLRFCKRHVMSKAQEKLGD 65 -FIDSVGSNEGINSLFKGNMLPKD 35 MQPIEQHAAHIYTREI 735| P0041A24.3_Osat_50924538 265 AVVFNTTHRLPALD- ----MLLGIWVGLNNHGMPCFFGCAFLREESLQSYAWALKVFLN-------FMNRK-APLTILTDENMYLKEAIEKELPGT-----KQALCIWLIAARFPSWFDA 65 ---GLLASPETSKSISVFIQRFSS 39 ATPMERHAAAVLTPYA 507| OJ1123F12.8_Osat_34902578 243 VVVFDSTYRVNKYN- ----LPFIPFVGVNHHGSTVIFGCAVVSDERVGTYEWVLKQFLS-------CMCQK-HPKSVITDGDNAMRRAILLVFPNS-----DHRLCTWHIEQNMARNLSP 60 -GMKSNQRSESLNSKLHRLLDRKM 37 STLIEKDAARVFTPKI 480| LOC_Os11g05340_Osat_62701690 804 VVVFDSTYRVNKYN- ----LPFIPFVGVNHHGSTVIFACAVVSDERVETYEWVLRQFLT-------CMCQK-HPKSVITDGDNAMRRAILHVFPNS-----DHRLCTWHIEQNMARNLSP 60 -SMKSNQRSESLNSKLHRLLDRKM 37 EAIPSFCIARRWTMQA 1041| MtrDRAFT_AC149494g11v1_Mtru_92881542 289 VVGLDTTCRTNKAF- ----RPFVQFLGVNHHKQVLIFAAAFLYDETIESFNWLFRTFIG-------AMSGK-KPKAIITEQDAAIIEAINAVLPET-----NRYTCVWQMYENTLKHLSH 65 -DVKGFHLGEILSHKLRSYLNPDL 36 NVVVLKHASVAYTPRA 530| MtrDRAFT_AC138056g24v1_Mtru_92871971 65 VLAFDATYRKIKYN- ----TPLVIFSDVNHHNQSVIFGSVIVGDETEETYVWLLRQFLE-------AMDGK-APVSVITDGDLSMRNAIRRVFPNA-----HHRLCVWHLARNATSNIKN 60 -GFRTTSRCEDLHSEFGKYVTVLS 37 HKSLEKSASKIYTRSV 302| MtrDRAFT_AC146789g14v2_Mtru_92890825 267 VLDFDATYKKNKYS- ----CPFVVFSGVNHHNQTIIFATFVVSNEVEGTYVWLLEQLLV-------ATKGK-TPVSIIKDGDIAMKNAIKKVFPKS-----YHRLCAWHLIRNAMTNIGN 60 -KIRTTSRWEAFHSHMCQFIHSKM 37 LRSLERFASNQFTKEI 504| LOC_Os03g11630_Osat_108706843 449 YIAMDSTHLTRRSRG -----QLASAIAIDGHNRLFPVAYGVIETESKESWTWFVQNVKK-------AIGTP-KGLVISTDACKGIESVVDDVYPGV-----EHRECMRHLWKNMKKKF-- 57 VDYINNNLSECFNSWVSKTKDRRI 17 RNNFAGKMEGRIIPAI 662| LOC651186_Hsap_89061892 154 SQHLDRLSFQSSKMT 16 NTQGHILYAFLVENKERESRVVHFAVLKAETATSVAKMLSIFTEF------NSDWP-KVKVVFVDPSFHYRAILQEIFPAA-----RILLSIYHTTRLLEKKLHR 84 LFREQQSLLDCILCFVDYIDFFNT 62 QVGMLDTLHQSGSELA 463| LOC428161_Ggal_50759053 206 DQGLERLNFQTSKMK 16 SERGHVLYVLLVESKERVGKIVHWSALKADTGESISKMLTVFKEF------NPEWQ-KVKVVFVDVSFLHKAVLQELFPLA-----QVLLSVYHTVRLLEENVNA 83 ALFGQHSSLEVPTERPEPPRHVVP 26 SSEWEVVQMSTQLISA 478| CHGG_10120_Cglo_88176248 134 LLLLDCTYKTNKHG- ----MPLLDMIGVDATQRSFCVAFAFLSGEAEEDYAWALEQLRSLYE----QCGIT-PPSVILTDRCLAAMNAASNLFPSA-----AILLCLWHANKAVLARCQP 71 FGNTATSRVEGIHALLKSYLRRST 37 GALYGAVRG------- 379|\Chaetomium expansion CHGG_02172_Cglo_88182769 203 LLLLDCTYKTNKHG- ----MPLLDMIGVDATQRSFCVAFAFLSGEAEEDYAWALEQLRSLYE----QCGIT-PPSVILTDRCLAAMNAASNLFPSA-----AILLCLWHANKAVLARCQP 71 FGNTATSRVEGIHALLKSYLRRST 37 GALYGAVRG------- 448| CHGG_05063_Cglo_88180976 232 TLLLDCTYKTNNYG- ----MPLLDMIGVDACQRSFCIAFAFLHGETEEDYCWALDQLRSLYE----VCNAR-TPSVVLTDRCIACMNAVSTCFPSA-----ASLLCLWHANKAILRHCQP 75 FGNVVTSRVEGIHALLKGYLQRST 37 GSLYSGVLLGQNSVLR 488| CHGG_06856_Cglo_88178135 189 LLLLDSTYKTNHHN- ----MPLFNACGVTSGNKTFNWAVTFMSGEKEGDYSCALAALIRILQ----NEGIK-VPGLIVTDRELALLNALNNSAWVSI----PHLLCRWHVNMNVLAKARR 80 MGHTTTQAVESSHAAIKKYLVSSR 42 QQHVGDDDSRNPNVEV 456| CHGG_10731_Cglo_88175445 263 VLGLDNTYKTNRFH- ----MYLFEVIGITDQKSVANFAFGLINTEKEDGFLWLCQQLEDLRQ----DLHVP-APTVVITDKETALKNALTATFPGA-----QQQLCVYHINAKVRARIRS 118 FGQRVNSPVETAHKDVKSFLITGT 38 QQGWLGNIPMTVSYVA 563| CHGG_06961_Cglo_88178240 222 VLGLDNTYKTNRFH- ----MYLFEVIGITDQKSVANFAFGLINTEKEDGFLWLCQQLEDLRQ----DLHVP-APTVVITDKETALKNALTATFPGA-----QQQLCVYHINAKVRARIRS 118 FGQRVNSPVETAHKDVKSFLITGT 38 QQGWLGNIPMTVSYVA 522| CHGG_09514_Cglo_88178032 165 VIGFDNTYKTNRFK- ----MPLFQVTGTADTGSLYNCAFGLASTERREGYDFLLKSPESLRA----EIHVE-RPKVAITDFEDALRSSITGIWPDT-----QLQLCIFHINQNVSLNAKR 114 FGVRTNSPTETAHKDLKSYIVTGN 36 HCDWLGAVSKEVGTKA 459| CHGG_01450_Cglo_88185747 46 LQLYDNTYKTNNKK- ----LAFFQVAGINAMGKIYSCAFGFINNERQEGFDWLMDQVNACRE----SIDAN-PPAVTITDYDKAMKSAISRVYPDA-----DQQLCIFHVNKNVVLNIKR 101 FGHRTTSPVESMNRYLKSFVVNGN 37 GKAWLGEAPYNVASRA 328| CHGG_05516_Cglo_88181429 197 VLSFDNTYNTNRFK- ----LPLFQATGQTCLGTVFNAAFGLVDNERLEGFQFLADGIRQFAI----QHNIR-LPDTILTDFGDQMKQALNEQFPES-----QQQICIHHIISNVLLEAKQ 111 FGCRVTSGTEASNNNVKPYLLNGM 39 GAEYLGELPQVISQKA 491| CHGG_05260_Cglo_88181173 359 VMQIDLTYNTNCFG- ----YPLYQVAGLTGANTIYNSIFGFIDNERKESFDWLCRGTHELRA----EFSVE-PPIVILTDHCKELKAALLEVFPDS-----QQQICIYHVIKNVLLNAKK 96 YGARVTSPTESSNLNIKSYLLDGR 37 SRRYLGSLPNKVTYKA 636| CHGG_01452_Cglo_88185749 331 VMQIDLTYNTNCFG- ----YPLYQVAGLTGANTIYNSIFGFIDNERKESFDWLCRGTHELRA----EFSVE-PPIVILTDHCKELKAALLEVFPDS-----QQQICIYHVIKNVLLNAKK 96 YGARVTSPTESSNLNIKSYLLDGR 37 SRRYLGSLPNKVTYKA 608| CHGG_00175_Cglo_88184472 275 CMLIDLTYNTNYMG- ----MPLYQVNCLTSVGKTLSTMFGLVSDETTQTFRWLMKATKKLRD----KFNIP-EPAVIVTDHCKELKQAISEVFPDS-----QQQTCIFHVIKNVMLNTKR 90 FGVRTTSPTESNNMSIKSYLINGR 37 SRPWLGALPLRVSYKA 546| CHGG_01239_Cglo_88185536 275 CMLIDLTYNTNYMG- ----MPLYQVNCLTSVGKTLSTMFGLVSDETTQTFRWLMKATKKLRD----KFNIP-EPAVIVTDHCKELKQAISEVFPDS-----QQQTCIFHVIKNVMLNTKR 90 FGVRTTSPTESNNMSIKSYLINGR 37 SRPWLGALPLRVSYKA 546| CHGG_02061_Cglo_88186358 434 LGFIDMTYNTNVQG- ----LPLYHFACITATGQAVNTIFGVIDNEKKDSFVFLLQATKELLAAA--DPPIR-QPLVILTDHCKEMKAALDEVFPDV-----QQQICVFHILKNVRLNAAK 80 FGYKATSPVESTNLTTKSYLLNSR 37 SRTWLGPLTTAISYKG 697| CHGG_09383_Cglo_88177901 254 CLSFDNTHSTNALG- ----FPLFVITTQTNINSTANVAFGLINNERREGFDFLAQGVKELQV----QLEAR-SPAVTITDKDERMRDALKETFPDA-----QQQLCRFHINKNFTTEE-- 77 FSQSVTSQTESSNFNIKSYLVTGK 37 HQDYLGDLPQAVSLKA 510| CHGG_09478_Cglo_88177996 254 CLSFDNTHSTNALG- ----FPLFVITTQTNINSTANVAFGLINNERREGFDFLAQGVKELQV----QLEAR-SPAVTITDKDERMRDALKETFPDA-----QQQLCRFHINKNFTTEE-- 77 FSQSVTSQTESSNFNIKSYLVTGK 37 HQDYLGDLPQAVSLKA 510| CHGG_05343_Cglo_88181256 254 CLSFDNTHSTNALG- ----FPLFVITTQTNINSTANVAFGLINNERREGFDFLAQGVKELQV----QLEAR-SPAVTILNKDERMRDALKETFPDA-----QQQLCRFHINKN---EE-- 77 FSQSVTSQTESSNFNIKSYLVTGK 37 HQDYLGDLPQAVSLKA 507| CHGG_02237_Cglo_88182834 1039 CLSFDNTHSTNALG- ----FPLFVITTQTNINSTANVAFGLINNERREGFDFLAQGVKELQV----QLEAR-SPAVTITDKDERMRDALKETFPDA-----QQQLCRFHINKNFTTEE-- 77 FSQSVTSQTESSNFNIKSYLVTGK 37 PQDYLGSEEVPVCDDS 1295/ consensus/90% hl.hD.......... ......hh.h.s.p.......hsh.h...E....a..hh..h...................hhsD....h..sh...h..s.........C..H........... .....ss..E.....h........ ...b............ consensus/80% hl.hDs...p.p... ......lh.h.u.s.p.p...huh.h..sEp...a..hhp.h...........s....s.hhlsD...sh.pulpp.aP.s........bC.hHh.pph...h.. ....sss..Es.p..hpp...... ...h............ Species abbreviations: Acry : Acidiphilium cryptum; Atha : Arabidopsis thaliana; Bcep : Burkholderia cepacia; Bhal : Bacillus halodurans; Bmar : Blastopirellula marina; Cele : Caenorhabditis elegans; Cglo : Chaetomium globosum; Cthe : Clostridium thermocellum; Dhaf : Desulfitobacterium hafniense; Ecol : Escherichia coli; Ecun : Encephalitozoon cuniculi; Efae : Enterococcus faecalis; Faci : Ferroplasma acidarmanus; Ggal : Gallus gallus; Hsap : Homo sapiens; Mbur : Methanococcoides burtonii; Mmag : Magnetospirillum magnetotacticum; Mmaz : Methanosarcina mazei; Mpen : Mycoplasma penetrans; Msp. : Mycobacterium sp.; Mtru : Medicago truncatula; Nsp. : Nitrobacter sp.; Osat : Oryza sativa; Paer : Pseudomonas aeruginosa; Psyr : Pseudomonas syringae; Rmet : Ralstonia metallidurans; Saur : Staphylococcus aureus; Scoe : Streptomyces coelicolor; Smel : Sinorhizobium meliloti; Ssol : Sulfolobus solfataricus; Syn : Synechococcus sp.; Tthe : Thermus thermophilus