SickKids wordmark

TM-Finder Training Sets

The following proteins were used to optimize the TM-Finder. For each protein the TM segments (found experimentally) are listed below the sequence. Click on the "submit this sequence to TM-Finder" button below each sequence to find TM-finder's prediction of TM-segments on this sequence

NAME:
Bacteriorhodopsin - 1AP9
1ap9_ mol:protein length:248 X-Ray Bacteriorhodopsin From Microcrystals

SEQUENCE:
QAQITGRPEWIWLALGTALMGLGTLYFLVKGMGVSDPDAKKFYAITTLVPAIAFTMYLSMLLGYGLTMVPFGGEQNP IYWARYADWLFTTPLLLLDLALLVDADQGTILALVGADGIMIGTGLVGALTKVYSYRFVWWAISTAAMLYILYVLFF GFTSKAESMRPEVASTFKVLRNVTVVLWSAYPVVWLIGSEGAGIVPLNIETLLFMVLDVSAKVGFGLILLRSRAIFG EAEAPEPSAGDGAAATS

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1030
3962
77101
105127
134156
169191
202224



NAME:
Photoreaction Center (L chain) 1AIG
281 AA; 31325 MW; B40F1B52 CRC32

SEQUENCE:
ALLSFERKYR VPGGTLVGGN LFDFWVGPFY VGFFGVATFF FAALGIILIA WSAVLQGTWNPQLISVYPPA LE YGLGGAPL AKGGLWQIIT ICATGAFVSW ALREVEICRK LGIGYHIPFAFAFAILAYLT LVLFRPVMMG AWG YAFPYGI WTHLDWVSNT GYTYGNFHYN PAHMIAISFFFTNALALALH GALVLSAANP EKGKEMRTPD HEDT FFRDLV GYSIGTLGIH RLGLLLSLSAVFFSALCMII TGTIWFDQWV DWWQWWVKLPWWANIPGGING

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
3356
84111
116132
171198
228249



NAME:
Photoreaction Center (M chain)
307 AA; 34378 MW; ED6FFEC9 CRC32; identity: P02953

SEQUENCE:
AEYQNIFSQV QVRGPADLGM TEDVNLANRS GVGPFSTLLG WFGNAQLGPI YLGSLGVLSLFSGLMWFFTI GI WFWYQAGW NPAVFLRDLF FFSLEPPAPE YGLSFAAPLK EGGLWLIASFFMFVAVWSWW GRTYLRAQAL GMG KHTAWAF LSAIWLWMVL GFIRPILMGS WSEAVPYGIFSHLDWTNNFS LVHGNLFYNP FHGLSIAFLY GSAL LFAMHG ATILAVSRFG GERELEQIADRGTAAERAAL FWRWTMGFNA TMEGIHRWAI WMAVLVTLTG GIGIL LSGTV VDNWYVWGQNHGMAPLN

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
5478
113140
145161
200229
264286



NAME:
Photoreaction Center (H chain)
260 AA; 28035 MW; 12B9CD39 CRC32; identity P11846

SEQUENCE:
MVGVTAFGNF DLASLAIYSF WIFLAGLIYY LQTENMREGY PLENEDGTPA ANQGPFPLPKPKTFILPHGR GT LTVPGPES EDRPIALART AVSEGFPHAP TGDPMKDGVG PASWVARRDLPELDGHGHNK IKPMKAAAGF HVS AGKNPIG LPVRGCDLEI AGKVVDIWVD IPEQMARFLEVELKDGSTRL LPMQMVKVQS NRVHVNALSS DLFA GIPTIK SPTEVTLLEE DKICGYVAGGLMYAAPKRKS VVAAMLAEYA

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1236



NAME:
Light harvesting complexes (A,D,G,J) 1LGH
56 AA; 5940 MW; D38352F7 CRC32; identity: P97253 REALLY: P26789

SEQUENCE:
SNPKDDYKIW LVINPSTWLP VIWIVATVVA IAVHAAVLAA PGFNWIALGA AKSAAK

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1539



NAME:
Light harvesting complexes (B,E,H,K)
45 AA; 5116 MW; 2F049C00 CRC32;identity: P95673 REALLY: P26790

SEQUENCE:
AERSLSGLTE EEAIAVHDQF KTTFSAFIIL AAVAHVLVWV WKPWF

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1041



NAME:
Photosystem I(PsaA) P25896
>gi|131139|sp|P25896|PSAA_SYNEN PHOTOSYSTEM I P700 CHLOROPHYLL A APOPROTEIN A1

SEQUENCE:
MTISPPEREPKVRVVVDNDPVPTSFEKWAKPGHFDRTLARGPQTTTWIWNLHALAHDFDTHTSDLEDISRKIFSAHF GHLAVVFIWLSGMYFHGAKFSNYEAWLADPTGIKPSAQVVWPIVGQGILNGDVGGGFHGIQITSGLFQLWRASGITN EFQLYCTAIGGLVMAGLMLFAGWFHYHKRAPKLEWFQNVESMLNHHLAGLLGLGSLAWAGHQIHVSLPINKLLDAGV AAKDIPLPHEFILNPSLMAELYPKVDWGFFSGVIPFFTFNWAAYSDFLTFNGGLNPVTGGLWLSDTAHHHLAIAVLF IIAGHMYRTNWGIGHSLKEILEAHKGPFTGAGHKGLYEVLTTSWHAQLAINLAMMGSLSIIVAQHMYAMPPYPYLAT DYPTQLSLFTHHMWIGGFLVVGGAAHGAIFMVRDYDPAMNQNNVLDRVLRHRDAIISHLNWVCIFLGFHSFGLYVHN DTMRAFGRPQDMFSDTGIQLQPVFAQWVQNLHTLAPGGTAPNAAATASVAFGGDVVAVGGKVAMMPIVLGTADFMVH HIHAFTIHVTVLILLKGVLFARSSRLIPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNCISVVIFHFSWK MQSDVWGTVAPDGTVSHITGGNFAQSAITINGWLRDFLWAQASQVIGSYGSALSAYGLLFLGAHFIWAFSLMFLFSG RGYWQELIESIVWAHNKLKVAPAIQPRALSIIQGRAVGVAHYLLGGIATTWAFFLARIISVG

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
7295
161181
197221
297315
352375
391417
440461
537554
594615
670690
729749



NAME:
Photosystem I (PsaB), P25897
>gi|131151|sp|P25897|PSAB_SYNEN PHOTOSYSTEM I P700 CHLOROPHYLL A APOPROTEIN A2

SEQUENCE:
MATKFPKFSQDLAQDPTTRRIWYAIAMAHDFESHDGMTEENLYQKIFASHFGHLAIIFLWVSGSLFHVAWQGNFEQW VQDPVNTRPIAHAIWDPQFGKAAVDAFTQAGASNPVDIAYSGVYHWWYTIGMRTNGDLYQGAIFLLILASLALFAGW LHLQPKFRPSLSWFKNAESRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLSTMPHPAGLAPFFTGNWG VYAQNPDTASHVFGTAQGAGTAILTFLGGFHPQTESLWLTDMAHHHLAIAVLFIVAGHMYRTQFGIGHSIKEMMDAK DFFGTKVEGPFNMPHQGIYETYNNSLHFQLGWHLACLGVITSLVAQHMYSLPPYAFIAQDHTTMAALYTHHQYIAGF LMVGAFAHGAIFLVRDYDPAQNKGNVLDRVLQHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPV FAQFIQAAHGKLLYGFDTLLSNPDSIASTAWPNYGNVWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTTL ILVKGALDARGSKLMPDKKDFGYAFPCDGPGRGGTCDISAWDAFYLAMFWMLNTIGWVTFYWHWKHLGVWEGNVAQF NESSTYLMGWLRDYLWLNSSQLINGYNPFGTNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLVWAHERTP LANLVRWKDKPVALSIVQARLVGLAHFSVGYILTYAAFLIASTAAKFG

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
4565
134154
174198
272290
333353
372398
421442
524541
581602
650670
713733



NAME:
Photosystem I (PsaI), P25900
>gi|131208|sp|P25900|PSAI_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT VIII

SEQUENCE:
MMGSYAASFLPWIFIPVVCWLMPTVVMGLLFLYIEGEA

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1034



NAME:
Photosystem I (PsaJ), P25901
>gi|131216|sp|P25901|PSAJ_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT IX

SEQUENCE:
MKHFLTYLSTAPVLAAIWMTITAGILIEFNRFYPDLLFHPL

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
721



NAME:
Photosystem I (PsaK), P20453
>gi|131223|sp|P20453|PSAK_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT X PRECURSOR (LIGHT-HARVESTING 8.0 KD POLYPEPTIDE)

SEQUENCE:
MVLATLPDTTWTPSVGLVVILCNLFAIALGRYAIQSRGKGPGLPIALPALFEGFGLPELLATTSFGHLLAAGVVSGLQYAGAL

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1230
6883



NAME:
Photosystem I (PsaL), P25902
>gi|131226|sp|P25902|PSAL_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT XI

SEQUENCE:
MAEELVKPYNGDPFVGHLSTPISDSGLVKTFIGNLPAYRQGLSPILPGLEVGMAHGYFLIGPWVKLGPLRDSDVANL GGLISGIALILVATACLAAYGLVSFQKGGSSSDPLKTSEGWSQFTAGFFVGAMGSAFVAFFLLENFLLSMAS

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
7999
124144



NAME:
Cytochrine C Oxidase (A chain, I) 1OCC, 2.8A, p00396
>gi|116969|sp|P00396|COX1_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE I

SEQUENCE:
MFINRWLFSTNHKDIGTLYLLFGAWAGMVGTALSLLIRAELGQPGTLLGDDQIYNVVVTAHAFVMIFFMVMPIMIGGF GNWLVPLMIGAPDMAFPRMNNMSFWLLPPSFLLLLASSMVEAGAGTGWTVYPPLAGNLAHAGASVDLTIFSLHLAGV SSILGAINFITTIINMKPPAMSQYQTPLFVWSVMITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAHHMFTVGMDVDTRAYFTSAT MIIAIPTGVKVFSWLATLHGGNIKWSPAMMWALGFIFLFTVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAV FAIMGGFVHWFPLFSGYTLNDTWAKIHFAIMFVGVNMTFFPQHFLGLSGMPRRYSDYPDAYTMWNTISSMGSFISLT AVMLMVFIIWEAFASKREVLTVDLTTTNLEWLNGCPPPYHTFEEPTYVNLK

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1241
5187
95116
141170
183212
228262
270285
299327
336359
371401
407433
445478



NAME:
Cytochrome C Oxidase (B chain, II) P00404
>gi|117010|sp|P00404|COX2_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE II

SEQUENCE:
MAYPMQLGFQDATSPIMEELLHFHDHTLMIVFLISSLVLYIISLMLTTKLTHTSTMDAQEVETIWTILPAIILILIA LPSLRILYMMDEINNPSLTVKTMGHQWYWSYEYTDYEDLSFDSYMIPTSELKPGELRLLEVDNRVVLPMEMTIRMLV SSEDVLHSWAVPSLGLKTDAIPGRLNQTTLMSSRPGLYYGQCSEICGSNHSFMPIVLELVPLKYFEKWSASML

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1545
6087



NAME:
Cytochrome C Oxidase (C chain, III) P00415
>gi|117055|sp|P00415|COX3_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE III

SEQUENCE:
MTHQTHAYHMVNPSPWPLTGALSALLMTSGLTMWFHFNSMTLLMIGLTTNMLTMYQWWRDVIRESTFQGHHTPAVQK GLRYGMILFIISEVLFFTGFFWAFYHSSLAPTPELGGCWPPTGIHPLNPLEVPLLNTSVLLASGVSITWAHHSLMEG DRKHMLQALFITITLGVYFTLLQASEYYEAPFTISDGVYGSTFFVATGFHGLHVIIGSTFLIVCFFRQLKFHFTSNH HFGFEAGAWYWHFVDVVWLFLYVSIYWWGS

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1637
4166
73106
129153
156183
191223
233255



NAME:
Cytochrome C Oxidase (D chain, IV),P00423
>gi|117085|sp|P00423|COX4_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE IV PRECURSOR

SEQUENCE:
MLATRVFSLIGRRAISTSVCVRAHGSVVKSEDYALPSYVDRRDYPLPDVAHVKNLSASQKALKEKEKASWSSLSIDE KVELYRLKFKESFAEMNRSTNEWKTVVGAAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRMLDMKVAPIQ GFSAKWDYDKNEWKK

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
77102



NAME:
Cytochrome C Oxidase (G chain, VI-A),P07471
>gi|117108|sp|P07471|COXD_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIA-HEART PRECURSOR (COXVIAH) (VIB)

SEQUENCE:
MALPLKSLSRGLASAAKGDHGGTGARTWRFLTFGLALPSVALCTLNSWLHSGHRERPAFIPYHHLRIRTKPFSWGDG NHTFFHNPRVNPLPTGYEKP

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1337



NAME:
Cytochrome C Oxidase (J chain, VII-A), P07470
>gi|117122|sp|P07470|COXK_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIIA-HEART PRECURSOR (COX VIIA-M) (VIIIC)

SEQUENCE:
STALAKPQMRGLLARRLRFHIVGAFMVSLGFATFYKFAVAEKRKKAYADFYRNYDSMKDFEEMRKAGIFQSAK

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
2654



NAME:

SEQUENCE:
MRALRVSQALVRSFSSTARNRFENRVAEKQKLFQEDNGLPVHLKGGATDNILYRVTMTLCLGGTLYSLYCLGWASFPHKK

EXPERIMENTALLY DETERMINED TM-SEGMENTS:



NAME:
Cytochrome C Oxidase (K chain, VII-B), P13183
>gi|117124|sp|P13183|COXM_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIIB PRECURSOR (IHQ)

SEQUENCE:
MFNLRMFPLAKNALSRLRVQSIQQAVQAVARQIHQKRAPDFHDKYGNAVLASGATFCVAVWVYMATQIGIEWNPSPV GRVTPKEWREQ

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
935



NAME:
Cytochrome C Oxidase (L chain, VII-C), P00430
>gi|117126|sp|P00430|COXO_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIIC PRECURSOR (VIIIA)

SEQUENCE:
MLGQSIRRFTTSVVRRSHYEEGPGKNIPFSVENKWRLLAMMTLFFGSGFAAPFFIVRHQLLKK

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1844



NAME:
Cytochrome C Oxidase (M chain, VIII), P10175
>gi|1169070|sp|P10175|COXQ_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIII-HEART PRECURSOR (VIIIB) (IX)

SEQUENCE:
MLRLAPTVRLLQAPLRGWAVPKAHITAKPAKTPTSPKEQAIGLSVTFLSFLLPAGWVLYHLDNYKKSSAA

EXPERIMENTALLY DETERMINED TM-SEGMENTS:

startend
1235



Click here to go to TM-Finder help

Click here to go to TM-Finder introductory page


Experiencing problems with this web tool? Contact