TM-Finder Training Sets
The following proteins were used to optimize the TM-Finder. For each protein the TM segments (found experimentally) are listed below the sequence. Click on the "submit this sequence to TM-Finder" button below each sequence to find TM-finder's prediction of TM-segments on this sequence
NAME:
Bacteriorhodopsin - 1AP9
1ap9_ mol:protein length:248 X-Ray Bacteriorhodopsin From Microcrystals
SEQUENCE:
QAQITGRPEWIWLALGTALMGLGTLYFLVKGMGVSDPDAKKFYAITTLVPAIAFTMYLSMLLGYGLTMVPFGGEQNP
IYWARYADWLFTTPLLLLDLALLVDADQGTILALVGADGIMIGTGLVGALTKVYSYRFVWWAISTAAMLYILYVLFF
GFTSKAESMRPEVASTFKVLRNVTVVLWSAYPVVWLIGSEGAGIVPLNIETLLFMVLDVSAKVGFGLILLRSRAIFG
EAEAPEPSAGDGAAATS
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
10 | 30 |
39 | 62 |
77 | 101 |
105 | 127 |
134 | 156 |
169 | 191 |
202 | 224 |
NAME:
Photoreaction Center (L chain) 1AIG
281 AA; 31325 MW; B40F1B52 CRC32
SEQUENCE:
ALLSFERKYR VPGGTLVGGN LFDFWVGPFY VGFFGVATFF FAALGIILIA WSAVLQGTWNPQLISVYPPA LE
YGLGGAPL AKGGLWQIIT ICATGAFVSW ALREVEICRK LGIGYHIPFAFAFAILAYLT LVLFRPVMMG AWG
YAFPYGI WTHLDWVSNT GYTYGNFHYN PAHMIAISFFFTNALALALH GALVLSAANP EKGKEMRTPD HEDT
FFRDLV GYSIGTLGIH RLGLLLSLSAVFFSALCMII TGTIWFDQWV DWWQWWVKLPWWANIPGGING
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
33 | 56 |
84 | 111 |
116 | 132 |
171 | 198 |
228 | 249 |
NAME:
Photoreaction Center (M chain)
307 AA; 34378 MW; ED6FFEC9 CRC32; identity: P02953
SEQUENCE:
AEYQNIFSQV QVRGPADLGM TEDVNLANRS GVGPFSTLLG WFGNAQLGPI YLGSLGVLSLFSGLMWFFTI GI
WFWYQAGW NPAVFLRDLF FFSLEPPAPE YGLSFAAPLK EGGLWLIASFFMFVAVWSWW GRTYLRAQAL GMG
KHTAWAF LSAIWLWMVL GFIRPILMGS WSEAVPYGIFSHLDWTNNFS LVHGNLFYNP FHGLSIAFLY GSAL
LFAMHG ATILAVSRFG GERELEQIADRGTAAERAAL FWRWTMGFNA TMEGIHRWAI WMAVLVTLTG GIGIL
LSGTV VDNWYVWGQNHGMAPLN
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
54 | 78 |
113 | 140 |
145 | 161 |
200 | 229 |
264 | 286 |
NAME:
Photoreaction Center (H chain)
260 AA; 28035 MW; 12B9CD39 CRC32; identity P11846
SEQUENCE:
MVGVTAFGNF DLASLAIYSF WIFLAGLIYY LQTENMREGY PLENEDGTPA ANQGPFPLPKPKTFILPHGR GT
LTVPGPES EDRPIALART AVSEGFPHAP TGDPMKDGVG PASWVARRDLPELDGHGHNK IKPMKAAAGF HVS
AGKNPIG LPVRGCDLEI AGKVVDIWVD IPEQMARFLEVELKDGSTRL LPMQMVKVQS NRVHVNALSS DLFA
GIPTIK SPTEVTLLEE DKICGYVAGGLMYAAPKRKS VVAAMLAEYA
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
12 | 36 |
NAME:
Light harvesting complexes (A,D,G,J) 1LGH
56 AA; 5940 MW; D38352F7 CRC32; identity: P97253 REALLY: P26789
SEQUENCE:
SNPKDDYKIW LVINPSTWLP VIWIVATVVA IAVHAAVLAA PGFNWIALGA AKSAAK
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
15 | 39 |
NAME:
Light harvesting complexes (B,E,H,K)
45 AA; 5116 MW; 2F049C00 CRC32;identity: P95673 REALLY: P26790
SEQUENCE:
AERSLSGLTE EEAIAVHDQF KTTFSAFIIL AAVAHVLVWV WKPWF
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
10 | 41 |
NAME:
Photosystem I(PsaA) P25896
>gi|131139|sp|P25896|PSAA_SYNEN PHOTOSYSTEM I P700 CHLOROPHYLL A APOPROTEIN A1
SEQUENCE:
MTISPPEREPKVRVVVDNDPVPTSFEKWAKPGHFDRTLARGPQTTTWIWNLHALAHDFDTHTSDLEDISRKIFSAHF
GHLAVVFIWLSGMYFHGAKFSNYEAWLADPTGIKPSAQVVWPIVGQGILNGDVGGGFHGIQITSGLFQLWRASGITN
EFQLYCTAIGGLVMAGLMLFAGWFHYHKRAPKLEWFQNVESMLNHHLAGLLGLGSLAWAGHQIHVSLPINKLLDAGV
AAKDIPLPHEFILNPSLMAELYPKVDWGFFSGVIPFFTFNWAAYSDFLTFNGGLNPVTGGLWLSDTAHHHLAIAVLF
IIAGHMYRTNWGIGHSLKEILEAHKGPFTGAGHKGLYEVLTTSWHAQLAINLAMMGSLSIIVAQHMYAMPPYPYLAT
DYPTQLSLFTHHMWIGGFLVVGGAAHGAIFMVRDYDPAMNQNNVLDRVLRHRDAIISHLNWVCIFLGFHSFGLYVHN
DTMRAFGRPQDMFSDTGIQLQPVFAQWVQNLHTLAPGGTAPNAAATASVAFGGDVVAVGGKVAMMPIVLGTADFMVH
HIHAFTIHVTVLILLKGVLFARSSRLIPDKANLGFRFPCDGPGRGGTCQVSGWDHVFLGLFWMYNCISVVIFHFSWK
MQSDVWGTVAPDGTVSHITGGNFAQSAITINGWLRDFLWAQASQVIGSYGSALSAYGLLFLGAHFIWAFSLMFLFSG
RGYWQELIESIVWAHNKLKVAPAIQPRALSIIQGRAVGVAHYLLGGIATTWAFFLARIISVG
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
72 | 95 |
161 | 181 |
197 | 221 |
297 | 315 |
352 | 375 |
391 | 417 |
440 | 461 |
537 | 554 |
594 | 615 |
670 | 690 |
729 | 749 |
NAME:
Photosystem I (PsaB), P25897
>gi|131151|sp|P25897|PSAB_SYNEN PHOTOSYSTEM I P700 CHLOROPHYLL A APOPROTEIN A2
SEQUENCE:
MATKFPKFSQDLAQDPTTRRIWYAIAMAHDFESHDGMTEENLYQKIFASHFGHLAIIFLWVSGSLFHVAWQGNFEQW
VQDPVNTRPIAHAIWDPQFGKAAVDAFTQAGASNPVDIAYSGVYHWWYTIGMRTNGDLYQGAIFLLILASLALFAGW
LHLQPKFRPSLSWFKNAESRLNHHLAGLFGVSSLAWAGHLIHVAIPESRGQHVGWDNFLSTMPHPAGLAPFFTGNWG
VYAQNPDTASHVFGTAQGAGTAILTFLGGFHPQTESLWLTDMAHHHLAIAVLFIVAGHMYRTQFGIGHSIKEMMDAK
DFFGTKVEGPFNMPHQGIYETYNNSLHFQLGWHLACLGVITSLVAQHMYSLPPYAFIAQDHTTMAALYTHHQYIAGF
LMVGAFAHGAIFLVRDYDPAQNKGNVLDRVLQHKEAIISHLSWVSLFLGFHTLGLYVHNDVVVAFGTPEKQILIEPV
FAQFIQAAHGKLLYGFDTLLSNPDSIASTAWPNYGNVWLPGWLDAINSGTNSLFLTIGPGDFLVHHAIALGLHTTTL
ILVKGALDARGSKLMPDKKDFGYAFPCDGPGRGGTCDISAWDAFYLAMFWMLNTIGWVTFYWHWKHLGVWEGNVAQF
NESSTYLMGWLRDYLWLNSSQLINGYNPFGTNNLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLVWAHERTP
LANLVRWKDKPVALSIVQARLVGLAHFSVGYILTYAAFLIASTAAKFG
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
45 | 65 |
134 | 154 |
174 | 198 |
272 | 290 |
333 | 353 |
372 | 398 |
421 | 442 |
524 | 541 |
581 | 602 |
650 | 670 |
713 | 733 |
NAME:
Photosystem I (PsaI), P25900
>gi|131208|sp|P25900|PSAI_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT VIII
SEQUENCE:
MMGSYAASFLPWIFIPVVCWLMPTVVMGLLFLYIEGEA
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
10 | 34 |
NAME:
Photosystem I (PsaJ), P25901
>gi|131216|sp|P25901|PSAJ_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT IX
SEQUENCE:
MKHFLTYLSTAPVLAAIWMTITAGILIEFNRFYPDLLFHPL
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
7 | 21 |
NAME:
Photosystem I (PsaK), P20453
>gi|131223|sp|P20453|PSAK_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT X PRECURSOR (LIGHT-HARVESTING 8.0 KD POLYPEPTIDE)
SEQUENCE:
MVLATLPDTTWTPSVGLVVILCNLFAIALGRYAIQSRGKGPGLPIALPALFEGFGLPELLATTSFGHLLAAGVVSGLQYAGAL
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
12 | 30 |
68 | 83 |
NAME:
Photosystem I (PsaL), P25902
>gi|131226|sp|P25902|PSAL_SYNEN PHOTOSYSTEM I REACTION CENTRE SUBUNIT XI
SEQUENCE:
MAEELVKPYNGDPFVGHLSTPISDSGLVKTFIGNLPAYRQGLSPILPGLEVGMAHGYFLIGPWVKLGPLRDSDVANL
GGLISGIALILVATACLAAYGLVSFQKGGSSSDPLKTSEGWSQFTAGFFVGAMGSAFVAFFLLENFLLSMAS
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
79 | 99 |
124 | 144 |
NAME:
Cytochrine C Oxidase (A chain, I) 1OCC, 2.8A, p00396
>gi|116969|sp|P00396|COX1_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE I
SEQUENCE:
MFINRWLFSTNHKDIGTLYLLFGAWAGMVGTALSLLIRAELGQPGTLLGDDQIYNVVVTAHAFVMIFFMVMPIMIGGF
GNWLVPLMIGAPDMAFPRMNNMSFWLLPPSFLLLLASSMVEAGAGTGWTVYPPLAGNLAHAGASVDLTIFSLHLAGV
SSILGAINFITTIINMKPPAMSQYQTPLFVWSVMITAVLLLLSLPVLAAGITMLLTDRNLNTTFFDPAGGGDPILYQ
HLFWFFGHPEVYILILPGFGMISHIVTYYSGKKEPFGYMGMVWAMMSIGFLGFIVWAHHMFTVGMDVDTRAYFTSAT
MIIAIPTGVKVFSWLATLHGGNIKWSPAMMWALGFIFLFTVGGLTGIVLANSSLDIVLHDTYYVVAHFHYVLSMGAV
FAIMGGFVHWFPLFSGYTLNDTWAKIHFAIMFVGVNMTFFPQHFLGLSGMPRRYSDYPDAYTMWNTISSMGSFISLT
AVMLMVFIIWEAFASKREVLTVDLTTTNLEWLNGCPPPYHTFEEPTYVNLK
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
12 | 41 |
51 | 87 |
95 | 116 |
141 | 170 |
183 | 212 |
228 | 262 |
270 | 285 |
299 | 327 |
336 | 359 |
371 | 401 |
407 | 433 |
445 | 478 |
NAME:
Cytochrome C Oxidase (B chain, II) P00404
>gi|117010|sp|P00404|COX2_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE II
SEQUENCE:
MAYPMQLGFQDATSPIMEELLHFHDHTLMIVFLISSLVLYIISLMLTTKLTHTSTMDAQEVETIWTILPAIILILIA
LPSLRILYMMDEINNPSLTVKTMGHQWYWSYEYTDYEDLSFDSYMIPTSELKPGELRLLEVDNRVVLPMEMTIRMLV
SSEDVLHSWAVPSLGLKTDAIPGRLNQTTLMSSRPGLYYGQCSEICGSNHSFMPIVLELVPLKYFEKWSASML
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
15 | 45 |
60 | 87 |
NAME:
Cytochrome C Oxidase (C chain, III) P00415
>gi|117055|sp|P00415|COX3_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE III
SEQUENCE:
MTHQTHAYHMVNPSPWPLTGALSALLMTSGLTMWFHFNSMTLLMIGLTTNMLTMYQWWRDVIRESTFQGHHTPAVQK
GLRYGMILFIISEVLFFTGFFWAFYHSSLAPTPELGGCWPPTGIHPLNPLEVPLLNTSVLLASGVSITWAHHSLMEG
DRKHMLQALFITITLGVYFTLLQASEYYEAPFTISDGVYGSTFFVATGFHGLHVIIGSTFLIVCFFRQLKFHFTSNH
HFGFEAGAWYWHFVDVVWLFLYVSIYWWGS
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
16 | 37 |
41 | 66 |
73 | 106 |
129 | 153 |
156 | 183 |
191 | 223 |
233 | 255 |
NAME:
Cytochrome C Oxidase (D chain, IV),P00423
>gi|117085|sp|P00423|COX4_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE IV PRECURSOR
SEQUENCE:
MLATRVFSLIGRRAISTSVCVRAHGSVVKSEDYALPSYVDRRDYPLPDVAHVKNLSASQKALKEKEKASWSSLSIDE
KVELYRLKFKESFAEMNRSTNEWKTVVGAAMFFIGFTALLLIWEKHYVYGPIPHTFEEEWVAKQTKRMLDMKVAPIQ
GFSAKWDYDKNEWKK
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
77 | 102 |
NAME:
Cytochrome C Oxidase (G chain, VI-A),P07471
>gi|117108|sp|P07471|COXD_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIA-HEART PRECURSOR (COXVIAH) (VIB)
SEQUENCE:
MALPLKSLSRGLASAAKGDHGGTGARTWRFLTFGLALPSVALCTLNSWLHSGHRERPAFIPYHHLRIRTKPFSWGDG
NHTFFHNPRVNPLPTGYEKP
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
13 | 37 |
NAME:
Cytochrome C Oxidase (J chain, VII-A), P07470
>gi|117122|sp|P07470|COXK_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIIA-HEART PRECURSOR (COX VIIA-M) (VIIIC)
SEQUENCE:
STALAKPQMRGLLARRLRFHIVGAFMVSLGFATFYKFAVAEKRKKAYADFYRNYDSMKDFEEMRKAGIFQSAK
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
26 | 54 |
NAME:
SEQUENCE:
MRALRVSQALVRSFSSTARNRFENRVAEKQKLFQEDNGLPVHLKGGATDNILYRVTMTLCLGGTLYSLYCLGWASFPHKK
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
NAME:
Cytochrome C Oxidase (K chain, VII-B), P13183
>gi|117124|sp|P13183|COXM_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIIB PRECURSOR (IHQ)
SEQUENCE:
MFNLRMFPLAKNALSRLRVQSIQQAVQAVARQIHQKRAPDFHDKYGNAVLASGATFCVAVWVYMATQIGIEWNPSPV
GRVTPKEWREQ
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
9 | 35 |
NAME:
Cytochrome C Oxidase (L chain, VII-C), P00430
>gi|117126|sp|P00430|COXO_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIIC PRECURSOR (VIIIA)
SEQUENCE:
MLGQSIRRFTTSVVRRSHYEEGPGKNIPFSVENKWRLLAMMTLFFGSGFAAPFFIVRHQLLKK
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
18 | 44 |
NAME:
Cytochrome C Oxidase (M chain, VIII), P10175
>gi|1169070|sp|P10175|COXQ_BOVIN CYTOCHROME C OXIDASE POLYPEPTIDE VIII-HEART PRECURSOR (VIIIB) (IX)
SEQUENCE:
MLRLAPTVRLLQAPLRGWAVPKAHITAKPAKTPTSPKEQAIGLSVTFLSFLLPAGWVLYHLDNYKKSSAA
EXPERIMENTALLY DETERMINED TM-SEGMENTS:
start | end |
12 | 35 |
Click here to go to TM-Finder help
Click here to go to TM-Finder introductory page
Experiencing problems with this web tool? Contact