HFE
(Assignment 3 - Sequence manipulation)

mRNA Protein
EXON Coding EXON
coords
length
coords
length
1 - 305
305 bp
222 - 305
84 bp
3600 - 3885
286 bp
3600 - 3885
286 bp
4095 - 4372
278 bp
4095 - 4372
278 bp
5466 - 5743
278 bp
5466 - 5477
12 bp
5900 - 6015
116 bp
6990 - 8022
1033 bp
9177 - 9610
434 bp
Total:
2730 bp
Total:
220 aa


mRNA join(1..305,3600..3885,4095..4372,5466..5743,5900..6015,6990..8022,9177..9610)
    /product="hemochromatosis protein, transcript variant 12"
/note="unclassified transcription discrepancy"
/transcript_id="NM_77777.1"
/db_xref="GI:210403387"
/db_xref="GeneID:3077"
/db_xref="LocusID:3077"
/db_xref="MIM:235200"
CDS join(222..305,3600..3885,4095..4372,5466..5477)
    /codon_start=1
/product="hemochromatosis protein isoform 2 precursor"
/protein_id="NP_77777.2"
/db_xref="GI:210403397"
/db_xref="GeneID:3077"
/db_xref="LocusID:3077"
/db_xref="MIM:235200"


Sequence of:
mRNA 1
51
101
151
201
251
301
351
401
451
501
551
601
651
701
751
801
851
901
951
1001
1051
1101
1151
1201
1251
1301
1351
1401
1451
1501
1551
1601
1651
1701
1751
1801
1851
1901
1951
2001
2051
2101
2151
2201
2251
2301
2351
2401
2451
2501
2551
2601
2651
2701
GGGGACACTGGATCACCTAGTGTTTCACAAGCAGGTACCTTCTGCTGTAG
GAGAGAGAGAACTAAAGTTCTGAAAGACCTGTTGCTTTTCACCAGGAAGT
TTTACTGGGCATCTCCTGAGCCTAGGCAATAGCTGTAGGGTGACTTCTGG
AGCCATCCCCGTTTCCCCGCCCCCCAAAAGAAGCGGAGATTTAACGGGGA
CGTGCGGCCAGAGCTGGGGAAATGGGCCCGCGAGCCAGGCCGGCGCTTCT
CCTCCTGATGCTTTTGCAGACCGCGGTCCTGCAGGGGCGCTTGCTGCGTG
AGTCCAGGCCTGTTGCTCTGTCTCCAGGTTCACACTCTCTGCACTACCTC
TTCATGGGTGCCTCAGAGCAGGACCTTGGTCTTTCCTTGTTTGAAGCTTT
GGGCTACGTGGATGACCAGCTGTTCGTGTTCTATGATCATGAGAGTCGCC
GTGTGGAGCCCCGAACTCCATGGGTTTCCAGTAGAATTTCAAGCCAGATG
TGGCTGCAGCTGAGTCAGAGTCTGAAAGGGTGGGATCACATGTTCACTGT
TGACTTCTGGACTATTATGGAAAATCACAACCACAGCAAGGAGTCCCACA
CCCTGCAGGTCATCCTGGGCTGTGAAATGCAAGAAGACAACAGTACCGAG
GGCTACTGGAAGTACGGGTATGATGGGCAGGACCACCTTGAATTCTGCCC
TGACACACTGGATTGGAGAGCAGCAGAACCCAGGGCCTGGCCCACCAAGC
TGGAGTGGGAAAGGCACAAGATTCGGGCCAGGCAGAACAGGGCCTACCTG
GAGAGGGACTGCCCTGCACAGCTGCAGCAGTTGCTGGAGCTGGGGAGAGG
TGTTTTGGACCAACAAGGTTGCCTCCTTTGGTGAAGGTGACACATCATGT
GACCTCTTCAGTGACCACTCTACGGTGTCGGGCCTTGAACTACTACCCCC
AGAACATCACCATGAAGTGGCTGAAGGATAAGCAGCCAATGGATGCCAAG
GAGTTCGAACCTAAAGACGTATTGCCCAATGGGGATGGGACCTACCAGGG
CTGGATAACCTTGGCTGTACCCCCTGGGGAAGAGCAGAGATATACGTGCC
AGGTGGAGCACCCAGGCCTGGATCAGCCCCTCATTGTGATCTGGGGTAGC
CCTCACCGTCTGGCACCCTAGTCATTGGAGTCATCAGTGGAATTGCTGTT
TTTGTCGTCATCTTGTTCATTGGAATTTTGTTCATAATATTAAGGAAGAG
GCAGGGTTCAAGTTTAGCTGAACGTGAGTGACACGCAGCCTGCAGACTCA
CTGTGGGAAGGAGACAAAACTAGAGACTCAAAGAGGGAGTGCATTTATGA
GCTCTTCATGTTTCAGGAGAGAGTTGAACCTAAACATAGAAATTGCCTGA
CGAACTCCTTGATTTTAGCCTTCTCTGTTCATTTCCTCAAAAAGATTTCC
CCATTTAGGTTTCTGAGTTCCTGCATGCCGGTGATCCCTAGCTGTGACCT
CTCCCCTGGAACTGTCTCTCATGAACCTCAAGCTGCATCTAGAGGCTTCC
TTCATTTCCTCCGTCACCTCAGAGACATACACCTATGTCATTTCATTTCC
TATTTTTGGAAGAGGACTCCTTAAATTTGGGGGACTTACATGATTCATTT
TAACATCTGAGAAAAGCTTTGAACCCTGGGACGTGGCTAGTCATAACCTT
ACCAGATTTTTACACATGTATCTATGCATTTTCTGGACCCGTTCAACTTT
TCCTTTGAATCCTCTCTCTGTGTTACCCAGTAACTCATCTGTCACCAAGC
CTTGGGGATTCTTCCATCTGATTGTGATGTGAGTTGCACAGCTATGAAGG
CTGTACACTGCACGAATGGAAGAGGCACCTGTCCCAGAAAAAGCATCATG
GCTATCTGTGGGTAGTATGATGGGTGTTTTTAGCAGGTAGGAGGCAAATA
TCTTGAAAGGGGTTGTGAAGAGGTGTTTTTTCTAATTGGCATGAAGGTGT
CATACAGATTTGCAAAGTTTAATGGTGCCTTCATTTGGGATGCTACTCTA
GTATTCCAGACCTGAAGAATCACAATAATTTTCTACCTGGTCTCTCCTTG
TTCTGATAATGAAAATTATGATAAGGATGATAAAAGCACTTACTTCGTGT
CCGACTCTTCTGAGCACCTACTTACATGCATTACTGCATGCACTTCTTAC
AATAATTCTATGAGATAGGTACTATTATCCCCATTTCTTTTTTAAATGAA
GAAAGTGAAGTAGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGAGTG
CTGAGATTACAGGTGTGAGCCACCCTGCCCAGCCGTCAAAAGAGTCTTAA
TATATATATCCAGATGGCATGTGTTTACTTTATGTTACTACATGCACTTG
GCTGCATAAATGTGGTACAAGCATTCTGTCTTGAAGGGCAGGTGCTTCAG
GATACCATATACAGCTCAGAAGTTTCTTCTTTAGGCATTAAATTTTAGCA
AAGATATCTCATCTCTTCTTTTAAACCATTTTCTTTTTTTGTGGTTAGAA
AAGTTATGTAGAAAAAAGTAAATGTGATTTACGCTCATTGTAGAAAAGCT
ATAAAATGAATACAATTAAAGCTGTTATTTAATTAGCCAGTGAAAAACTA
TTAACAACTTGTCTATTACCTGTTAGTATTATTGTTGCATTAAAAATGCA
TATACTTTAATAAATGTATATTGTATTGTA
Protein 1
51
101
151
201

MGPRARPALL LLMLLQTAVL QGRLLRESRP VALSPGSHSL HYLFMGASEQ
DLGLSLFEAL GYVDDQLFVF YDHESRRVEP RTPWVSSRIS SQMWLQLSQS
LKGWDHMFTV DFWTIMENHN HSKESHTLQV ILGCEMQEDN STEGYWKYGY
DGQDHLEFCP DTLDWRAAEP RAWPTKLEWE RHKIRARQNR AYLERDCPAQ
LQQLLELGRG VLDQQGCLLW

 


Thank you for visiting my website!

Copyright © 2002 E.T.
All rights reserved. Terms and conditions apply