HFE
(Assignment 3 - Sequence manipulation)
| mRNA |
Protein |
| EXON |
Coding EXON |
coords |
length |
coords |
length |
1 - 305 |
305 bp |
222 - 305 |
84 bp |
3600 - 3885 |
286 bp |
3600 - 3885 |
286 bp |
4095 - 4372 |
278 bp |
4095 - 4372 |
278 bp |
5466 - 5743 |
278 bp |
5466 - 5477 |
12 bp |
5900 - 6015 |
116 bp |
|
|
6990 - 8022 |
1033 bp |
|
|
9177 - 9610 |
434 bp |
|
|
Total: |
2730 bp |
Total: |
220 aa |
| mRNA |
join(1..305,3600..3885,4095..4372,5466..5743,5900..6015,6990..8022,9177..9610)
|
| |
|
/product="hemochromatosis protein, transcript variant 12"
/note="unclassified transcription discrepancy"
/transcript_id="NM_77777.1"
/db_xref="GI:210403387"
/db_xref="GeneID:3077"
/db_xref="LocusID:3077"
/db_xref="MIM:235200"
|
| CDS |
join(222..305,3600..3885,4095..4372,5466..5477) |
| |
|
/codon_start=1
/product="hemochromatosis protein isoform 2 precursor"
/protein_id="NP_77777.2"
/db_xref="GI:210403397"
/db_xref="GeneID:3077"
/db_xref="LocusID:3077"
/db_xref="MIM:235200"
|
Sequence of: |
| mRNA |
1
51
101
151
201
251
301
351
401
451
501
551
601
651
701
751
801
851
901
951
1001
1051
1101
1151
1201
1251
1301
1351
1401
1451
1501
1551
1601
1651
1701
1751
1801
1851
1901
1951
2001
2051
2101
2151
2201
2251
2301
2351
2401
2451
2501
2551
2601
2651
2701 |
GGGGACACTGGATCACCTAGTGTTTCACAAGCAGGTACCTTCTGCTGTAG
GAGAGAGAGAACTAAAGTTCTGAAAGACCTGTTGCTTTTCACCAGGAAGT
TTTACTGGGCATCTCCTGAGCCTAGGCAATAGCTGTAGGGTGACTTCTGG
AGCCATCCCCGTTTCCCCGCCCCCCAAAAGAAGCGGAGATTTAACGGGGA
CGTGCGGCCAGAGCTGGGGAAATGGGCCCGCGAGCCAGGCCGGCGCTTCT
CCTCCTGATGCTTTTGCAGACCGCGGTCCTGCAGGGGCGCTTGCTGCGTG
AGTCCAGGCCTGTTGCTCTGTCTCCAGGTTCACACTCTCTGCACTACCTC
TTCATGGGTGCCTCAGAGCAGGACCTTGGTCTTTCCTTGTTTGAAGCTTT
GGGCTACGTGGATGACCAGCTGTTCGTGTTCTATGATCATGAGAGTCGCC
GTGTGGAGCCCCGAACTCCATGGGTTTCCAGTAGAATTTCAAGCCAGATG
TGGCTGCAGCTGAGTCAGAGTCTGAAAGGGTGGGATCACATGTTCACTGT
TGACTTCTGGACTATTATGGAAAATCACAACCACAGCAAGGAGTCCCACA
CCCTGCAGGTCATCCTGGGCTGTGAAATGCAAGAAGACAACAGTACCGAG
GGCTACTGGAAGTACGGGTATGATGGGCAGGACCACCTTGAATTCTGCCC
TGACACACTGGATTGGAGAGCAGCAGAACCCAGGGCCTGGCCCACCAAGC
TGGAGTGGGAAAGGCACAAGATTCGGGCCAGGCAGAACAGGGCCTACCTG
GAGAGGGACTGCCCTGCACAGCTGCAGCAGTTGCTGGAGCTGGGGAGAGG
TGTTTTGGACCAACAAGGTTGCCTCCTTTGGTGAAGGTGACACATCATGT
GACCTCTTCAGTGACCACTCTACGGTGTCGGGCCTTGAACTACTACCCCC
AGAACATCACCATGAAGTGGCTGAAGGATAAGCAGCCAATGGATGCCAAG
GAGTTCGAACCTAAAGACGTATTGCCCAATGGGGATGGGACCTACCAGGG
CTGGATAACCTTGGCTGTACCCCCTGGGGAAGAGCAGAGATATACGTGCC
AGGTGGAGCACCCAGGCCTGGATCAGCCCCTCATTGTGATCTGGGGTAGC
CCTCACCGTCTGGCACCCTAGTCATTGGAGTCATCAGTGGAATTGCTGTT
TTTGTCGTCATCTTGTTCATTGGAATTTTGTTCATAATATTAAGGAAGAG
GCAGGGTTCAAGTTTAGCTGAACGTGAGTGACACGCAGCCTGCAGACTCA
CTGTGGGAAGGAGACAAAACTAGAGACTCAAAGAGGGAGTGCATTTATGA
GCTCTTCATGTTTCAGGAGAGAGTTGAACCTAAACATAGAAATTGCCTGA
CGAACTCCTTGATTTTAGCCTTCTCTGTTCATTTCCTCAAAAAGATTTCC
CCATTTAGGTTTCTGAGTTCCTGCATGCCGGTGATCCCTAGCTGTGACCT
CTCCCCTGGAACTGTCTCTCATGAACCTCAAGCTGCATCTAGAGGCTTCC
TTCATTTCCTCCGTCACCTCAGAGACATACACCTATGTCATTTCATTTCC
TATTTTTGGAAGAGGACTCCTTAAATTTGGGGGACTTACATGATTCATTT
TAACATCTGAGAAAAGCTTTGAACCCTGGGACGTGGCTAGTCATAACCTT
ACCAGATTTTTACACATGTATCTATGCATTTTCTGGACCCGTTCAACTTT
TCCTTTGAATCCTCTCTCTGTGTTACCCAGTAACTCATCTGTCACCAAGC
CTTGGGGATTCTTCCATCTGATTGTGATGTGAGTTGCACAGCTATGAAGG
CTGTACACTGCACGAATGGAAGAGGCACCTGTCCCAGAAAAAGCATCATG
GCTATCTGTGGGTAGTATGATGGGTGTTTTTAGCAGGTAGGAGGCAAATA
TCTTGAAAGGGGTTGTGAAGAGGTGTTTTTTCTAATTGGCATGAAGGTGT
CATACAGATTTGCAAAGTTTAATGGTGCCTTCATTTGGGATGCTACTCTA
GTATTCCAGACCTGAAGAATCACAATAATTTTCTACCTGGTCTCTCCTTG
TTCTGATAATGAAAATTATGATAAGGATGATAAAAGCACTTACTTCGTGT
CCGACTCTTCTGAGCACCTACTTACATGCATTACTGCATGCACTTCTTAC
AATAATTCTATGAGATAGGTACTATTATCCCCATTTCTTTTTTAAATGAA
GAAAGTGAAGTAGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGAGTG
CTGAGATTACAGGTGTGAGCCACCCTGCCCAGCCGTCAAAAGAGTCTTAA
TATATATATCCAGATGGCATGTGTTTACTTTATGTTACTACATGCACTTG
GCTGCATAAATGTGGTACAAGCATTCTGTCTTGAAGGGCAGGTGCTTCAG
GATACCATATACAGCTCAGAAGTTTCTTCTTTAGGCATTAAATTTTAGCA
AAGATATCTCATCTCTTCTTTTAAACCATTTTCTTTTTTTGTGGTTAGAA
AAGTTATGTAGAAAAAAGTAAATGTGATTTACGCTCATTGTAGAAAAGCT
ATAAAATGAATACAATTAAAGCTGTTATTTAATTAGCCAGTGAAAAACTA
TTAACAACTTGTCTATTACCTGTTAGTATTATTGTTGCATTAAAAATGCA
TATACTTTAATAAATGTATATTGTATTGTA |
| Protein |
1
51
101
151
201
|
MGPRARPALL LLMLLQTAVL QGRLLRESRP VALSPGSHSL HYLFMGASEQ
DLGLSLFEAL
GYVDDQLFVF YDHESRRVEP RTPWVSSRIS SQMWLQLSQS
LKGWDHMFTV DFWTIMENHN
HSKESHTLQV ILGCEMQEDN STEGYWKYGY
DGQDHLEFCP DTLDWRAAEP RAWPTKLEWE RHKIRARQNR AYLERDCPAQ
LQQLLELGRG VLDQQGCLLW
|
|