Construct: ORF ccsbBroadEn_03562
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF006981.1_s300c1, BRDN0000393894
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- YY1AP1 (55249)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198901.1 | 100% | 100% | |
2 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198902.1 | 100% | 100% | |
3 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139119.2 | 100% | 100% | |
4 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198899.1 | 98.5% | 98.5% | 0_1ins33 |
5 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198900.1 | 98.5% | 98.5% | 0_1ins33 |
6 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_018253.3 | 98.5% | 98.5% | 0_1ins33 |
7 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198905.1 | 97.3% | 97.3% | 728_729ins60 |
8 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139121.2 | 91.2% | 91.2% | 0_1ins198 |
9 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139118.2 | 89.4% | 89.4% | 1_198del;926_927ins60 |
10 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198903.1 | 84.4% | 84.4% | 1_414del |
11 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198904.1 | 82.2% | 82.2% | 1_414del;1142_1143ins60 |
12 | human | 54856 | GON4L | gon-4 like | NM_001282861.2 | 47.2% | 45.6% | (many diffs) |
13 | human | 54856 | GON4L | gon-4 like | NM_032292.6 | 47.2% | 45.6% | (many diffs) |
14 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198906.2 | 42.7% | 40.3% | 1_198del;1193_1212del;1272_1273ins1196 |
15 | human | 54856 | GON4L | gon-4 like | XM_005245284.3 | 34.9% | 33.7% | (many diffs) |
16 | human | 54856 | GON4L | gon-4 like | XM_011509659.2 | 34.9% | 33.7% | (many diffs) |
17 | human | 54856 | GON4L | gon-4 like | NM_001282858.2 | 32.2% | 31.1% | (many diffs) |
18 | human | 54856 | GON4L | gon-4 like | XM_006711393.3 | 32.2% | 31.1% | (many diffs) |
19 | human | 54856 | GON4L | gon-4 like | XM_006711394.4 | 32.2% | 31.1% | (many diffs) |
20 | human | 54856 | GON4L | gon-4 like | NM_001282856.1 | 32.2% | 31.1% | (many diffs) |
21 | human | 54856 | GON4L | gon-4 like | NM_001282860.1 | 32.2% | 31.1% | (many diffs) |
22 | human | 54856 | GON4L | gon-4 like | XM_005245286.3 | 31.3% | 30.2% | (many diffs) |
23 | human | 54856 | GON4L | gon-4 like | XM_011509658.2 | 31% | 30% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2316
- ORF length:
- 2250
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGA AGATCTGTTT GAAACTTTCC AAGATGAGAT GGGATTCTCC AACATGGAAG 121 ATGATGGCCC AGAAGAGGAG GAGCGTGTGG CTGAGCCTCA AGCTAACTTT AACACCCCTC 181 AAGCTCTACG GTTTGAGGAA CTACTGGCCA ACCTACTAAA TGAACAACAT CAGATAGCGA 241 AGGAACTATT TGAACAGCTG AAGATGAAGA AACCTTCAGC CAAACAGCAG AAGGAGGTAG 301 AGAAGGTTAA ACCCCAGTGT AAGGAAGTTC ATCAGACCCT GATTCTGGAC CCAGCACAAA 361 GGAAGAGACT CCAGCAGCAG ATGCAGCAGC ATGTTCAGCT CTTGACACAA ATCCACCTTC 421 TTGCCACCTG CAACCCCAAT CTCAATCCGG AGGCCAGTAG CACCAGGATA TGTCTTAAAG 481 AGCTGGGAAC CTTTGCTCAA AGCTCCATCG CCCTTCACCA TCAGTACAAC CCCAAGTTTC 541 AGACCCTGTT CCAACCCTGT AACTTGATGG GAGCTATGCA GCTGATTGAA GACTTCAGCA 601 CACATGTCAG CATTGACTGC AGCCCTCATA AAACTGTCAA GAAGACTGCC AATGAATTTC 661 CCTGTTTGCC AAAGCAAGTG GCTTGGATCC TGGCCACAAG CAAGGTTTTC ATGTATCCAG 721 AGTTACTTCC AGTGTGTTCC CTGAAGGCAA AGAATCCCCA GGATAAGATC CTCTTCACCA 781 AGGCTGAGGA CAATTTGTTA GCTTTAGGAC TGAAGCATTT TGAAGGGACT GAGTTTCTTA 841 ACCCTCTAAT CAGCAAGTAC CTTCTAACCT GCAAGACTGC CCGCCAACTG ACAGTGAGAA 901 TCAAGAACCT CAACATGAAC AGAGCTCCTG ACAACATCAT TAAATTTTAT AAGAAGACCA 961 AACAGCTGCC AGTCCTAGGA AAATGCTGTG AAGAGATCCA GCCACATCAG TGGAAGCCAC 1021 CTATAGAGAG AGAAGAACAC CGGCTCCCAT TCTGGTTAAA GGCCAGTCTG CCATCCATCC 1081 AGGAAGAACT GCGGCACATG GCTGATGGTG CTAGAGAGGT AGGAAATATG ACTGGAACCA 1141 CTGAGATCAA CTCAGATCAA GGCCTAGAAA AAGACAACTC AGAGTTGGGG AGTGAAACTC 1201 GGTACCCACT GCTATTGCCT AAGGGTGTAG TCCTGAAACT GAAGCCAGTT GCCGACCGTT 1261 TCCCCAAGAA GGCTTGGAGA CAGAAGCGTT CATCAGTCCT GAAACCCCTC CTTATCCAAC 1321 CCAGCCCCTC TCTCCAGCCC AGCTTCAACC CTGGGAAAAC ACCAGCCCAA TCAACTCATT 1381 CAGAAGCCCC TCCGAGCAAA ATGGTGCTCC GGATTCCTCA CCCAATACAG CCAGCCACTG 1441 TTTTACAGAC AGTTCCAGGT GTCCCTCCAC TGGGGGTCAG TGGAGGTGAG AGTTTTGAGT 1501 CTCCTGCAGC ACTGCCTGCT ATGCCCCCTG AGGCCAGGAC AAGCTTCCCT CTGTCTGAGT 1561 CCCAGACTTT GCTCTCTTCT GCCCCTGTGC CCAAGGTAAT GATGCCCTCC CCTGCCTCTT 1621 CCATGTTTCG AAAGCCATAT GTGAGACGGA GACCCTCAAA AAGAAGGGGA GCCAGGGCCT 1681 TTCGCTGTAT CAAACCTGCC CCTGTTATCC ACCCTGCATC TGTTATCTTC ACTGTTCCTG 1741 CTACCACTGT GAAGATTGTG AGCCTTGGCG GTGGCTGTAA CATGATCCAG CCTGTCAATG 1801 CGGCTGTGGC CCAGAGTCCC CAGACTATTC CCATCGCCAC CCTCTTGGTT AACCCTACTT 1861 CCTTCCCCTG TCCATTGAAC CAGCCCCTTG TGGCCTCCTC TGTCTCACCC TTAATTGTTT 1921 CTGGCAATTC TGTGAATCTT CCTATACCAT CCACCCCTGA AGATAAGGCC CACATGAATG 1981 TGGACATTGC TTGTGCTGTG GCTGATGGGG AAAATGCCTT TCAGGGCCTA GAACCCAAAT 2041 TAGAGCCCCA GGAACTATCT CCTCTCTCTG CTACTGTTTT CCCCAAAGTG GAACATAGCC 2101 CAGGGCCTCC ACCAGTCGAT AAACAGTGCC AAGAAGGATT GTCAGAGAAC AGTGCCTATC 2161 GCTGGACCGT TGTGAAAACA GAGGAGGGAA GGCAAGCTCT GGAGCCGCTC CCTCAGGGCA 2221 TCCAGGAGTC TCTAAACAAC TCTTCCCCTG GGGATTTAGA GGAAGTTGTC AAGATGGAAC 2281 CTGAAGATGC TACAGAGGAA ATCAGTGGAT TTCTTTGCCC AACTTTCTTG TACAAAGTtg 2341 gcattataag aaagcattgc ttatcaattt gttgcaacga ac