LOCUS V021414 11199 bp DNA circular SYN 01-JAN-1980 DEFINITION synthetic circular DNA. ACCESSION V021414 VERSION V021414 KEYWORDS . SOURCE . ORGANISM . . FEATURES Location/Qualifiers LTR 1..634 /note="3' long terminal repeat (LTR) from HIV-1" /label="3' LTR" misc_feature 681..806 /note="packaging signal of human immunodeficiency virus type 1" /label="HIV-1 Ψ" misc_feature 1303..1536 /note="The Rev response element (RRE) of HIV-1 allows for Rev-dependent mRNA export from the nucleus to the cytoplasm." /label="RRE" CDS 1721..1765 /codon_start=1 /note="recognized by the 2H10 single-chain llama nanobody" /product="antigenic peptide corresponding to amino acids 655 to 669 of the HIV envelope protein gp41 (Lutje Hulsik et al., 2013)" /transl_table=1 /translation="KNEQELLELDKWASL" /label="gp41 peptide" misc_feature 2028..2143 /note="central polypurine tract and central termination sequence of HIV-1 (lacking the first T)" /label="cPPT/CTS" enhancer 2201..2504 /note="human cytomegalovirus immediate early enhancer" /label="CMV enhancer" promoter 2505..2708 /note="human cytomegalovirus (CMV) immediate early promoter" /label="CMV promoter" CDS 2815..5850 /label="UBA7(NM_003335)" /note="UBA7(NM_003335)" /gene="UBA7" misc_feature 5870..6442 /note="internal ribosome entry site (IRES) of the encephalomyocarditis virus (EMCV)" /label="IRES" CDS 6444..7154 /codon_start=1 /note="mammalian codon-optimized" /product="monomeric derivative of DsRed fluorescent protein (Shaner et al., 2004)" /transl_table=1 /translation="MVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEG TQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNF EDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALK GEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERA EGRHSTGGMDELYK*" /label="mCherry" misc_feature 7168..7756 /note="woodchuck hepatitis virus posttranscriptional regulatory element" /label="WPRE" LTR 7963..8596 /note="3' long terminal repeat (LTR) from HIV-1" /label="3' LTR" primer_bind complement(8724..8740) /note="common sequencing primer, one of multiple similar variants" /label="M13 rev" protein_bind 8748..8764 /bound_moiety="lac repressor encoded by lacI" /note="The lac repressor binds to the lac operator to inhibit transcription in E. coli. This inhibition can be relieved by adding lactose or isopropyl-β-D-thiogalactopyranoside (IPTG)." /label="lac operator" promoter complement(8772..8802) /note="promoter for the E. coli lac operon" /label="lac promoter" protein_bind 8817..8838 /bound_moiety="E. coli catabolite activator protein" /note="CAP binding activates transcription in the presence of cAMP." /label="CAP binding site" rep_origin complement(9126..9714) /direction=LEFT /note="high-copy-number ColE1/pMB1/pBR322/pUC origin of replication" /label="ori" CDS complement(9885..10745) /codon_start=1 /gene="bla" /note="confers resistance to ampicillin, carbenicillin, and related antibiotics" /product="β-lactamase" /transl_table=1 /translation="MSIQHFRVALIPFFAAFCLPVFA,HPETLVKVKDAEDQLGARVGY IELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRIDAGQEQLGRRIHYSQNDLVEY SPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDR WEPELNEAIPNDERDTTMPVAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRS ALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGA SLIKHW*" /label="AmpR" promoter complement(10746..10850) /gene="bla" /label="AmpR promoter" polyA_signal 10898..11032 /note="SV40 polyadenylation signal" /label="SV40 poly(A) signal" ORIGIN 1 tggaagggct aattcactcc caaagaagac aagatatcct tgatctgtgg atctaccaca 61 cacaaggcta cttccctgat tagcagaact acacaccagg gccaggggtc agatatccac 121 tgacctttgg atggtgctac aagctagtac cagttgagcc agataaggta gaagaggcca 181 ataaaggaga gaacaccagc ttgttacacc ctgtgagcct gcatgggatg gatgacccgg 241 agagagaagt gttagagtgg aggtttgaca gccgcctagc atttcatcac gtggcccgag 301 agctgcatcc ggagtacttc aagaactgct gatatcgagc ttgctacaag ggactttccg 361 ctggggactt tccagggagg cgtggcctgg gcgggactgg ggagtggcga gccctcagat 421 cctgcatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga 481 gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 541 tgagtgcttc aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 601 agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 661 cgaaagggaa accagaggag ctctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 721 caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga 781 aggagagaga tgggtgcgag agcgtcagta ttaagcgggg gagaattaga tcgcgatggg 841 aaaaaattcg gttaaggcca gggggaaaga aaaaatataa attaaaacat atagtatggg 901 caagcaggga gctagaacga ttcgcagtta atcctggcct gttagaaaca tcagaaggct 961 gtagacaaat actgggacag ctacaaccat cccttcagac aggatcagaa gaacttagat 1021 cattatataa tacagtagca accctctatt gtgtgcatca aaggatagag ataaaagaca 1081 ccaaggaagc tttagacaag atagaggaag agcaaaacaa aagtaagacc accgcacagc 1141 aagcggccgg ccgctgatct tcagacctgg aggaggagat atgagggaca attggagaag 1201 tgaattatat aaatataaag tagtaaaaat tgaaccatta ggagtagcac ccaccaaggc 1261 aaagagaaga gtggtgcaga gagaaaaaag agcagtggga ataggagctt tgttccttgg 1321 gttcttggga gcagcaggaa gcactatggg cgcagcgtca atgacgctga cggtacaggc 1381 cagacaatta ttgtctggta tagtgcagca gcagaacaat ttgctgaggg ctattgaggc 1441 gcaacagcat ctgttgcaac tcacagtctg gggcatcaag cagctccagg caagaatcct 1501 ggctgtggaa agatacctaa aggatcaaca gctcctgggg atttggggtt gctctggaaa 1561 actcatttgc accactgctg tgccttggaa tgctagttgg agtaataaat ctctggaaca 1621 gatttggaat cacacgacct ggatggagtg ggacagagaa attaacaatt acacaagctt 1681 aatacactcc ttaattgaag aatcgcaaaa ccagcaagaa aagaatgaac aagaattatt 1741 ggaattagat aaatgggcaa gtttgtggaa ttggtttaac ataacaaatt ggctgtggta 1801 tataaaatta ttcataatga tagtaggagg cttggtaggt ttaagaatag tttttgctgt 1861 actttctata gtgaatagag ttaggcaggg atattcacca ttatcgtttc agacccacct 1921 cccaaccccg aggggacccg acaggcccga aggaatagaa gaagaaggtg gagagagaga 1981 cagagacaga tccattcgat tagtgaacgg atctcgacgg tatcgccttt aaaagaaaag 2041 gggggattgg ggggtacagt gcaggggaaa gaatagtaga cataatagca acagacatac 2101 aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt tattacaggg 2161 acagcagaga tccagtttat cgataagctt gggagttccg cgttacataa cttacggtaa 2221 atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata atgacgtatg 2281 ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggag tatttacggt 2341 aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc cctattgacg 2401 tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta tgggactttc 2461 ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg cggttttggc 2521 agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt ctccacccca 2581 ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca aaatgtcgta 2641 acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag gtctatataa 2701 gcagagctcg tttagtgaac cgtcagatcg cctggagacg ccatccacgc tgttttgacc 2761 tccatagaag acaccgactc tactagagga tctatttccg gtgaattcgc caccatggat 2821 gccctggacg cttcgaagct actggatgag gagctgtatt caagacagct gtatgtgctg 2881 ggctcacctg ccatgcagag gattcaggga gccagggtcc tggtgtcagg cctgcagggc 2941 ctgggggccg aggtggccaa gaacttggtt ctgatgggtg tgggcagcct cactctgcat 3001 gatccccacc ccacctgctg gtccgacctg gctgcccagt ttctcctctc agagcaggac 3061 ttggaaagga gcagagccga ggcctctcaa gagctcttgg ctcagctcaa cagagctgtc 3121 caggtcgtcg tgcacacggg tgacatcact gaggacctgc tgttggactt ccaggtggtg 3181 gtgctgactg ctgcaaagct ggaggagcag ctgaaggtgg gcaccttgtg tcataagcat 3241 ggagtttgct ttctggcggc tgacacccgg ggcctcgtgg ggcagttgtt ctgtgacttt 3301 ggtgaggact tcactgtgca ggaccccaca gaggcagaac ccctgacagc tgccatccag 3361 cacatctccc agggctcccc tggcattctc actctgagga aaggggccaa tacccactac 3421 ttccgtgatg gagacttggt gactttctcg ggaattgagg gaatggttga gctcaacgac 3481 tgtgatcccc ggtctatcca cgtgcgggag gatgggtccc tggagattgg agacacaaca 3541 actttctctc ggtacttgcg tggtggggct atcactgaag tcaagagacc caagactgtg 3601 agacataagt ccctggacac agccctgctc cagccccatg tggtggccca gagctcccag 3661 gaagttcacc atgcccactg cctgcatcag gccttctgtg cactgcacaa gttccagcac 3721 ctccatggcc ggccacccca gccctgggat cctgttgatg cagagactgt ggtgggcctg 3781 gcccgggacc tggaaccact gaagcggaca gaggaagagc cactggaaga gccactggat 3841 gaggccctag tgcggacagt cgccctaagc agtgcaggtg tcttgagccc tatggtggcc 3901 atgctgggtg cagtagctgc ccaggaagtg ctgaaggcaa tctccaggaa gttcatgcct 3961 ctggaccagt ggctttactt tgatgccctc gattgtcttc cggaagatgg ggagctcctt 4021 cccagtcctg aggactgtgc cctgagaggc agccgctatg atgggcaaat tgcagtgttt 4081 ggggctggtt ttcaggagaa actgagacgc cagcactacc tcctggtggg cgctggtgcc 4141 attggttgtg agctgctcaa agtctttgcc ctagtgggac tgggggccgg gaacagcggg 4201 ggcttgactg ttgttgacat ggaccacata gagcgctcca atctcagccg tcagttcctc 4261 ttcaggtccc aggacgttgg tagacccaag gcagaggtgg ctgcagcagc tgcccggggc 4321 ctgaacccag acttacaggt gatcccgctc acctacccac tggatcccac cacagagcac 4381 atctatgggg ataacttttt ctcccgtgtg gatggtgtgg ctgctgccct ggacagtttc 4441 caggcccggc gctatgtggc tgctcgttgc acccactatc tgaagccact gctggaggca 4501 ggcacatcgg gcacctgggg cagtgctaca gtattcatgc cacatgtgac tgaggcctac 4561 agagcccctg cctcagctgc agcttctgag gatgccccct accctgtctg taccgtgcgg 4621 tacttcccta gcacagccga gcacaccctg cagtgggccc ggcatgagtt tgaagaactc 4681 ttccgactgt ctgcagagac catcaaccac caccaacagg cacacacttc cctggcagac 4741 atggatgagc cacagacact caccttactg aagccagtgc ttggggtcct gagagtgcgt 4801 ccacagaact ggcaagactg tgtggcgtgg gctcttggcc actggaaact ctgctttcat 4861 tatggcatca aacagctgct gaggcacttc ccacctaata aagtgcttga ggatggaact 4921 cccttctggt caggtcccaa acagtgtccc cagcccttgg agtttgacac caaccaagac 4981 acacacctcc tctacgtact ggcagctgcc aacctgtatg cccagatgca tgggctgcct 5041 ggctcacagg actggactgc actcagggag ctgctgaagc tgctgccaca gcctgacccc 5101 caacagatgg cccccatctt tgctagtaat ctagagctgg cttcggcttc tgctgagttt 5161 ggccctgagc agcagaagga actgaacaaa gccctggaag tctggagtgt gggccctccc 5221 ctgaagcctc tgatgtttga gaaggatgat gacagcaact tccatgtgga ctttgtggta 5281 gcggcagcta gcctgagatg tcagaactac gggattccac cggtcaaccg tgcccagagc 5341 aagcgaattg tgggccagat tatcccagcc attgccacca ctacagcagc tgtggcaggc 5401 ctgttgggcc tggagctgta taaggtggtg agtgggccac ggcctcgtag tgcctttcgc 5461 cacagctacc tacatctggc tgaaaactac ctcatccgct atatgccttt tgccccagcc 5521 atccagacgt tccatcacct gaagtggacc tcttgggacc gtctgaaggt accagctggg 5581 cagcctgaga ggaccctgga gtcgctgctg gctcatcttc aggagcagca cgggttgagg 5641 gtgaggatcc tgctgcacgg ctcagccctg ctctatgcgg ccggatggtc acctgaaaag 5701 caggcccagc acctgcccct cagggtgaca gaactggttc agcagctgac aggccaggca 5761 cctgctcctg ggcagcgggt gttggtgcta gagctgagct gtgagggtga cgacgaggac 5821 actgccttcc cacctctgca ctatgagctg tgagcggccg cggatcccgc ccctctccct 5881 cccccccccc taacgttact ggccgaagcc gcttggaata aggccggtgt gcgtttgtct 5941 atatgttatt ttccaccata ttgccgtctt ttggcaatgt gagggcccgg aaacctggcc 6001 ctgtcttctt gacgagcatt cctaggggtc tttcccctct cgccaaagga atgcaaggtc 6061 tgttgaatgt cgtgaaggaa gcagttcctc tggaagcttc ttgaagacaa acaacgtctg 6121 tagcgaccct ttgcaggcag cggaaccccc cacctggcga caggtgcctc tgcggccaaa 6181 agccacgtgt ataagataca cctgcaaagg cggcacaacc ccagtgccac gttgtgagtt 6241 ggatagttgt ggaaagagtc aaatggctca cctcaagcgt attcaacaag gggctgaagg 6301 atgcccagaa ggtaccccat tgtatgggat ctgatctggg gcctcggtgc acatgcttta 6361 catgtgttta gtcgaggtta aaaaacgtct aggccccccg aaccacgggg acgtggtttt 6421 cctttgaaaa acacgatgat aatatggtga gcaagggcga ggaggataac atggccatca 6481 tcaaggagtt catgcgcttc aaggtgcaca tggagggctc cgtgaacggc cacgagttcg 6541 agatcgaggg cgagggcgag ggccgcccct acgagggcac ccagaccgcc aagctgaagg 6601 tgaccaaggg tggccccctg cccttcgcct gggacatcct gtcccctcag ttcatgtacg 6661 gctccaaggc ctacgtgaag caccccgccg acatccccga ctacttgaag ctgtccttcc 6721 ccgagggctt caagtgggag cgcgtgatga acttcgagga cggcggcgtg gtgaccgtga 6781 cccaggactc ctccctgcag gacggcgagt tcatctacaa ggtgaagctg cgcggcacca 6841 acttcccctc cgacggcccc gtaatgcaga agaagaccat gggctgggag gcctcctccg 6901 agcggatgta ccccgaggac ggcgccctga agggcgagat caagcagagg ctgaagctga 6961 aggacggcgg ccactacgac gctgaggtca agaccaccta caaggccaag aagcccgtgc 7021 agctgcccgg cgcctacaac gtcaacatca agttggacat cacctcccac aacgaggact 7081 acaccatcgt ggaacagtac gaacgcgccg agggccgcca ctccaccggc ggcatggacg 7141 agctgtacaa gtgaacgcgt ctggaacaat caacctctgg attacaaaat ttgtgaaaga 7201 ttgactggta ttcttaacta tgttgctcct tttacgctat gtggatacgc tgctttaatg 7261 cctttgtatc atgctattgc ttcccgtatg gctttcattt tctcctcctt gtataaatcc 7321 tggttgctgt ctctttatga ggagttgtgg cccgttgtca ggcaacgtgg cgtggtgtgc 7381 actgtgtttg ctgacgcaac ccccactggt tggggcattg ccaccacctg tcagctcctt 7441 tccgggactt tcgctttccc cctccctatt gccacggcgg aactcatcgc cgcctgcctt 7501 gcccgctgct ggacaggggc tcggctgttg ggcactgaca attccgtggt gttgtcgggg 7561 aagctgacgt cctttccatg gctgctcgcc tgtgttgcca cctggattct gcgcgggacg 7621 tccttctgct acgtcccttc ggccctcaat ccagcggacc ttccttcccg cggcctgctg 7681 ccggctctgc ggcctcttcc gcgtcttcgc cttcgccctc agacgagtcg gatctccctt 7741 tgggccgcct ccccgcctgg aattaattct gcagtcgaga cctagaaaaa catggagcaa 7801 tcacaagtag caatacagca gctaccaatg ctgattgtgc ctggctagaa gcacaagagg 7861 aggaggaggt gggttttcca gtcacacctc aggtaccttt aagaccaatg acttacaagg 7921 cagctgtaga tcttagccac tttttaaaag aaaagagggg actggaaggg ctaattcact 7981 cccaacgaag acaagatatc cttgatctgt ggatctacca cacacaaggc tacttccctg 8041 attagcagaa ctacacacca gggccagggg tcagatatcc actgaccttt ggatggtgct 8101 acaagctagt accagttgag ccagataagg tagaagaggc caataaagga gagaacacca 8161 gcttgttaca ccctgtgagc ctgcatggga tggatgaccc ggagagagaa gtgttagagt 8221 ggaggtttga cagccgccta gcatttcatc acgtggcccg agagctgcat ccggagtact 8281 tcaagaactg ctgatatcga gcttgctaca agggactttc cgctggggac tttccaggga 8341 ggcgtggcct gggcgggact ggggagtggc gagccctcag atcctgcata taagcagctg 8401 ctttttgcct gtactgggtc tctctggtta gaccagatct gagcctggga gctctctggc 8461 taactaggga acccactgct taagcctcaa taaagcttgc cttgagtgct tcaagtagtg 8521 tgtgcccgtc tgttgtgtga ctctggtaac tagagatccc tcagaccctt ttagtcagtg 8581 tggaaaatct ctagcagtag tagttcatgt catcttatta ttcagtattt ataacttgca 8641 aagaaatgaa tatcagagag tgagaggcct tgacattgct agcgtttacc gtcgacctct 8701 agctagagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc 8761 acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga 8821 gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg 8881 tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg 8941 cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg 9001 gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga 9061 aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg 9121 gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag 9181 aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc 9241 gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg 9301 ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt gtaggtcgtt 9361 cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc 9421 ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc 9481 actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg 9541 tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca 9601 gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc 9661 ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat 9721 cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt 9781 ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt 9841 tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc 9901 agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc 9961 gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata 10021 ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg 10081 gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc 10141 cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct 10201 acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa 10261 cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt 10321 cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca 10381 ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac 10441 tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca 10501 atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt 10561 tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc 10621 actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca 10681 aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata 10741 ctcatactct tcctttttca atattattga agcatttatc agggttattg tctcatgagc 10801 ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc 10861 cgaaaagtgc cacctgacgt cgacggatcg ggagatcaac ttgtttattg cagcttataa 10921 tggttacaaa taaagcaata gcatcacaaa tttcacaaat aaagcatttt tttcactgca 10981 ttctagttgt ggtttgtcca aactcatcaa tgtatcttat catgtctgga tcaactggat 11041 aactcaagct aaccaaaatc atcccaaact tcccacccca taccctatta ccactgccaa 11101 ttacctgtgg tttcatttac tctaaacctg tgattcctct gaattatttt cattttaaag 11161 aaattgtatt tgttaaatat gtactacaaa cttagtagt //