BLASTX 1.4.13-Paracel [2002-12-12]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= At1g01880.f
(1713 letters)
Database: BlastDB/NCBI/blast/db/2003_02_27_02_00_4/nr
1,347,713 sequences; 431,508,457 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_171691.1| hypothetical protein; protein id: At1g0188... 1117 0.0
gb|AAF76467.1|AC020622_1 Contains similarity to excision re... 796 0.0
dbj|BAC27207.1| unnamed protein product [Mus musculus] 195 2e-48
ref|XP_147642.2| similar to GM10765p [Drosophila melanogast... 195 2e-48
dbj|BAC27242.1| unnamed protein product [Mus musculus] 194 3e-48
>ref|NP_171691.1| hypothetical protein; protein id: At1g01880.1 [Arabidopsis thaliana]
Length = 570
Score = 1117 bits (2890), Expect = 0.0
Identities = 550/570 (96%), Positives = 550/570 (96%)
Frame = +1
Query: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 180
MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 181 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 360
NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 120
Query: 361 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 540
RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS
Sbjct: 121 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 180
Query: 541 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 720
REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL
Sbjct: 181 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 240
Query: 721 ERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEHCGC 900
ERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEHCGC
Sbjct: 241 ERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEHCGC 300
Query: 901 DSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIELYLSDG 1080
DSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIELYLSDG
Sbjct: 301 DSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIELYLSDG 360
Query: 1081 LMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREKARNNTGYALL 1260
LMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREKARNNTGYALL
Sbjct: 361 LMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREKARNNTGYALL 420
Query: 1261 CDQYEFHSINXXXXXXXXXXXXXXXXXXXXGLNEPQVQNDNGDCFLLTDECIGLVQSAFP 1440
CDQYEFHSIN GLNEPQVQNDNGDCFLLTDECIGLVQSAFP
Sbjct: 421 CDQYEFHSINEPEESIVVLEEEEESVDPLDGLNEPQVQNDNGDCFLLTDECIGLVQSAFP 480
Query: 1441 DETEHFLHEKKLRESKKKNVSEEETATPRATTMGVQRSITDFYRSAKKAAAGQSIETGGS 1620
DETEHFLHEKKLRESKKKNVSEEETATPRATTMGVQRSITDFYRSAKKAAAGQSIETGGS
Sbjct: 481 DETEHFLHEKKLRESKKKNVSEEETATPRATTMGVQRSITDFYRSAKKAAAGQSIETGGS 540
Query: 1621 SKASAEKKRQATSTSSSNLTKSVRRRLLFG 1710
SKASAEKKRQATSTSSSNLTKSVRRRLLFG
Sbjct: 541 SKASAEKKRQATSTSSSNLTKSVRRRLLFG 570
>gb|AAF76467.1|AC020622_1 Contains similarity to excision repair protein ERCC5 from Homo
sapiens gi|1082359 and contains XPG N-terminal PF|00752
and XPG I-region PF|00867 domains. [Arabidopsis
thaliana]|gi|25513625|pir||F86150 F22M8.2 protein -
Arabidopsis thaliana
Length = 497
Score = 796 bits (2055), Expect = 0.0
Identities = 384/384 (100%), Positives = 384/384 (100%)
Frame = +1
Query: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 180
MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI
Sbjct: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVKGFVLKPHLRLTFFRTI 60
Query: 181 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 360
NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV
Sbjct: 61 NLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSEWV 120
Query: 361 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 540
RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS
Sbjct: 121 RECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKPNS 180
Query: 541 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 720
REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL
Sbjct: 181 REPFECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSEDQVL 240
Query: 721 ERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEHCGC 900
ERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEHCGC
Sbjct: 241 ERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEHCGC 300
Query: 901 DSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIELYLSDG 1080
DSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIELYLSDG
Sbjct: 301 DSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKIIELYLSDG 360
Query: 1081 LMTGDGSSMSWGTPDTGMLVDLMV 1152
LMTGDGSSMSWGTPDTGMLVDLMV
Sbjct: 361 LMTGDGSSMSWGTPDTGMLVDLMV 384
Score = 224 bits (571), Expect = 3e-57
Identities = 114/114 (100%), Positives = 114/114 (100%)
Frame = +1
Query: 1369 VQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRESKKKNVSEEETATPRATTMGVQ 1548
VQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRESKKKNVSEEETATPRATTMGVQ
Sbjct: 384 VQNDNGDCFLLTDECIGLVQSAFPDETEHFLHEKKLRESKKKNVSEEETATPRATTMGVQ 443
Query: 1549 RSITDFYRSAKKAAAGQSIETGGSSKASAEKKRQATSTSSSNLTKSVRRRLLFG 1710
RSITDFYRSAKKAAAGQSIETGGSSKASAEKKRQATSTSSSNLTKSVRRRLLFG
Sbjct: 444 RSITDFYRSAKKAAAGQSIETGGSSKASAEKKRQATSTSSSNLTKSVRRRLLFG 497
>dbj|BAC27207.1| unnamed protein product [Mus musculus]
Length = 908
Score = 195 bits (495), Expect = 2e-48
Identities = 134/414 (32%), Positives = 201/414 (48%), Gaps = 4/414 (0%)
Frame = +1
Query: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVK--GFVLKPHLRLTFFR 174
MGV + W +L P Q L K +AVDLS W+ + +T K G V KPHLR FFR
Sbjct: 1 MGVN-DLWQILEPVKQHIHLQDLSGKTIAVDLSLWVCEAQTVKKMIGTVKKPHLRNLFFR 59
Query: 175 TINLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSE 354
I+ ++ VFV++G P LK+ R G P K F
Sbjct: 60 -ISYLTQMNVKLVFVMEGEPPMLKADVISKRTQTRYG------PSGKSRSQKTGRSHFKS 112
Query: 355 WVRECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKP 534
+REC+E+LE LG+P ++A GEAEA+CA LN+ G VD C+T D DAFL+GA V ++
Sbjct: 113 VLRECLEMLECLGMPWVQAAGEAEAMCAYLNASGHVDGCLTNDGDAFLYGAQTVYRNFTM 172
Query: 535 NSREP-FECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSED 711
N+++P +CY +S I+S LGL R L+ +++L+G DY GV G+G ++AL++++ F
Sbjct: 173 NTKDPHVDCYTISSIKSKLGLDRDALVGLAVLLGCDYLPKGVPGVGKEQALKLLQIFKGQ 232
Query: 712 QVLERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEH 891
+L+R +VP K+ HCS C H GS + H ++ C
Sbjct: 233 SLLQRFNQWIEDPCYSVP-------------QSAPKKVVHCSVCSHPGSPKDHERNGCIL 279
Query: 892 CGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKII-ELY 1068
C D C + C C + D + + N+ K C FP ++I E
Sbjct: 280 CKSDKYCEPHDYDYLCPCEWHQTDHNRHLSEIENNIKKKACS----CEGFPFHEVIQEFL 335
Query: 1069 LSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREK 1230
L+ M +++ PD + V K+ W Y + LL +L+ + E+
Sbjct: 336 LNKNKML---KPITYQRPDLLLFQRFTVQKMEWPSHYACEKLLVLLTRYDMIER 386
>ref|XP_147642.2| similar to GM10765p [Drosophila melanogaster] [Mus musculus]
Length = 908
Score = 195 bits (495), Expect = 2e-48
Identities = 134/414 (32%), Positives = 201/414 (48%), Gaps = 4/414 (0%)
Frame = +1
Query: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVK--GFVLKPHLRLTFFR 174
MGV + W +L P Q L K +AVDLS W+ + +T K G V KPHLR FFR
Sbjct: 1 MGVN-DLWQILEPVKQHIHLQDLSGKTIAVDLSLWVCEAQTVKKMIGTVKKPHLRNLFFR 59
Query: 175 TINLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSE 354
I+ ++ VFV++G P LK+ R G P K F
Sbjct: 60 -ISYLTQMNVKLVFVMEGEPPMLKADVISKRTQTRYG------PSGKSRSQKTGRSHFKS 112
Query: 355 WVRECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKP 534
+REC+E+LE LG+P ++A GEAEA+CA LN+ G VD C+T D DAFL+GA V ++
Sbjct: 113 VLRECLEMLECLGMPWVQAAGEAEAMCAYLNASGHVDGCLTNDGDAFLYGAQTVYRNFTM 172
Query: 535 NSREP-FECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSED 711
N+++P +CY +S I+S LGL R L+ +++L+G DY GV G+G ++AL++++ F
Sbjct: 173 NTKDPHVDCYTISSIKSKLGLDRDALVGLAVLLGCDYLPKGVPGVGKEQALKLLQIFKGQ 232
Query: 712 QVLERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEH 891
+L+R +VP K+ HCS C H GS + H ++ C
Sbjct: 233 SLLQRFNQWIEDPCYSVP-------------QSAPKKVVHCSVCSHPGSPKDHERNGCIL 279
Query: 892 CGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKII-ELY 1068
C D C + C C + D + + N+ K C FP ++I E
Sbjct: 280 CKSDKYCEPHDYDYLCPCEWHQTDHNRHLSEIENNIKKKACS----CEGFPFHEVIQEFL 335
Query: 1069 LSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREK 1230
L+ M +++ PD + V K+ W Y + LL +L+ + E+
Sbjct: 336 LNKNKML---KPITYQRPDLLLFQRFTVQKMEWPSHYACEKLLVLLTRYDMIER 386
>dbj|BAC27242.1| unnamed protein product [Mus musculus]
Length = 908
Score = 194 bits (494), Expect = 3e-48
Identities = 133/414 (32%), Positives = 201/414 (48%), Gaps = 4/414 (0%)
Frame = +1
Query: 1 MGVGGNFWDLLRPYAQQQGFDFLRNKRVAVDLSFWIVQHETAVK--GFVLKPHLRLTFFR 174
MGV + W +L P Q L K +AVDLS W+ + +T K G + KPHLR FFR
Sbjct: 1 MGVN-DLWQILEPVKQHIHLQDLSGKTIAVDLSLWVCEAQTVKKMIGTIKKPHLRNLFFR 59
Query: 175 TINLFSKFGAYPVFVVDGTPSPLKSQARISRFFRSSGIDTCNLPVIKDGVSVERNKLFSE 354
I+ ++ VFV++G P LK+ R G P K F
Sbjct: 60 -ISYLTQMNVKLVFVMEGEPPMLKADVISKRTQTRYG------PSGKSRSQKTGRSHFKS 112
Query: 355 WVRECVELLELLGIPVLKANGEAEALCAQLNSQGFVDACITPDSDAFLFGAMCVIKDIKP 534
+REC+E+LE LG+P ++A GEAEA+CA LN+ G VD C+T D DAFL+GA V ++
Sbjct: 113 VLRECLEMLECLGMPWVQAAGEAEAMCAYLNASGHVDGCLTNDGDAFLYGAQTVYRNFTM 172
Query: 535 NSREP-FECYHMSHIESGLGLKRKHLIAISLLVGNDYDSGGVLGIGVDKALRIVREFSED 711
N+++P +CY +S I+S LGL R L+ +++L+G DY GV G+G ++AL++++ F
Sbjct: 173 NTKDPHVDCYTISSIKSKLGLDRDALVGLAVLLGCDYLPKGVPGVGKEQALKLLQIFKGQ 232
Query: 712 QVLERLQDIGNGLQPAVPGGIKSGDDGEEFRSEMKKRSPHCSRCGHLGSKRTHFKSSCEH 891
+L+R +VP K+ HCS C H GS + H ++ C
Sbjct: 233 SLLQRFNQWIEDPCYSVP-------------QSAPKKVVHCSVCSHPGSPKDHERNGCIL 279
Query: 892 CGCDSGCIKKPLGFRCECSFCSKDRDLREQKKTNDWWIKVCDKIALAPEFPNRKII-ELY 1068
C D C + C C + D + + N+ K C FP ++I E
Sbjct: 280 CKSDKYCEPHDYDYLCPCEWHQTDHNRHLSEIENNIKKKACS----CEGFPFHEVIQEFL 335
Query: 1069 LSDGLMTGDGSSMSWGTPDTGMLVDLMVFKLHWDPSYVRKMLLPMLSTIYLREK 1230
L+ M +++ PD + V K+ W Y + LL +L+ + E+
Sbjct: 336 LNKNKML---KPITYQRPDLLLFQRFTVQKMEWPSHYACEKLLVLLTRYDMIER 386
Database: BlastDB/NCBI/blast/db/2003_02_27_02_00_4/nr
Posted date: Feb 27, 2003 4:02 PM
Number of letters in database: 431,508,457
Number of sequences in database: 1,347,713
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
length of database: 431,508,457
effective HSP length: 128
effective length of database: 259,001,193
effective search space used: 114478527306
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)