/examples/data/ensembl_pax_sub.fasta
http://github.com/sbotond/phylosim · Unknown · 385 lines · 385 code · 0 blank · 0 comment · 0 complexity · a85b058bef53d25e37d645e781f1bb35 MD5 · raw file
- >ENSOGAP00000013678
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRX
- X-XXXXX----------------------------XXXXXXXXXX---------------
- -------------------------X------XXXXXXXXXX-XXXXX------------
- --------------XXXXXX-XXXXXXXXXXXXXX-------XXXXXX--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------NNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------GX---------------------------------XXXXNPY
- SHP------QYTAYNE-AWRFSNP------------ALL-------------------MP
- PPGAPPL-----------------------------PLLPLP----MTATS--YRGDH--
- ----IKLQADSFGLHIVPV---------------
- >ENSPTRP00000005013
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------EA---------------------------------AVGPSSS
- --------------------------------------L-------------------MS
- KPGRKLA-----------------------------EVPPCV----QPTGA--SSPATRT
- ATPSTRPTTRLGDSATPPY---------------
- >ENSGGOP00000014550
- ----------------------MAGPCCVWGVVFFSCL----SP--A-------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------EA---------------------------------AVGPSSS
- --------------------------------------L-------------------MS
- KPGRKLA-----------------------------EVPPCV----QPTVC--HGPSTAP
- THPSLCP---------------------------
- >ENSMMUP00000012017
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------GS---------------------------------EFSGNPY
- SHP------QYTAYNE-AWRFSNP------------ALL-------------------MP
- PPGAPPL-----------------------------PLLPL------------PMTATSY
- RGDHIKLQADSFGLHIVPV---------------
- >ENSMICP00000001240
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEGEVYTDPVHIRGGGGLHLVWTLRX
- X-XXXXV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------GS---------------------------------EFSGNPY
- SHP------QYTAYNE-AWRFSNP------------ALL---------------------
- ------------------------------------------------------------
- ----------------------------------
- >ENSSTOP00000010388
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGV-QLGGVXXXXXXX--------------XXXXXXXXXXXXXX-XXXXXXXXXXXX
- ------------------XXXGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINXXXXXXX-----------------
- ----------------------XXXXX---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------XXXXX-XXXXXXXXX
- --XXXXXXXXXXXXX----------------XXX-------------XXXXXX---XXXX
- XXXXX------XXXXXX-------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
- X-XXXXX----------------------------XXXXXXXXXX---------------
- -------------------------X------XXXXXXXXXX-XXXXX------------
- --------------XXXXXX-XXXXXXXXXXXXXX-------XXXXXX--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------GS---------------------------------EFSGNPY
- SHP------QYTAYNE-AWRFSNP------------ALL---------------------
- ------------------------------------------------------------
- ----------------------------------
- >ENSCJAP00000032077
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------GS---------------------------------EFSGNPY
- SHP------QYTAYNE-AWRFSNP------------ALL-------------------SS
- PYYYSAA-----------------------------PRGSAP----AAAAA--AYDRH--
- ----------------------------------
- >ENSDORP00000005799
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -EHGGVNQLGG-FVNGRP---------------PDVV-QRIVELAQQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSNASPPVS----------------SAS-------------NDPEGS---YSIN
- GXXXX------XXXXXX-------------XXXXXXXEVYTDPAHIRGGRGLQLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- -----------------------------------------------------GAAVGPS
- SS-----------------HMSNP------------GFT-------------------E-
- ----------------------------------------VR----MTXXX--XXXXXXX
- XXXXXXXXXXXXXXXXXHY---------------
- >ENSP00000359319
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------EA---------------------------------AVGPSSS
- --------------------------------------L-------------------MS
- KPGRKLA-----------------------------EVPPCV----QPTGA--SSPATRT
- ATPSTRPTTRLGDSATPPY---------------
- >ENSCPOP00000000844
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-TGTGVSAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDE-----------------------D
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PTLTP-G-------LDEVKSGLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------VP---------------------------------RGCNGP-
- ---------------------SSS------------LMN-------------------NS
- DRKLAEV-----------------------------PFTLHR----GPSPA--PTPQEYW
- PPPVTPPTTRPGNSATPAL---------------
- >ENSPPYP00000002986
- ----------------------MDMHCKADP---FSAM----HP----------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- ------------------------------------------------------------
- -GHGGVNQLGGVFVNGRPL--------------PDVVRQRIVELAHQ-GVRPCDISRQLR
- ------------------VSHGCVSKILG--------RYYETGSIKPGVI------GGSK
- PK-VATPKVVDKIAEYKRQNPTMFAWEI----------------------RDRLLAEGIC
- ------------------------------------------------------------
- ------------------------------------------------------------
- DNDTVPS------------------------VSSINRIIRTKV-----------------
- ----------------------QQPFH---------------------------------
- ------------------------------------------------------------
- ---------------------------------------------PTPDG-AGTGVTAPG
- --HTIVPSTASPPVS----------------SAS-------------NDPVGS---YSIN
- GILGI------PRSNGE-------------KRKRDEVEVYTDPAHIRGGGGLHLVWTLRD
- V-SEGSV----------------------------PNGDSQSGVD---------------
- -------------------------S------LRKHLRADTF-TQQQL------------
- --------------EALDRV-FERPSYPDVFQASE-------HIKSEQ--GNE-YS-L--
- PALTP-G-------LDEVKSSLSAS-TNPELG----------------------------
- -------------------------------------------------------SNVS-
- -GTQTYP--------------VVT------------------------------------
- ---------------------------------------------------GRDMASTT-
- --LPG--------------------------------------------------YPPHV
- PP----------------------------------------------------------
- --------------------------------------------------------TGQG
- S----YPTST-----L-----------AGMVP----------------------------
- ------------------EA---------------------------------AVGPSSS
- --------------------------------------L-------------------MS
- KPGRKLA-----------------------------EVPPCV----QPT-----------
- ----------------------------------