Premium
FASTC : a file format for multi‐character sequence data
Author(s) -
Wheeler Ward C.,
Washburn Alexander J.
Publication year - 2019
Publication title -
cladistics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.323
H-Index - 92
eISSN - 1096-0031
pISSN - 0748-3007
DOI - 10.1111/cla.12370
Subject(s) - file format , sequence (biology) , character (mathematics) , synteny , computer science , variety (cybernetics) , alphabet , biology , programming language , artificial intelligence , gene , linguistics , genetics , mathematics , philosophy , geometry , genome
Here, we define a sequence file format that allows for multi‐character elements ( FASTC ). The format is derived from the FASTA format and the custom alphabet format of POY 4/5. The format is more general than either of these formats and can represent a broad variety of sequence‐type data. This format should be useful for analyses involving datasets encoded as linear streams such as gene synteny, comparative linguistics, temporal gene expression and development, complex animal behaviours, and general biological time‐series data.