Statistical analysis of the DNA sequence of human chromosome 22
Author(s) -
Dirk Holste,
Ivo Große,
Hanspeter Herzel
Publication year - 2001
Publication title -
physical review. e, statistical physics, plasmas, fluids, and related interdisciplinary topics
Language(s) - English
Resource type - Journals
eISSN - 1095-3787
pISSN - 1063-651X
DOI - 10.1103/physreve.64.041917
Subject(s) - sublinear function , tandem repeat , sequence (biology) , dna , chromosome , oligonucleotide , physics , y chromosome , biology , genetics , combinatorics , mathematics , gene , genome
We study statistical patterns in the DNA sequence of human chromosome 22, the first completely sequenced human chromosome. We find that (i). the 33.4 x 10(6) nucleotide long human chromosome exhibits long-range power-law correlations over more than four orders of magnitude, (ii). the entropies H(n) of the frequency distribution of oligonucleotides of length n (n-mers) grow sublinearly with increasing n, indicating the presence of higher-order correlations for all of the studied lengths 1
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom