Python: module TAMO.seq.Human

TAMO.seq.Human

index
/home/David_Gordon/docs/TAMO/seq/Human.py

Fast access to human sequence data in fasta format. The human genome doesn't fit in the memory of many computers in text format. This interface provides a quick way to extract arbitrary pieces of sequence from disk. Usage: from TAMO.Seq.Human import get_seq txt = get_seq('chr1',5678910,5778910) # -or - txt = get_seq(1,5678910,5778910) txt = get_seq('chrX',5678910,5778910) # -or - txt = get_seq('X',5678910,5778910) #-or- txt = get_seq('chrX:5679010-567890') Copyright (2005) Whitehead Institute for Biomedical Research (except as noted below) All Rights Reserved Author: David Benjamin Gordon

Modules

TAMO

Classes



ChromoFasta

class ChromoFasta

    Rapid access into large Human Chromosome Files.  Index positions should be taken directly from BLAST/BLAT output (no +/-1, or pythonish adding 1 to get the last letter.

Methods defined here:

__del__(self)

__init__(self, chromosome)

compute_offset(self, idx)
        compute_offset(idx)         Example:  If pos = 246, and linelen = 50,         then linenum = int(246 / 50)  = 4         endpos       = 246 - ( 4* 50) = 46         but each line actually contains a " " at the end,         and the count starts at self.offset, so the seek         position is:                  self.offset + linenum * (self.linelen + 1) + endpos - 1  #Counting from 1

get_range(self, start, end)

sniff_info(self)

Functions

get_seq(chr, start=None, end=None)
txt = get_seq('chr1',5678910,5778910) # -or - txt = get_seq(1,5678910,5778910) txt = get_seq('chrX',5678910,5778910) # -or - txt = get_seq('X',5678910,5778910) #-or- txt = get_seq('chrX:5679010-567890')

Data

CHROMOROOT = '/etc/TAMOdata/HumanSeq/'
ChrD = {}

Functions
		get_seq(chr, start=None, end=None) `txt = get_seq('chr1',5678910,5778910) # -or - txt = get_seq(1,5678910,5778910) txt = get_seq('chrX',5678910,5778910) # -or - txt = get_seq('X',5678910,5778910) #-or- txt = get_seq('chrX:5679010-567890')`

Data
		CHROMOROOT = '/etc/TAMOdata/HumanSeq/' ChrD = {}