This is a limited proof of concept to search for research data, not a production system.

Search the MIT Libraries

Title: pyfaidx: efficient pythonic random access to fasta subsequences

Type Software Shirley, Matthew (2014): pyfaidx: efficient pythonic random access to fasta subsequences. Zenodo. Software. https://zenodo.org/record/8548

Author: Shirley, Matthew (Johns Hopkins School of Medicine) ;

Links

Summary

Samtools provides a function "faidx" (FAsta InDeX), which creates a small flat index file ".fai" allowing for fast random access to any subsequence in the indexed fasta, while loading a minimal amount of the file in to memory.

Pyfaidx provides an interface for creating and using this index for fast random access of DNA subsequences from huge fasta files in a "pythonic" manner. Indexing speed is comparable to samtools, and in some cases sequence retrieval is much faster.

https://github.com/mdshw5/pyfaidx

More information

  • DOI: 10.5281/zenodo.8548

Subjects

  • fasta, sequence retrieval

Dates

  • Publication date: 2014
  • Issued: March 24, 2014

Rights


Much of the data past this point we don't have good examples of yet. Please share in #rdi slack if you have good examples for anything that appears below. Thanks!

Format

electronic resource

Relateditems

DescriptionItem typeRelationshipUri
IsPartOfhttps://zenodo.org/communities/zenodo