CPBS 7711 readings

Introduction

Molecular Biology Databases

Database Search and Sequence Alignment

PyCogent
  • Please read Knight 2007
  • Please additionally review the PyCogent project page and in particular look at the Application Controller Framework documentation.
  • I also encourage you to download and install the PyCogent toolkit from the developer site (so you have access to the latest code) before the lecture, and begin experimenting.
  • Use case 1: tol_example.py This illustrates how to apply pycogent to evaluate the idea that life on Earth clusters into three related domains, which are detectable by distances between their rRNA sequences. Using sequences collections derived from the Silva database (filtered with cd-hit-est so the max pairwise identity between any two sequences is 97%), I randomly select sequences, build a tree, and then visualize the tree. Note that you'll need muscle, fasttree, and matplotlib installed to run this example.
  • Use case 2: applying_an_existing_appc.txt This illustrates three different ways to apply the RDP classifier application controller: via the RDPController object, via the assign_taxonomy convenience function, and via the command line interface to the python module.
  • Use case 3: defining_a_new_appc.txt, minimal_formatdb.py This discusses how to define a new application controller class to wrap the formatdb program packages with NCBI's blast package. (I had Blast-2.2.20 install when we ran through this in class.) As I mentioned, I frequently use formatdb to create temporary blast databases which I to clean up after using. So, I designed this new app controller for class and will be added an extended version, including some convenience functions, to PyCogent soon. The code I wrote in class is attached as minimal_formatdb.py

Multiple Sequence Alignment

Research plans and reviews of same

Hidden Markov Models

Reviewing Research Proposals

Sequence Assembly

Computational Phylogeny

Protein Protein Interactions and networks

Reporting on your research

Protein Structure Prediction

Reviewing Research Manuscripts

Mechanics, Dynamics & Docking

Genetic Analysis

Presenting research

Biomedical Language Processing

   
         
Course home page | Computational Bioscience Program home page | Professor Hunter's home page