BIOL200 2013: Difference between revisions

From QiuLab
Jump to navigation Jump to search
imported>Cmartin
imported>Cmartin
Line 60: Line 60:
{| class="collapsible collapsed wikitable"
{| class="collapsible collapsed wikitable"
|- style="background-color:lightsteelblue;"
|- style="background-color:lightsteelblue;"
! Assignment #2
! PROCEDURE
|- style="background-color:powderblue;"
|- style="background-color:powderblue;"
| '''Before you begin...'''<br />
Do this ONLY ONCE: <pre>echo "source /data/yoda/b/student.accounts/bio425_2011/bio425.profile" >> ~/.bash_profile</pre>
Alternatively, you can open ~/.bash_profile in a text editor (ask me if don't know how) and paste the line: <pre>source /data/yoda/b/student.accounts/bio425_2011/bio425.profile</pre> at the end.
|-style="background-color:powderblue;"
| '''Beginning Perl'''<br />
For the homework, read up to page 221 in Appendix 1. For February 26, read all of Appendix 1.


There are '''two choices''' for the homework. The first is recommended for novices. The second is for those who are either comfortable with Perl, or feel the need for a challenge this early. Only complete ONE of these assignments, as I will only accept one. Please follow the guidelines listed [[#Programming Assignment Expectations|above]].
#Examine Table I, select representative species from Bergey’s Manual. Select 2 prokaryotic species from each group, giving 14 prokaryotic species total. Also select the Eukaryotic representative, Saccharomyces cerevisiae.
 
#Access the NCBI website: http://www.ncbi.nlm.nih.gov/
# Copy the code from page 221 in a new file. (Remember to put the code from the slides in the beginning of the file and to declare all variables on first use!) You must alter the code so that the resulting program accomplishes the following four tasks:
#Under the “Search” category, select “Nucleotide”
##Instead of taking the average of 10 numbers, ask the user how many numbers to average and use that number instead. (Hint: see how the code asks for each number). This must be stored in a new variable.
#Under the “for” category, type the accession number for your first organism, and hit the “Go” button. This takes you to the access for the 16S rRNA for your organism.
##If the number the user gave was 0 or negative, print a message telling the user so, and exit immediately. You can exit using <pre>exit;</pre>
#Download the 16S rRNA sequence for your first organisms by choosing “FASTA” under the “Display” category.
##The code always prints 'Enter another number:'. Change it so that on the '''first time only''' it instead prints 'Enter a number:'.
#Copy and paste the entire output into a Microsoft Word file.
##Just before printing the average, print a message saying 'The numbers to average are: '. Then print out out all the numbers the user entered.
#Edit the sequence id to match the format of “Genus_Species_Genbank#” (eg. > Escherichia_coli_174375).
#More advanced programmers can try this assignment (you may wish to read all of Appendix 1 now): create a script which can take as input one or more DNA sequences from a file and translate directly to the correct amino acid sequence (single-letter format). You may implement this program in Perl however you wish, with as much complexity as you wish, as long as it meets the guidelines above and satisfies the following four criteria:
#Repeat process for all of your organisms, pasting the sequences into the same Microsoft Word file. (note: be sure to place a blank line between each sequence entry)
##The format of the input file it reads must be: one DNA sequence per line, so that each DNA sequence is separated by a new line character. '''Also assume you are given the coding strand.'''
#Access the EMBL CLUSTALW alignment website: http://www.ebi.ac.uk/Tools/clustalw/, and copy and paste your entire Microsoft Word file into the area which asks you to “Enter or paste a set of sequences in any supported format”. Click “Run”. This program will make an alignment of all of your sequences.
##The name of the input file cannot be hard coded. You may either ask the user for the file location/name or take it as a command line argument.
#Click “Show as Phylogram Tree” to create a tree showing the relatedness of your organisms based on their 16S rRNA sequences.
##It must tolerate all upper-case, lower-case or mixed-case sequences in the input
#To print your phylogram tree:
##For every input DNA sequence, output the DNA sequence, the equivalent RNA, and the peptide sequence. The output '''must''' be informative, ie:
##:hit the “Print Screen” button on your keyboard
##:Input: atgcgtcgataa
##:open the Paint program from your “accessories” menu on your computer
##:Output: augcgucgauaa
##:hit paste to paste your screen
##:Peptide: MRR*
##:“select” your phylogram tree
#:Additionally, the program cannot use any outside dependencies/modules such as BioPerl (supposing you know how to use it.) Also note that STOP codons are denoted by a '<nowiki>*</nowiki>'
##:copy and paste it into a new paint file
##:print your tree and email it to yourself
|-style="background-color:powderblue;"
|-style="background-color:powderblue;"
| '''Problems'''<br />
| '''Problems'''<br />

Revision as of 16:18, 4 March 2013

EXPERIMENT # 4

BIOL 200 Cell Biology II LAB, Spring 2013

Hunter College of the City University of New York

Course information

Instructors: TBD

Class Hours: Room TBD HN; TBD

Office Hours: Room 830 HN; Thursdays 2-4pm or by appointment

Contact information:

  • Dr. Weigang Qiu: weigang@genectr.hunter.cuny.edu, 1-212-772-5296


Experiment #4

The Tree of Life and Molecular Identification of Microorganisms

Objective

To classify microorganisms and determine their relatedness using molecular sequences.

LAB REPORT GRADING GUIDE

CELL BIO II Experiment #4:

  • Introduction 1 point :
 Statement of objectives or aims of the experiment in the student’s own words.
 (not to be copied from the Lab Manual)
  • MATERIALS AND METHODS 0 points :
 This should be a brief synopsis and must include any changes or deviations 
 from the procedures outlined in the Lab Manual. Specify which organisms were 
 used to create the phylogram.
  • RESULTS 4 points :
 A print out of the phylogram will suffice.
  • DISCUSSION 4 points :
 Responses to discussion questions.
  • SUMMARY |CONCLUSION 1 point :
 Two sentence summary of your findings.
  • REFERENCES 1 point :
 Credit is given for pertinent references obtained from sources other than the Lab Manual.
 This point is in addition to the 10 for the lab report..

INTRODUCTION

MATERIALS

  • Required hardware: Computer
  1. Examine Table I, select representative species from Bergey’s Manual. Select 2 prokaryotic species from each group, giving 14 prokaryotic species total. Also select the Eukaryotic representative, Saccharomyces cerevisiae.
  2. Access the NCBI website: http://www.ncbi.nlm.nih.gov/
  3. Under the “Search” category, select “Nucleotide”
  4. Under the “for” category, type the accession number for your first organism, and hit the “Go” button. This takes you to the access for the 16S rRNA for your organism.
  5. Download the 16S rRNA sequence for your first organisms by choosing “FASTA” under the “Display” category.
  6. Copy and paste the entire output into a Microsoft Word file.
  7. Edit the sequence id to match the format of “Genus_Species_Genbank#” (eg. > Escherichia_coli_174375).
  8. Repeat process for all of your organisms, pasting the sequences into the same Microsoft Word file. (note: be sure to place a blank line between each sequence entry)
  9. Access the EMBL CLUSTALW alignment website: http://www.ebi.ac.uk/Tools/clustalw/, and copy and paste your entire Microsoft Word file into the area which asks you to “Enter or paste a set of sequences in any supported format”. Click “Run”. This program will make an alignment of all of your sequences.
  10. Click “Show as Phylogram Tree” to create a tree showing the relatedness of your organisms based on their 16S rRNA sequences.
  11. To print your phylogram tree:
    1. hit the “Print Screen” button on your keyboard
      open the Paint program from your “accessories” menu on your computer
      hit paste to paste your screen
      “select” your phylogram tree
      copy and paste it into a new paint file
      print your tree and email it to yourself

February 12

NO CLASS

(Read Chapter 6 for next class)

February 19

Yozen will not be lecturing

  • Chapter 6. Gene and Genome Structures [Lecture Slides Lecture Slides Ch.6-Che
  • Tutorial: ORF Prediction using GLIMMER
  • Homework: This homework will be graded.

February 26

March 5

March 12

March 19

  • REVIEW Session for MID-TERM EXAMS

March 26

  • MID-TERM

April 2

April 9

April 16

  • Topic: Relational Database and SQL
  • Tutorial: the Borrelia Genome Database
  • Homework: SQL-embedded PERL

April 23

NO CLASSES (Spring recess)

April 30

May 7

  • Chapter 6 (Gene Expression) & Chapter 8 (Proteomics)
  • Tutorial: Array Data Visualization and Analysis ( Micro-Array Analysis Slides)
  • Homework:Data Analysis using R

May 14

  • Chapter 7. Protein Structure Prediction

May 21

  • Final Project Due (TBA)

Useful Links

Unix Tutorials

Perl Help

  • Professor Stewart Weiss has taught CSCI132, a UNIX and Perl class. His slides go into much greater detail and are an invaluable resource. They can be found on his course page here.
  • Perl documentation at perldoc.perl.org. Besides that, running the perldoc command before either a function (with the -f option ie, perldoc -f substr) or a perl module (ie, perldoc Bio::Seq) can get you similar results without having to leave the terminal.

Bioperl

SQL

R Project

  • Install location and instructions for Windows
  • Install location and instructions for Mac OS X
  • For users of Ubuntu/Debian:
sudo apt-get install r-base-core
  • For users of Fedora/Red Hat:
su -
yum install R

Utilities

Other Resources


© Weigang Qiu, Hunter College, Last Update Jan 2013