BINF G4006 Translational Bioinformatics
Course Description: Methods in translational bioinformatics (i.e. biomedical data sciences) for graduate students as well as juniors and seniors. Students study the statistical and computational algorithms to evaluate large biomedical data, with a special focus on the integration of molecular and clinical data for the advancement of medicine. Methods may include including sequence analysis, supervised and unsupervised machine learning, graph theoretic models and network analysis, information theory, deep neural networks, density estimation, and others. Students will study how to practically apply these methods to biomedical domains in non-human and human genetics, pharmacology, and public health. Successful completion of the course readies the student for graduate level research in translational bioinformatics.
Nicholas Tatonetti, PhD
Classes are held Mondays and Wednesdays from 2:40 pm - 3:55 pm in MUDD 644
This course will be a hybrid of didactic lectures, flipped-classrooms, and peer-instruction. Learning materials will center around a practical and hands on approach to biomedical informatics applications in the translational sciences (e.g. population health, pharmaceutical sciences, molecular disease etiology, and human genetics).
The first third of the course will be lectures on data science application to molecular and cellular biology, drug discovery and safety, protein function and pathway analysis, and human genetic variation. The remaining two-thirds of course will be dedicated to a deep dive on an advanced class of computational/statistical methods applied to a particular biological domain. At this point students will take control of the direction of the course by choosing the methods that will be studied, designing lectures, and delivering teaching materials to their peers.
Students will be graded on their engagement with the course materials, their participation in the peer-instruction, their performance on assigned tasks, and an end-of-term research study.
The textbooks for this course are both freely available.
• PLOS Computational Biology: Translational Bioinformatics edited by Maricel Kann, Guest Editor, and Fran Lewitter.
• The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Second Edition) by Trevor Hastie, Robert Tibshirani and Jerome Friedman (2009) http://web.stanford.edu/~hastie/pub.htm
|1||Sept. 9||Introduction to Translational Bioinformatics||||HW1(A)||Tatonetti|
|2||Sept. 11||Data-driven Disease Biology||[2,3]||HW1(D); HW2(A)||Tatonetti|
|3||Sept. 16||Data-driven Disease Biology||[4,5]||HW2(D)||Tatonetti|
|4||Sept. 18||Lab: Data-driven Disease Biology||[5,6]||HW3(A)||TA|
|5||Sept. 23||Application Spotlight: Drug Safety and Drug-Drug Interactions||[22,23,24,25]||Tatonetti|
|6||Sept. 25||Unsupervised machine learning methods||[7,8]||Tatonetti|
|7||Sept. 30||Lab: Drug safety and drug-drug interactions||||HW3(D)|
|8||Oct. 2||Student Lecture 1 and Discussion; How to write an abstract||Student; Tatonetti|
|9||Oct. 7||Student Lecture 2 and Discussion||HW3.R(D); HW4(A)||Student|
|10||Oct. 9||Student Lecture 3 and Discussion||Student|
|11||Oct. 14||Student Lecture 4 and Discussion||Student|
|12||Oct. 16||Student Lecture 5 and Discussion||Student|
|13||Oct. 21||Systems Pharmacology I||[9,10]||Tatonetti|
|14||Oct. 23||Systems Pharmacology II||[11-15]||Tatonetti|
|15||Oct. 28||How to prepare a scientific presentation||Tatonetti|
|16||Oct. 30||Chemical Informatics I||[16,17]||Tatonetti|
|Nov. 4||Election Day|
|17||Nov. 6||Student Lecture 7 and Discussion||Student|
|18||Nov. 11||Chemical Informatics II||[18,19]||Tatonetti|
|19||Nov. 13||Network Biology and Graph Theory||[20,21]||TA|
|Nov. 18||Cancel due to AMIA|
|20||Nov. 20||Student Lecture 8 and Discussion||HW5(D)||Student|
|Nov. 25||Thanksgiving Week|
|Nov. 27||Thanksgiving Week|
|21||Dec. 2||Application Spotlight: Deep neural networks in medicine||TBA||Sassan Ostvar|
|22||Dec. 7||Final Presentations I|
|23||Dec. 9||Final Presentations II|
Other Possible Lecture Topics
These are additional lectures that can be slotted in depending on the number of days that the student lectures consume.
- Biomedical Blockchains: The Don’ts and Don’ts
- Zero knowledge proofs and why their cool
- Differential privacy and its application to healthcare
- The genetic architecture of disease, heritability and recurrence
- Application Spotlight: Celiac disease, its etiology, and estimation of risk (Guest Lecturer: Ben Lebwohl)
All homework must be completed independently. Homework will be graded on uniqueness and creativity in content. “Google search results” will receive low grades.
HW1. Review syllabus of Computational Methods course
HW2. Comprehensive outline of a chosen methods topic (2-5 pages)
HW3. Build 35min lecture with references on assigned topic
HW3.R. Revise lecture according to instructor feedback
HW4. Abstract proposal for short research project (400-600 words; 1 figure; 1 table)
HW5. 10 minute presentation of research findings (10 minutes; 10 slides)
All assigned readings should be completed before the lecture, unless otherwise noted.
 “Introduction to Translational Bioinformatics Collection” in PLOS Computational Biology: Translational Bioinformatics (http://tatonettilab.org/courses/BINFG4006/materials/TranslationalBioinformatics.pdf)
 Chapter 4 “Protein interactions and disease” of PLOS Computational Biology: Translational Bioinformatics
 Chapter 9 “Analysis using disease ontologies” of PLOS Computational Biology: Translational Bioinformatics
 Chapter 2 “Data-Driven View of Disease Biology” of PLOS Computational Biology: Translational Bioinformatics
 Chapter 14 sections 14.1 and 14.3. of Elements of Statistical Learning
 Chapter 3 “Small molecules and disease” of PLOS Computational Biology: Translational Bioinformatics
 Chapter 6 “Structural variation and medical genomics” of PLOS Computational Biology: Translational Bioinformatics
 Chapter 7 “Pharmacogenomics” of PLOS Computational Biology: Translational Bioinformatics
 Chapter 5 “Network biology approach to complex disease” of PLOS Computational Biology: Translational Bioinformatics
Midterm and finals:
This class is a flipped peer-instructed and project-based class. Each student needs to come up with a research question and then find appropriate datasets, use statistical methods to answer the question. The midterm assignment (aka HW4) of this class will be a 1-page research proposal that described the research methods and materials you’ll be using for your project (template will be uploaded on coursework). Then each student will present the proposal on the class.
For the finals, each student will work on their proposed research plan and present the results in a 10 minute scientific presentation.
• 15% class participation
• 40% homework assignments 1-4
• 25% midterm proposal (i.e. HW5)
• 20% final project presentation
The course will employ a “no excuse necessary” absentee policy. After the first two absences, your grade will be reduced by one third of a letter grade for each additional absence. For example, if you earned an A in the class and missed three of the classes in total, your final grade would be an A-. If you earned an A and missed four classes total, then your grade would be a B+, and so on. There is no need to notify the TA or Instructor when you will be absent. Attendance will be taken at the start of each class.
Academic Integrity Statement
(taken from http://gsas.columbia.edu/content/sample-statement-academic-integrity)
Columbia’s intellectual community relies on academic integrity and responsibility as the cornerstone of its work. Graduate students are expected to exhibit the highest level of personal and academic honesty as they engage in scholarly discourse and research. In practical terms, you must be responsible for the full and accurate attribution of the ideas of others in all of your research papers and projects; you must be honest when taking your examinations; you must always submit your own work and not that of another student, scholar, or internet source. Graduate students are responsible for knowing and correctly utilizing referencing and bibliographical guidelines. When in doubt, consult your professor. Citation and plagiarism- prevention resources can be found at the GSAS page on Academic Integrity and Responsible Conduct of Research (http://gsas.columbia.edu/academic-integrity).
Failure to observe these rules of conduct will have serious academic consequences, up to and including dismissal from the university. If a faculty member suspects a breach of academic honesty, appropriate investigative and disciplinary action will be taken following Dean’s Discipline procedures (http://gsas.columbia.edu/content/disciplinary-procedures).