Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction

Size: px
Start display at page:

Download "Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction"

Transcription

1 Supporting Information Identification of Amino Acids with Sensitive Nanoporous MoS 2 : Towards Machine Learning-Based Prediction Amir Barati Farimani, Mohammad Heiranian, Narayana R. Aluru 1 Department of Mechanical Science and Engineering Beckman Institute for Advanced Science and Technology University of Illinois at Urbana-Champaign, Urbana, Illinois Department of Chemistry, Stanford University, Stanford, California Table of contents 1. Translocation, forces and simulation sets 2. Conformations and blockade in the pore 3. Anion binding to thiol groups 4. Effect of mass of amino acids 5. Machine learning models 1 Corresponding Author, aluru@illinois.edu, web: 1

2 1. Translocation, forces and simulation sets The translocation of Alanine, Aspartic acid and Tryptophan are plotted as a function of time for different pulling forces in Figure S.1. As shown in the figure, for the small amino acid case of Alanine, the residues begin to pass through the pore under a force of pn which is also strong enough to push the larger amino acids of Tryptophan and Aspartic acid. a b Alanine translocation (# of residues) pn Aspartic Acid translocation (# of residues) pn Translocation time (ns) Translocation time (ns) c 10 Tryptophan translocation (# residues) pn Translocation time (ns) Figure S.1. Translocation of a Alanine, b Aspartic acid and c Tryptophan chains as a function of time for different values of applied forces. 2

3 4,293 simulations have been performed to obtain the residence times, ionic currents, I-V curve and study the pore sensitivity of the MoS 2 nanopore. The detailed information about all the simulations is tabulated below. Table 1: Details of all the simulations performed. Simulation # of Purpose Comment set simulations Residence times and ionic Force per residue of pn currents Ionic currents Different forces per residue in the range of pn pn 3 80 I-V curve No amino acids Pore sensitivity Force per residue of pn None Discarded (no translocation events) 6 10 Studying real-life amino acids Force per residue of pn 1-6 (total)

4 2. Conformations and blockade in the pore Tryptophan amino acid has a large volume that occupies most of the pore (d=1.85 nm). The most dominant conformations of the amino acid in the pore are shown in Figure S.2.1. The amino acid blocks the pore and consequently the ionic current during its translocation is small. This leads to the lowest current and the longest residence time among all the amino acids. In Figure S.2.2, the dominant conformations of Alanine are shown. This small molecule sticks to the edge of the pore during its translation leaving most of the pore volume accessible to ions to pass through. Figure S.2.1. Dominant conformations and occupancy of Tryptophan in the pore with d=1.85 nm and an applied force of pn. The yellow and blue colors represent S and Mo atoms in MoS 2 structure and blue, red and cyan represent N, O, and C atoms in amino acids. The three last snapshots of amino acids are shown in vdw radii to demonstrate the volume of pore blockade. Hydrogens are not shown. 4

5 Figure S.2.2. Dominant conformations and occupancy of Alanine in the pore with d=1.85 nm and an applied force of pn. The yellow and blue colors represent S and Mo atoms in the MoS 2 structure and blue, red and cyan represent N, O, and C atoms in amino acids. The three last snapshots of amino acids are shown in vdw radii to demonstrate the volume of pore blockade. 5

6 3. Anion binding to thiol groups As mentioned in the manuscript, Methionine translocation results in a negative ionic current under a positive electric field. This is because anions are bound to the thiol groups of the amino acids (Figure S.3) and are dragged down with the chain resulting in a net negative ionic current. a b Cl-Thiol H distance (Å) binding time Ion translocation through the pore time (ps) Figure S.3. a Different snapshots of Methionine translocation through the pore while anions (orange spheres), which are bound to the amino acids, are dragged down across the membrane. b Distance between a tagged Cl - and the hydrogen of the thiol group as a function of time. 6

7 4. Effect of mass of amino acids A 3D scattered plot of the mass of each amino acid as a function of its residence time and ionic current are shown in Figure S.4.1. As the mass increases, the current decreases since the amino acids with higher masses have larger volumes blocking the nanopore. Also, the residence time significantly increases with increasing mass and follows a power-law relation as shown in Figure S.4.2. Figure S D plot of mass (Da) of each amino acid versus its average residence time (ps) and ionic current (pa). Mass-current data is shown in red color, mass-residence time data is shown in green color and current-residence time is shown in blue color. 7

8 Residence time (ps) Model Allometric1 Equation y = a*x^b Reduced Chi-Sqr Adj. R-Square Value Standard Error C a E E-8 b T R 4.66(10 ) M Mass (daltons) Figure S.4.2. The relationship between the residence time (T R ) and mass (M ) of amino acids. The circles represent the data from the simulations. The curve is the fitted power-law relation between T R and M. 5. Machine learning models There are different methods to identify the class of an unseen data point based on known data. Here, we use k-nearest neighbor (KNN), logistic regression and random forest algorithms 2. In KNN, to classify a given point, a, the k nearest data points (neighbors) of a given point are first identified. In the nanopore data classification problem, each point of our data is a pair of ionic current and residence time (a (ionic current, residence time)). Then the class of a is predicted by the majority of voting among the nearest neighbors. There are different metrics to calculate the distance between two points, a and b. Here, we use the Euclidian distance d which is given by d( a, b) a b n 2 i i S.5.1 i 1 where i denotes the type of the feature (here ionic current and residence time) and n is the total number of features (here, 2). In KNN, k is identified as the number of closest (neighboring) data points to the data point of interest. Appropriate choice of k is needed for high accuracy prediction. Small values of k (smallest limit, k=1) are sensitive to local structure and noise in the data. Ideally, larger values of k lead to better classification and finer boundaries between class regions. However, because of the finite number of data points, the accuracy recedes as the value of k approaches the number of sample data points. To find the 8

9 optimal value of k, the accuracy of prediction is calculated for different values of k in Figure S.5.1. k=3, 7 and 8 result in the highest accuracy. k=3 is chosen since it is computationally less expensive. Figure S.5.1. The accuracy of the prediction based on KNN as a function of k. In logistic regression classification, any point x in the multi-feature vector space is mapped into a scalar value (p) between 0 and 1 using the logistic function as follows p 1 1 ( a ) e bx S.5.2 where a and b are the optimized parameters based on the traning data points. The training data for the nanopore sequencing in this paper is (ionic current, residence time, amino acid label). The training data is available as the excel supplementary file (Aminoacid_IR.xlsx). The class of any point in space is defined by the value of p and the threshold in p indicate boundaries between class regions. In the random forest method, a collection of randomized trees (estimators) are constructed based on the input data. Each tree can be thought of as a regression fit to the given data. Finally, the forest selects the classification based on the majority of voting from the decisions of all the trees. To find the optimal number of estimators (N), the accuracy of prediction is investigated for different values of N in Figure S.5.2. N=9, which is used in our calculations, leads to the most accurate prediction. 9

10 Figure S.5.2. The accuracy of the prediction based on Random Forest as a function of the number of estimators. We tried to classify the data without any prior training to see how good the data can be classified into clusters and what the accuracy of classification is compared to the labeled data. In the classification of the raw data, all the data points are unlabeled (only ionic current and residence time are known, and we do not know their respective amino acid labels) and the only given information is the number of clusters. All the clusters are predicted using k-mean clustering method with an accuracy of 72.6% when the Tryptophan cluster is excluded (1900 data points clustered into 19 clusters) as shown in Figure S.5.3. With the Tryptophan data included (2000 data points and with 20 clusters), the accuracy is 54%. This accuracy is defined by the number of correctly clustered data points compared to the total number of data points. A correctly identified cluster means that the cluster membership of each data point is the same as the labeled residence time and ionic current. 10

11 Figure S.5.3. The unlabeled data points (blue) are clustered using k-mean clustering. The mean values of the clusters based on the learning and the actual data are presented in black stars and red spheres, respectively. 11

SDS-Assisted Protein Transport Through Solid-State Nanopores

SDS-Assisted Protein Transport Through Solid-State Nanopores Supplementary Information for: SDS-Assisted Protein Transport Through Solid-State Nanopores Laura Restrepo-Pérez 1, Shalini John 2, Aleksei Aksimentiev 2 *, Chirlmin Joo 1 *, Cees Dekker 1 * 1 Department

More information

SUPPLEMENTARY INFORMATION. Computational Assay of H7N9 Influenza Neuraminidase Reveals R292K Mutation Reduces Drug Binding Affinity

SUPPLEMENTARY INFORMATION. Computational Assay of H7N9 Influenza Neuraminidase Reveals R292K Mutation Reduces Drug Binding Affinity SUPPLEMENTARY INFORMATION Computational Assay of H7N9 Influenza Neuraminidase Reveals R292K Mutation Reduces Drug Binding Affinity Christopher Woods 1, Maturos Malaisree 1, Ben Long 2, Simon McIntosh-Smith

More information

Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation

Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation Transient β-hairpin Formation in α-synuclein Monomer Revealed by Coarse-grained Molecular Dynamics Simulation Hang Yu, 1, 2, a) Wei Han, 1, 3, b) Wen Ma, 1, 2 1, 2, 3, c) and Klaus Schulten 1) Beckman

More information

Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random

Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random S1 Supplementary Figure 1 (previous page). EM analysis of full-length GCGR. (a) Exemplary tilt pair images of the GCGR mab23 complex acquired for Random Conical Tilt (RCT) reconstruction (left: -50,right:

More information

1.4 - Linear Regression and MS Excel

1.4 - Linear Regression and MS Excel 1.4 - Linear Regression and MS Excel Regression is an analytic technique for determining the relationship between a dependent variable and an independent variable. When the two variables have a linear

More information

Interactions of Polyethylenimines with Zwitterionic and. Anionic Lipid Membranes

Interactions of Polyethylenimines with Zwitterionic and. Anionic Lipid Membranes Interactions of Polyethylenimines with Zwitterionic and Anionic Lipid Membranes Urszula Kwolek, Dorota Jamróz, Małgorzata Janiczek, Maria Nowakowska, Paweł Wydro, Mariusz Kepczynski Faculty of Chemistry,

More information

Detergent solubilised 5 TMD binds pregnanolone at the Q245 neurosteroid potentiation site.

Detergent solubilised 5 TMD binds pregnanolone at the Q245 neurosteroid potentiation site. Supplementary Figure 1 Detergent solubilised 5 TMD binds pregnanolone at the Q245 neurosteroid potentiation site. (a) Gel filtration profiles of purified 5 TMD samples at 100 nm, heated beforehand for

More information

Study on Different types of Structure based Properties in Human Membrane Proteins

Study on Different types of Structure based Properties in Human Membrane Proteins 2017 IJSRST Volume 3 Issue 7 Print ISSN: 2395-6011 Online ISSN: 2395-602X Themed Section: Science and Technology Study on Different types of Structure based Properties in Human Membrane Proteins S. C.

More information

Supplementary Materials for

Supplementary Materials for advances.sciencemag.org/cgi/content/full/4/3/eaaq0762/dc1 Supplementary Materials for Structures of monomeric and oligomeric forms of the Toxoplasma gondii perforin-like protein 1 Tao Ni, Sophie I. Williams,

More information

Table S1: Kinetic parameters of drug and substrate binding to wild type and HIV-1 protease variants. Data adapted from Ref. 6 in main text.

Table S1: Kinetic parameters of drug and substrate binding to wild type and HIV-1 protease variants. Data adapted from Ref. 6 in main text. Dynamical Network of HIV-1 Protease Mutants Reveals the Mechanism of Drug Resistance and Unhindered Activity Rajeswari Appadurai and Sanjib Senapati* BJM School of Biosciences and Department of Biotechnology,

More information

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab

Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features. Tyler Yue Lab Supervised Learner for the Prediction of Hi-C Interaction Counts and Determination of Influential Features Tyler Derr @ Yue Lab tsd5037@psu.edu Background Hi-C is a chromosome conformation capture (3C)

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1: Predicted structure of the most stable {110} antiphase boundary defect in magnetite (model APB-I). a) The same structure as that shown in Fig. 1b (main text)

More information

Supplementary Figures

Supplementary Figures Supplementary Figures Supplementary Figure 1 Minicircle topoisomer generation. a, Generation of supercoiled topoisomers. Minicircles were nicked with the sequence-specific nicking endonuclease, Nb.BbvCI.

More information

NeuroMem. RBF Decision Space Mapping

NeuroMem. RBF Decision Space Mapping NeuroMem RBF Decision Space Mapping Version 4.2 Revised 01/04/2017 This manual is copyrighted and published by General Vision Inc. All rights reserved. Contents 1 Introduction to a decision space... 3

More information

List of Figures. List of Tables

List of Figures. List of Tables Supporting Information for: Signaling Domain of Sonic Hedgehog as Cannibalistic Calcium-Regulated Zinc-Peptidase Rocio Rebollido-Rios 1, Shyam Bandari 3, Christoph Wilms 1, Stanislav Jakuschev 1, Andrea

More information

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing

SUPPLEMENTARY INFORMATION. Table 1 Patient characteristics Preoperative. language testing Categorical Speech Representation in the Human Superior Temporal Gyrus Edward F. Chang, Jochem W. Rieger, Keith D. Johnson, Mitchel S. Berger, Nicholas M. Barbaro, Robert T. Knight SUPPLEMENTARY INFORMATION

More information

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018

Introduction to Machine Learning. Katherine Heller Deep Learning Summer School 2018 Introduction to Machine Learning Katherine Heller Deep Learning Summer School 2018 Outline Kinds of machine learning Linear regression Regularization Bayesian methods Logistic Regression Why we do this

More information

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys

CRITERIA FOR USE. A GRAPHICAL EXPLANATION OF BI-VARIATE (2 VARIABLE) REGRESSION ANALYSISSys Multiple Regression Analysis 1 CRITERIA FOR USE Multiple regression analysis is used to test the effects of n independent (predictor) variables on a single dependent (criterion) variable. Regression tests

More information

Supplementary Information A Hydrophobic Barrier Deep Within the Inner Pore of the TWIK-1 K2P Potassium Channel Aryal et al.

Supplementary Information A Hydrophobic Barrier Deep Within the Inner Pore of the TWIK-1 K2P Potassium Channel Aryal et al. Supplementary Information A Hydrophobic Barrier Deep Within the Inner Pore of the TWIK-1 K2P Potassium Channel Aryal et al. Supplementary Figure 1 TWIK-1 stability during MD simulations in a phospholipid

More information

Arginine side chain interactions and the role of arginine as a mobile charge carrier in voltage sensitive ion channels. Supplementary Information

Arginine side chain interactions and the role of arginine as a mobile charge carrier in voltage sensitive ion channels. Supplementary Information Arginine side chain interactions and the role of arginine as a mobile charge carrier in voltage sensitive ion channels Craig T. Armstrong, Philip E. Mason, J. L. Ross Anderson and Christopher E. Dempsey

More information

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes.

Statistics 202: Data Mining. c Jonathan Taylor. Final review Based in part on slides from textbook, slides of Susan Holmes. Final review Based in part on slides from textbook, slides of Susan Holmes December 5, 2012 1 / 1 Final review Overview Before Midterm General goals of data mining. Datatypes. Preprocessing & dimension

More information

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1

From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Chapter 1: Introduction... 1 From Biostatistics Using JMP: A Practical Guide. Full book available for purchase here. Contents Dedication... iii Acknowledgments... xi About This Book... xiii About the Author... xvii Chapter 1: Introduction...

More information

INTRODUCTION TO MACHINE LEARNING. Decision tree learning

INTRODUCTION TO MACHINE LEARNING. Decision tree learning INTRODUCTION TO MACHINE LEARNING Decision tree learning Task of classification Automatically assign class to observations with features Observation: vector of features, with a class Automatically assign

More information

Biology 2E- Zimmer Protein structure- amino acid kit

Biology 2E- Zimmer Protein structure- amino acid kit Biology 2E- Zimmer Protein structure- amino acid kit Name: This activity will use a physical model to investigate protein shape and develop key concepts that govern how proteins fold into their final three-dimensional

More information

Supplementary Material

Supplementary Material Supplementary Material Materials and methods Enzyme assay The enzymatic activity of -glucosidase toward salicin was measured with the Miller method (Miller, 1959) using glucose as the standard. A total

More information

Supplementary Materials for

Supplementary Materials for www.sciencemag.org/cgi/content/full/science.aal4326/dc1 Supplementary Materials for Structure of a eukaryotic voltage-gated sodium channel at near-atomic resolution Huaizong Shen, Qiang Zhou, Xiaojing

More information

Prediction of heart disease using k-nearest neighbor and particle swarm optimization.

Prediction of heart disease using k-nearest neighbor and particle swarm optimization. Biomedical Research 2017; 28 (9): 4154-4158 ISSN 0970-938X www.biomedres.info Prediction of heart disease using k-nearest neighbor and particle swarm optimization. Jabbar MA * Vardhaman College of Engineering,

More information

Supplementary Information. Conformational states of Lck regulate clustering in early T cell signaling

Supplementary Information. Conformational states of Lck regulate clustering in early T cell signaling Supplementary Information Conformational states of Lck regulate clustering in early T cell signaling Jérémie Rossy, Dylan M. Owen, David J. Williamson, Zhengmin Yang and Katharina Gaus Centre for Vascular

More information

Gene Selection for Tumor Classification Using Microarray Gene Expression Data

Gene Selection for Tumor Classification Using Microarray Gene Expression Data Gene Selection for Tumor Classification Using Microarray Gene Expression Data K. Yendrapalli, R. Basnet, S. Mukkamala, A. H. Sung Department of Computer Science New Mexico Institute of Mining and Technology

More information

ATP-independent reversal of a membrane protein aggregate by a chloroplast SRP

ATP-independent reversal of a membrane protein aggregate by a chloroplast SRP ATP-independent reversal of a membrane protein aggregate by a chloroplast SRP Peera Jaru-Ampornpan 1, Kuang Shen 1,3, Vinh Q. Lam 1,3, Mona Ali 2, Sebastian Doniach 2, Tony Z. Jia 1, Shu-ou Shan 1 Supplementary

More information

Supplementary Materials Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE

Supplementary Materials Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE Supplementary Materials Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE Peng Qiu1,4, Erin F. Simonds2, Sean C. Bendall2, Kenneth D. Gibbs Jr.2, Robert V. Bruggner2, Michael

More information

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data

Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data Probability-Based Protein Identification for Post-Translational Modifications and Amino Acid Variants Using Peptide Mass Fingerprint Data Tong WW, McComb ME, Perlman DH, Huang H, O Connor PB, Costello

More information

Identification of Tissue Independent Cancer Driver Genes

Identification of Tissue Independent Cancer Driver Genes Identification of Tissue Independent Cancer Driver Genes Alexandros Manolakos, Idoia Ochoa, Kartik Venkat Supervisor: Olivier Gevaert Abstract Identification of genomic patterns in tumors is an important

More information

Hemoglobin & Sickle Cell Anemia Exercise

Hemoglobin & Sickle Cell Anemia Exercise Name StarBiochem Hemoglobin & Sickle Cell Anemia Exercise Background Hemoglobin is the protein in red blood cells responsible for carrying oxygen from the lungs to the rest of the body and for returning

More information

Practice Problems 3. a. What is the name of the bond formed between two amino acids? Are these bonds free to rotate?

Practice Problems 3. a. What is the name of the bond formed between two amino acids? Are these bonds free to rotate? Life Sciences 1a Practice Problems 3 1. Draw the oligopeptide for Ala-Phe-Gly-Thr-Asp. You do not need to indicate the stereochemistry of the sidechains. Denote with arrows the bonds formed between the

More information

Supplementary Figure-1. SDS PAGE analysis of purified designed carbonic anhydrase enzymes. M1-M4 shown in lanes 1-4, respectively, with molecular

Supplementary Figure-1. SDS PAGE analysis of purified designed carbonic anhydrase enzymes. M1-M4 shown in lanes 1-4, respectively, with molecular Supplementary Figure-1. SDS PAGE analysis of purified designed carbonic anhydrase enzymes. M1-M4 shown in lanes 1-4, respectively, with molecular weight markers (M). Supplementary Figure-2. Overlay of

More information

Understandable Statistics

Understandable Statistics Understandable Statistics correlated to the Advanced Placement Program Course Description for Statistics Prepared for Alabama CC2 6/2003 2003 Understandable Statistics 2003 correlated to the Advanced Placement

More information

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis

Applications. DSC 410/510 Multivariate Statistical Methods. Discriminating Two Groups. What is Discriminant Analysis DSC 4/5 Multivariate Statistical Methods Applications DSC 4/5 Multivariate Statistical Methods Discriminant Analysis Identify the group to which an object or case (e.g. person, firm, product) belongs:

More information

Supplementary Table 1. Data collection and refinement statistics (molecular replacement).

Supplementary Table 1. Data collection and refinement statistics (molecular replacement). Supplementary Table 1. Data collection and refinement statistics (molecular replacement). Data set statistics HLA A*0201- ALWGPDPAAA PPI TCR PPI TCR/A2- ALWGPDPAAA PPI TCR/A2- ALWGPDPAAA Space Group P2

More information

Extraction and Identification of Tumor Regions from MRI using Zernike Moments and SVM

Extraction and Identification of Tumor Regions from MRI using Zernike Moments and SVM I J C T A, 8(5), 2015, pp. 2327-2334 International Science Press Extraction and Identification of Tumor Regions from MRI using Zernike Moments and SVM Sreeja Mole S.S.*, Sree sankar J.** and Ashwin V.H.***

More information

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process

Research Methods in Forest Sciences: Learning Diary. Yoko Lu December Research process Research Methods in Forest Sciences: Learning Diary Yoko Lu 285122 9 December 2016 1. Research process It is important to pursue and apply knowledge and understand the world under both natural and social

More information

Lecture 10 More about proteins

Lecture 10 More about proteins Lecture 10 More about proteins Today we're going to extend our discussion of protein structure. This may seem far-removed from gene cloning, but it is the path to understanding the genes that we are cloning.

More information

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012

STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION XIN SUN. PhD, Kansas State University, 2012 STATISTICAL METHODS FOR DIAGNOSTIC TESTING: AN ILLUSTRATION USING A NEW METHOD FOR CANCER DETECTION by XIN SUN PhD, Kansas State University, 2012 A THESIS Submitted in partial fulfillment of the requirements

More information

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection

Efficacy of the Extended Principal Orthogonal Decomposition Method on DNA Microarray Data in Cancer Detection 202 4th International onference on Bioinformatics and Biomedical Technology IPBEE vol.29 (202) (202) IASIT Press, Singapore Efficacy of the Extended Principal Orthogonal Decomposition on DA Microarray

More information

Supplementary Figure 1. Recording sites.

Supplementary Figure 1. Recording sites. Supplementary Figure 1 Recording sites. (a, b) Schematic of recording locations for mice used in the variable-reward task (a, n = 5) and the variable-expectation task (b, n = 5). RN, red nucleus. SNc,

More information

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training.

Nature Neuroscience: doi: /nn Supplementary Figure 1. Behavioral training. Supplementary Figure 1 Behavioral training. a, Mazes used for behavioral training. Asterisks indicate reward location. Only some example mazes are shown (for example, right choice and not left choice maze

More information

Project Manual Bio3055. Cholesterol Homeostasis: HMG-CoA Reductase

Project Manual Bio3055. Cholesterol Homeostasis: HMG-CoA Reductase Project Manual Bio3055 Cholesterol Homeostasis: HMG-CoA Reductase Bednarski 2003 Funded by HHMI Cholesterol Homeostasis: HMG-CoA Reductase Introduction: HMG-CoA Reductase is an enzyme in the cholesterol

More information

Coarse grained simulations of Lipid Bilayer Membranes

Coarse grained simulations of Lipid Bilayer Membranes Coarse grained simulations of Lipid Bilayer Membranes P. B. Sunil Kumar Department of Physics IIT Madras, Chennai 600036 sunil@iitm.ac.in Atomistic MD: time scales ~ 10 ns length scales ~100 nm 2 To study

More information

(B D) Three views of the final refined 2Fo-Fc electron density map of the Vpr (red)-ung2 (green) interacting region, contoured at 1.4σ.

(B D) Three views of the final refined 2Fo-Fc electron density map of the Vpr (red)-ung2 (green) interacting region, contoured at 1.4σ. Supplementary Figure 1 Overall structure of the DDB1 DCAF1 Vpr UNG2 complex. (A) The final refined 2Fo-Fc electron density map, contoured at 1.4σ of Vpr, illustrating well-defined side chains. (B D) Three

More information

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine

Brain Tumour Detection of MR Image Using Naïve Beyer classifier and Support Vector Machine International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Brain Tumour Detection of MR Image Using Naïve

More information

Sum of Neurally Distinct Stimulus- and Task-Related Components.

Sum of Neurally Distinct Stimulus- and Task-Related Components. SUPPLEMENTARY MATERIAL for Cardoso et al. 22 The Neuroimaging Signal is a Linear Sum of Neurally Distinct Stimulus- and Task-Related Components. : Appendix: Homogeneous Linear ( Null ) and Modified Linear

More information

A: All atom molecular simulation systems

A: All atom molecular simulation systems Cholesterol level affects surface charge of lipid membranes in physiological environment Aniket Magarkar a, Vivek Dhawan b, Paraskevi Kallinteri a, Tapani Viitala c, Mohammed Elmowafy c, Tomasz Róg d,

More information

Supplementary Figure 1 Preparation, crystallization and structure determination of EpEX. (a), Purified EpEX and EpEX analyzed on homogenous 12.

Supplementary Figure 1 Preparation, crystallization and structure determination of EpEX. (a), Purified EpEX and EpEX analyzed on homogenous 12. Supplementary Figure 1 Preparation, crystallization and structure determination of EpEX. (a), Purified EpEX and EpEX analyzed on homogenous 12.5 % SDS-PAGE gel under reducing and non-reducing conditions.

More information

THE FUTURE OF OR. Dimitris Bertsimas MIT

THE FUTURE OF OR. Dimitris Bertsimas MIT THE FUTURE OF OR Dimitris Bertsimas MIT The future of OR Optimization Power Improvements in Software and hardware have accelerated Mixed Integer Optimization 2.2 Trillion times! This forces to rethink

More information

Bilayer Deformation, Pores & Micellation Induced by Oxidized Lipids

Bilayer Deformation, Pores & Micellation Induced by Oxidized Lipids Supporting Information Bilayer Deformation, Pores & Micellation Induced by Oxidized Lipids Phansiri Boonnoy 1, Viwan Jarerattanachat 1,2, Mikko Karttunen 3*, and Jirasak Wongekkabut 1* 1 Department of

More information

Interaction Between Amyloid-b (1 42) Peptide and Phospholipid Bilayers: A Molecular Dynamics Study

Interaction Between Amyloid-b (1 42) Peptide and Phospholipid Bilayers: A Molecular Dynamics Study Biophysical Journal Volume 96 February 2009 785 797 785 Interaction Between Amyloid-b (1 42) Peptide and Phospholipid Bilayers: A Molecular Dynamics Study Charles H. Davis and Max L. Berkowitz * Department

More information

Supplementary Material

Supplementary Material 10.1071/CH15728_AC The Authors 2016 Australian Journal of Chemistry 2016, 69(8), 846-855 Supplementary Material Structural Diversity and Properties of Six Zn II /Cd II Coordination Polymers Based on a

More information

Lane: 1. Spectra BR protein ladder 2. PFD 3. TERM 4. 3-way connector 5. 2-way connector

Lane: 1. Spectra BR protein ladder 2. PFD 3. TERM 4. 3-way connector 5. 2-way connector kda 1 2 3 4 5 26 14 1 7 Lane: 1. Spectra BR protein ladder 2. PFD 3. TERM 4. 3-way connector 5. 2-way connector 5 4 35 25 15 1 Supplementary Figure 1. SDS-PAGE of acterially expressed and purified proteins.

More information

Predicting Sleep Using Consumer Wearable Sensing Devices

Predicting Sleep Using Consumer Wearable Sensing Devices Predicting Sleep Using Consumer Wearable Sensing Devices Miguel A. Garcia Department of Computer Science Stanford University Palo Alto, California miguel16@stanford.edu 1 Introduction In contrast to the

More information

Structural analysis of fungus-derived FAD glucose dehydrogenase

Structural analysis of fungus-derived FAD glucose dehydrogenase Structural analysis of fungus-derived FAD glucose dehydrogenase Hiromi Yoshida 1, Genki Sakai 2, Kazushige Mori 3, Katsuhiro Kojima 3, Shigehiro Kamitori 1, and Koji Sode 2,3,* 1 Life Science Research

More information

Mammogram Analysis: Tumor Classification

Mammogram Analysis: Tumor Classification Mammogram Analysis: Tumor Classification Term Project Report Geethapriya Raghavan geeragh@mail.utexas.edu EE 381K - Multidimensional Digital Signal Processing Spring 2005 Abstract Breast cancer is the

More information

Hemoglobin & Sickle Cell Anemia Exercise

Hemoglobin & Sickle Cell Anemia Exercise Name StarBiochem Hemoglobin & Sickle Cell Anemia Exercise Learning Objectives In this exercise, you will use StarBiochem, a protein 3D viewer, to explore the structure of the normal hemoglobin protein

More information

Engineering of Ministry of Education, Institute of Molecular Science, Shanxi University, Taiyuan , China.

Engineering of Ministry of Education, Institute of Molecular Science, Shanxi University, Taiyuan , China. Indicator approach to develop a chemosensor for the colorimetric sensing of thiol-containing in water and its application for the thiol detection in plasma Fang-Jun Huo, a Yu-Tao Yang, b Jing Su, b Yuan-Qiang

More information

Introduction to Protein Structure Collection

Introduction to Protein Structure Collection Introduction to Protein Structure Collection Teaching Points This collection is designed to introduce students to the concepts of protein structure and biochemistry. Different activities guide students

More information

Guided Inquiry Skills Lab. Additional Lab 1 Making Models of Macromolecules. Problem. Introduction. Skills Focus. Materials.

Guided Inquiry Skills Lab. Additional Lab 1 Making Models of Macromolecules. Problem. Introduction. Skills Focus. Materials. Additional Lab 1 Making Models of Macromolecules Guided Inquiry Skills Lab Problem How do monomers join together to form polymers? Introduction A small number of elements make up most of the mass of your

More information

Evaluating Classifiers for Disease Gene Discovery

Evaluating Classifiers for Disease Gene Discovery Evaluating Classifiers for Disease Gene Discovery Kino Coursey Lon Turnbull khc0021@unt.edu lt0013@unt.edu Abstract Identification of genes involved in human hereditary disease is an important bioinfomatics

More information

The MOLECULES of LIFE

The MOLECULES of LIFE The MOLECULES of LIFE Physical and Chemical Principles Solutions Manual Prepared by James Fraser and Samuel Leachman Chapter 16 Principles of Enzyme Catalysis Problems True/False and Multiple Choice 1.

More information

Simulation of Self-Assembly of Ampiphiles Using Molecular Dynamics

Simulation of Self-Assembly of Ampiphiles Using Molecular Dynamics Simulation of Self-Assembly of Ampiphiles Using Molecular Dynamics Haneesh Kesari, Reza Banki, and Misty Davies Project Final Paper ME346 Stanford University December 15, 2005 1 Introduction Understanding

More information

Detection of Glaucoma and Diabetic Retinopathy from Fundus Images by Bloodvessel Segmentation

Detection of Glaucoma and Diabetic Retinopathy from Fundus Images by Bloodvessel Segmentation International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249 8958, Volume-5, Issue-5, June 2016 Detection of Glaucoma and Diabetic Retinopathy from Fundus Images by Bloodvessel Segmentation

More information

File name: Supplementary Information Description: Supplementary Figures, Supplementary Table and Supplementary References

File name: Supplementary Information Description: Supplementary Figures, Supplementary Table and Supplementary References File name: Supplementary Information Description: Supplementary Figures, Supplementary Table and Supplementary References File name: Supplementary Data 1 Description: Summary datasheets showing the spatial

More information

1 Pattern Recognition 2 1

1 Pattern Recognition 2 1 1 Pattern Recognition 2 1 3 Perceptrons by M.L. Minsky and S.A. Papert (1969) Books: 4 Pattern Recognition, fourth Edition (Hardcover) by Sergios Theodoridis, Konstantinos Koutroumbas Publisher: Academic

More information

Protein Investigator. Protein Investigator - 3

Protein Investigator. Protein Investigator - 3 Protein Investigator Objectives To learn more about the interactions that govern protein structure. To test hypotheses regarding protein structure and function. To design proteins with specific shapes.

More information

The role of Ca² + ions in the complex assembling of protein Z and Z-dependent protease inhibitor: A structure and dynamics investigation

The role of Ca² + ions in the complex assembling of protein Z and Z-dependent protease inhibitor: A structure and dynamics investigation www.bioinformation.net Hypothesis Volume 8(9) The role of Ca² + ions in the complex assembling of protein Z and Z-dependent protease inhibitor: A structure and dynamics investigation Zahra Karimi 1 *,

More information

Gold nanocrystals at DPPC bilayer. Bo Song, Huajun Yuan, Cynthia J. Jameson, Sohail Murad

Gold nanocrystals at DPPC bilayer. Bo Song, Huajun Yuan, Cynthia J. Jameson, Sohail Murad Gold nanocrystals at DPPC bilayer Bo Song, Huajun Yuan, Cynthia J. Jameson, Sohail Murad Coarse-grained mapping strategy of a DPPC lipid molecule. The structure of gold nanocrystals (bare gold nanoparticles)

More information

Trilateral Project WM4

Trilateral Project WM4 ANNEX 2: Comments of the JPO Trilateral Project WM4 Comparative studies in new technologies Theme: Comparative study on protein 3-dimensional (3-D) structure related claims 1. Introduction As more 3-D

More information

Predicting Breast Cancer Survival Using Treatment and Patient Factors

Predicting Breast Cancer Survival Using Treatment and Patient Factors Predicting Breast Cancer Survival Using Treatment and Patient Factors William Chen wchen808@stanford.edu Henry Wang hwang9@stanford.edu 1. Introduction Breast cancer is the leading type of cancer in women

More information

Chemical Nature of the Amino Acids. Table of a-amino Acids Found in Proteins

Chemical Nature of the Amino Acids. Table of a-amino Acids Found in Proteins Chemical Nature of the Amino Acids All peptides and polypeptides are polymers of alpha-amino acids. There are 20 a- amino acids that are relevant to the make-up of mammalian proteins (see below). Several

More information

Classification of ECG Data for Predictive Analysis to Assist in Medical Decisions.

Classification of ECG Data for Predictive Analysis to Assist in Medical Decisions. 48 IJCSNS International Journal of Computer Science and Network Security, VOL.15 No.10, October 2015 Classification of ECG Data for Predictive Analysis to Assist in Medical Decisions. A. R. Chitupe S.

More information

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data

Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Feature selection methods for early predictive biomarker discovery using untargeted metabolomic data Dhouha Grissa, Mélanie Pétéra, Marion Brandolini, Amedeo Napoli, Blandine Comte and Estelle Pujos-Guillot

More information

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1

Nature Structural & Molecular Biology: doi: /nsmb Supplementary Figure 1 Supplementary Figure 1 Design of isolated protein and RNC constructs, and homogeneity of purified RNCs. (a) Schematic depicting the design and nomenclature used for all the isolated proteins and RNCs used

More information

EECS 433 Statistical Pattern Recognition

EECS 433 Statistical Pattern Recognition EECS 433 Statistical Pattern Recognition Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208 http://www.eecs.northwestern.edu/~yingwu 1 / 19 Outline What is Pattern

More information

Improved Intelligent Classification Technique Based On Support Vector Machines

Improved Intelligent Classification Technique Based On Support Vector Machines Improved Intelligent Classification Technique Based On Support Vector Machines V.Vani Asst.Professor,Department of Computer Science,JJ College of Arts and Science,Pudukkottai. Abstract:An abnormal growth

More information

Unsupervised MRI Brain Tumor Detection Techniques with Morphological Operations

Unsupervised MRI Brain Tumor Detection Techniques with Morphological Operations Unsupervised MRI Brain Tumor Detection Techniques with Morphological Operations Ritu Verma, Sujeet Tiwari, Naazish Rahim Abstract Tumor is a deformity in human body cells which, if not detected and treated,

More information

Discovering Meaningful Cut-points to Predict High HbA1c Variation

Discovering Meaningful Cut-points to Predict High HbA1c Variation Proceedings of the 7th INFORMS Workshop on Data Mining and Health Informatics (DM-HI 202) H. Yang, D. Zeng, O. E. Kundakcioglu, eds. Discovering Meaningful Cut-points to Predict High HbAc Variation Si-Chi

More information

Supplementary Information: A Critical. Comparison of Biomembrane Force Fields: Structure and Dynamics of Model DMPC, POPC, and POPE Bilayers

Supplementary Information: A Critical. Comparison of Biomembrane Force Fields: Structure and Dynamics of Model DMPC, POPC, and POPE Bilayers Supplementary Information: A Critical Comparison of Biomembrane Force Fields: Structure and Dynamics of Model DMPC, POPC, and POPE Bilayers Kristyna Pluhackova,, Sonja A. Kirsch, Jing Han, Liping Sun,

More information

Nature Structural & Molecular Biology: doi: /nsmb.2419

Nature Structural & Molecular Biology: doi: /nsmb.2419 Supplementary Figure 1 Mapped sequence reads and nucleosome occupancies. (a) Distribution of sequencing reads on the mouse reference genome for chromosome 14 as an example. The number of reads in a 1 Mb

More information

Amino acids. Dr. Mamoun Ahram Summer semester,

Amino acids. Dr. Mamoun Ahram Summer semester, Amino acids Dr. Mamoun Ahram Summer semester, 2017-2018 Resources This lecture Campbell and Farrell s Biochemistry, Chapters 3 (pp.66-76) General structure (Chiral carbon) The amino acids that occur in

More information

Augmented Medical Decisions

Augmented Medical Decisions Machine Learning Applied to Biomedical Challenges 2016 Rulex, Inc. Intelligible Rules for Reliable Diagnostics Rulex is a predictive analytics platform able to manage and to analyze big amounts of heterogeneous

More information

Analysis of Diabetic Dataset and Developing Prediction Model by using Hive and R

Analysis of Diabetic Dataset and Developing Prediction Model by using Hive and R Indian Journal of Science and Technology, Vol 9(47), DOI: 10.17485/ijst/2016/v9i47/106496, December 2016 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Analysis of Diabetic Dataset and Developing Prediction

More information

Classification of EEG signals in an Object Recognition task

Classification of EEG signals in an Object Recognition task Classification of EEG signals in an Object Recognition task Iacob D. Rus, Paul Marc, Mihaela Dinsoreanu, Rodica Potolea Technical University of Cluj-Napoca Cluj-Napoca, Romania 1 rus_iacob23@yahoo.com,

More information

CS612 - Algorithms in Bioinformatics

CS612 - Algorithms in Bioinformatics Spring 2016 Protein Structure February 7, 2016 Introduction to Protein Structure A protein is a linear chain of organic molecular building blocks called amino acids. Introduction to Protein Structure Amine

More information

Automated Medical Diagnosis using K-Nearest Neighbor Classification

Automated Medical Diagnosis using K-Nearest Neighbor Classification (IMPACT FACTOR 5.96) Automated Medical Diagnosis using K-Nearest Neighbor Classification Zaheerabbas Punjani 1, B.E Student, TCET Mumbai, Maharashtra, India Ankush Deora 2, B.E Student, TCET Mumbai, Maharashtra,

More information

7.1 Grading Diabetic Retinopathy

7.1 Grading Diabetic Retinopathy Chapter 7 DIABETIC RETINOPATHYGRADING -------------------------------------------------------------------------------------------------------------------------------------- A consistent approach to the

More information

Supporting Information. Electrophoretic Deformation of Individual Transfer. RNA Molecules Reveals Their Identity

Supporting Information. Electrophoretic Deformation of Individual Transfer. RNA Molecules Reveals Their Identity Supporting Information Electrophoretic Deformation of Individual Transfer RNA Molecules Reveals Their Identity Robert Y. Henley a, Brian Alan Ashcroft f, Ian Farrell c, Barry S. Cooperman d, Stuart M.

More information

Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT

Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT Model-free machine learning methods for personalized breast cancer risk prediction -SWISS PROMPT Chang Ming, 22.11.2017 University of Basel Swiss Public Health Conference 2017 Breast Cancer & personalized

More information

FIRST MIDTERM EXAMINATION

FIRST MIDTERM EXAMINATION FIRST MIDTERM EXAMINATION 1. True or false: because enzymes are produced by living organisms and because they allow chemical reactions to occur that would not otherwise occur, enzymes represent an exception

More information

Nature Immunology: doi: /ni Supplementary Figure 1

Nature Immunology: doi: /ni Supplementary Figure 1 Supplementary Figure 1 A β-strand positions consistently places the residues at CDR3β P6 and P7 within human and mouse TCR-peptide-MHC interfaces. (a) E8 TCR containing V β 13*06 carrying with an 11mer

More information

Measuring Focused Attention Using Fixation Inner-Density

Measuring Focused Attention Using Fixation Inner-Density Measuring Focused Attention Using Fixation Inner-Density Wen Liu, Mina Shojaeizadeh, Soussan Djamasbi, Andrew C. Trapp User Experience & Decision Making Research Laboratory, Worcester Polytechnic Institute

More information

Fitting discrete-data regression models in social science

Fitting discrete-data regression models in social science Fitting discrete-data regression models in social science Andrew Gelman Dept. of Statistics and Dept. of Political Science Columbia University For Greg Wawro sclass, 7 Oct 2010 Today s class Example: wells

More information

Mutations and Disease Mutations in the Myosin Gene

Mutations and Disease Mutations in the Myosin Gene Biological Sciences Initiative HHMI Mutations and Disease Mutations in the Myosin Gene Goals Explore how mutations can lead to disease using the myosin gene as a model system. Explore how changes in the

More information