Package PELVIS. R topics documented: December 18, Type Package

Similar documents
Package memapp. R topics documented: June 2, Version 2.10 Date

Package appnn. R topics documented: July 12, 2015

Package clust.bin.pair

Package inctools. January 3, 2018

Package nopaco. R topics documented: April 13, Type Package Title Non-Parametric Concordance Coefficient Version 1.0.

Package citccmst. February 19, 2015

Package MethPed. September 1, 2018

Package CompareTests

Package speff2trial. February 20, 2015

Package AbsFilterGSEA

Package pollen. October 7, 2018

Package GANPAdata. February 19, 2015

Package ega. March 21, 2017

Package ordibreadth. R topics documented: December 4, Type Package Title Ordinated Diet Breadth Version 1.0 Date Author James Fordyce

Package HAP.ROR. R topics documented: February 19, 2015

Package ORIClust. February 19, 2015

User Guide. Association analysis. Input

ivlewbel: Uses heteroscedasticity to estimate mismeasured and endogenous regressor models

Logistic Regression and Bayesian Approaches in Modeling Acceptance of Male Circumcision in Pune, India

Package p3state.msm. R topics documented: February 20, 2015

Package ncar. July 20, 2018

Package StepReg. November 3, 2017

Package wally. May 25, Type Package

Package hei. August 17, 2017

Package inote. June 8, 2017

Package propoverlap. R topics documented: February 20, Type Package

Intro to R. Professor Clayton Nall h/t Thomas Leeper, Ph.D. (University of Aarhus) and Teppei Yamamoto, Ph.D. (MIT) June 26, 2014

Package BCRA. April 25, 2018

Package xseq. R topics documented: September 11, 2015

Package rvtdt. February 20, rvtdt... 1 rvtdt.example... 3 rvtdts... 3 TrendWeights Index 6. population control weighted rare-varaints TDT

Package frailtyhl. February 19, 2015

Package sbpiper. April 20, 2018

Package longroc. November 20, 2017

Module 3: Pathway and Drug Development

CNV PCA Search Tutorial

Package ICCbin. November 14, 2017

Instructions for the ECN201 Project on Least-Cost Nutritionally-Adequate Diets

WFS. User Guide. thinkwhere Glendevon House Castle Business Park Stirling FK9 4TZ Tel +44 (0) Fax +44 (0)

Package preference. May 8, 2018

Package MSstatsTMT. February 26, Title Protein Significance Analysis in shotgun mass spectrometry-based

ECDC HIV Modelling Tool User Manual

Package bionetdata. R topics documented: February 19, Type Package Title Biological and chemical data networks Version 1.0.

Package QualInt. February 19, 2015

Data mining with Ensembl Biomart. Stéphanie Le Gras

Lesson 3 Profex Graphical User Interface for BGMN and Fullprof

Ward Headstrom Institutional Research Humboldt State University CAIR

Package prognosticroc

Package singlecase. April 19, 2009

Package tumgr. February 4, 2016

Diabetes Management Software V1.3 USER S MANUAL

Chapter 1: Managing workbooks

Bangor University Laboratory Exercise 1, June 2008

Pyxis MedStation System. Guide for Managing Patient-Specific Medication

EHS QUICKSTART GUIDE RTLAB / CPU SECTION EFPGASIM TOOLBOX.

CSDplotter user guide Klas H. Pettersen

Part 8 Logistic Regression

OneTouch Reveal Web Application. User Manual for Healthcare Professionals Instructions for Use

38 Int'l Conf. Bioinformatics and Computational Biology BIOCOMP'16

Clay Tablet Connector for hybris. User Guide. Version 1.5.0

Package ClusteredMutations

A SAS Format Catalog for ICD-9/ICD-10 Diagnoses and Procedures: Data Research Example and Custom Reporting Example

Package medicalrisk. August 29, 2016

4Stat Wk 10: Regression

The FunCluster Package

TMWSuite. DAT Interactive interface

IBRIDGE 1.0 USER MANUAL

SPADE: Statistical Program to Assess habitual Dietary Exposure User's manual

Lesson: A Ten Minute Course in Epidemiology

Package Actigraphy. R topics documented: January 15, Type Package Title Actigraphy Data Analysis Version 1.3.

Anticoagulation Manager - Getting Started

EBCC Data Analysis Tool (EBCC DAT) Introduction

A Quick-Start Guide for rseqdiff

Design and Study of Online Fuzzy Risk Score Analyzer for Diabetes Mellitus

Package cancer. July 10, 2018

Package seqdesign. April 27, Version 1.1 Date

BORN Ontario: NSO DERF Training Guide FEBRUARY 2012

OpenCount 100 Call Data Recording in the OpenCom 100 Communications System

Appendix B. Nodulus Observer XT Instructional Guide. 1. Setting up your project p. 2. a. Observation p. 2. b. Subjects, behaviors and coding p.

Today: Binomial response variable with an explanatory variable on an ordinal (rank) scale.

Agile Product Lifecycle Management for Process

Web Feature Services Tutorial

Package TEQR. R topics documented: February 5, Version Date Title Target Equivalence Range Design Author M.

Iso-inertial dynamometer

Photic Output control using BrainMaster Atlantis

Hand Gesture Recognition and Speech Conversion for Deaf and Dumb using Feature Extraction

DPV. Ramona Ranz, Andreas Hungele, Prof. Reinhard Holl

Package RNAsmc. December 11, 2018

Unit 1: Introduction to the Operating System, Computer Systems, and Networks 1.1 Define terminology Prepare a list of terms with definitions

THE AFIX PRODUCT TRAINING MANUAL

Migraine Dataset. Exercise 1

Package AIMS. June 29, 2018

Package labstats. December 5, 2016

BMMC (UG SDE) IV SEMESTER

Package mem. R topics documented: October 23, Type Package Title The Moving Epidemic Method Version 2.11 Date Depends R (>= 3.4.

GridMAT-MD: A Grid-based Membrane Analysis Tool for use with Molecular Dynamics

Exercises: Differential Methylation

Package ExactCIdiff. February 15, 2013

University of California- Los Angeles. Affiliations: University of California- Los Angeles

NYSIIS. Immunization Evaluator and Manage Schedule Manual. October 16, Release 1.0

Transcription:

Type Package Package PELVIS December 18, 2018 Title Probabilistic Sex Estimate using Logistic Regression, Based on VISual Traits of the Human Os Coxae Version 1.0.5 Date 2018-12-18 Depends R (>= 3.4.0), shiny Imports MASS, DT An R-Shiny application implementing two methods for sexing the human os coxae based on eleven trichotomic traits, following Bruzek (2002) <doi:10.1002/ajpa.10012>. License GPL (>= 2) Encoding UTF-8 NeedsCompilation no Author Frédéric Santos [aut, cre] (<https://orcid.org/0000-0003-1445-3871>) Maintainer Frédéric Santos <frederic.santos@u-bordeaux.fr> Repository CRAN Date/Publication 2018-12-18 10:10:03 UTC R topics documented: PELVIS-package...................................... 2 addmetavars......................................... 2 bruzek02.......................................... 3 indivsexing......................................... 4 metavar........................................... 6 refdata........................................... 7 sexingfromfile....................................... 8 StartPELVIS......................................... 9 tenfoldcv.glm....................................... 10 Index 12 1

2 addmetavars PELVIS-package Probabilistic Sex Estimate using Logistic Regression, Based on VISual Traits of the Human Os Coxae Details An R-Shiny application implementing two methods for sexing the human os coxae, based on eleven trichotomic traits. Package: PELVIS Type: Package Version: 1.0.5 Date: 2018-18-12 License: GPL >=2 Bruzek, J. (2002) A method for visual determination of sex, using the human hip bone. American Journal of Physical Anthropology 117, 157 168. doi: 10.1002/ajpa.10012 Santos, F., Guyomarc h, P., Rmoutilova, R. and Bruzek, J. (Submitted to American Journal of Physical Anthropology) Probabilistic approach for sex classification using visual traits of the human os coxae following Bruzek (2002) Examples if(interactive()){ StartPELVIS() } addmetavars Internal function From a given dataset including the 11 visual traits exposed by Bruzek (2002), this function adds three corresponding main characters (PrSu, GrSN and InfP) based on the majority rule exposed in the original article.

bruzek02 3 addmetavars(dat) Arguments dat A dataframe including the 11 visual traits described by Bruzek. A dataframe including also the main characters derived from those visual traits. This is an internal function for the R-Shiny application implemented in PELVIS, documented here for testing purposes only. Bruzek, J. (2002) A method for visual determination of sex, using the human hip bone. American Journal of Physical Anthropology 117, 157 168. doi: 10.1002/ajpa.10012 bruzek02 Internal function for sexing the human os coxae using Bruzek s method (2002) Produces a single (and non-probabilistic) sex estimate from five characters observed on the human os coxae, following Bruzek (2002) bruzek02(x) Arguments x A character vector of length 5, having three possible values: F, 0 or M. One unique character value, F, I or M, according to the majority rule exposed by Bruzek.

4 indivsexing This is an internal function for the R-Shiny application implemented in PELVIS, documented here for testing purposes only. Bruzek, J. (2002) A method for visual determination of sex, using the human hip bone. American Journal of Physical Anthropology 117, 157 168. doi: 10.1002/ajpa.10012 Examples # Here we create manually an individual: individual <- c(prsu="m", GrSN="F", CArc="F", InfP="0", IsPu="F") individual # Determination produced by Bruzek (2002): female individual. bruzek02(individual) indivsexing Internal function for sexing one single human os coxae using revised Bruzek s method (2018) Produces a statistical sex estimate from eleven characters observed on the human os coxae, following Bruzek (2018), and using logistic regression models. indivsexing(ref, newind) Arguments ref newind A learning dataset for logistic regression models, basically the dataset refdata included in PELVIS (or any other dataset with the same variables). A new os coxae to be determined, with eleven observed traits (possibly with missing values).

indivsexing 5 A list with the following components: PredictedSex PostProb BestModel VariablesUsed cvrate cvindet One unique character value, F, I or M : final sex estimate for the studied os coxae. Posterior probability for the individual to be a male. Best logistic regression model for the studied os coxae according to the BIC criterion. Names of the variables (including part or all of the nonmissing traits for the studied os coaxe) used in this best model. Success rate in cross-validation. Cf. Santos et al. (2018) for more details about cross-validation here. Rate of individuals remaining indeterminate using the best logistic regression model. This is an internal function for the R-Shiny application implemented in PELVIS, documented here for testing purposes only. Santos, F., Guyomarc h, P., Rmoutilova, R. and Bruzek, J. (Submitted to American Journal of Physical Anthropology) Probabilistic approach for sex classification using visual traits of the human os coxae following Bruzek (2002) Examples data(refdata) # Pick the first individual of the reference dataset with its 11 traits, as an example: individual <- refdata[1, -c(1:6)] individual # Produce a sex estimate for this individual: indivsexing(ref=refdata, newind=individual)

6 metavar metavar Internal function used to get the five main characters described by Bruzek (2002) from the eleven visual traits observed on the human os coaxe. Converts a set of three visual traits to its correspoding main character (e.g. PrSu1, PrSu2 and PrSu3 are converted to PrSu). metavar(x, seuil=2) Arguments x seuil A vector of characters of length 3, having three possible values: f, i or m. For internal purposes only. A unique character value among three possible values: F, 0 or M. This is an internal function for the R-Shiny application implemented in PELVIS, documented here for testing purposes only. Bruzek, J. (2002) A method for visual determination of sex, using the human hip bone. American Journal of Physical Anthropology 117, 157 168. doi: 10.1002/ajpa.10012 Examples # Here we create a set of three values for the traits observed on the preauricular surface: prsu <- c(prsu1="f", PrSu2="f", PrSu3="m") prsu # Final determination for the main character PrSu: metavar(prsu)

refdata 7 refdata Learning dataset for logistic regression models Format This dataset includes 1110 ossa coxa from five population samples. The eleven trichotomic traits are given for each os coaxe (possibly with missing values for incomplete bones), along with the geographical origin and known sex of the individual. When possible, the age and stature of the individual are also given. data(refdata) A data frame with 1110 observations on the following 17 variables: Id a factor with 592 levels (unique ID of the individual to whom the bone belongs) Orig a factor with 5 levels (geographical origin) Sex a factor with levels F, M (known sex) Age a numeric vector (age of the associated individual in years) Side a factor with levels L, R (left or right side) Stature a numeric vector (in cm) PrSu1 a factor with levels f, i, m PrSu2 a factor with levels f, i, m PrSu3 a factor with levels f, i, m GrSN1 a factor with levels f, i, m GrSN2 a factor with levels f, i, m GrSN3 a factor with levels f, i, m CArc a factor with levels F, 0, M IsPu a factor with levels F, 0, M InfP1 a factor with levels f, i, m InfP2 a factor with levels f, i, m InfP3 a factor with levels f, i, m Santos, F., Guyomarc h, P., Rmoutilova, R. and Bruzek, J. (Submitted to American Journal of Physical Anthropology) A probabilistic approach for sex classification using visual traits of the human os coxae following Bruzek (2002)

8 sexingfromfile sexingfromfile Internal function for sexing several single human ossa coxarum using both original and revised Bruzek s methods (2002, 2018) Produces sex estimates from each of the ossa coxarum submitted by the user through the graphical user interface of the R-Shiny application. sexingfromfile(dat, ref, updateprogressbar=null) Arguments dat ref A test dataset submitted by the user throught the graphical user interface. The predictive factors (i.e. the eleven trichotomic traits) should have the same headers and levels as in the reference dataset refdata included in PELVIS. An example of valid data file can be downloaded here: http://www.pacea.u-bordeaux.fr/img/csv/data_test_pelv (its field separator is the semicolon ";"). A learning dataset for logistic regression models, basically the dataset refdata included in PELVIS (or any other dataset with the same variables). updateprogressbar Internal option for the R-Shiny application. A complete dataframe of results displayed through the R-Shiny application. This is an internal function for the R-Shiny application implemented in PELVIS. Santos, F., Guyomarc h, P., Rmoutilova, R. and Bruzek, J. (Submitted to American Journal of Physical Anthropology) Probabilistic approach for sex classification using visual traits of the human os coxae following Bruzek (2002)

StartPELVIS 9 StartPELVIS An R-Shiny application for the sex estimation of the human os coxae Launches a graphical user interface (GUI) allowing to use Bruzek s methods (2002, 2018) for sexing the human os coxae, based on eleven visual traits. StartPELVIS() Details The R-Shiny application proposes two tabs: Data input: manual editing can be used for both data entry and sex classification. The eleven trichotomic traits are manually edited for each os coxae through the GUI, and the corresponding sex estimates are then produced. Data input: from text file is the classical way to get the sex estimates for a whole sample of ossa coxa correctly described in a file. PELVIS accepts.csv or.txt data files, but does not support.ods or.xls(x) files. The predictive factors (i.e. the eleven trichotomic traits) should have the same headers and levels as in the reference dataset refdata included in PELVIS. An example of valid data file can be downloaded here: http://www.pacea.ubordeaux.fr/img/csv/data_test_pelvis.csv (its field separator is the semicolon ";"). In both tabs, two sex estimates are given: the visual sex estimate from Bruzek (2002), and the probabilistic sex estimate from Santos, Guyomarc h, Rmoutilova and Bruzek (2018, submitted). Depending on the traits possibly missing on the ossa coxa submitted to the program, the logistic regression models can use various subsets of best predictors (selected by BIC). The final subset of predictors used for each os coxae is given in the table of results. The function returns no value by itself, but the results can be downloaded through the graphical interface. The table of results includes the following columns: Sex estimate (Bruzek 2002) : the visual sex estimate based on Bruzek s method (2002). Statistical sex estimate (2018) : a sex estimation based on a logistic regression model, following the method described in Santos, Guyomarc h, Rmoutilova and Bruzek (2018, submitted). Prob(M) is the probability (obtained with the logistic regression model) that the individual is a man. According to tradition in biological anthropology, we have the following decsion rule: if Prob(M)>0.95 then the sex estimate is M ; if Prob(M)<0.05 then the sex estimate is F ; else the individual remains indeterminate ( I ). Prob(F), defined as 1-Prob(M), is the probability that the individual is a woman.

10 tenfoldcv.glm Selected predictors in LR model : for a given individual, the sex estimation proceeds as follows. First, a complete model is built using all available (i.e., nonmissing) traits for this individual. Then, a classical stepwise model selection by BIC is performed, and the subset of the most useful traits is used to produce the final sex estimate. This column gives the traits used for each individual. 10-fold CV accuracy (%) : the rate of correct classification for the corresponding logistic regression model is estimated using a ten-fold cross-validation on the learning sample. Indet. rate in CV (%) : the rate of individuals remaining indeterminate in cross-validation for the corresponding logistic regression model. The R console is not available when the GUI is active. To exit the GUI, type Echap (on MS Windows systems) or Ctrl+C (on Linux systems) in the R console. Regardless of the size and resolution of your screen, for convenience, it is advisable to decrease the zoom level of your web browser and/or to turn on fullscreen mode. Bruzek, J. (2002) A method for visual determination of sex, using the human hip bone. American Journal of Physical Anthropology 117, 157 168. doi: 10.1002/ajpa.10012 Santos, F., Guyomarc h, P., Rmoutilova, R. and Bruzek, J. (Submitted to American Journal of Physical Anthropology) A probabilistic approach for sex classification using visual traits of the human os coxae following Bruzek (2002) Examples if(interactive()){startpelvis()} tenfoldcv.glm Internal function for evaluating the performance of logistic regression models used in Bruzek s revised method (2018) Error rate, and rate of individuals remaining indeterminate, using 10-fold cross validation for a given logistic regression model in the context of Bruzek s revised method. tenfoldcv.glm(dat.glm, mod, seuil=0.95)

tenfoldcv.glm 11 Arguments dat.glm mod seuil A data frame to evaluate the GLM. GLM model to be evaluated. Threshold for sex determination, classically 0.95 following Bruzek s recommendations. A complete dataframe of results displayed through the R-Shiny application. This is an internal function for the R-Shiny application implemented in PELVIS. Please do not use this function for classical validation of GLMs outside the context of this package. Santos, F., Guyomarc h, P., Rmoutilova, R. and Bruzek, J. (Submitted to American Journal of Physical Anthropology) Probabilistic approach for sex classification using visual traits of the human os coxae following Bruzek (2002)

Index Topic bruzek, hip bone, logistic regression, morphoscopy, os coxae, sex estimate, sex estimation PELVIS-package, 2 Topic hip bone, os coxae, ossa coxa, sex estimation refdata, 7 addmetavars, 2 bruzek02, 3 indivsexing, 4 metavar, 6 PELVIS (PELVIS-package), 2 PELVIS-package, 2 refdata, 7 sexingfromfile, 8 StartPELVIS, 9 tenfoldcv.glm, 10 12