Statistics
601. Due on December 14, 2019 at 5 pm central time.
Do two of the following three problems. Turn in one pdf and RMD file for each problem
clearly labeling which problems you have chosen to do.
1: The data shown in schizophrenia.csv were collected in a follow-up study
of women patients with schizophrenia and summarized in Davis (2002), Statistical Methods
for the Analysis of Repeated Measurements, Springer, New York. The binary
response recorded at 0, 2, 6, 8 and 10 months after hospitalization was thought
disorder (absent or present). The single covariate is the factor indicating
whether a patient had suffered early or late onset of her condition (age of
onset less than 20 years or age of onset 20 years or above). The
question of interest is whether the course of the illness differs between
patients with early and late onset? Investigate the question of
interest.
i)
Provide
a two to three-page write-up (including graphs) explaining your analysis of the
experiment and the conclusions you can draw from it.
ii)
As
a secondary component provide annotated code that replicates your analysis.
Make sure to
discuss any concerns about the modeling assumptions used in your analysis.
The .csv file
has the following variables.
subject
o the patient ID, a
factor with levels 1 to 44.
onset
o the time of onset
of the disease, a factor with levels < 20 yrs and > 20 yrs.
disorder
o whether thought
disorder was absent or present, the response variable.
month
o
month after hospitalization.
Please
note that you may have
already explored this dataset in the class. Even so, please do a complete and
extended analysis answering the questions, with the focus of writing and
explaining the what you have found in your analysis.
2: (Vole Data)- Consider the “microtus" dataset in the “Flury" library in R.
Background from Airoldi_Flury_Salvioni_JTheorBiol_1995:
Discrimination Between Two Species of Microtus using both Classified and
Unclassified Observations.
“1. Introduction
Microtus subterraneus and M. multiplex are now considered to be two
distinct species (Niethammer, 1982; Krapp, 1982), contrary to the older view of
Ellerman & Morrison-Scott (1951). The two species differ in the number of
chromosomes: 2n=52 or 54 for M. subterraneus, and 2n=46 or 48 for M. multiplex.
Hybrids from the laboratory have reduced fertility (Meylan, 1972), and hybrids
from the field, whose karyotypes would be clearly recognizable, have never been
found (Krapp, 1982).
The geographic ranges of distribution of M. subterraneus and M. multiplex
overlap to some extent in the Alps of southern Switzerland and northern Italy
(Niethammer, 1982; Krapp, 1982). M. subterraneus is smaller than M. multiplex
in most measurements, and occurs at elevations from 1000 m to over 2000 m,
except in the western part of its range (for example, Belgium and Brittany),
where it is found in lower elevations. M. multiplex is found at similar
elevations, but also at altitudes from 200–300 m south of the Alps (Ticino, Toscana).
The two chromosomal types of M. subterraneus can be crossed in the
laboratory (Meylan, 1970, 1972), but no hybrids have so far been found in the
field. In M. multiplex, the two chromosomal types show a distinct distribution
range, but they are morphologically indistinguishable, and a hybrid has been
found in the field (Storch & Winking, 1977).
No reliable criteria based on cranial morphology have been found to
distinguish the two species. Saint Girons (1971) pointed out a difference in
the sutures of the posterior parts of the premaxillary and nasal bones compared
to the frontal one, but this criterion does not work well in many cases. For
both paleontological and biogeographical research it would be useful to have a
good rule for discriminating between the two species, because much of the data
available are in form of skull remains, either fossilized or from owl pellets.
The present study was initiated by a data collection consisting of
eight morphometric variables measured by one of the authors (Salvioni) using a
Nikon measure-scope (accuracy 1/1000 mm) and dial calipers (accuracy 1/100 mm).
The sample consists of 288 specimens collected mostly in Central Europe (Alps
and Jura mountains) and in Toscana. One peculiar aspect of this data set is
that the chromosomes of 89 specimens were analyzed to identify the species.
Only the morphometric characteristics are available for the remaining 199
specimens…”
Get Free Quote!
417 Experts Online